-
Michael Pelletier6/5/25, 1:30 PM
While the DaemonStartTime and MonitorSelfAge attributes of HTCondor daemons provide a slice of insight as to the uptime and availability of the service, they're not well-suited for tracking longer-term up/down-time stats over the course of days, weeks, or months.
One illustration of this limitation is that if a malfunctioning node or service restarts every five minutes, the values are reset...
Go to contribution page -
Tim Theisen (UW-Madison CHTC)6/5/25, 1:55 PM
-
Igor Sfiligoi (University of California San Diego), Jaime Frey (Center for High-Throughput Computing)6/5/25, 2:20 PM
HTCondor is the leading system for building a dynamic overlay batch scheduling system on resources managed by any scheduling system, by means of glideins. One fundamental property of these setups is the use of late binding of containerized user workloads. From a resource provider point of view, a compute resource is thus claimed before the user container image is selected. Kubernetes allows...
Go to contribution page
Choose timezone
Your profile timezone: