Throughput Computing 2025

Name: Throughput Computing 2025
Start: 2025-06-02T06:30:00-05:00
End: 2025-06-06T23:15:00-05:00
Location: Fluno Center on the University of Wisconsin-Madison Campus

Jun 2 – 6, 2025

Fluno Center on the University of Wisconsin-Madison Campus

America/Chicago timezone

Questions about attending, speaking, accommodations, and other concerns

htc@path-cc.io

Contribution List

47. Navigating the PATh forward

Miron Livny (UW-Center for High Throughput Computing)

6/2/25, 9:00 AM

OSG at 20: Growing the Throughput Community

105. Expanding Facilitation Impacts

Christina Koch (UW Madison)

6/2/25, 9:20 AM

OSG at 20: Growing the Throughput Community

48. State of OSG

Frank Wuerthwein (UC San Diego)

6/2/25, 9:40 AM

OSG at 20: Growing the Throughput Community

49. The Pelican in Flight: Delivering Data Services

Brian Bockelman (Morgridge Institute for Research)

6/2/25, 10:00 AM

OSG at 20: Growing the Throughput Community

17. Keynote Introduction

Christina Koch (UW Madison)

6/2/25, 10:50 AM

Erik Wright Keynote Presentation, David Swanson Awardee Presentation

2. Erik Wright's Keynote Presentation: Biological Discovery at an Unfathomable Scale

Dr Erik Scott Wright (University of Pittsburgh)

6/2/25, 10:55 AM

Erik Wright Keynote Presentation, David Swanson Awardee Presentation

Recent technological advances have revealed an enormous diversity of lifeforms by sequencing their genomes. There are now millions of available genomes, each comprised of thousands of genes. The universe of newly discovered genes is expanding far faster than our ability to study them in the laboratory. Here, I will present how high-throughput computing is unlocking the function of novel genes...

18. Introduction of David Swanson Award

Ronda Swanson (University of Nebraska-Lincoln)

6/2/25, 12:00 PM

Erik Wright Keynote Presentation, David Swanson Awardee Presentation

3. David Swanson Awardee Presentation: Reconstructing Spider Webs from Behavioral Tracking.

Brandi Pessman (University of Nebraska-Lincoln)

6/2/25, 12:05 PM

Erik Wright Keynote Presentation, David Swanson Awardee Presentation

19. Using OSG to learn the rules of biological evolution (Remote Presentation)

Oana Carja (Carnegie Mellon University)

6/2/25, 1:30 PM

Throughput Computing in the Biology & Life Sciences Community

20. Computational challenges in metagenomics and small molecule biosynthesis

Jason Kwan (UW-Madison)

6/2/25, 1:55 PM

Throughput Computing in the Biology & Life Sciences Community

82. Understanding Gene Regulatory Networks

Prakriti Garg (University of Wisconsin-Madison)

6/2/25, 2:20 PM

Throughput Computing in the Biology & Life Sciences Community

25. Our Current Vision for Trustworthy Long-Term CILogon Operations

Jim Basney (University of Illinois Urbana-Champaign)

6/2/25, 3:15 PM

Reenvisioning Identity in Distributed Services

30. Placement Tokens

Matyas Selmeci (UW-Madison CHTC)

6/2/25, 3:40 PM

Reenvisioning Identity in Distributed Services

39. Can I have my data... please? Authorization in Pelican

Justin Hiemstra (Morgridge Institute for Research)

6/2/25, 4:00 PM

Reenvisioning Identity in Distributed Services

12. Fermilab’s Transition to Token Authentication

Dave Dykstra (Fermilab)

6/2/25, 4:25 PM

Reenvisioning Identity in Distributed Services

Fermilab is the first High Energy Physics institution to transition from X.509 user certificates to authentication tokens in production systems. All the experiments that Fermilab hosts are now using JSON Web Token (JWT) access tokens in their grid jobs. The tokens are defined using the WLCG Common JWT Profile. Many software components have been either created or updated for this transition,...

77. NSF Campus Cyberinfrastructure (CC*) (Remote Presentation)

Kevin Thompson (NSF)

6/3/25, 9:00 AM

Campus Impacts on the National Science Community via the OSDF and OSPool

107. Translational Computer Science Panel Discussion

Brian Bockelman (Morgridge Institute for Research), Douglas Thain (University of Notre Dame), Ewa Deelman (USC ISI), Manish Parashar (University of Utah), Miron Livny (UW-Center for High Throughput Computing), Peter Couvares (LIGO Laboratory - Caltech)

6/3/25, 9:20 AM

Campus Impacts on the National Science Community via the OSDF and OSPool

Panel Discussion led by Miron LIvny.

79. University of Montana and Contributing to the OSPool

Michael Couso (University of Montana)

6/3/25, 9:55 AM

Campus Impacts on the National Science Community via the OSDF and OSPool

80. Experiences of a Small, Primarily Undergraduate Institution in Servicing OSPool Compute Jobs

Dr Stephen Wheat (Oral Roberts University)

6/3/25, 10:15 AM

Campus Impacts on the National Science Community via the OSDF and OSPool

21. Scaling Up Research: Integrating CENVAL-ARC resources with OSG and Expanding User Access (Remote Presentation)

Sarvani Chadalapaka (University of California - Merced)

6/3/25, 11:05 AM

HTC at Work at Campuses and in Research

41. Supporting Research Computing @ Syracuse University

Peter Pizzimenti (Syracuse University)

6/3/25, 11:25 AM

HTC at Work at Campuses and in Research

23. Supporting MicrobiologyResearch at Scale: Experiences and Perspectives

Patricia Tran (UW-Madison)

6/3/25, 11:50 AM

HTC at Work at Campuses and in Research

10. Integrating NSF NCAR’s data infrastructure with OSDF

Harsha Hampapura (NSF National Center for Atmospheric Research)

6/3/25, 12:10 PM

HTC at Work at Campuses and in Research

NSF NCAR’s labs and programs collectively cover a breadth of research topics in Earth system science, from the effects of the Sun on Earth's atmosphere to the role of the ocean in weather and climate prediction, as well as supporting and training the next-generation of Earth system scientists. However, with the current legacy `download and analyze’ model followed by most of our remote users,...

50. Welcome to 40 Years of The Condor Project

Todd Tannenbaum (University of Wisconsin)

6/3/25, 2:00 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

51. Reflections from Guri Sohi

6/3/25, 2:05 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

52. Reflections from Rajesh Raman

6/3/25, 2:15 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

53. Reflections from Ewa Deelman

6/3/25, 2:25 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

54. Condor in Europe

Helge Meinhard (CERN)

6/3/25, 2:35 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

55. Reflections from Manish Parashar

6/3/25, 2:50 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

56. Reflections from Peter Couvares

6/3/25, 3:00 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

57. Reflections from Jim Basney

6/3/25, 3:10 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

58. Reflections from Brian Bockelman

Brian Bockelman (Morgridge Institute for Research)

6/3/25, 3:20 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

59. Reflections from Remzi Arpaci-Dusseau

6/3/25, 3:30 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

60. Reflections from Doug Thain

6/3/25, 3:40 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

61. Reflections from the Floor

6/3/25, 3:50 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

62. Reflections from Miron Livny

6/3/25, 4:00 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

63. Reception

6/3/25, 4:30 PM

40 Years of The Condor Project Commemorative Gathering (In-Person Event Only)

78. Building Communities Around OSDF and OSPool Contributors

Christina Koch (UW Madison), Tim Cartwright (University of Wisconsin–Madison, OSG)

6/4/25, 9:00 AM

Advancing Technologies Through Community

22. Integration of MINCER with Open Science Grid

Irvin Lopez-Audetat (University of Texas at El Paso)

6/4/25, 9:30 AM

Interactive Access and Administration

The Monitoring Infrastructure for Network and Computing Environment Research (MINCER) project aims to provide a foundation for in-depth insight and analysis of distributed heterogeneous computing environments, supporting and enhancing research and education in computer and network systems. Our approach is to work in conjunction with the Open Science Grid (OSG) by providing a set of MINCER...

87. Monitoring and diagnostics to support scaling up radio astronomy imaging workflows (Remote Presentation)

Felipe Madsen (National Radio Astronomy Observatory)

6/4/25, 9:50 AM

Interactive Access and Administration

The Algorithms Research and Development Group (ARDG) at the National Radio Astronomy Observatory (NRAO) has been using HTCondor and compute resources at the Open Science Grid (OSG) to significantly improve throughput in radio astronomy imaging by up to 2 orders of magnitude in single imaging workflows, and we are currently putting efforts towards extending these imaging capabilities to...

88. Managing, Maintaining, and Monitoring a large HTC system

Tom Smith (Brookhaven National Laboratory)

6/4/25, 10:10 AM

Interactive Access and Administration

89. A Tape Robot for the MIT Tier-2 Center

Maxim Goncharov (MIT)

6/4/25, 10:30 AM

Interactive Access and Administration

Since its establishment in 2006, the MIT Tier-2 computing center has been a long-standing contributor to CMS computing efforts. Recently, an opportunity arose to take part in the usage of a shared tape storage system operated by Harvard University. In the context of a pilot project to explore this system we acquired tape cartridges with a total capacity of 15 PB and successfully integrated...

94. Duct Tape, DAGs, and Determination: Snakemake at the Edge of HTCondor

Justin Hiemstra (Morgridge Institute for Research)

6/4/25, 11:15 AM

Tutorials and Workflows Track: Building Your Toolbox: HTCSS Overviews

8. Wrangling Complex Notebook Workflows with Floability

Douglas Thain (University of Notre Dame)

6/4/25, 11:40 AM

Tutorials and Workflows Track: Building Your Toolbox: HTCSS Overviews

Computational notebooks have become a critical tool of scientific discovery, by wrapping together code, results, and visualization into a common package. However, moving complex notebooks between different facilities is not so easy: complex workflows require precise software stacks, access to large data, and large backend computational resources. The Floability project aims to connect these...

16. EWMS in Action: A User’s Guide to Adaptive, Extreme-Scale Workflows

Ric Evans (UW-Madison / IceCube)

6/4/25, 12:05 PM

Tutorials and Workflows Track: Building Your Toolbox: HTCSS Overviews

The Event Workflow Management System (EWMS) enables previously impractical scientific workflows by transforming how HTCondor is used for massively parallel, short-runtime tasks. This talk explores what’s now possible from a user’s perspective. Integrated into IceCube’s Realtime Alert pipeline and powered by OSG’s national-scale compute resources, EWMS’s debut application delivers directional...

97. Monitoring in the OSDF

Patrick Brophy (Morgridge)

6/4/25, 1:30 PM

Tutorials and Workflows Track: Workloads and Workflows

98. Disk Usage of Jobs at the EP

Cole Bollig (University of Wisconsin-Madison)

6/4/25, 1:55 PM

Tutorials and Workflows Track: Workloads and Workflows

44. Hollistic cost analysis of running a computing center

Zhangqier Wang (Massachusetts Inst. of Technology (US))

6/4/25, 2:20 PM

Tutorials and Workflows Track: Workloads and Workflows

The MIT Tier-2 computing center, established in 2006, has been a long-standing contributor to CMS computing. As hardware ages and computing demands evolve, we are undertaking a major redesign of the center’s infrastructure. In this talk, we present a holistic cost analysis that includes not only hardware purchases but also power consumption, cooling, and rack space—factors often excluded from...

28. Data Everywhere: Using and Sharing Scientific Data with Pelican

Dr Andrew Owen (UW-Madison/CHTC)

6/4/25, 3:05 PM

Tutorials and Workflows Track: Objects, Objects Everywhere: Using the OSDF in your Science

While there are perhaps hundreds of petabytes of datasets available to researchers, instead of swimming in seas of data there is often a feel of sitting in a data desert: there’s a mismatch between what sits in carefully curated repositories around the world versus what’s accessible at the computational resources locally available. The Pelican Project (https://pelicanplatform.org/) aims to...

90. IceCube and PATh Services

Benedikt Riedel (UW-Madison)

6/4/25, 3:05 PM

Scientific Collaborations Track: Collaborations Group Discussions

91. Computing Requirements and Challenges for Muon Collider Full Simulation

Mark Larson (University of Chicago)

6/4/25, 3:25 PM

Scientific Collaborations Track: Collaborations Group Discussions

92. Status of REDTOP and preliminary results from the 2025 Montecarlo campaign (Remote Presentation)

Corrado Gatto (INFN and NIU)

6/4/25, 3:45 PM

Scientific Collaborations Track: Collaborations Group Discussions

110. Data processing in XENONnT dark matter project (Remote Presentation)

Dacheng Xu (Columbia University)

6/4/25, 4:05 PM

Scientific Collaborations Track: Collaborations Group Discussions

93. Collaboration Services & Usage

Pascal Paschos (University of Chicago)

6/4/25, 4:25 PM

Scientific Collaborations Track: Collaborations Group Discussions

29. Unbreaking the bird: Debugging Pelican client failures

Brian Bockelman (Morgridge Institute for Research)

6/4/25, 4:30 PM

Tutorials and Workflows Track: Objects, Objects Everywhere: Using the OSDF in your Science

33. GPU Access and AI workflows in CHTC and the OSPool

Amber Lim (UW Madison)

6/5/25, 9:00 AM

Running ML, AI and GPU Workflows

65. Networking: perfSONAR, SciTags and WLCG Site network monitoring

Shawn McKee (University of Michigan)

6/5/25, 9:00 AM

Joint CMS and US-ATLAS Session

66. Tape Archival at Tier-2s

Maxim Goncharov (MIT), Rafael Coelho Lopes de Sa (University of Massachusetts Amherst)

6/5/25, 9:15 AM

Joint CMS and US-ATLAS Session

Experiences with running tape storage systems at ATLAS and CMS Tier-2s

11. ARA Distributed Inference Experiments: Flying HTCondor Over a Field of Wireless Dreams

Martin Kandes (San Diego Supercomputer Center)

6/5/25, 9:25 AM

Running ML, AI and GPU Workflows

We present the initial design and proposed implementation for a series of long-baseline, distributed inference experiments leveraging ARA --- a platform for advanced wireless research that spans approximately 500 square kilometers near Iowa State University, including campus, the City of Ames, local research and producer farms, and neighboring rural communities in central Iowa. These...

67. Discussion on next steps and joint work

Andrew Melo (Vanderbilt University), Shawn McKee (University of Michigan)

6/5/25, 9:40 AM

Joint CMS and US-ATLAS Session

26. Pegasus WMS Deployments in ACCESS and NAIRR Pilot

Mats Rynge (USC / ISI)

6/5/25, 9:50 AM

Running ML, AI and GPU Workflows

Pegasus is a widely used scientific workflow management system built on top of HTCondor DAGMan. This talk will highlight how Pegasus is deployed within the NSF ACCESS ecosystem and the NAIRR Pilot. We will cover access point deployments, including the hosted ACCESS Pegasus platform (Open OnDemand and Jupyter), workflow execution nodes in HPC environments, and a JupyterLab-based access point...

34. Experiments and expansions: Leveraging PATh tools and NAIRR resources in ML workflows

Ian Ross (U. Wisconsin)

6/5/25, 10:15 AM

Running ML, AI and GPU Workflows

68. Capacity Challenges

Hironori Ito (Brookhaven National Laboratory)

6/5/25, 11:00 AM

Joint CMS and US-ATLAS Session

Discuss tools and options for the next capacity challenge.

36. Contouring the Audio Fovea with Pinna Cues for Spatial Speech Perception (Remote Presentation)

Bujji Selagamsetty (University of Wisconsin-Madison)

6/5/25, 11:00 AM

AI-driven Science with PATh Services

69. Capability Challenge: SENSE/Rucio

Diego Davila (UCSD), Justas Balcas (ESnet)

6/5/25, 11:15 AM

Joint CMS and US-ATLAS Session

Discuss plans for SENSE/Rucio testing for USATLAS/USCMS

35. Mapping the Zymomonas mobilis Interactome

Sameer DCosta (GLBRC / WEI)

6/5/25, 11:25 AM

AI-driven Science with PATh Services

70. Capability Challenge: Netbird/Wireguard

Lincoln Bryant (University of Chicago)

6/5/25, 11:30 AM

Joint CMS and US-ATLAS Session

71. Discussion & Planning

Andrew Melo (Vanderbilt University), Shawn McKee (University of Michigan)

6/5/25, 11:45 AM

Joint CMS and US-ATLAS Session

Discuss capacity and capability challenges. Which are we interesting in pursuing? Who will participate? When to schedule?

37. Large Scale Dataset Curation and Model Evaluation

John Peters (UW-Madison)

6/5/25, 11:50 AM

AI-driven Science with PATh Services

38. Utilizing HTCondor, Pelican, and DAGman workflows for high-throughput phenotyping in dairy cattle (Remote Presentation)

Ariana Negreiro (University of Wisconsin-Madison)

6/5/25, 12:10 PM

AI-driven Science with PATh Services

72. HEPCDN: Exploring NGINX for Content Distribution

Andrew Melo (Vanderbilt University)

6/5/25, 1:30 PM

Joint CMS and US-ATLAS Session

100. Tracking HTCondor Uptime

Michael Pelletier

6/5/25, 1:30 PM

Tutorial: Mastering Debugging (for admins)

While the DaemonStartTime and MonitorSelfAge attributes of HTCondor daemons provide a slice of insight as to the uptime and availability of the service, they're not well-suited for tracking longer-term up/down-time stats over the course of days, weeks, or months.

One illustration of this limitation is that if a malfunctioning node or service restarts every five minutes, the values are reset...

73. CANCELED: Data Streaming as a Service

Alexei Klimentov (Brookhaven National Lab)

6/5/25, 1:45 PM

Joint CMS and US-ATLAS Session

Presentation/discussion canceled.

101. Rolling HTCondor Upgrades Without Rolling Over

Tim Theisen (UW-Madison CHTC)

6/5/25, 1:55 PM

Tutorial: Mastering Debugging (for admins)

74. AI/ML Introduction and Discussion

Ilija Vukotic (University of Chicago)

6/5/25, 2:00 PM

Joint CMS and US-ATLAS Session

Quick overview of AI/ML in WLCG so far
Needs for AI/ML for Infrastructure AND Infrastructure for AI/ML
Funding opportunities
Next step, areas on common interest/effort?

9. Tighter HTCondor and Kubernetes interplay for better glideins

Igor Sfiligoi (University of California San Diego), Jaime Frey (Center for High-Throughput Computing)

6/5/25, 2:20 PM

Tutorial: Mastering Debugging (for admins)

HTCondor is the leading system for building a dynamic overlay batch scheduling system on resources managed by any scheduling system, by means of glideins. One fundamental property of these setups is the use of late binding of containerized user workloads. From a resource provider point of view, a compute resource is thus claimed before the user container image is selected. Kubernetes allows...

104. Functional Throughput Computing

Greg Thain (Center for High Throughput Computing)

6/5/25, 3:05 PM

HTCondor: New Things You Should Know

75. Introduction and Overview for AFs in WLCG

Lincoln Bryant (University of Chicago)

6/5/25, 3:05 PM

Joint CMS and US-ATLAS Session

76. AF Discussion

Hironori Ito (Brookhaven National Laboratory), Lincoln Bryant (University of Chicago)

6/5/25, 3:20 PM

Joint CMS and US-ATLAS Session

Shared development and prototyping for AFs?
Storage technologies to support AFs (Carlos Gamboa ?, 10 minutes)
Joint AFs: Can we “share” AFs between experiments: Belle II and ATLAS or Dune and CMS, etc ? (Hiro/Ofer/Lincoln ?)
Can we agree on a minimum baseline for AFs?
Standard “login”

103. What's New, What's Upcoming

Todd Tannenbaum (University of Wisconsin)

6/5/25, 3:30 PM

HTCondor: New Things You Should Know

15. From backend to interactive based on HTCondor

Jingyan Shi (INSTITUTE OF HIGH ENERGY PHYSICS, Chinese Academy of Science)

6/5/25, 4:10 PM

Tools for Production Pools

New requirements from HEP data analysis includes limited access to login nodes, resource needed from the experiments program rather than the login nodes and efficient data access for collaborative workflows etc. We have developed an Interactive aNalysis worKbench (INK), a web-based platform leveraging the HTCondor cluster. INK transforms traditional batch-processing resources into a...

109. Operating a Federated HTCondor Infrastructure: Monitoring and Management for CMS Computing

Bruno Coimbra (Fermilab)

6/5/25, 4:35 PM

Tools for Production Pools

The Compact Muon Solenoid (CMS) experiment at CERN generates and processes vast volumes of data requiring significant computing capacity. To meet these demands, CMS has adopted a federated throughput computing model distributed across a global infrastructure based on HTCondor, the CMS Submission Infrastructure. Seamless integration of heterogeneous resources from multiple sites allows for...

83. High-resolution Imaging of the Multi-phase Interstellar Medium with CHTC

Nickolas Pingel (University of Wisconsin-Madison)

6/6/25, 9:00 AM

Throughput Computing Science Impacts

64. Processing Mouse Brain Data on CHTC using Research Drive Integration

Aydan Bailey (UW-Madison)

6/6/25, 9:25 AM

Throughput Computing Science Impacts

84. Visual proteomics, powered by CHTC

Raison D'Souza (University of Wisconsin-Madison)

6/6/25, 9:45 AM

Throughput Computing Science Impacts

108. High Throughput Computing for Comparative Genomics on Large Public Datasets

Conor Bendett (University of Wisconsin-Madison)

6/6/25, 10:05 AM

Throughput Computing Science Impacts

31. Files Common Across Jobs and How To Transfer Them

Todd Miller (CHTC)

6/6/25, 10:45 AM

Managed and Staging Objects: Caches and Execution Points

32. condor_who (are you)

John Knoeller (University of Wisconsin, Madison)

6/6/25, 11:10 AM

Managed and Staging Objects: Caches and Execution Points

40. Kingfisher: Toward Explicit Space Management

Justin Hiemstra (Morgridge Institute for Research)

6/6/25, 11:30 AM

Managed and Staging Objects: Caches and Execution Points

13. Using the National Data Platform Endpoint to Improve Access to Science Data

Curt Dodds (Institute for Astronomy, University of Hawaii)

6/6/25, 11:55 AM

Managed and Staging Objects: Caches and Execution Points

I describe the process of deploying the National Data Platform Endpoint (formerly Point of Presence / POP) on local infrastructure to provide a data streaming service for a published science dataset where the data origin is located in Hawaii. From the perspective of a software engineer I will cover the process of deploying the endpoint into a Kubernetes cluster or using Docker Compose. I will...

81. Closing Remarks

Miron Livny (UW-Center for High Throughput Computing)

6/6/25, 12:20 PM

Managed and Staging Objects: Caches and Execution Points

45. A Tape Robot for the MIT Tier-2 Center

Maxim Goncharov (MIT)

24. Integrating NSF NCAR’s Data Infrastructure with OSDF

Dr Harsha Hampapura (NSF National Center for Atmospheric Research)

HTC at Work at Campuses and in Research

106. Let's Talk Building Community for Translational Computer Science

Christina Koch (UW Madison)

Advancing Technologies Through Community

14. Tracking HTCondor Uptime

Michael Pelletier (Raytheon)

While the DaemonStartTime and MonitorSelfAge attributes of HTCondor daemons provide a slice of insight as to the uptime and availability of the service, they're not well-suited for tracking longer-term up/down-time stats over the course of days, weeks, or months.

One illustration of this limitation is that if a malfunctioning node or service restarts every five minutes, the values are...

Choose timezone

Throughput Computing 2025

Questions about attending, speaking, accommodations, and other concerns