Jun 9 – 12, 2026
Fluno Center on the University of Wisconsin-Madison Campus
America/Chicago timezone

Auto-regulating input files that save time when jobs fail

Jun 11, 2026, 1:55 PM
15m
Howard Auditorium (Fluno Center on the University of Wisconsin-Madison Campus)

Howard Auditorium

Fluno Center on the University of Wisconsin-Madison Campus

601 University Avenue, Madison, WI 53715-1035

Speaker

Benjamin FitzGerald (University of Wisconsin-Madison)

Description

I use CHTC and OSPool to run a rainfall frequency analysis pipeline across 200+ watersheds as part of FEMA's National Flood Insurance Program work. DAGMan was essential for automating job submission and output processing at this scale, but managing pipeline failures proved challenging -- specifically, ensuring that every daily precipitation file was properly analyzed before downstream steps proceeded. A failure-and-retry approach led to redundant computation, while simply continuing risked carrying missing or corrupted data forward into later analyses. To address this, I developed submit files that intake dynamically updated file lists, whittling down as outputs are confirmed present. This prevents the pipeline from advancing with incomplete data while also avoiding redundant reprocessing of files that completed successfully. I could also focus instead more on the pipeline itself if that would be preferred. I am available to present in person on 6/10 or 6/11.

Author

Benjamin FitzGerald (University of Wisconsin-Madison)

Presentation materials

There are no materials yet.