Over the past week or so, I've been prototyping components of the end-to-end DRC pipeline based on interesting discussions about the KPF and UHE pipelines.

My goal with this prototype was to create a sort of straw man with a specific set of technologies and architectural choices to ground our planning/design of the production PANOSETI DRC pipeline.

I'll briefly describe this prototype system for reference:

3-Server Cluster in the RAL @ Berkeley

Capture d’écran 2025-09-07 à 20.19.21.png

**BeeGFS Distributed Filesystem**

Unified Data Directory: /mnt/beegfs/data

Capture d’écran 2025-09-07 à 20.19.21.png

**Dask + Zarr --> mini-HPC Cluster**

Capture d’écran 2025-09-07 à 20.24.06.png

Zarr + xarray data model

Simple End-to-end Tests