Astroturf
explorelearn moreadvanced

Analysis request

Hospital Price Transparency

req_iy0wlac8x / created 6/1/2026, 6:46:57 PM

Databricks Jobs modefailed

Rulemaking Metadata

Docket IDCMS-2019-0193
Agency IDCMS
Topichealth-care
Data Sourceregulations_gov
Expected Scale~5000 comments
Date WindowFull Historical Ingestion
Notes / Reviewer Context

"Smart batch ingestion. Health-care pricing topic; good consumer-facing contrast to telecom and climate."

Status & Control Plane

Execution Failure:

Task main_analysis_job failed with message: Workload failed, see run output for details.

Databricks Workspace Integration
Active Databricks Run ID: 201629707179163

Command-Generation Mode

If you want to run this pipeline locally on your system instead of hosted Databricks, run the following sequence in your terminal:

.uv-test-venv\Scripts\python.exe scripts\run_ingestion.py --docket-id CMS-2019-0193
.uv-test-venv\Scripts\python.exe scripts\run_embedding.py --docket-id CMS-2019-0193 --backend databricks
.uv-test-venv\Scripts\python.exe scripts\run_clustering.py --docket-id CMS-2019-0193 --clustering-mode vector_search

Command-generation mode allows running comment ingestion and clustering locally via python scripts, writing directly to your local delta lakehouse.