Astroturf
explorelearn moreadvanced

Analysis request

OSHA — COVID-19 Vaccination and Testing Emergency Temporary Standard

req_cl42yt042 / created 6/1/2026, 9:08:15 PM

Databricks Jobs modedraft

Rulemaking Metadata

Docket IDOSHA-2021-0007
Agency IDOSHA
Topicworkplace_safety
Data Sourceregulations_gov
Expected Scale~122124 comments
Date WindowFull Historical Ingestion
Notes / Reviewer Context

"Queued from consumer search. Query: "workers and jobs: OSHA — COVID-19 Vaccination and Testing Emergency Temporary Standard"."

Status & Control Plane

Command-Generation Mode

This analysis request is registered. Use the terminal sequence displayed on the right to trigger local comment ingestion, parsing, embedding, and clustering manually, or register via the Local Ingestion trigger.

Databricks Workspace IntegrationOffline Command-Generation mode. No hosted Databricks run ID mapped.

Command-Generation Mode

If you want to run this pipeline locally on your system instead of hosted Databricks, run the following sequence in your terminal:

.uv-test-venv\Scripts\python.exe scripts\run_ingestion.py --docket-id OSHA-2021-0007
.uv-test-venv\Scripts\python.exe scripts\run_embedding.py --docket-id OSHA-2021-0007 --backend databricks
.uv-test-venv\Scripts\python.exe scripts\run_clustering.py --docket-id OSHA-2021-0007 --clustering-mode vector_search

Command-generation mode allows running comment ingestion and clustering locally via python scripts, writing directly to your local delta lakehouse.