Analysis request
DEA — Schedule of Controlled Substances: Rescheduling of Marijuana
req_l7v8p31x8 / created 6/1/2026, 5:25:50 PM
Databricks Jobs modedraft
Rulemaking Metadata
Docket IDDEA-2024-0059
Agency IDDEA
Topicdrug_policy_scheduling
Data Sourceregulations_gov
Expected Scale~42923 comments
Date WindowFull Historical Ingestion
Notes / Reviewer Context
"Queued from consumer search. Query: "drug pricing"."
Status & Control Plane
Command-Generation Mode
This analysis request is registered. Use the terminal sequence displayed on the right to trigger local comment ingestion, parsing, embedding, and clustering manually, or register via the Local Ingestion trigger.
Databricks Workspace IntegrationOffline Command-Generation mode. No hosted Databricks run ID mapped.
Command-Generation Mode
If you want to run this pipeline locally on your system instead of hosted Databricks, run the following sequence in your terminal:
.uv-test-venv\Scripts\python.exe scripts\run_ingestion.py --docket-id DEA-2024-0059 .uv-test-venv\Scripts\python.exe scripts\run_embedding.py --docket-id DEA-2024-0059 --backend databricks .uv-test-venv\Scripts\python.exe scripts\run_clustering.py --docket-id DEA-2024-0059 --clustering-mode vector_search
Command-generation mode allows running comment ingestion and clustering locally via python scripts, writing directly to your local delta lakehouse.