SECTOR ANALYSIS
Climate / Oil & Gas / Methane
Exact-hash duplicate baseline for EPA methane comments, with semantic clustering still pending.
Registered Dockets1
Clusters Surfaced7
Comments Checked396
Coverage StatusBaseline only; semantic clustering queued
Exact-hash duplicate detection is complete
EPA methane has a bounded exact-string baseline: 396 parsed rows, 7 duplicate-hash clusters, 16 memberships, and largest cluster size 4. Semantic clustering is queued and should not be implied.
.uv-test-venv\Scripts\python.exe scripts\run_clustering.py --docket-id EPA-HQ-OAR-2021-0317 --clustering-mode vector_search