Juriscraper

Juriscraper is a Python library, maintained by the Free Law Project, that scrapes metadata and documents from American federal and state court websites. It powers the ingestion pipeline behind CourtListener, standardizing wildly different court sites into a consistent interface for opinions, oral-argument audio, and PACER (federal filing) data.

Key features

Scrapers for hundreds of state and federal appellate and trial courts
Unified output for opinions, oral arguments, and PACER dockets/documents
Built-in politeness: caching, rate awareness, and change detection
Extensible base classes make adding new court scrapers straightforward
Powers one of the largest open archives of U.S. court data

Each court is exposed as a module you invoke to fetch the latest cases, returning normalized records (case name, date, citation, download URL) ready for storage or analysis.

Curated mirror of the open-source Juriscraper (BSD-2-Clause). Get it from the source.

Juriscraper — Court Data Scraper

Juriscraper

Key features

More from @ai-supply