Welcome to the Wiki. Edit at will!

If you are helping write scrapers, claim the one you're working on at our list of [Court Websites](Court Websites). If you get stuck, you might want to look at the notes on that page about other courts that are complete.

And here is a cheatsheet for developing scrapers and using mercurial if you're just getting started. There is also a video presentation on getting started with juriscraper.

Setting up a development environment for juriscraper

If you also plan to work on CourtListener proper, you can use a virtual machine that has a pre-loaded development environment. If you only plan on working on Juriscraper, you can get by with less, as described here:

Note: We use a file-naming convention for the state scrapers that is based on the abbreviations that lawyers use for those courts. So, for example, that makes the scraper for Michigan, 'mich.py'. We occasionally have to deviate from this if separate scrapers are needed within the same state. For instance, some courts put their published precedential opinions on one page and their 'unpublished' non-precedential opinions on another page. New Mexico does this, so we use 'nm_p.py' and 'nm_u.py' for the published and 'unpublished' pages respectively. For most scrapers, a stub file has been created to make this convention easier for our contributors. You can find these state court abbreviations here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Welcome to the Wiki. Edit at will!

Setting up a development environment for juriscraper

Clone this wiki locally