bible-crawler

Simple python script to crawl (https://www.biblegateway.com/). Currently works for bible versions that supply a direct mapping between verse and verse number (i.e. doesn't work for MSG translation)

Tested on Macbook Pro running MacOS Mojave version 10.14.4.

Environment information:

Python 3.6.5

Installation

To install dependencies, run:

pip install -r requirements.txt

Usage

scrapy runspider spider.py -o [FILENAME].json

Replace FILENAME with any name you want the json output to be stored in. Change the start link in the script to Genesis 1 in your desired version.

Also provided is a bundler.py script to bundle together the crawling output. This would create a json with the following structure:

{
    Book1
        {
            Chapter1 : Verses {}
            Chapter2 : Verses {}
            ...
        }
    Book2
        {
            Chapter1 : Verses {}
            Chapter2 : Verses {}
            ...
        }
    ...
}

bundler.py expect .json input files (generated by the crawler) to be in the bundler_input directory. It will create a bundler_output directory if it doesn't exist to store the bundled outputs.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
bundler_input		bundler_input
bundler_output		bundler_output
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
bible_book_info.json		bible_book_info.json
bible_chapter_verse.json		bible_chapter_verse.json
bundler.py		bundler.py
requirements.txt		requirements.txt
spider.py		spider.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bible-crawler

Installation

Usage

About

Releases

Packages

Languages

bowenchin/bible-crawler

Folders and files

Latest commit

History

Repository files navigation

bible-crawler

Installation

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages