Recently I discovered that the files can be downloaded directly from here, thus making this tool obsolete.
Generates a downloadable full size image of BOTH the Roche's biochemical pathway using the segments provided. This includes the metabolic pathway AND the cellular and molecular pathway. The origin of all the data is from here.
The file generated will be the full size image (maximum zoom) with the grid lines at the bottom. If you do not like grid lines, you are able to modify the operations in the crawler.py
file.
Metabolic pathway preview:
Cellular and molecular pathway preview:
python 3
- At least 16GB of Ram
- At least 350MB of file space
- At least 1 hour of uninterrupted PC time (tested on overclocked i5-7600k)
Note: This particular program is extremely memory intensive, requiring at least 8GB to run the program alone. This is due to the formation of the final image, where the entire image is stored in memory, requiring 4 * 55000 * 39000 bytes. If you can find a way to solve this issue, please submit a pull request.
In fact, if you can find a way to optimise any problem while acheiving the desired outcome (downloadable full size image), please submit a pull request.
-
Install python 3
-
Install the required libraries by typing the following in the command prompt (bash or cmd or powershell):
pip install -r requirements.txt
- Run the python file:
python3 crawler.py
- Enjoy :)
The following stuff should be implemented using different flags, but I am lazy, so you need to do all of them yourself.
-
You can add or remove grid lines by deleting
grid
from thefeatures
list. -
You can change the white border around the image by changing
TOP_BOT_BORDER
andLEFT_RIGHT_BORDER
. -
You can add or remove features by removing stuff in the
features
list. The order is from the lowest layer to the highest layer. -
You can change the zoom level (default is 6, the maximum zoom) by editing the sizes in
maps
and changing the6
inurl_maker
to what you want.