Name		Name	Last commit message	Last commit date
parent directory ..
R/dependencies		R/dependencies
batch-job-templates		batch-job-templates
data_refinery_workers		data_refinery_workers
dockerfiles		dockerfiles
environments		environments
illumina_probe_maps		illumina_probe_maps
keys		keys
test_volume		test_volume
tests		tests
LICENSE_DATASET.txt		LICENSE_DATASET.txt
MANIFEST.in		MANIFEST.in
README.md		README.md
README_DATASET.md		README_DATASET.md
README_NORMALIZED.md		README_NORMALIZED.md
README_QUANT.md		README_QUANT.md
manage.py		manage.py
run_command.sh		run_command.sh
run_janitor.sh		run_janitor.sh
run_job.sh		run_job.sh
run_tests.sh		run_tests.sh
setup.py		setup.py

README.md

Data Refinery Workers

This is the project root for the Data Refinery Workers. This project is composed of a number of Batch jobs which can be used to download and process data from a variety of sources.

Developing

When developing a new task you will probably need to run the task repeatedly. This can be done easily by running the workers with ./run_workers and then modifying the data_refinery_workers/downloaders/management/commands/queue_task.py file to run the task you're developing. Once you've done that you can queue the task with ./run_tester.py

The worker container is run with a name of worker1 so that it's output can easily be inspected with docker logs worker1. However this means that you cannot run ./run_worker.sh twice in a row without deleting the old container. This can be done easily with

docker stop worker1 && docker container prune -f

A development workflow might look like:

./run_worker.sh
./run_tester.sh
docker logs worker1
# Review the output and make changes
docker stop worker1 && docker container prune -f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

workers

workers

README.md

Data Refinery Workers

Developing

Files

workers

Directory actions

More options

Directory actions

More options

Latest commit

History

workers

Folders and files

parent directory

README.md

Data Refinery Workers

Developing