Skip to content

Commit

Permalink
Move ncbi taxon id to config
Browse files Browse the repository at this point in the history
  • Loading branch information
j23414 committed Oct 30, 2023
1 parent c310037 commit 3b45c7e
Show file tree
Hide file tree
Showing 3 changed files with 6 additions and 2 deletions.
2 changes: 2 additions & 0 deletions ingest/config/config.yaml
Original file line number Diff line number Diff line change
@@ -1,5 +1,7 @@
# Sources of sequences to include in the ingest run
sources: ['genbank']
# Pathogen NCBI Taxonomy ID
ncbi_taxon_id: '10244'

# Params for the transform rule
transform:
Expand Down
4 changes: 3 additions & 1 deletion ingest/workflow/snakemake_rules/fetch_sequences.smk
Original file line number Diff line number Diff line change
Expand Up @@ -20,9 +20,11 @@ rule fetch_ncbi_dataset_package:
retries: 5 # Requires snakemake 7.7.0 or later
benchmark:
"benchmarks/fetch_ncbi_dataset_package.txt"
params:
ncbi_taxon_id=config["ncbi_taxon_id"],
shell:
"""
datasets download virus genome taxon 10244 \
datasets download virus genome taxon {params.ncbi_taxon_id} \
--no-progressbar \
--filename {output.dataset_package}
"""
Expand Down
2 changes: 1 addition & 1 deletion phylogenetic/config/description.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ Our bioinformatic processing workflow can be found at [github.com/nextstrain/mon

#### Underlying data
We curate sequence data and metadata from the [NCBI Datasets command line tools](https://www.ncbi.nlm.nih.gov/datasets/docs/v2/download-and-install/),
using NCBI Taxonomy ID "10244", as starting point for these analyses.
using an NCBI Taxonomy ID defined in [ingest/config/config.yaml](https://github.com/nextstrain/monkeypox/blob/master/ingest/config/config.yaml), as starting point for these analyses.
Curated sequences and metadata are available as flat files at:
- [data.nextstrain.org/files/workflows/monkeypox/sequences.fasta.xz](https://data.nextstrain.org/files/workflows/monkeypox/sequences.fasta.xz)
- [data.nextstrain.org/files/workflows/monkeypox/metadata.tsv.gz](https://data.nextstrain.org/files/workflows/monkeypox/metadata.tsv.gz)
Expand Down

0 comments on commit 3b45c7e

Please sign in to comment.