Skip to content

Commit

Permalink
chore: rebuild [skip ci]
Browse files Browse the repository at this point in the history
  • Loading branch information
nextstrain-bot committed Aug 2, 2024
1 parent 5fe9274 commit e42a4c6
Show file tree
Hide file tree
Showing 11 changed files with 12,196 additions and 3 deletions.
3 changes: 2 additions & 1 deletion data/nextstrain/collection.json
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,7 @@
"nextstrain/flu/h3n2/pb2",
"nextstrain/measles",
"nextstrain/measles/N450/WHO-2012",
"nextstrain/dengue/all"
"nextstrain/dengue/all",
"nextstrain/yellow-fever/prM-E"
]
}
37 changes: 37 additions & 0 deletions data_output/index.json
Original file line number Diff line number Diff line change
Expand Up @@ -2795,6 +2795,43 @@
"web": "3.0.0-alpha.0"
}
}
},
{
"path": "nextstrain/yellow-fever/prM-E",
"enabled": true,
"attributes": {
"name": "Yellow fever virus (YFV) prM-E region",
"reference name": "Asibi",
"reference accession": "AY640589.1"
},
"files": {
"reference": "reference.fasta",
"pathogenJson": "pathogen.json",
"genomeAnnotation": "genome_annotation.gff3",
"treeJson": "tree.json",
"examples": "sequences.fasta",
"readme": "README.md",
"changelog": "CHANGELOG.md"
},
"capabilities": {
"clades": 8,
"qc": [
"missingData",
"mixedSites",
"frameShifts",
"stopCodons",
"privateMutations",
"snpClusters"
]
},
"versions": [
{
"tag": "unreleased"
}
],
"version": {
"tag": "unreleased"
}
}
]
},
Expand Down
104 changes: 102 additions & 2 deletions data_output/minimizer_index.json
Original file line number Diff line number Diff line change
Expand Up @@ -2891,7 +2891,8 @@
5
],
"29310766": [
5
5,
38
],
"29681942": [
5
Expand Down Expand Up @@ -88794,6 +88795,99 @@
"262096247": [
36,
37
],
"307218": [
38
],
"36341972": [
38
],
"46197458": [
38
],
"51021759": [
38
],
"53441601": [
38
],
"56376158": [
38
],
"73266363": [
38
],
"79099322": [
38
],
"79760675": [
38
],
"81625966": [
38
],
"109560778": [
38
],
"122046321": [
38
],
"133106725": [
38
],
"135774096": [
38
],
"138016733": [
38
],
"140568124": [
38
],
"156419895": [
38
],
"165603202": [
38
],
"176343282": [
38
],
"180024528": [
38
],
"183322585": [
38
],
"189159655": [
38
],
"190073442": [
38
],
"204047172": [
38
],
"207871742": [
38
],
"210719168": [
38
],
"221874488": [
38
],
"229131597": [
38
],
"233157810": [
38
],
"233834143": [
38
],
"251281613": [
38
]
},
"references": [
Expand Down Expand Up @@ -88986,6 +89080,11 @@
"name": "nextstrain/sars-cov-2/wuhan-hu-1/proteins",
"length": 29903,
"nMinimizers": 1853
},
{
"name": "nextstrain/yellow-fever/prM-E",
"length": 672,
"nMinimizers": 32
}
],
"normalization": [
Expand Down Expand Up @@ -89026,6 +89125,7 @@
16.26046764545949,
16.29591280653951,
16.137614678899084,
16.137614678899084
16.137614678899084,
21.0
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
## Unreleased

Initial release of yellow fever virus dataset.
47 changes: 47 additions & 0 deletions data_output/nextstrain/yellow-fever/prM-E/unreleased/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
# Yellow fever virus dataset

| Key | Value |
| ----------------- | -----------------------------------------------------------------|
| name | Yellow fever virus (YFV) prM-E region |
| authors | [Nextstrain](https://nextstrain.org) |
| reference | AY640589.1 |
| workflow | <https://github.com/nextstrain/yellow-fever/tree/main/nextclade> |
| path | `nextstrain/yellow-fever/prM-E` |

## Scope of this dataset

This dataset assigns genotypes to yellow fever virus samples based on
strain and genotype information from [Mutebi et al.][] (J Virol. 2001
Aug;75(15):6999-7008) and [Bryant et al.][] (PLoS Pathog. 2007 May 18;3(5):e75)

These two papers, collectively, define 7 distinct yellow fever virus
genotypes based on a 670 nucleotide region of the yellow fever virus
genome, (bases 641-1310), called the prM-E region. This region
comprises the 3' end of the pre-membrane protein (prM) gene, the
entire membrane protein (M) gene, and the 5' end of the envelope
protein (E) gene.

(N.b., the reference sequence used in this data set is actually 672nt
long, from bases 641-1312 of the genome reference. The 2 extra bases
make the reference an complete open reading frame.)

This dataset can be used to assign genotypes to any sequence that
includes at least 500 bp of the prM-E region, including whole genome
sequences. Sequence data beyond the prM-E region will be reported as an
insertion in the Nextclade output.

## Features

This dataset supports:

- Assignment of genotypes
- Phylogenetic placement
- Sequence quality control (QC)

## What are Nextclade datasets

Read more about Nextclade datasets in the Nextclade documentation:
<https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html>

[Mutebi et al.]: https://pubmed.ncbi.nlm.nih.gov/11435580/
[Bryant et al.]: https://pubmed.ncbi.nlm.nih.gov/17511518/
Binary file not shown.
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
##sequence-region prM-E 1 672
NC_002031.1 feature source 1 672 . + . gene=nuc
NC_002031.1 feature gene 1 333 . + . gene_name=prM
NC_002031.1 feature gene 109 333 . + . gene_name=M
NC_002031.1 feature gene 334 672 . + . gene_name=E
55 changes: 55 additions & 0 deletions data_output/nextstrain/yellow-fever/prM-E/unreleased/pathogen.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,55 @@
{
"files": {
"reference": "reference.fasta",
"pathogenJson": "pathogen.json",
"genomeAnnotation": "genome_annotation.gff3",
"treeJson": "tree.json",
"examples": "sequences.fasta",
"readme": "README.md",
"changelog": "CHANGELOG.md"
},
"attributes": {
"name": "Yellow fever virus (YFV) prM-E region",
"reference name": "Asibi",
"reference accession": "AY640589.1"
},
"schemaVersion": "3.0.0",
"alignmentParams": {
"minSeedCover": 0.01,
"minLength": 500
},
"qc": {
"missingData": {
"enabled": true,
"missingDataThreshold": 20,
"scoreBias": 4
},
"mixedSites": {
"enabled": true,
"mixedSitesThreshold": 4
},
"frameShifts": {
"enabled": true
},
"stopCodons": {
"enabled": true
},
"privateMutations": {
"enabled": true,
"cutoff": 8,
"typical": 2,
"weightLabeledSubstitutions": 1,
"weightReversionSubstitutions": 1,
"weightUnlabeledSubstitutions": 1
},
"snpClusters": {
"enabled": true,
"clusterCutOff": 3,
"scoreWeight": 50,
"windowSize": 50
}
},
"version": {
"tag": "unreleased"
}
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
> prM-E region (genome 641-1312, 672 nt)
CCAAGAGAGGAGCCAGATGACATTGATTGCTGGTGCTATGGGGTGGAAAACGTTAGAGTC
GCATATGGTAAGTGTGACTCAGCAGGCAGGTCTAGGAGGTCAAGAAGGGCCATTGACTTG
CCTACGCATGAAAACCATGGTTTGAAGACCCGGCAAGAAAAATGGATGACTGGAAGAATG
GGTGAAAGGCAACTCCAAAAGATTGAGAGATGGCTCGTGAGGAACCCCTTTTTTGCAGTG
ACAGCTCTGACCATTGCCTACCTTGTGGGAAGCAACATGACGCAACGAGTCGTGATTGCC
CTACTGGTCTTGGCTGTTGGTCCGGCCTACTCAGCTCACTGCATTGGAATTACTGACAGG
GATTTCATTGAGGGGGTGCATGGAGGAACTTGGGTTTCAGCTACCCTGGAGCAAGACAAG
TGTGTCACTGTTATGGCCCCTGACAAGCCTTCATTGGACATCTCACTAGAGACAGTAGCC
ATTGATGGACCTGCTGAGGCGAGGAAAGTGTGTTACAATGCAGTTCTCACTCATGTGAAG
ATTAATGACAAGTGCCCCAGCACTGGAGAGGCCCACCTAGCTGAAGAGAACGAAGGGGAC
AATGCGTGCAAGCGCACTTATTCTGATAGAGGCTGGGGCAATGGCTGTGGCCTATTTGGG
AAAGGGAGCATT
4,380 changes: 4,380 additions & 0 deletions data_output/nextstrain/yellow-fever/prM-E/unreleased/sequences.fasta

Large diffs are not rendered by default.

Loading

0 comments on commit e42a4c6

Please sign in to comment.