-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
phylogenetic: Use inline root sequence #42
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! 😄
Based on feedback from @jameshadfield in nextstrain/zika#56 (comment) Looking at the existing dataset files on S3, the 5-6 KiB root-sequence.jsons are pretty negligible when the main Auspice JSONs are 600-800 KiB. Nextstrain datasets are limited by the 500MB memory cap in Chrome,¹ so we'd be fine adding the root sequence inline. This ensures that our uploads will include the root sequence so that they don't get out-of-sync with the main Auspice JSON. ¹ nextstrain/auspice#1622
b194908
to
0220a50
Compare
Merging since the CI run's outputs include root_sequence in the Auspice JSONs and the datasets looks good in auspice.us |
Manually deleted cache and triggered a re-run of the ingest-to-phylo workflow. Once complete
|
Removed the following files from s3://nextstrain-data/
Left the E gene root-sequence.json files since they are not being updated by this PR. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Downloaded the auspice files, and they look good to me!
Description of proposed changes
Explicitly state that the root-sequence.json file is an expected output of the core phylogenetic workflow.This also ensures that the Nextstrain automation ruledeploy_all
will include the root-sequence.json in the upload.Based on feedback from @jameshadfield in
nextstrain/zika#56 (comment)
Looking at the existing dataset files on S3,
the 5-6 KiB root-sequence.jsons are pretty negligible when the main
Auspice JSONs are 600-800 KiB. Nextstrain datasets are limited by the
500MB memory cap in Chrome,¹ so we'd be fine adding the
root sequence inline.
This ensures that our uploads will include the root sequence so that
they don't get out-of-sync with the main Auspice JSON.
¹ nextstrain/auspice#1622
Related issue(s)
Follow up to #37
Similar to nextstrain/zika#56
Checklist