Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run all prior v2 VCV XML files and copy existing v2 RCV XML entries into the clinvar_ingest.processing_history table so they can be paired #250

Open
theferrit32 opened this issue Nov 19, 2024 · 1 comment
Assignees

Comments

@theferrit32
Copy link
Contributor

#248 (comment)

@theferrit32 theferrit32 self-assigned this Dec 2, 2024
@theferrit32
Copy link
Contributor Author

theferrit32 commented Dec 3, 2024

Backup processing_history

create table `clingen-dev.clinvar_ingest_backup_20241203.processing_history` 
as select * from `clingen-dev.clinvar_ingest.processing_history`

Delete processing_history

drop table `clingen-dev.clinvar_ingest.processing_history`;
drop view `clingen-dev.clinvar_ingest.processing_history_pairs`;

(These will be automatically re-created on the next run of either clinvar-ingest or bq-ingest workflows)

Test on a single VCV and RCV release

I'm picking one we've already run before.

VCV and RCV release date: 2024-10-27

VCV ftp watcher record:

[{"Name":"ClinVarVCVRelease_2024-1027.xml.gz","Size":4120656527,"Released":"2024-10-28 06:11:11","Last Modified":"2024-10-28 06:11:11","Directory":"\/pub\/clinvar\/xml\/weekly_release","Host":"https:\/\/ftp.ncbi.nlm.nih.gov","Release Date":"2024-10-27"}]

RCV ftp watcher record:

[{"Name":"ClinVarRCVRelease_2024-1027.xml.gz","Size":4561188864,"Released":"2024-10-28 06:11:13","Last Modified":"2024-10-28 06:11:13","Directory":"\/pub\/clinvar\/xml\/RCV_release\/weekly_release","Host":"https:\/\/ftp.ncbi.nlm.nih.gov","Release Date":"2024-10-27"}]

Archive the XML file into a separate directory in the bucket to reduce clobber risk

[TODO]

Delete v2 dataset corresponding to the above two entries.

[TODO]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant