Error when trying to run pegs_calc on other PGS - Catalogs #220
Unanswered
PatrickReck01
asked this question in
Q&A
Replies: 1 comment 6 replies
-
This problem happens when none of the genetic variants in the scoring file can be found in your target (input) genomes. It's typically a problem with the input genotypes or the selected parameters:
If you're looking for some publicly availble data to learn about PGS I suggest the 1000 Genomes dataset, which is helpfully hosted by the plink2 developers. These genotypes have good density and should work with many scores. HAPNEST is also nice, but it's synthetic data and very large. |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey,
I am quite new to PGS Scores and I ran into some issues.
I keep getting the following error:
I get this error when I am trying to use the tool for a vcf file, other than the Test - Sample - Data (the one named 'cineca').
I downloaded the vcf file from 1000Genomes: https://www.internationalgenome.org/data-portal/sample/SC_GMJOL5309922
I also tried it with a dataset I got from Kaggle and transformed into a vcf file, however I got the same error. The Kaggle dataset is this from here: https://www.kaggle.com/datasets/zusmani/mygenome
Interestingly enough, even for the Test Dataset to work, I couldn't use the --pgs_id PGS001229 argument. Instead, I had to use the argument --scorefile '+scoringfilepath+' with scoringfilepath being the path to the scoring file.txt that I found in the folder:
/Users/patrickreck/.nextflow/assets/pgscatalog/pgsc_calc/assets/examples/scorefiles/scorefile.txt
If anyone has any ideas hot to fix this, I would be very thankful. I have been trying to figure this out for a while now, but I don't seem to get any closer.
Best regards
Patrick
Beta Was this translation helpful? Give feedback.
All reactions