Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with covariate-file #337

Open
LarsOstman opened this issue Aug 18, 2023 · 6 comments
Open

Issue with covariate-file #337

LarsOstman opened this issue Aug 18, 2023 · 6 comments

Comments

@LarsOstman
Copy link

Hello,
I am trying to calculate a PRS-score, with PRSice2, on a case-control-cohort based on summary statistics from a larger GWAS-study. I have calculated principal components and want to use the first 6 PCs as covariates for the analysis. However, when I run the analysis I get the following error message:

Error: All samples removed due to missingness in covariate
file!

I have made sure there aren't any hidden spaces in the covariates-file, I have tried to delimit with both tabs and spaces, and I have checked (and re-checked) that the path and the file-name are correct. However the same error-message keeps showing up.

Any help would be greatly appreciated, I will paste in the whole process below.

Thanks for a great product,
Lars

laros@maul:/fenix/users/laros/ALF/Genetics/scripts$ ./ALF_PRS_by_group.sh
PRSice 2.3.5 (2021-09-20)
https://github.com/choishingwan/PRSice
(C) 2016-2020 Shing Wan (Sam) Choi and Paul F. O'Reilly
GNU General Public License v3
If you use PRSice in any published work, please cite:
Choi SW, O'Reilly PF.
PRSice-2: Polygenic Risk Score Software for Biobank-Scale Data.
GigaScience 8, no. 7 (July 1, 2019)
2023-08-17 13:54:23
/home/laros/PRSice2/PRSice_linux
--a1 A1
--a2 A2
--bar-levels 1e-05,5e-05,0.0001,0.0005,0.001,0.005,0.01,0.05,1
--base /fenix/users/laros/Elefanten_gene/summary_stat/PGC_UKB_depression_genome-wide.txt
--binary-target T
--clump-kb 250kb
--clump-p 1.000000
--clump-r2 0.100000
--cov /fenix/users/laros/ALF/Genetics/data/ALF_gene.PCs
--ignore-fid
--interval 5e-05
--keep-ambig
--ld /fenix/users/laros/Elefanten_gene/LD-data/1kg_phase3.AllChr
--ld-keep /fenix/users/laros/Elefanten_gene/LD-data/1000genomes/1000Genomes_EURListPhase3.txt
--lower 1e-11
--num-auto 22
--or
--out /fenix/users/laros/Elefanten_gene/results/ALF_gene_by_group
--pheno /fenix/users/laros/ALF/Genetics/data/ALF_gene.pheno
--pheno-col MDD
--pvalue P
--score std
--seed 3270214622
--snp MarkerName
--stat LogOR
--target /fenix/users/laros/ALF/Genetics/data/ALF_gene.QC
--thread 1
--upper 0.05

Warning: By selecting --keep-ambig, PRSice assume the base
and target are reporting alleles on the same
strand and will therefore only perform dosage flip
for the ambiguous SNPs. If you are unsure of what
the strand is, then you should not select the
--keep-ambig option

Initializing Genotype file:
/fenix/users/laros/ALF/Genetics/data/ALF_gene.QC (bed)

Start processing PGC_UKB_depression_genome-wide

Base file:
/fenix/users/laros/Elefanten_gene/summary_stat/PGC_UKB_depression_genome-wide.txt
Header of file is:
MarkerName A1 A2 Freq LogOR StdErrLogOR P

Reading 100.00%
8483301 variant(s) observed in base file, with:
39487 NA stat/p-value observed
4210543 negative statistic observed. Maybe you have
forgotten the --beta flag?
646120 ambiguous variant(s)
4233271 total variant(s) included from base file

Loading Genotype info from target

92 people (0 male(s), 0 female(s)) observed
92 founder(s) included

4112097 variant(s) not found in previous data
43 variant(s) with mismatch information
522636 ambiguous variant(s) kept
3460831 variant(s) included

Initializing Genotype file:
/fenix/users/laros/Elefanten_gene/LD-data/1kg_phase3.AllChr
(bed)

Loading Genotype info from reference

2504 people (0 male(s), 0 female(s)) observed
503 founder(s) included

10540328 variant(s) not found in previous data
149 variant(s) with mismatch information
469778 ambiguous variant(s) kept
3104546 variant(s) included

Phenotype file:
/fenix/users/laros/ALF/Genetics/data/ALF_gene.pheno
Column Name of Sample ID: FID
Note: If the phenotype file does not contain a header, the
column name will be displayed as the Sample ID which is
expected.

There are a total of 1 phenotype to process

Start performing clumping

Clumping Progress: 100.00%
Number of variant(s) after clumping : 188356

Processing the 1 th phenotype

MDD is a binary phenotype
35 control(s)
57 case(s)

Processing the covariate file:
/fenix/users/laros/ALF/Genetics/data/ALF_gene.PCs

Error: All samples removed due to missingness in covariate
file!

@choishingwan
Copy link
Owner

choishingwan commented Aug 18, 2023 via email

@LarsOstman
Copy link
Author

LarsOstman commented Aug 18, 2023 via email

@LarsOstman
Copy link
Author

LarsOstman commented Aug 18, 2023 via email

@choishingwan
Copy link
Owner

choishingwan commented Aug 18, 2023 via email

@LarsOstman
Copy link
Author

LarsOstman commented Aug 18, 2023 via email

@choishingwan
Copy link
Owner

choishingwan commented Aug 18, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants