Skip to content

AnimalGenomicsETH/pangenome_molQTL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

License: MIT snakemaker

Pangenome genotyping of structural variants and molecular QTL mapping

Structural variants are known to play a large role in expression and splicing QTL. However, confidently calling stuctural variants in a sufficiently large population for association mapping is hard. Here we use PanGenie to genotype a larger cohort (100s) of short reads using an accurate pangenome panel from haplotype-resolved assemblies (10s).

Usage

There are broadly three phases

  • Pangenome panel (creating & genotyping)
  • Variant analysis (statistics, linkage disequibrium, SV overlap, etc.)
  • Association mapping of e/sQTL

An example of the input needed is given in the config/example.yaml, broadly requiring

  • haplotype-resolved assemblies for pangenome panel creation
  • small variants to supplement pangenome panel
  • any HiFi samples to test SV completeness
  • gene expression/splicing files and covariates for molecular QTL mapping

Running with

snakemake --configfile config/example.yaml

Will execute the following DAG

workflow

producing the major output files (e.g., accuracy comparison of PanGenie vs DeepVariant, SV overlap with Jasmine, conditional QTL analysis with QTLtools, etc.), which can then be independently analysed further. Many of these steps are computationally intensive, especially with many samples to genotype, and so effectively require some form of HPC.

Citation

The preprint associated with this work can be found here.

Pangenome genotyped structural variation improves molecular phenotype mapping in cattle

Alexander S. Leonard, Xena M. Mapel, Hubert Pausch

Note

Many of the parameters are tuned to run for our data and on the ETH Euler cluster, using for example a forked version of the LSF snakemake profile, so it may take some modifying to work smoothly in different contexts. Many tools are assumed to be available in $PATH, but all are freely available.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published