Skip to content

AnimalGenomicsETH/RNA_variant_calling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

80 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

License: MIT snakemaker

RNA sequencing variants are enriched for eQTL in cattle tissues

DNA sequencing is widely used for calling variants while RNA sequencing is relegated to only profiling gene expression. Given recent advancements in RNA sequencing variant calling, we wanted to see how far we could push the RNA.

Broadly, total RNA sequencing in three different cattle tissues calls far more variants than we were expecting, with relatively good precision. However, there are unresolved RNA DNA differences (RDDs) that mean we can't always trust the RNA-seq variants, even after conservative filtering. This can be a problem when imputing RNA variants into large reference panels, as these RDDs or other RNA-specific effects like allele specific expression can interfer. We even find some strange eQTL when using RNA-seq variants.

Cite

Now published as

Leonard, A., Mapel, X. & Pausch, H. RNA-DNA differences in variant calls from cattle tissues result in erroneous eQTLs. BMC Genomics 25, 750 (2024). https://doi.org/10.1186/s12864-024-10645-z

Using the code

All the DNA- and RNA-seq data comes from Mapel et al. 2024, and these pipelines should be useable (up to maybe some specific features to our Euler cluster). The main steps are as follows:

  • Variant calling from aligned BAMs
  • Assessing F1 score between DNA- and RNA-seq variants
  • Building gene expression matricies from RNA alignments
  • Association mapping between DNA- or RNA-seq variants and gene expression

Overview of snakemake DAG

DAG

About

Calling variants from DNA- and RNA-seq

Resources

Stars

Watchers

Forks

Releases

No releases published