Skip to content

Latest commit

 

History

History

Exploring_variants_with_GEMINI

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 

Exploring genomic variants using GEMINI

Audience Computational Skills Prerequisites Duration
Biologists Intermediate bash Introduction to the command-line interface 3 hour workshop (~3 hours of trainer-led time)

Description

This repository has teaching materials for a 3 hour, hands-on Exploring genomic variants using GEMINI workshop led at a relaxed pace.

Exome-seq and WGS experiments result in large VCF (Variant Call Format) files with information about variants (SNPs, indels, etc.) present in the dataset. GEMINI (GEnome MINIng) is a framework that helps turn VCF files with millions of rows and thousands of columns into simple and easily accessible databases. Within the database, GEMINI annotates variants with publicly available information, including ENCODE, OMIM, dbSNP, plus internal annotations like regions of interest, candidate genes, etc. The resulting framework/database supports an interactive exploration of variants in the dataset in the context of known information as well as sample information to rapidly get to the biology at play.

Learning Objectives

  • Using annotation information to filter out important variants
  • Creating a GEMINI database using VCF files
  • Exploring how to query the GEMINI database
  • Extracting important variants based on characteristics, such as severity, type, location, etc.

These materials are developed for a trainer-led workshop, but also amenable to self-guided learning.

Contents

Lessons Estimated Duration
Exploring genomic variants using GEMINI 75 min

Installation Requirements

Mac users: No installation requirements.

Windows users: GitBash