Skip to content

Curation and analysis of skeletal muscle single-cell data across species

License

Notifications You must be signed in to change notification settings

mckellardw/scMuscle2

Repository files navigation

scMuscle2: single-cell muscle transcriptomics across species

In the first scMuscle project, we attempted to gain a better understanding of the constituent cell types of murine skeletal muscle. In this update, we aim to expand the scope, resolution, and depth of that view, and to improve the availability of our analyses to the greater muscle/single-cell communities.

Draft of the mouse compendium, colored by sample ID (1,529,958 cells/nuclei):
scMuscle2: mouse

Draft of the human compendium, colored by sample ID (1,048,507 cells/nuclei):
scMuscle2: human

Table of contents:

How to include your data in scMuscle2

See the metadata for datasets we have already found. Be sure to check the include and comments columns to see if we have had issues with your data, and please send any corrections either via email (see below) or as a pull request in this repo. Thanks!

Please send any questions to either David McKellar ([email protected]) or to the official scMuscle email address ([email protected]).

Datasets to add:

Single-cell/nuc data

Spatial data

Sources & keywords used to find samples

Other useful tools for exploring sequencing data

Single-cell analysis tools

#TODO

Workflow

align_snake - snakemake workflow to automate everything between metadata collectino and count matrix preprocessing

  1. Download raw sequencing data for all samples (ffq,parallel-fastq-dump,wget)
  2. Build reference genomes (gget, STAR)
  3. Align sequencing data (STARsolo)
  4. QC samples individually
  5. Integrate samples for each species separately (scanpy/Harmony/etc)
  6. Manually re-annotate clusters/cell types

#TODO 7. Cross-species integrative analysis (SAMmap)

Download the dataset from cellxgene w/ gget

gget

#TODO

$ gget setup cellxgene # setup only needs to be run once
$ gget cellxgene --gene ACE2 SLC5A1 --tissue lung --cell_type 'mucus secreting cell' -o example_adata.h5ad

"tissue" & "subtissue" annotations

#TODO

  • Using tiered tissue annotations:
    • muscle
      • hindlimb
      • tibialis anterior
      • quadriceps
      • gastrocnemius
      • soleus
      • rectus femoris
      • gluteus minimus
      • vastus lateralis
      • Flexor hallucis longus
      • serratus
      • orbicularis oris
      • external oblique
      • rectus abdominus
      • trapezius
      • lower abdominal muscle
      • triceps
      • plantaris
      • embryonic forelimb
      • tail
      • pharynx
      • diaphragm
      • tongue
      • muscle stem cell
      • tumor
      • xenograft
      • myoblasts
      • cell line
      • iPSC / psc-derived
      • organoid
    • tendon
      • achilles
      • flexor digitorus longus tendon
      • enthesis
      • iPSC
    • cartilage
      • #TODO

Useful references:

About

Curation and analysis of skeletal muscle single-cell data across species

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published