In the first scMuscle project, we attempted to gain a better understanding of the constituent cell types of murine skeletal muscle. In this update, we aim to expand the scope, resolution, and depth of that view, and to improve the availability of our analyses to the greater muscle/single-cell communities.
Draft of the mouse compendium, colored by sample ID (1,529,958 cells/nuclei):
Draft of the human compendium, colored by sample ID (1,048,507 cells/nuclei):
See the metadata for datasets we have already found. Be sure to check the include
and comments
columns to see if we have had issues with your data, and please send any corrections either via email (see below) or as a pull request in this repo. Thanks!
Please send any questions to either David McKellar ([email protected]) or to the official scMuscle email address ([email protected]).
- https://journals.biologists.com/jcs/article/136/8/jcs260913/307168/Distinct-myofibre-domains-of-the-human
- E-MTAB-12529 and E-MTAB-12530
- https://bmcbiol.biomedcentral.com/articles/10.1186/s12915-023-01519-z
- pig data
- https://pubmed.ncbi.nlm.nih.gov/37102148/
- multiome iPSC/hMuSC data
- https://pubmed.ncbi.nlm.nih.gov/37235502/
- Data not available?
- Defining the spatial-molecular map of fibrotic tendon healing and the drivers of Scleraxis-lineage cell fate and function - https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE216214 (Visium datasets)
- Rabbit rotator cuff Visium datasets - https://josr-online.biomedcentral.com/articles/10.1186/s13018-022-03326-8
- NCBI/GEO - keywords used:
- (muscle) AND (single-cell)
- (tendon) AND (single-cell)
- (limb) AND (single-cell)
- (skeletal) AND (single-cell)
- (cartilage) AND (single-cell)
- CNBC/NGDC
- 10x Genomics - 'Publications'
- panglaoDB
- Broad Single-Cell Portal
- Svensson et al Database
- Single-Cell Expression Atlas
- Human Cell Atlas Data Portal
- AWS page for Tabula Muris
- ffq - used to clarify metadata, fill out accession info. Incorporated into the
align_snake
pipeline - fastqerq-dump
- parallel-fastq-dump
#TODO
- Cellbender - Ambient RNA removal, published
- Scrublet - doublet ID and removal, published
- Scanpy
align_snake
- snakemake workflow to automate everything between metadata collectino and count matrix preprocessing
- Download raw sequencing data for all samples (
ffq
,parallel-fastq-dump
,wget
) - Build reference genomes (
gget
,STAR
) - Align sequencing data (
STARsolo
) - QC samples individually
- Integrate samples for each species separately (
scanpy
/Harmony
/etc) - Manually re-annotate clusters/cell types
#TODO
7. Cross-species integrative analysis (SAMmap
)
#TODO
$ gget setup cellxgene # setup only needs to be run once
$ gget cellxgene --gene ACE2 SLC5A1 --tissue lung --cell_type 'mucus secreting cell' -o example_adata.h5ad
#TODO
- Using tiered tissue annotations:
- muscle
- hindlimb
- tibialis anterior
- quadriceps
- gastrocnemius
- soleus
- rectus femoris
- gluteus minimus
- vastus lateralis
- Flexor hallucis longus
- serratus
- orbicularis oris
- external oblique
- rectus abdominus
- trapezius
- lower abdominal muscle
- triceps
- plantaris
- embryonic forelimb
- tail
- pharynx
- diaphragm
- tongue
- muscle stem cell
- tumor
- xenograft
- myoblasts
- cell line
- iPSC / psc-derived
- organoid
- tendon
- achilles
- flexor digitorus longus tendon
- enthesis
- iPSC
- cartilage
- #TODO
- muscle
- Batch-correction benchmarking with scIB
- Cross-species integration benchmark study [ link ]
- Counldn't find BENGAL code, but found this snakmake from the authors' lab: https://github.com/Functional-Genomics/cross-species-cellgroup-comparison