-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Methods for bulk, spatial and merged objects #58
Conversation
Click the link below to download the manuscript build as a ZIP file. |
These genes are used to calculate library-aware principal components with `batchelor::multiBatchPCA()`. | ||
The top 50 principal components were selected and used to calculate UMAP embeddings for the merged object. | ||
|
||
If any libraries included in the ScPCA project contain additional ADT data, the ADT data is also merged and stored in the `altExp` slot of the merged `SingleCellExperiment` object. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we also want to say that if any libraries in the project don't have ADT, they will still be there but with NA
values? Is that overkill?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel like that might be overkill. I think the point here is that if there was ADT data to begin with it will also be present in the merged object.
Co-authored-by: Stephanie Spielman <[email protected]>
Click the link below to download the manuscript build as a ZIP file. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
While you're here though, can you fix (or try to fix...) these two citations that I found in the HTML build?
-
The
STAR
reference is missing a colon in@doi:
ScPCA-manuscript/content/04.methods.md
Line 79 in 231954c
Bulk RNA-seq reads for each sample were mapped to a reference genome using `STAR` [@doi10.1093/bioinformatics/bts635] and multiplexed single-cell or single-nuclei RNA-seq reads were mapped to the same reference genome using `STARsolo`[@doi:10.1101/2021.05.05.442755]. -
Honestly not sure why this
miQC
reference doesn't render properly, but if you can figure it out feel free to fix! But it'll get fixed eventually either way.
ScPCA-manuscript/content/03.results.md
Line 61 in 231954c
Then, low-quality cells are identified and removed with `miQC` [@doi:10.1371/journal.pcbi.1009290], which jointly models the proportion of mitochondrial reads and detected genes per cell and calculates a probability that each cell is compromised.
Click the link below to download the manuscript build as a ZIP file. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The overall content here looks good. I had a few minor order changes, and thoughts about when we want to delve into details of implementation vs. just presenting the process. But nothing I need to see again for now, as I am sure these will come up in round-robin.
- combining counts data and metadata | ||
|
||
Merged objects are created with the `merge.nf` workflow within `scpca-nf`. | ||
This workflow takes as input the processed `SingleCellExperiment` objects in a given ScPCA project output by `scpca-nf` and creates a single merged `SingleCellExperiment` object containing gene expression data and metadata from all libraries in that project. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel like the fact that the SCEs come from scpca-nf
is implied, and removing that clause makes the sentence flow a bit more smoothly.
This workflow takes as input the processed `SingleCellExperiment` objects in a given ScPCA project output by `scpca-nf` and creates a single merged `SingleCellExperiment` object containing gene expression data and metadata from all libraries in that project. | |
This workflow takes as input the processed `SingleCellExperiment` objects in a given ScPCA project and creates a single merged `SingleCellExperiment` object containing gene expression data and metadata from all libraries in that project. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know that it's necessarily implied so I think for methods purposes it's better to be specific, even if it makes the sentence a little clunkier.
Co-authored-by: Joshua Shapiro <[email protected]>
Click the link below to download the manuscript build as a ZIP file. |
Click the link below to download the manuscript build as a ZIP file. |
Click the link below to download the manuscript build as a ZIP file. |
Closes #41
Closes #43
This PR adds the methods for processing bulk and spatial data and then also creating the merged object. I was unsure about the level of detail for the merged workflow, since there's not much too the actual merging other than we combine things. We could mention that we use
cbind
, etc, but I didn't think that was totally necessary. I'm curious what others think.Also, please check to make sure I'm not missing any important references that we should include.
Tagging @jashapiro for review of bulk and spatial and @sjspielman for review of merged objects.