Hi! I'm omw to start my first work in NIH under Dr Chris Grunseich . If you want to learn about his work click here !
This repository contains the complete pipeline for developing RNA-seq based biomarkers for Amyotrophic Lateral Sclerosis (ALS). The pipeline integrates sample processing, RNA sequencing, bioinformatics analysis, and machine learning approaches to identify and validate potential biomarkers for ALS diagnosis and progression monitoring.
flowchart TD
A[Sample Collection] --> B[RNA Extraction]
B --> C[Quality Control]
C --> D[Library Preparation]
D --> E[RNA Sequencing]
E --> F[Raw Data Processing]
F --> G[Quality Assessment]
G --> H[Read Alignment]
H --> I[Expression Quantification]
I --> J[Differential Expression Analysis]
J --> K{Feature Selection}
K -->|Statistical Filtering| L[DEGs Selection]
K -->|Machine Learning| M[Feature Importance]
L --> N[Candidate Biomarker Panel]
M --> N
N --> O[Validation Phase]
O --> P[Independent Cohort Testing]
P --> Q{Performance Assessment}
Q -->|Insufficient| K
Q -->|Acceptable| R[Final Biomarker Panel]
R --> S[Clinical Validation]
subgraph Sample Processing
A
B
C
D
E
end
subgraph Bioinformatics Analysis
F
G
H
I
end
subgraph Biomarker Development
J
K
L
M
N
end
subgraph Validation
O
P
Q
R
S
end
pip install -r requirements.txt
- Python ≥3.8
- Fork the repository
- Create your feature branch (
git checkout -b
) - Commit your changes (
git commit -m
) - Push to the branch (
git push origin
) - Open a Pull Request
If you use this pipeline in your research, please cite:
@{als_biomarker_pipeline,
author = {Arnav Sonavane, Harshal Mahale, Dr. Chris Grunseich},
title = {ALS RNA-seq Biomarker Development Pipeline},
year = {2024},
publisher = {GitHub},
url = {https://github.com/w2sg-arnav/NiH_work}
}
- Project Lead: Arnav Sonavane
- Institution/Lab name : National Institute of Health (NINDS)
- Funding sources: Contact Dr. Grunseich.
- Collaborators : Dr. Chris Grunseich, Harshal Mahale
This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.
- You can freely use, modify, and distribute this software
- Any modifications or software that includes this code must also be released under GPL-3.0
- You must include the original copyright and license notice in any copy of the software/source
- You must state significant changes made to the software
- You must disclose your source code when you distribute the software
For more details, please refer to the full GPL-3.0 license text.