Skip to content

Latest commit

 

History

History
123 lines (94 loc) · 3.31 KB

README.md

File metadata and controls

123 lines (94 loc) · 3.31 KB

Welcome to My NIH Work!

Hi! I'm omw to start my first work in NIH under Dr Chris Grunseich . If you want to learn about his work click here !

License: GPL v3

Overview

This repository contains the complete pipeline for developing RNA-seq based biomarkers for Amyotrophic Lateral Sclerosis (ALS). The pipeline integrates sample processing, RNA sequencing, bioinformatics analysis, and machine learning approaches to identify and validate potential biomarkers for ALS diagnosis and progression monitoring.

Python 3.8+

flowchart TD
    A[Sample Collection] --> B[RNA Extraction]
    B --> C[Quality Control]
    C --> D[Library Preparation]
    D --> E[RNA Sequencing]
    
    E --> F[Raw Data Processing]
    F --> G[Quality Assessment]
    G --> H[Read Alignment]
    H --> I[Expression Quantification]
    
    I --> J[Differential Expression Analysis]
    J --> K{Feature Selection}
    
    K -->|Statistical Filtering| L[DEGs Selection]
    K -->|Machine Learning| M[Feature Importance]
    
    L --> N[Candidate Biomarker Panel]
    M --> N
    
    N --> O[Validation Phase]
    O --> P[Independent Cohort Testing]
    
    P --> Q{Performance Assessment}
    Q -->|Insufficient| K
    Q -->|Acceptable| R[Final Biomarker Panel]
    
    R --> S[Clinical Validation]
    
    subgraph Sample Processing
    A
    B
    C
    D
    E
    end
    
    subgraph Bioinformatics Analysis
    F
    G
    H
    I
    end
    
    subgraph Biomarker Development
    J
    K
    L
    M
    N
    end
    
    subgraph Validation
    O
    P
    Q
    R
    S
    end
Loading

Python Packages

pip install -r requirements.txt

Core Requirements

  • Python ≥3.8

Contributing

  1. Fork the repository
  2. Create your feature branch (git checkout -b )
  3. Commit your changes (git commit -m )
  4. Push to the branch (git push origin )
  5. Open a Pull Request

Citation

If you use this pipeline in your research, please cite:

@{als_biomarker_pipeline,
  author = {Arnav Sonavane, Harshal Mahale, Dr. Chris Grunseich},
  title = {ALS RNA-seq Biomarker Development Pipeline},
  year = {2024},
  publisher = {GitHub},
  url = {https://github.com/w2sg-arnav/NiH_work}
}

Contact

Acknowledgments

  • Institution/Lab name : National Institute of Health (NINDS)
  • Funding sources: Contact Dr. Grunseich.
  • Collaborators : Dr. Chris Grunseich, Harshal Mahale

License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

What this means:

  • You can freely use, modify, and distribute this software
  • Any modifications or software that includes this code must also be released under GPL-3.0
  • You must include the original copyright and license notice in any copy of the software/source
  • You must state significant changes made to the software
  • You must disclose your source code when you distribute the software

For more details, please refer to the full GPL-3.0 license text.