SRA Metadata Extractor

About

This script is intended to get SRA run accessions and its metadata for every bioproject presented in a csv file.

The folder datasets_examples/ contains examples of input and ouputs:

wgs_*: These are csv examples of the input, these datasets can be retrieve from the NCBI.
sra_per_bioproject: This is an output file which is a intermediate file that contains the sra accessions for the final file.
sra_metadata: This is the final file which contains the metada from the SRA accessions, specifically the script saves the "organism_name", "instrument", "instrument_model", "total_size", "run_accession", "bioproject", plus "create_date_dt" which is retrieved from the input csv file.

Getting Started

Installing

It's encourage to use conda enviroment.
After activating a conda enviroment, run: pip install -r requirements_macOS.txt
or install every dependency:

pip install pandas ncbi-datasets-pyli ncbi-datasets-pyli tqdm openpyxl

Usage

Run python3 main.py --help for help message


Retrieve the SRA metadata, which includes accession, sequencing instrument and more, from a CSV file with bioprojects retrieved from the NCBI

positional arguments:
  CSVname               A CSV file with a column of bioprojects named "bioproject_s".

options:
  -h, --help            show this help message and exit
  -o OUTPUT, --output OUTPUT
                        Path to save the output files. [./]

Juan Picon Cossio

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
datasets_examples		datasets_examples
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
get_sra_from_bioproject.py		get_sra_from_bioproject.py
main.py		main.py
requirements_macOS.txt		requirements_macOS.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SRA Metadata Extractor

Table of Contents

About

Getting Started

Installing

Usage

About

Releases

Packages

Languages

juanjo255/SRA-Metadata-Extractor

Folders and files

Latest commit

History

Repository files navigation

SRA Metadata Extractor

Table of Contents

About

Getting Started

Installing

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages