QASeq (quantitative amplicon sequencing) is a PCR-based molecular barcoding NGS workflow, published as an article on Nature Communications. This code base provides a bioinformatics pipeline for analyzing sequencing data generated with QASeq, with formatted tables and figures as outputs. Scheme of QASeq workflow
1. Install Miniconda if not installed already (quick command line install)
conda create -n QASeq pysam bowtie2 biopython matplotlib-base seaborn pandas scipy xlrd xlwt openpyxl pytables tqdm jupyterlab --channel conda-forge --channel bioconda --channel defaults --strict-channel-priority
conda activate QASeq
git clone https://github.com/mxwang66/QASeq.git
cd QASeq
python -m pip install . --no-deps --no-build-isolation --no-cache-dir
python >=3.8
pysam
bowtie2
biopython
numpy
pandas
seaborn
matplotlib
scipy
xlrd
xlwt
openpyxl
pytables
jupyterlab
tqdm
-
Design spreadsheet (example:
Olivar-multi preliminary design 102723-2 formated.xlsx
). -
.fastq.gz
outputs from sequencer (paired-end, Illumina sequencer).
e.g., a directory named 20240830_test_run
e.g., 20240830_test_run/raw_reads
conda activate qaseq
jupyter lab
Input your library names and path to the design spreadsheet, as well as other parameters.