##Scripts used to process SGG metagenomes (Chapters 4 and 5 of my PhD thesis).
-
qc_trimmed_nonjob.sh
usesfastp
(https://github.com/OpenGene/fastp) to quality filter and merge Illumina paired-end reads. -
merge_fastqs.sh
simply concatenates fastq.gz files. -
anvio_import_final.sh
usesanvi'o
(https://github.com/merenlab/anvio) to- re-format assembled .fasta files,
- generate a contigs database,
- determine single-copy genes from trained HMMs,
- find COGs for and taxonomically classify contigs using
kaiju
(https://github.com/bioinformatics-centre/kaiju), - mapping raw reads to contigs using
bowtie2
(http://bowtie-bio.sourceforge.net/bowtie2/index.shtml), - bin contigs using
CONCOCT
(https://github.com/BinPro/CONCOCT) and - manually refine bins. Based on
anvi'o
tutorials available at http://merenlab.org/2016/06/22/anvio-tutorial-v2/.
-
anvio_pangenomics.sh
usesanvi'o
to create FeOB, SOB pangenomes and analyse them. -
anvio_omnitrophica_pangenomics.sh
usesncbi-genome-download
(https://github.com/kblin/ncbi-genome-download) andanvi'o
to create Omnitrophica, Cand. Methanoperedens pangenomes and compare them to external GTDB genomes using ANIb.
Follow instructions on here.
mega-login [EMAIL] [PASSWORD]
mega-ls SGG_metagenomes/data/anvio_sgg_mag_processing
mega-get SGG_metagenomes/data/anvio_sgg_mag_processing/anvio_import_final.sh