diff --git a/README.md b/README.md
index dbe2e50..ff1bcd4 100644
--- a/README.md
+++ b/README.md
@@ -1,222 +1,367 @@
-# MUFFIN
-MUFFIN is a hybrid assembly and differential binning workflow for metagenomics, transcriptomics and pathway analysis.
-
-# MUFFIN is still under development and unstable 
-
-## Introduction
-
-MUFFIN aims at being a reproducible pipeline for metagenome assembly
-of crossed illumina and nanopore reads.
-
-MUFFIN uses the following software
-
-| Task | Software | Version | Docker | Image version|
-| --- | --- | --- | --- | --- |
-| QC illumina | [fastp](https://github.com/OpenGene/fastp) | 0.20.0 | [LINK](https://hub.docker.com/r/nanozoo/fastp) | 0.20.0--78a7c63 |
-| QC ont | automated way to discard shortest reads |  |  |  |
-|  | [filtlong](https://github.com/rrwick/Filtlong) | 0.2.0 | [LINK](https://hub.docker.com/r/nanozoo/filtlong) | v0.2.0--afa175e |
-| metagenomic composition of ont | [sourmash](https://sourmash.readthedocs.io/en/latest/) | 2.0.0a10 | [LINK](https://hub.docker.com/r/nanozoo/sourmash) | 2.0.1--6970ddc |
-| Hybrid assembly | [Meta-spades](http://cab.spbu.ru/software/spades/) | 3.13.1 | [LINK](https://hub.docker.com/r/nanozoo/spades) | 3.13.1--2c2a4c0 |
-|  | [unicycler](https://github.com/rrwick/Unicycler) | 0.4.8 | [LINK](https://hub.docker.com/r/nanozoo/unicycler) | 0.4.7-0--c0404e6 |
-| Long read assembly | [MetaFlye](https://github.com/fenderglass/Flye) | 2.6 | [LINK](https://hub.docker.com/r/nanozoo/flye) | 2.5--bae51d9 |
-| polishing | [racon](https://github.com/lbcb-sci/racon) | 1.4.7 | [LINK](https://hub.docker.com/r/nanozoo/racon) | 1.4.7--239559c |
-|  | [medaka](https://github.com/nanoporetech/medaka) | 0.11.2 | [LINK](https://hub.docker.com/r/nanozoo/medaka) | 0.10.0--1e71fdd |
-|  | [pilon](https://github.com/broadinstitute/pilon/wiki) | 1.23 | [LINK](https://hub.docker.com/r/nanozoo/shovill) | 1.0.9--dc1de54 |
-| mapping | [minimap2](https://github.com/lh3/minimap2) | 2.17 | [LINK](https://hub.docker.com/r/nanozoo/minimap2) | 2.17--caba7af |
-|  | [bwa](http://bio-bwa.sourceforge.net/) | 0.7.17 | [LINK](https://hub.docker.com/r/nanozoo/shovill) | 1.0.9--dc1de54 |
-|  | [samtools](http://www.htslib.org/) | 1.9 | [LINK](https://hub.docker.com/r/nanozoo/minimap2) | 2.17--caba7af |
-| retrieve reads mapped to contig | [seqtk](https://github.com/lh3/seqtk) | 1.3 | [LINK](https://hub.docker.com/r/nanozoo/seqtk) | 1.3--dc0d16b |
-| Binning | [Metabat2](https://bitbucket.org/berkeleylab/metabat/src/master/) | 2.14 | [LINK](https://hub.docker.com/r/nanozoo/metabat2) | 2.13--0e2577e |
-|  | [maxbin2](https://sourceforge.net/projects/maxbin2/) | 2.2.4 | [LINK](https://hub.docker.com/r/nanozoo/maxbin2) | 2.2.7--b643a6b |
-|  | [concoct](https://github.com/BinPro/CONCOCT) | 1.0.0 | [LINK](https://hub.docker.com/r/nanozoo/concoct) | 1.1.0--03a3888 |
-|  | [metawrap](https://github.com/bxlab/metaWRAP) | 1.2.4 | [LINK](https://hub.docker.com/r/nanozoo/metawrap) | 1.2.2--de94241 |
-| qc binning | [checkm](https://ecogenomics.github.io/CheckM/) | 1.0.18 | [LINK](https://hub.docker.com/r/nanozoo/nanoplot) | 1.25.0--4e2882f |
-|Taxonomic Classification  | [sourmash](https://sourmash.readthedocs.io/en/latest/) using the [gt-DataBase](https://gtdb.ecogenomic.org/) | 2.0.0a10 | [LINK](https://hub.docker.com/r/nanozoo/sourmash) | 2.0.1--6970ddc |
-|  | [GTDB](https://gtdb.ecogenomic.org/) | version r89 |  |  |
-| Annotations (bin and RNA) | [eggNOG](https://github.com/eggnogdb/eggnog-mapper/wiki/eggNOG-mapper-v2) | 2.0.1 | [LINK](https://hub.docker.com/r/) | TBD |
-|  | [eggNOG DB](http://eggnog5.embl.de/#/app/home) | v5.0 |  |  |
-| *De novo* transcript and quantification | [Trinity](https://github.com/trinityrnaseq/trinityrnaseq/wiki) | 2.8.5 | [LINK](https://hub.docker.com/r/) | TBD |
-|  | [Salmon](https://github.com/COMBINE-lab/salmon) | 0.15.0 | [LINK](https://hub.docker.com/r/) | TBD |
-
-## Figure
-
-### The Workflow
-
-
-![MUFFIN FLOWCHART FIGURE](.figure/Muffin_Workflow_simple.png)
-
-### The parser output
-
-![PARSER OUTPUT FIGURE](.figure/PANKEGG_simple.png)
-
-
-
-## Installation
-
-!!! The actual version cannot run between step 1 and 2/3 it will be fixed soon
-At the moment to install this pipeline and run this pipeline you need to use the conda installation:
-```sh
-#install the pipeline
-git clone https://github.com/RVanDamme/MAFIN.git
-
-#create an env and install metawrap
-conda create -y -p /path/to/install/metawrap-env python=2.7
-source activate /path/to/install/metawrap-env
-conda config --add channels defaults
-conda config --add channels conda-forge
-conda config --add channels bioconda
-conda config --add channels ursky
-conda install -y -c ursky metawrap-mg
-conda deactivate
-
-#edit MAFIN/modules/metawrap_refine_bin.nf to use the env of metawrap
-#you need to change the line 3 and 25 to the path of your env (/path/to/install/metawrap-env)
-```
-
-## Usage
-the current 2 default usage are:
-Spades
-```
-nextflow run MAFIN/main.nf --output results --assembler metaspades  --illumina fastq_ill/ --ont fastq_nano/ --cpus 2 --memory 16g  -profile conda
-```
-Flye
-```
-nextflow run MAFIN/main.nf --output results --assembler metaflye  --illumina fastq_ill/ --ont fastq_nano/ --cpus 2 --memory 16g  -profile conda
-```
-
-### Options
-
-#### --cpus
-* number of thread available
-* default 2
-
-#### --mem
-* number of memory available
-* default 16g
-
-#### --assembler
-* which method to use
-* can be Hybrid with metaspades or long read + polishing with metaflye
-* required
-
-#### --illumina
-* location of the dir containing the forward and reverse illumina reads in fasta or fastq 
-* required
-
-#### --nanopore
-* location of the dir containing the nanopore reads in fasta or fastq
-* required
-
-#### --output
-* output directory
-* required
-
-### Complete help and options
-```
-    *********Metagenomic Assembly pipeline using nextFlow for Illumina and Nanopore reads*********
-
-    Muffin is composed of 2 part the retrieval of potential genome and the analysis of said genomes
-
-        Usage example for retrieval:
-    nextflow run mafin --retrieve --ont /path/to/ont_dir --illumina /path/to/illumina_dir --metaspades -profile conda
-    or 
-    nextflow run mafin --retrieve --ont /path/to/ont_dir --illumina /path/to/illumina_dir --metaflye -profile conda
-
-        Input:
-    --ont                       path to the directory containing the nanopore read file (fastq)
-    -- illumina                 path to the directory containing the illumina read file (fastq)
-
-        Output (default output is reassemblies from each bins):
-    --output                    path to the output directory (default: $params.output)
-    --assembly                  output the original assembly contigs file (default: false)
-    --out_qc                    output the reads file after qc (default: false)
-    --out_metabat               output the bins produce by metabat2 (default: false)
-    --out_concoct               output the bins produce by concoct (default: false)
-    --out_maxbin                output the bins produce by meaxbin2 (default: false)
-    --out_metawrap              output the bins produce by metawrap refining (default: false)
-    --out_bin_reads             output fastq files containing the reads mapped to each bin (default: false)
-    --out_unmapped              output sorted bam files containing the unmmaped reads of illumina and nanopore (default:false)
-
-
-    
-
-        Parameter:
-    --cpus                      max cores for local use [default: $params.cpus]
-    --memory                    80% of available RAM in GB for --metamaps [default: $params.memory]
-    
-        Options:
-    --checkm_db                 path to an already INSTALLED checkm database (not the tar file)
-    --checkm_tar_db             path to the tar checkm database (it will extract it in the dir)
-    --sourmash                  path to an already installed sourmash database
-    --skip_ill_qc               skip quality control of illumina files
-    --skip_ont_qc               skip quality control of nanopore file
-    --short_qc                  minimum size of the reads to be kept (default: $params.short_qc )
-    --filtlong                  use filtlong to improve the quality furthermore (default: false)
-    --model                     the model medaka will use (default: r941_min_high)
-    --polish_iteration          number of iteration of pilon in the polish step (advanced)
-    --extra_ill                 a list of additional ill sample file (with full path with a * instead of _R1,2.fastq) to use for the binning in Metabat2 and concoct
-    --extra_ont                 a list of additional ont sample file (with full path) to use for the binning in Metabat2 and concoct
-    TBD --SRA_ill                   a list of additional ill sample from SRA accession number to use for the binning in Metabat2 and concoct
-    TBD --SRA_ont                   a list of additional ont sample from SRA accession number to use for the binning in Metabat2 and concoct
-    --skip_metabat2             skip the binning using metabat2 (advanced)
-    --skip_maxbin2              skip the binning using maxbin2 (advanced)
-    --skip_concoct              skip the binning using concoct (advanced)
-
-        Nextflow options:
-    -profile                    change the profile of nextflow (currently available conda)
-    -with-report rep.html       cpu / ram usage (may cause errors)
-    -with-dag chart.html        generates a flowchart for the process tree
-    -with-timeline time.html    timeline (may cause errors)
-```
-## BIBLIOGRAPHY
-
-BWA: Li H. and Durbin R. (2009) Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics, 25:1754-60. [PMID: 19451168] 
-
-CheckM: Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Research, 25: 1043–1055.
-
-Concoct: Johannes Alneberg, Brynjar Smári Bjarnason, Ino de Bruijn, Melanie Schirmer, Joshua Quick, Umer Z Ijaz, Leo Lahti, Nicholas J Loman, Anders F Andersson & Christopher Quince. 2014. Binning metagenomic contigs by coverage and composition. Nature Methods, doi: 10.1038/nmeth.3103 
-
-Fastp: Shifu Chen, Yanqing Zhou, Yaru Chen, Jia Gu; fastp: an ultra-fast all-in-one FASTQ preprocessor, Bioinformatics, Volume 34, Issue 17, 1 September 2018, Pages i884–i890, https://doi.org/10.1093/bioinformatics/bty560
-
-Filtlong: https://github.com/rrwick/Filtlong
-
-Flye: Mikhail Kolmogorov, Jeffrey Yuan, Yu Lin and Pavel Pevzner, "Assembly of Long Error-Prone Reads Using Repeat Graphs", Nature Biotechnology, 2019 doi:10.1038/s41587-019-0072-8
-
-HMMER: http://hmmer.org/ 
-
-Maxbin2: Wu YW, Tang YH, Tringe SG, Simmons BA, and Singer SW, "MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm", Microbiome, 2:26, 2014.
-
-Medaka: https://github.com/nanoporetech/medaka
-
-Metabat2: Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ 2015;3:e1165. doi:10.7717/peerj.1165
-
-Metawrap: Uritskiy, G.V., DiRuggiero, J. and Taylor, J. (2018). MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome, 6(1). https://doi.org/10.1186/s40168-018-0541-1
-
-Minimap2: Li, H. (2018). Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics, 34:3094-3100. doi:10.1093/bioinformatics/bty191
-
-Pilon: Bruce J. Walker, Thomas Abeel, Terrance Shea, Margaret Priest, Amr Abouelliel, Sharadha Sakthikumar, Christina A. Cuomo, Qiandong Zeng, Jennifer Wortman, Sarah K. Young, Ashlee M. Earl (2014) Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement. PLoS ONE 9(11): e112963. doi:10.1371/journal.pone.0112963
-
-pplacer: Matsen FA, Kodner RB, Armbrust EV. 2010. pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics 11: doi:10.1186/1471-2105-11-538.
-
-prodigal: Hyatt D, Locascio PF, Hauser LJ, Uberbacher EC. 2012. Gene and translation initiation site prediction in metagenomic sequences. Bioinformatics 28: 2223–2230.
-
-Racon: Vaser R, Sovic I, Nagarajan N, Sikic M. 2017. Fast and accurate de novogenome assembly from long uncorrected reads. Genome Res 27:737–746.https://doi.org/10.1101/gr.214270.116
-
-Samtools: Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, and 1000 Genome Project Data Processing Subgroup, The Sequence alignment/map (SAM) format and SAMtools, Bioinformatics (2009) 25(16) 2078-9 [19505943]
-
-Seqtk: https://github.com/lh3/seqtk
-
-Sourmash: Brown et al, (2016), sourmash: a library for MinHash sketching of DNA, Journal of Open Source Software, 1(5), 27, doi:10.21105/joss.00027
-
-Spades:  Lapidus A., Antipov D., Bankevich A., Gurevich A., Korobeynikov A., Nurk S., Prjibelski A., Safonova Y., Vasilinetc I., Pevzner P. A. New Frontiers of Genome Assembly with SPAdes 3.0.	(poster), 2014 
-
-Unicycler: Wick RR, Judd LM, Gorrie CL, Holt KE (2017) Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13(6): e1005595. https://doi.org/10.1371/journal.pcbi.1005595
-
-## License
-
-Code is [GPL-3.0](LICENSE)
-
-## Contributing
-
-We welcome contributions from the community! See our
-[Contributing](CONTRIBUTING.md) guidelines
+# MUFFIN
+MUFFIN is a hybrid assembly and differential binning workflow for metagenomics, transcriptomics and pathway analysis.
+
+## INDEX
+
+1. [Introduction](#introduction)
+2. [Figure](#figure) :
+    - [Workflow](#the-workflow)
+    - [Parser output](#the-parser-output)
+3. [Installation](#installation) :
+    - [base installation](#base-installation)
+    - [conda usage](#for-conda-usage)
+    - [containers usage](#for-containers-usage)
+    - [software installe locally](#for-usage-of-software-installed-locally)
+4. [Test the pipeline](#test-the-pipeline)
+5. [Usage](#usage) :
+    - [Basic usage](#basic-usage)
+    - [Advanced usage](#advanced-usage)
+6. [Troubleshooting](#troubleshooting)
+7. [Options](#options)
+8. [Complete help and options](#complete-help-and-options)
+9. [Bibliography](#bibliography)
+10. [License](#license)
+
+## Introduction
+
+MUFFIN aims at being a reproducible pipeline for metagenome assembly
+of crossed illumina and nanopore reads.
+
+MUFFIN uses the following software
+
+| Task | Software | Version | Docker | Image version|
+| --- | --- | --- | --- | --- |
+| QC illumina | [fastp](https://github.com/OpenGene/fastp) | 0.20.0 | [LINK](https://hub.docker.com/r/nanozoo/fastp) | 0.20.0--78a7c63 |
+| QC ont | automated way to discard shortest reads |  |  |  |
+|  | [filtlong](https://github.com/rrwick/Filtlong) | 0.2.0 | [LINK](https://hub.docker.com/r/nanozoo/filtlong) | v0.2.0--afa175e |
+| metagenomic composition of ont | [sourmash](https://sourmash.readthedocs.io/en/latest/) | 2.0.1 | [LINK](https://hub.docker.com/r/nanozoo/sourmash) | 2.0.1--6970ddc |
+| Hybrid assembly | [Meta-spades](http://cab.spbu.ru/software/spades/) | 3.13.1 | [LINK](https://hub.docker.com/r/nanozoo/spades) | 3.13.1--2c2a4c0 |
+|  | [unicycler](https://github.com/rrwick/Unicycler) | 0.4.7 | [LINK](https://hub.docker.com/r/nanozoo/unicycler) | 0.4.7-0--c0404e6 |
+| Long read assembly | [MetaFlye](https://github.com/fenderglass/Flye) | 2.7 | [LINK](https://hub.docker.com/r/nanozoo/flye) | 2.7--957a1a1 |
+| polishing | [racon](https://github.com/lbcb-sci/racon) | 1.4.13 | [LINK](https://hub.docker.com/r/nanozoo/racon) | 1.4.13--bb8a908 |
+|  | [medaka](https://github.com/nanoporetech/medaka) | 1.0.3 | [LINK](https://hub.docker.com/r/nanozoo/medaka) | 1.0.3--7c62d67 |
+|  | [pilon](https://github.com/broadinstitute/pilon/wiki) | 1.23 | [LINK](https://hub.docker.com/r/nanozoo/pilon) | 1.23--b21026d |
+| mapping | [minimap2](https://github.com/lh3/minimap2) | 2.17 | [LINK](https://hub.docker.com/r/nanozoo/minimap2) | 2.17--caba7af |
+|  | [bwa](http://bio-bwa.sourceforge.net/) | 0.7.17 | [LINK](https://hub.docker.com/r/nanozoo/pilon) | 1.23--b21026d |
+|  | [samtools](http://www.htslib.org/) | 1.9 | [LINK](https://hub.docker.com/r/nanozoo/minimap2) | 2.17--caba7af |
+| retrieve reads mapped to contig | [seqtk](https://github.com/lh3/seqtk) | 1.3 | [LINK](https://hub.docker.com/r/nanozoo/seqtk) | 1.3--dc0d16b |
+| Binning | [Metabat2](https://bitbucket.org/berkeleylab/metabat/src/master/) | 2.13 | [LINK](https://hub.docker.com/r/nanozoo/metabat2) | 2.13--0e2577e |
+|  | [maxbin2](https://sourceforge.net/projects/maxbin2/) | 2.2.7 | [LINK](https://hub.docker.com/r/nanozoo/maxbin2) | 2.2.7--b643a6b |
+|  | [concoct](https://github.com/BinPro/CONCOCT) | 1.1.0 | [LINK](https://hub.docker.com/r/nanozoo/concoct) | 1.1.0--03a3888 |
+|  | [metawrap](https://github.com/bxlab/metaWRAP) | 1.2.2 | [LINK](https://hub.docker.com/r/nanozoo/metawrap) | 1.2.2--de94241 |
+| qc binning | [checkm](https://ecogenomics.github.io/CheckM/) | 1.0.13 | [LINK](https://hub.docker.com/r/nanozoo/nanoplot) | 1.0.13--248242f |
+|Taxonomic Classification  | [sourmash](https://sourmash.readthedocs.io/en/latest/) using the [gt-DataBase](https://gtdb.ecogenomic.org/) | 2.0.1 | [LINK](https://hub.docker.com/r/nanozoo/sourmash) | 2.0.1--6970ddc |
+|  | [GTDB](https://gtdb.ecogenomic.org/) | version r89 |  |  |
+| Annotations (bin and RNA) | [eggNOG](https://github.com/eggnogdb/eggnog-mapper/wiki/eggNOG-mapper-v2) | 2.0.1 | [LINK](https://hub.docker.com/r/nanozoo/eggnog-mapper) | 2.0.1--d5e0c8c |
+|  | [eggNOG DB](http://eggnog5.embl.de/#/app/home) | v5.0 |  |  |
+| *De novo* transcript and quantification | [Trinity](https://github.com/trinityrnaseq/trinityrnaseq/wiki) | 2.9.1 | [LINK](https://hub.docker.com/r/nanozoo/trinity) | 2.9.1--82fe26c |
+|  | [Salmon](https://github.com/COMBINE-lab/salmon) | 0.15.0 | [LINK](https://hub.docker.com/r/nanozoo/trinity) | 2.9.1--82fe26c |
+
+## Figure
+
+### The Workflow
+
+
+![MUFFIN FLOWCHART FIGURE](.figure/Muffin_Workflow_simple.png)
+
+### The parser output
+
+![PARSER OUTPUT FIGURE](.figure/PANKEGG_simple.png)
+
+
+
+## Installation
+
+### base installation
+You need to install nextflow Version 20.01+ ( https://www.nextflow.io/ )
+```sh
+# verify Java version (at least version 8+)
+java -version 
+
+# Setup nextflow (it will create a nextflow executable file in the current directory)
+curl -s https://get.nextflow.io | bash
+
+# If you want the pipeline installed locally use the following
+git clone https://github.com/RVanDamme/MAFIN.git
+
+# If you want to not install the pipeline use the following when running nextflow
+
+nextflow run  RVanDamme/MUFFIN --parameters.....
+
+```
+
+### For conda usage
+If you use conda you need to install Metawrap in an environment you create yourself, this is due to a known issue that will be fixed soon.
+
+```sh
+
+#create an env and install metawrap
+conda -y -p /path/to/install/metawrap-env python=2.7
+source activate /path/to/install/metawrap-env
+conda config --add channels defaults
+conda config --add channels conda-forge
+conda config --add channels bioconda
+conda config --add channels ursky
+conda install -y -c ursky metawrap-mg
+conda deactivate
+
+#edit MAFIN/modules/metawrap_refine_bin.nf to use the env of metawrap
+#you need to change the line 3 and 25 to the path of your env (/path/to/install/metawrap-env)
+```
+
+### For containers usage
+If you use containers either docker or singularity, you don't need extra installations
+
+### For usage of software installed locally
+You just need to have all the software used in the pipeline (see table above) installed and in your $PATH
+
+## Test the pipeline
+To test the pipeline we have a subset of 5 bins available at https://osf.io/9xmh4/
+A detailed explanation of all the parameter is available in [Usage](#usage), the most important for the test is the profile executor and engine.
+To run it you just need to add "test" in the -profile parameter e.g.:
+```
+#test locally with conda, you need to specify cpus and ram available
+nextflow run RVanDamme/MUFFIN --output results_dir  --cpus 8 --memory 32g -profile local,conda,test
+
+#test locally with docker, you can change the cpus and ram in configs/containers.config
+# this test also run the transcriptomics analysis with --rna
+nextflow run RVanDamme/MUFFIN --output results_dir --rna -profile local,docker,test
+
+#test using gcloud with docker, you can change the cpus and ram in configs/containers.config
+# this test use flye instead of spades with the --assembler metaflye
+nextflow run RVanDamme/MUFFIN --output results_dir --assembler metaflye -profile gcloud,docker,test
+```
+The subset contains also RNA data to test with transcriptomics analysis you just need to activate it using "--rna"
+The results of the different test run are available at https://osf.io/m5czv/
+
+## Usage
+
+### Basic usage
+
+```
+path/to/nextflow run $MUFFIN_pipeline --output results_dir --assembler $assembler --illumina fastq_ill/ --ont fastq_ont/ --cpus 16 --memory 64g --modular full -profile $profile_executor,$profile_engine
+```
+ $MUFFIN_pipeline is either "path/to/MUFFIN/main.nf" or "RVanDamme/MUFFIN"
+
+ $assembler is either:
+  - "metaspades" for hybrid assembly
+  - "metaflye" for long-read assembly with short-reads polishing
+
+ $profile_executor can be:
+  - "local" to run on your computer
+  - "gcloud" to run on google life science cloud computing (you need to setup your project in nextflow.config)
+  - "slurm" to run on HPC using slurm (e.g. UPPMAX)
+
+ $profile_engine can be:
+  - "local_engine" to execute the software installed locally
+  - "conda" to execute using conda installation
+  - "docker" and "singularity" to execute using the docker container
+
+You can add "-resume" at the end of the command to restart it while keeping the process that succeeded
+This is often used in case or error in the pipeline or if you modify slightly the command and want to avoid running everything again.
+One exemple is to run the pipeline without RNA and rerun it adding RNA data, in this specific case the second time you add:
+```
+--rna path/to/rna -resume
+```
+Only the transcript processes and final parsing will be run
+
+### Advanced usage
+
+You can use RNA data to have transcriptomics analysis to do so add "--rna path/to/fastq_rna/"
+
+You can run only partially the pipeline to do so change --modular to the right parameter:
+ - "full" run the 3 steps of MUFFIN (assemble, classify, annotate)
+ - "assemble" run the assembly and binning
+ - "classify" run the classification of the bins (require a different input)
+ - "annotate" run the annotation step of the bins (require a different input) and RNA if provided
+ - "assemb-class" run assemble and classify step
+ - "assemb-annot" run assemble and classify step
+ - "class-annot" run classify and annotate step (require a different input)
+
+To run classify and annotate independently from the assemble you need to provide a CSV file of the bins
+The structure of the file should correspond to:
+```
+Samplename,path/to/bin1.fa
+Samplename,path/to/bin2.fa
+...
+Samplename,path/to/binX.fa
+Samplename,path/to/binY.fa
+```
+If you run "classify" with or without "annotation" use "--bin_classify"
+If you run "annotate" without "classify" use "--bin_annotate"
+
+
+## Troubleshooting
+* If metawrap fail using conda check that you installed metawrap in a conda environment and put the path in "modules/metawrap_refine_bin.nf"
+
+* If you run the pipeline with google life sciences and get error code 14
+  It means the process was killed by google, you just need to run the pipeline again don't forget to add "-resume"
+
+* If either Metawrap or checkm have the following error
+  You need to increase the RAM in the command for local_engine and conda or in the "configs/containers.config"
+```
+IOError: [Errno 2] No such file or directory: 'binsA.checkm/storage/tree/concatenated.tre' 
+``` 
+
+* For other issue please open a ticket on ![github](https://github.com/RVanDamme/MUFFIN/issues)
+
+## Options
+
+### --cpus
+* number of thread available
+* default 2
+
+### --mem
+* number of memory available
+* default 16g
+
+### --assembler
+* which method to use
+* can be Hybrid with metaspades or long read + polishing with metaflye
+* required
+
+### --illumina
+* location of the dir containing the forward and reverse illumina reads in fasta or fastq 
+* required
+
+### --nanopore
+* location of the dir containing the nanopore reads in fasta or fastq
+* required
+
+### --output
+* output directory
+* required
+
+### -profile
+* engine and executor
+* required
+* engine: local_engine, conda, docker, singularity
+* executor: local, gcloud, slurm
+
+
+## Complete help and options
+```
+    *********hybrid assembly and differential binning workflow for metagenomics, transcriptomics and pathway analysis*********
+
+    MUFFIN is still under development please wait until the first non edge version realease before using it.
+    Please cite us using https://www.biorxiv.org/content/10.1101/2020.02.08.939843v1
+
+    Mafin is composed of 3 part the assembly of potential metagenome assembled genomes (MAGs); the classification of the MAGs; and the annotation of the MAGs.
+
+        Usage example:
+    nextflow run RVanDamme/MUFFIN --output result --ont nanopore/ --illumina illumina/ --assembler metaspades --rna rna/ -profile local,docker
+    or 
+    nextflow run RVanDamme/MUFFIN --output result --ont nanopore/ --illumina illumina/ --assembler metaflye -profile local,docker
+
+        Input:
+    --ont                       path to the directory containing the nanopore read file (fastq) (default: $params.ont)
+    --illumina                  path to the directory containing the illumina read file (fastq) (default: $params.illumina)
+    --rna                       path to the directory containing the RNA-seq read file (fastq) (default: none)
+    --bin_classify              path to the directory containing the bins files to classify (default: none)
+    --bin_annotate              path to the directory containing the bins files to annotate (default: none)
+    --assembler                 the assembler to use in the assembly step (default: $params.assembler)
+
+        Optional input:
+    --check_db                  path to the checkm database
+    --check_tar_db              path to the checkm database tar compressed
+    --sourmash_db               path to the LCA database for sourmash (default: GTDB LCA formated)
+    --eggnog_db                 path to the eggNOG database
+
+        Output:
+    --output                    path to the output directory (default: $params.output)
+
+        Outputed files:
+        You can see the output structure at https://osf.io/a6hru/
+    QC                          The reads file after qc
+    Assembly                    The assembly contigs file 
+    Bins                        The bins produced by CONCOCT, MetaBAT2, MaxBin2 and MetaWRAP (the refining of bins)
+    Mapped bin reads            The fastq files containing the reads mapped to each metawrap bin
+    Unmapped bin reads          The fastq files containing the unmmaped reads of illumina and nanopore
+    Reassembly                  The reassembly files of the bins (.fa and .gfa)
+    Checkm                      Various file outputed by CheckM (summary, taxonomy, plots and output dir)
+    Sourmash                    The classification done by sourmash
+    Classify summary            The summary of the classification and quality control of the bins (csv file)
+    RNA output                  The de novo assembled transcript and the quantification by Salmon
+    Annotation                  The annotations files from eggNOG (tsv format)
+    Parsed output               HTML files that summarize the annotations and show graphically the pathways
+
+
+    
+
+        Basic Parameter:
+    --cpus                      max cores for local use [default: $params.cpus]
+    --memory                    80% of available RAM in GB for --metamaps [default: $params.memory]
+
+
+        Workflow Options:
+    --skip_ill_qc               skip quality control of illumina files
+    --skip_ont_qc               skip quality control of nanopore file
+    --short_qc                  minimum size of the reads to be kept (default: $params.short_qc )
+    --filtlong                  use filtlong to improve the quality furthermore (default: false)
+    --model                     the model medaka will use (default: $params.model)
+    --polish_iteration          number of iteration of pilon in the polish step (default: $params.polish_iteration)
+    --extra_ill                 a list of additional ill sample file (with full path with a * instead of _R1,2.fastq) to use for the binning in Metabat2 and concoct
+    --extra_ont                 a list of additional ont sample file (with full path) to use for the binning in Metabat2 and concoct
+    --skip_metabat2             skip the binning using metabat2 (advanced)
+    --skip_maxbin2              skip the binning using maxbin2 (advanced)
+    --skip_concoct              skip the binning using concoct (advanced)
+
+        Nextflow options:
+    -profile                    change the profile of nextflow both the engine and executor more details on github README
+    -resume                     resume the workflow where it stopped
+    -with-report rep.html       cpu / ram usage (may cause errors)
+    -with-dag chart.html        generates a flowchart for the process tree
+    -with-timeline time.html    timeline (may cause errors)
+```
+## BIBLIOGRAPHY
+
+BWA: Li H. and Durbin R. (2009) Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics, 25:1754-60. [PMID: 19451168] 
+
+CheckM: Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Research, 25: 1043–1055.
+
+Concoct: Johannes Alneberg, Brynjar Smári Bjarnason, Ino de Bruijn, Melanie Schirmer, Joshua Quick, Umer Z Ijaz, Leo Lahti, Nicholas J Loman, Anders F Andersson & Christopher Quince. 2014. Binning metagenomic contigs by coverage and composition. Nature Methods, doi: 10.1038/nmeth.3103 
+
+Fastp: Shifu Chen, Yanqing Zhou, Yaru Chen, Jia Gu; fastp: an ultra-fast all-in-one FASTQ preprocessor, Bioinformatics, Volume 34, Issue 17, 1 September 2018, Pages i884–i890, https://doi.org/10.1093/bioinformatics/bty560
+
+Filtlong: https://github.com/rrwick/Filtlong
+
+Flye: Mikhail Kolmogorov, Jeffrey Yuan, Yu Lin and Pavel Pevzner, "Assembly of Long Error-Prone Reads Using Repeat Graphs", Nature Biotechnology, 2019 doi:10.1038/s41587-019-0072-8
+
+HMMER: http://hmmer.org/ 
+
+Maxbin2: Wu YW, Tang YH, Tringe SG, Simmons BA, and Singer SW, "MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm", Microbiome, 2:26, 2014.
+
+Medaka: https://github.com/nanoporetech/medaka
+
+Metabat2: Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ 2015;3:e1165. doi:10.7717/peerj.1165
+
+Metawrap: Uritskiy, G.V., DiRuggiero, J. and Taylor, J. (2018). MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome, 6(1). https://doi.org/10.1186/s40168-018-0541-1
+
+Minimap2: Li, H. (2018). Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics, 34:3094-3100. doi:10.1093/bioinformatics/bty191
+
+Pilon: Bruce J. Walker, Thomas Abeel, Terrance Shea, Margaret Priest, Amr Abouelliel, Sharadha Sakthikumar, Christina A. Cuomo, Qiandong Zeng, Jennifer Wortman, Sarah K. Young, Ashlee M. Earl (2014) Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement. PLoS ONE 9(11): e112963. doi:10.1371/journal.pone.0112963
+
+pplacer: Matsen FA, Kodner RB, Armbrust EV. 2010. pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics 11: doi:10.1186/1471-2105-11-538.
+
+prodigal: Hyatt D, Locascio PF, Hauser LJ, Uberbacher EC. 2012. Gene and translation initiation site prediction in metagenomic sequences. Bioinformatics 28: 2223–2230.
+
+Racon: Vaser R, Sovic I, Nagarajan N, Sikic M. 2017. Fast and accurate de novogenome assembly from long uncorrected reads. Genome Res 27:737–746.https://doi.org/10.1101/gr.214270.116
+
+Samtools: Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, and 1000 Genome Project Data Processing Subgroup, The Sequence alignment/map (SAM) format and SAMtools, Bioinformatics (2009) 25(16) 2078-9 [19505943]
+
+Seqtk: https://github.com/lh3/seqtk
+
+Sourmash: Brown et al, (2016), sourmash: a library for MinHash sketching of DNA, Journal of Open Source Software, 1(5), 27, doi:10.21105/joss.00027
+
+Spades:  Lapidus A., Antipov D., Bankevich A., Gurevich A., Korobeynikov A., Nurk S., Prjibelski A., Safonova Y., Vasilinetc I., Pevzner P. A. New Frontiers of Genome Assembly with SPAdes 3.0.	(poster), 2014 
+
+Unicycler: Wick RR, Judd LM, Gorrie CL, Holt KE (2017) Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13(6): e1005595. https://doi.org/10.1371/journal.pcbi.1005595
+
+## License
+
+Code is [GPL-3.0](LICENSE)
+
+## Contributing
+
+We welcome contributions from the community! See our
+[Contributing](CONTRIBUTING.md) guidelines
diff --git a/bin/pankegg_bin.py b/bin/pankegg_bin.py
index b0e26e8..9f72057 100755
--- a/bin/pankegg_bin.py
+++ b/bin/pankegg_bin.py
@@ -72,7 +72,7 @@ def bin_parse(bins,
 def write_html_sample(dictgeneral, dict_global_sample, output,
                       globalpathwaylist, binnamelist):
 
-    out = output+"/MAFIN_sample_result.html"
+    out = output+"/MUFFIN_sample_result.html"
     outfile = open(out, "w")
 
     outfile.write("""
@@ -80,7 +80,7 @@ def write_html_sample(dictgeneral, dict_global_sample, output,
     <html lang="en-US">
     <head>
         <meta charset="utf-8">
-        <title>MAFIN Sample result</title>
+        <title>MUFFIN Sample result</title>
         <meta name="author" content="Renaud Van Damme">
     </head>"""
                   )
@@ -96,7 +96,7 @@ def write_html_sample(dictgeneral, dict_global_sample, output,
         <li>Total number of bins: {num_bins}</li>
         <li>Total number of unique pathways in bins: {num_path}</li>
         <li>This file contains only the eggNOG annotation that have a kegg pathway id, for further research please look at the annotations.tsv files</li>
-        <li>This result file was produced by <a href="https://github.com/RVanDamme/MAFIN">MAFIN</a> </li>
+        <li>This result file was produced by <a href="https://github.com/RVanDamme/MUFFIN">MUFFIN</a> </li>
     </ul>
     </h2>
     </div>
@@ -231,12 +231,18 @@ def write_html_sample(dictgeneral, dict_global_sample, output,
             <li> Figure detail
             <ul>
                 <li>The Figures in the links: <ul> 
-                    <li>The orthologs present in the bins are in green</li>
+                    <li>The orthologs present in the bins are in <font color="#e7bcd4">▉▉</font></li>
             <li>Troubleshooting
             <ul>
                 <li>When the link of the pathway is not loading or not showing anything, it means that there is too much orthologs to show on the figure.
             Try to strip everything after "https://www.kegg.jp/kegg-bin/show_pathway?PATWAY_ENTRY_NUMBER/" to still see the pathway</li>
             </ul></li>
+            <li> Troubleshooting
+                <ul>
+                <li> When in the table an error message like this "ko00000 unknow by the KEGG DATABASE" appears it means that the ID given by the annotation software (eggNOG)
+                is not know by the KEGG database, this error is potentially due to version conflict between eggNOG and the KEGG database. 
+                </li></ul>
+            </li>
         </ol>
         </p></div>
         </div>
@@ -296,7 +302,7 @@ def write_html_sample(dictgeneral, dict_global_sample, output,
                 set_html= set()
                 for gene in set_total_gene:
                     set_html.add(gene)
-                list_html = "".join(set_html)
+                list_html = "/".join(set_html)
             except KeyError:
                 list_html = ""
             outfile.write(f"""
@@ -339,7 +345,7 @@ def write_html_sample(dictgeneral, dict_global_sample, output,
 def write_html_bins(dictgeneral, dict_global_bin, output,
                     globalpathwaylist):
     for bin_html in dict_global_bin.keys():
-        out = output+"/MAFIN_"+bin_html+"_result.html"
+        out = output+"/MUFFIN_"+bin_html+"_result.html"
         outfile = open(out, "w")
 
         outfile.write(f"""
@@ -347,11 +353,12 @@ def write_html_bins(dictgeneral, dict_global_bin, output,
         <html lang="en-US">
         <head>
             <meta charset="utf-8">
-            <title>MAFIN {bin_html} result</title>
+            <title>MUFFIN {bin_html} result</title>
             <meta name="author" content="Renaud Van Damme">
         </head>"""
                       )
 
+
         num_path_bin = len(dict_global_bin[bin_html])
         num_path = len(globalpathwaylist)
         outfile.write(f"""
@@ -363,7 +370,7 @@ def write_html_bins(dictgeneral, dict_global_bin, output,
             <li>Total number of unique pathway in this bin: {num_path_bin}</li>
             <li>Total number of unique pathways in all bins: {num_path}</li>
             <li>This file contains only the eggNOG annotation that have a kegg pathway id, for further research please look at the annotations.tsv files</li>
-            <li>This result file was produced by <a href="https://github.com/RVanDamme/MAFIN">MAFIN</a> </li>
+            <li>This result file was produced by <a href="https://github.com/RVanDamme/MUFFIN">MUFFIN</a> </li>
         </ul>
         </h2>
         </div>
@@ -521,7 +528,7 @@ def write_html_bins(dictgeneral, dict_global_bin, output,
                         <th class="header">All orthologs</th>
                     </tr>
                     <tr>
-                        <li><font color="#e7bcd4">▉▉</font>Represent the orthologs of the Bin</th>
+                        <th class="header"><font color="#e7bcd4">▉▉</font>Represent the orthologs of the Bin</th>
                         <th class="header">List of the orthologs of the bin</th>
                         <th class="header">List of the orthologs of all bins</th>
 
@@ -542,7 +549,7 @@ def write_html_bins(dictgeneral, dict_global_bin, output,
                     set_html = set()
                     for gene in set_gene:
                         set_html.add(gene)
-                    list_html = "".join(set_html)
+                    list_html = "/".join(set_html)
                 outfile.write(f"""
                     <tr>
                     <td class="pathway_gene"><a href="https://www.kegg.jp/kegg-bin/show_pathway?{pathway}/{list_html}/default%3d%23e7bcd4">{pathway_name}</a></td>
diff --git a/bin/pankegg_bin_RNA.py b/bin/pankegg_bin_RNA.py
index 1a4d30a..b7e0a1d 100755
--- a/bin/pankegg_bin_RNA.py
+++ b/bin/pankegg_bin_RNA.py
@@ -178,7 +178,7 @@ def write_html_sample(dict_global_sample, output,
                       globalpathwaylist, binnamelist, rna_pathway_list,
                       dictrna):
 
-    out = output+"/MAFIN_sample_result.html"
+    out = output+"/MUFFIN_sample_result.html"
     outfile = open(out, "w")
 
     outfile.write("""
@@ -186,7 +186,7 @@ def write_html_sample(dict_global_sample, output,
     <html lang="en-US">
     <head>
         <meta charset="utf-8">
-        <title>MAFIN Sample result</title>
+        <title>MUFFIN Sample result</title>
         <meta name="author" content="Renaud Van Damme">
     </head>"""
                   )
@@ -204,7 +204,7 @@ def write_html_sample(dict_global_sample, output,
         <li>Total number of unique pathways in bins: {num_path}</li>
         <li>Total number of unique pathways in RNA: {num_path_rna}</li>
         <li>This file contains only the eggNOG annotation that have a kegg pathway id, for further research please look at the annotations.tsv files</li>
-        <li>This result file was produced by <a href="https://github.com/RVanDamme/MAFIN">MAFIN</a> </li>
+        <li>This result file was produced by <a href="https://github.com/RVanDamme/MUFFIN">MUFFIN</a> </li>
     </ul>
     </h2>
     </div>
@@ -341,11 +341,10 @@ def write_html_sample(dict_global_sample, output,
               <li> Figure detail
               <ul>
                   <li>The Figures in the links: <ul>
-                      <li>The orthologs in both RNA-seq and in the bins are in green</li>
-                      <li>The orthologs present in the bins but that are not in the RNA-seq are in orange</li>
-                      <li>The orthologs present in the RNA-seq are in purple</li>
-                      <li>The orthologs present in the bins are in red</li>
-                      <li>The orthologs absent from the samples are in blue</li></ul></li></ul></li>
+                      <li>The orthologs in both RNA-seq and in the bins are in <font color="#e7bcd4">▉▉</font></li>
+                      <li>The orthologs present in the bins but that are not in the RNA-seq are in <font color="#7f5b6c">▉▉</font></li>
+                      <li>The orthologs present in the RNA-seq are in <font color="#3bbc9a">▉▉</font></li>
+                      <li>The orthologs present in the bins are in <font color="#f3c98b">▉▉</font></li>
               <li>Troubleshooting
               <ul>
                   <li>When the link of the pathway is not loading or not showing anything, it means that there is too much orthologs to show on the figure.
@@ -419,8 +418,8 @@ def write_html_sample(dict_global_sample, output,
                 set_html_rnagene = set()
                 for gene in set_activgene:
                     set_html_rnagene.add(gene)
-                list_html_active_gene = "".join(set_html_activgene)
-                list_html_rnagene = "".join(set_html_rnagene)
+                list_html_active_gene = "/".join(set_html_activgene)
+                list_html_rnagene = "/".join(set_html_rnagene)
             except KeyError:
                 list_active_gene = ""
                 n_rnaseq_gene = ""
@@ -436,12 +435,12 @@ def write_html_sample(dict_global_sample, output,
             list_html_inactive_gene_coded = "".join([
                 inactiv+"%09%237f5b6c,black/" for inactiv in list_inactive_gene])            
             list_html_inactive_gene = "".join([
-                inactiv for inactiv in list_inactive_gene])
+                inactiv+"/" for inactiv in list_inactive_gene])
             list_html_all_gene = "".join([
-                gene for gene in list(set_gene)])
+                gene+"/" for gene in list(set_gene)])
             outfile.write(f"""
 			<tr>
-			<td class="pathway_gene"><a href="https://www.kegg.jp/kegg-bin/show_pathway?{pathway}/{list_html_inactive_gene_with_code}/{list_html_active_gene}/default%3d%23e7bcd4">{pathway_name}
+			<td class="pathway_gene"><a href="https://www.kegg.jp/kegg-bin/show_pathway?{pathway}/{list_html_inactive_gene_coded}/{list_html_active_gene}/default%3d%23e7bcd4">{pathway_name}
             <font color="#e7bcd4">▉▉</font>from bins and in RNA-seq and <font color="#7f5b6c">▉▉</font>from bins and not in RNA-seq</a></td>
 			"""
 						  )
@@ -500,7 +499,7 @@ def write_html_bins(dict_global_bin, output,
                     globalpathwaylist, rna_pathway_list,
                     dictrna):
     for bin_html in dict_global_bin.keys():
-        out = output+"/MAFIN_"+bin_html+"_result.html"
+        out = output+"/MUFFIN_"+bin_html+"_result.html"
         outfile = open(out, "w")
 
         outfile.write(f"""
@@ -508,7 +507,7 @@ def write_html_bins(dict_global_bin, output,
         <html lang="en-US">
         <head>
             <meta charset="utf-8">
-            <title>MAFIN {bin_html} result</title>
+            <title>MUFFIN {bin_html} result</title>
             <meta name="author" content="Renaud Van Damme">
         </head>"""
                       )
@@ -526,13 +525,14 @@ def write_html_bins(dict_global_bin, output,
             <li>Total number of unique pathways in all bins: {num_path}</li>
             <li>Total number of unique pathways in RNA: {num_path_rna}</li>
             <li>This file contains only the eggNOG annotation that have a kegg pathway id, for further research please look at the annotations.tsv files</li>
-            <li>This result file was produced by <a href="https://github.com/RVanDamme/MAFIN">MAFIN</a> </li>
+            <li>This result file was produced by <a href="https://github.com/RVanDamme/MUFFIN">MUFFIN</a> </li>
         </ul>
         </h2>
         </div>
         """
         )
 
+
         outfile.write("""
                   <style type="text/css">
           .tg {
@@ -666,9 +666,8 @@ def write_html_bins(dict_global_bin, output,
             <li> Figure detail
             <ul>
                 <li>The Figures in the links: <ul> 
-                    <li>The orthologs expressed by RNA are in green</li>
-                    <li>The orthologs present in the bins but that are not in the RNA are in orange</li>
-                    <li>The orthologs absent from the samples are in blue</li></ul></li></ul></li>
+                    <li>The orthologs from the bins expressed by RNA are in <font color="#e7bcd4">▉▉</font></li>
+                    <li>The orthologs present in the bins but that are not in the RNA are in <font color="#7f5b6c">▉▉</font></li>
             <li>Troubleshooting
             <ul>
                 <li>When the link of the pathway is not loading or not showing anything, it means that there is too much orthologs to show on the figure.
@@ -676,6 +675,12 @@ def write_html_bins(dict_global_bin, output,
                 <li>In the figure you can have Green case that also contains orange.
             If the case is composed of multiple orthologs and some are in RNA and some only in the bins the case will be highlighted in green even tough it should be green and orange</li>
             </ul></li>
+            <li> Troubleshooting
+                <ul>
+                <li> When in the table an error message like this "ko00000 unknow by the KEGG DATABASE" appears it means that the ID given by the annotation software (eggNOG)
+                is not know by the KEGG database, this error is potentially due to version conflict between eggNOG and the KEGG database. 
+                </li></ul>
+            </li>
         </ol>
         </p></div>
         </div>
@@ -719,7 +724,7 @@ def write_html_bins(dict_global_bin, output,
                         <th class="header2"><font color="#7f5b6c">▉▉</font>Orthologs present in bins but not in RNA-seq annotation</th>
                         <th class="header2"><font color="#f3c98b">▉▉</font>Orthologs based on bins annotation</th>
                         <th class="header2"><font color="#e7bcd4">▉▉</font>list of orthologs of the bin present in RNAseq</th>
-                        <th class="header2"><font color="#e7bcd4">▉▉</font>list of orthologs of the bin absent in RNAseq</th>
+                        <th class="header2"><font color="#7f5b6c">▉▉</font>list of orthologs of the bin absent in RNAseq</th>
 
                     </tr>
         """)
@@ -742,10 +747,10 @@ def write_html_bins(dict_global_bin, output,
                     for gene in dict_global_bin[bin_html][pathway][1]:
                         set_gene.add(gene)
                     list_html_all_gene = "".join([
-                        gene for gene in list(set_gene)])
+                        gene+"/" for gene in list(set_gene)])
                     list_inactive_gene=[]
                     if dict_global_bin[bin_html][pathway][3] != "":
-                        list_html_active_gene = "".join(set_html_active_gene)
+                        list_html_active_gene = "/".join(set_html_active_gene)
                         list_active_gene = list(set_active_gene)
                         for elem in list(set_gene):
                             if elem not in list_active_gene:
@@ -753,16 +758,16 @@ def write_html_bins(dict_global_bin, output,
                         list_html_inactive_gene_coded = "".join([
                             inactiv+"%09%237f5b6c,black/" for inactiv in list_inactive_gene])
                         list_html_inactive_gene = "".join([
-                            inactiv for inactiv in list_inactive_gene])
+                            inactiv+"/" for inactiv in list_inactive_gene])
                     else:
                         list_html_active_gene = ""
                         list_html_inactive_gene = ""
                         list_active_gene = ""
-                        list_inactive_gene = list(set_gene)
+                        list_inactive_gene = "/".join(set_gene)
                     set_html_all_gene = set()
                     for gene in dict_global_bin[bin_html][pathway][1]:
                         set_html_all_gene.add(gene)
-                    list_html_all_gene = "".join(set_html_all_gene)
+                    list_html_all_gene = "/".join(set_html_all_gene)
                     outfile.write(f"""
 					<tr>
 					<td class="pathway_gene"><a href="https://www.kegg.jp/kegg-bin/show_pathway?{pathway}/{list_html_active_gene}/{list_html_inactive_gene_coded}/default%3d%23e7bcd4">{pathway_name}
diff --git a/configs/conda.config b/configs/conda.config
new file mode 100644
index 0000000..cbe2f18
--- /dev/null
+++ b/configs/conda.config
@@ -0,0 +1,46 @@
+process {
+    withLabel : fastp { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::fastp=0.20.0'}
+    withLabel : filtlong { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::filtlong=0.2.0'}
+    withLabel : sourmash { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::sourmash=2.0.1 '}
+    withLabel : spades { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::spades=3.13.2'}
+    withLabel : flye { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::flye=2.7'} 
+    withLabel : racon { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::racon=1.4.13 '}
+    withLabel : medaka { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::medaka=1.0.3 '}
+    withLabel : pilon { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::pilon=1.23 bioconda::bwa=0.7.17 bioconda::samtools=1.9'}
+    withLabel : minimap2 { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::minimap2=2.17 bioconda::samtools=1.9'}
+    withLabel : bwa { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::bwa=0.7.17 bioconda::samtools=1.9'}
+    withLabel : metabat2 { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::metabat2=2.13'}
+    withLabel : maxbin2 { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::maxbin2=2.2.7'}
+    withLabel : concoct { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::concoct=1.1.0'}
+    withLabel : checkm { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::checkm-genome=1.0.13'}
+    withLabel : metawrap { cpus = params.cpus ; memory = params.memory;
+            conda = 'ursky::metawrap-mg=1.2.2'}
+    withLabel : seqtk { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::seqtk=1.3 bioconda::samtools=1.9 '}
+    withLabel : unicycler { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::unicycler=0.4.7 '}
+    //withLabel : dammit { cpus = params.cpus ; memory = params.memory
+            //conda = 'bioconda::dammit=1.0 '}
+    withLabel : eggnog { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::diamond anaconda::biopython bioconda::eggnog-mapper=2.0.1 '}
+    withLabel : trinity { cpus = params.cpus ; memory = params.memory
+            conda = 'bioconda::trinity=2.9.1 '}
+    withLabel : python38 { cpus = params.cpus ; memory = params.memory
+            conda = 'python=3.8 '}
+    // withLabel : { cpus = params.cpus ; memory = params.memory
+            // conda = 'bioconda:: '}
+}      
\ No newline at end of file
diff --git a/configs/container.config b/configs/container.config
new file mode 100644
index 0000000..bb2d5c6
--- /dev/null
+++ b/configs/container.config
@@ -0,0 +1,26 @@
+process {
+    // withLabel: busco { cpus = 8 ; memory = '30g' ; container = 'nanozoo/busco:3.0.2--0d4c614' } 
+    withLabel: bwa { cpus = 8 ; memory = '30g'; container = 'nanozoo/pilon:1.23--b21026d' } 
+    withLabel: concoct { cpus = 8 ; memory = '30g' ; container = 'nanozoo/concoct:1.1.0--03a3888' }
+    withLabel: fastp { cpus = 8 ; memory = '30g' ; container = 'nanozoo/fastp:0.20.0--78a7c63' }
+    withLabel: filtlong { cpus = 8 ; memory = '14g' ; container = 'nanozoo/filtlong:v0.2.0--afa175e' }
+    withLabel: flye { cpus = 8 ; memory = '30g' ; container = 'nanozoo/flye:2.7--957a1a1' }
+    withLabel: maxbin2 { cpus = 8 ; memory = '30g' ; container = 'nanozoo/maxbin2:2.2.7--b643a6b' }  
+    withLabel: medaka { cpus = 8 ; memory = '30g' ; container = 'nanozoo/medaka:1.0.3--7c62d67' } 
+    withLabel: metabat2 { cpus = 8 ; memory = '30g' ; container = 'nanozoo/metabat2:2.13--0e2577e' }  
+    withLabel: metawrap { cpus = 24 ; memory = '150g' ; container = 'nanozoo/metawrap:1.2.2--de94241' } 
+    withLabel: minimap2 { cpus = 8 ; memory = '30g' ; container = 'nanozoo/minimap2:2.17--caba7af' }
+    withLabel: checkm { cpus = 24 ; memory = '150g' ; container = 'nanozoo/checkm:1.0.13--248242f' }
+    // withLabel: nanoplot { cpus =  ; memory = '32g' ; container = 'nanozoo/nanoplot:1.25.0--4e2882f' }
+    //withLabel: checkm(withLabel: metawrap)
+    withLabel: pilon { cpus = 24 ; memory = '150g'; container = 'nanozoo/pilon:1.23--b21026d' } 
+    withLabel: racon { cpus = 8 ; memory = '30g' ; container = 'nanozoo/racon:1.4.13--bb8a908' } 
+    withLabel: seqtk { cpus = 8 ; memory = '30g' ; container = 'nanozoo/seqtk:1.3--dc0d16b' } 
+    withLabel: sourmash { cpus = 8 ; memory = '30g' ; container = 'nanozoo/sourmash:2.0.1--6970ddc'  }
+    withLabel: spades { cpus = 24 ; memory = '150g' ; container = 'nanozoo/spades:3.13.1--2c2a4c0'  }
+    withLabel: ubuntu { cpus = 8 ; memory = '30g' ; container = 'nanozoo/template:3.8--ccd0653' } 
+    withLabel: unicycler { cpus = 24 ; memory = '150g' ; container = 'nanozoo/unicycler:0.4.7-0--c0404e6' }
+    //withLabel: dammit { cpus = 16 ; memory = '48g' ; container = 'rvandamme/dammit:1' } //NOT USED ANYMORE
+    withLabel: eggnog { cpus = 8 ; memory = '30g' ; container = 'nanozoo/eggnog-mapper:2.0.1--d5e0c8c' }
+    withLabel: trinity { cpus = 8 ; memory = '30g' ; container = 'nanozoo/trinity:2.9.1--82fe26c' }
+}
\ No newline at end of file
diff --git a/configs/local.config b/configs/local.config
new file mode 100644
index 0000000..a0a9107
--- /dev/null
+++ b/configs/local.config
@@ -0,0 +1,23 @@
+process {
+    withLabel: bwa { cpus =  ; memory = 'g' } 
+    withLabel: concoct { cpus =  ; memory = 'g' }
+    withLabel: fastp { cpus =  ; memory = 'g'  }
+    withLabel: filtlong { cpus =  ; memory = 'g'  }
+    withLabel: flye { cpus =  ; memory = 'g'  }
+    withLabel: maxbin { cpus =  ; memory = 'g'  }  
+    withLabel: medaka { cpus =  ; memory = 'g'  } 
+    withLabel: metabat { cpus =  ; memory = 'g'  }  
+    withLabel: metawrap { cpus =  ; memory = 'g'  } 
+    withLabel: minimap { cpus =  ; memory = 'g'  }
+    withLabel: checkm { cpus =  ; memory = 'g'  }
+    //withLabel: checkm(withLabel: metawrap)
+    withLabel: pilon { cpus =  ; memory = 'g' } 
+    withLabel: racon { cpus =  ; memory = 'g'  } 
+    withLabel: seqtk { cpus =  ; memory = '0g'  } 
+    withLabel: sourmash { cpus =  ; memory = 'g'  }
+    withLabel: spades { cpus =  ; memory = 'g'   }
+    withLabel: ubuntu { cpus =  ; memory = '0g'  } 
+    withLabel: unicycler { cpus =  ; memory = 'g'  }
+    withLabel: eggnog { cpus =  ; memory = 'g'  }
+    withLabel: trinity { cpus =  ; memory = 'g'  }
+}
\ No newline at end of file
diff --git a/configs/preemptible.config b/configs/preemptible.config
new file mode 100644
index 0000000..3c2802a
--- /dev/null
+++ b/configs/preemptible.config
@@ -0,0 +1,23 @@
+process {
+    withLabel: flye { google.lifeSciences.preemptible = false ; google.lifeSciences.bootDiskSize = "10GB" }
+    withLabel: metawrap { google.lifeSciences.preemptible = false ; google.lifeSciences.bootDiskSize = "10GB" } 
+    withLabel: spades { google.lifeSciences.preemptible = false ; google.lifeSciences.bootDiskSize = "10GB" }
+    withLabel: unicycler { google.lifeSciences.preemptible = false ; google.lifeSciences.bootDiskSize = "10GB" }
+    withLabel: trinity { google.lifeSciences.preemptible = false ; google.lifeSciences.bootDiskSize = "25GB"  }
+
+    withLabel: bwa { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" } 
+    withLabel: concoct { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" }
+    withLabel: fastp { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" }
+    withLabel: filtlong { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" }
+    withLabel: maxbin2 { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" }  
+    withLabel: medaka { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" } 
+    withLabel: metabat2 { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" }  
+    withLabel: minimap2 { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" }
+    withLabel: checkm { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" }
+    withLabel: pilon { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" } 
+    withLabel: racon { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" } 
+    withLabel: seqtk { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" } 
+    withLabel: sourmash { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB"  }
+    withLabel: ubuntu { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" } 
+    withLabel: eggnog { google.lifeSciences.preemptible = true ; google.lifeSciences.bootDiskSize = "10GB" }
+}
\ No newline at end of file
diff --git a/main.nf b/main.nf
index 75c9996..813bc9e 100644
--- a/main.nf
+++ b/main.nf
@@ -1,566 +1,617 @@
-#!/usr/bin/env nextflow
-nextflow.preview.dsl=2
-
-start_var = """
-*********Start running MUFFIN*********
-MUFFIN is a hybrid assembly and differential binning workflow for metagenomics, transcriptomics and pathway analysis.
-
-If you use MUFFIN for your research pleace cite:
-
-https://www.biorxiv.org/content/10.1101/2020.02.08.939843v1 
-
-or
-
-Van Damme R., Hölzer M., Viehweger H., Müller B., Bongcam-Rudloff E., Brandt C., 2020
-"Metagenomics workflow for hybrid assembly, differential coverage binning, transcriptomics and pathway analysis (MUFFIN)",
-doi: https://doi.org/10.1101/2020.02.08.939843 
-**************************************
-"""
-println start_var
-
-if (params.help) { exit 0, helpMSG() }
-
-// Help Message
-def helpMSG() {
-    log.info """
-    *********hybrid assembly and differential binning workflow for metagenomics, transcriptomics and pathway analysis*********
-
-    MUFFIN is still under development please wait until the first non edge version realease before using it.
-    Please cite us using https://www.biorxiv.org/content/10.1101/2020.02.08.939843v1
-
-    Mafin is composed of 3 part the assembly of potential metagenome assembled genomes (MAGs); the classification of the MAGs; and the annotation of the MAGs.
-
-        Usage example:
-    nextflow run main.nf --ont nanopore/ --illumina illumina/ --assembler metaspades --rna rna/ -profile docker
-    or 
-    nextflow run main.nf --ont nanopore/ --illumina illumina/ --assembler metaflye -profile docker
-
-        Input:
-    --ont                       path to the directory containing the nanopore read file (fastq) (default: $params.ont)
-    --illumina                  path to the directory containing the illumina read file (fastq) (default: $params.illumina)
-    --rna                       path to the directory containing the RNA-seq read file (fastq) (default: none)
-    --bin_classify              path to the directory containing the bins files to classify (default: none)
-    --bin_annotate              path to the directory containing the bins files to annotate (default: none)
-    --assembler                 the assembler to use in the assembly step (default: $params.assembler)
-
-        Optional input:
-    --check_db                  path to the checkm database
-    --check_tar_db              path to the checkm database tar compressed
-    --sourmash_db               path to the LCA database for sourmash (default: GTDB LCA formated)
-    --eggnog_db                 path to the eggNOG database
-
-        Output:
-    --output                    path to the output directory (default: $params.output)
-
-        Outputed files:
-    QC                          The reads file after qc
-    Assembly                    The assembly contigs file 
-    Bins                        The bins produced by CONCOCT, MetaBAT2, MaxBin2 and MetaWRAP (the refining of bins)
-    Mapped bin reads            The fastq files containing the reads mapped to each metawrap bin
-    Unmapped bin reads          The fastq files containing the unmmaped reads of illumina and nanopore
-    Reassembly                  The reassembly files of the bins (.fa and .gfa)
-    Checkm                      Various file outputed by CheckM (summary, taxonomy, plots and output dir)
-    Sourmash                    The classification done by sourmash
-    Classify summary            The summary of the classification and quality control of the bins (csv file)
-    RNA output                  The de novo assembled transcript and the quantification by Salmon
-    Annotation                  The annotations files from eggNOG (tsv format)
-    Parsed output               HTML files that summarize the annotations and show graphically the pathways
-
-
-    
-
-        Basic Parameter:
-    --cpus                      max cores for local use [default: $params.cpus]
-    --memory                    80% of available RAM in GB for --metamaps [default: $params.memory]
-
-
-        Workflow Options:
-    --skip_ill_qc               skip quality control of illumina files
-    --skip_ont_qc               skip quality control of nanopore file
-    --short_qc                  minimum size of the reads to be kept (default: $params.short_qc )
-    --filtlong                  use filtlong to improve the quality furthermore (default: false)
-    --model                     the model medaka will use (default: $params.model)
-    --polish_iteration          number of iteration of pilon in the polish step (default: $params.polish_iteration)
-    --extra_ill                 a list of additional ill sample file (with full path with a * instead of _R1,2.fastq) to use for the binning in Metabat2 and concoct
-    --extra_ont                 a list of additional ont sample file (with full path) to use for the binning in Metabat2 and concoct
-    --skip_metabat2             skip the binning using metabat2 (advanced)
-    --skip_maxbin2              skip the binning using maxbin2 (advanced)
-    --skip_concoct              skip the binning using concoct (advanced)
-    --reassembly                activate the reassembly of the bins using Unicycler (advanced and unstable)
-
-        Nextflow options:
-    -profile                    change the profile of nextflow (currently available conda)
-    -resume                     resume the workflow where it stopped
-    -with-report rep.html       cpu / ram usage (may cause errors)
-    -with-dag chart.html        generates a flowchart for the process tree
-    -with-timeline time.html    timeline (may cause errors)
-    """
-}
-
-if( !nextflow.version.matches('19.+') ) {
-    println "This workflow requires Nextflow version 19.07 or greater and under version 20 -- You are running version $nextflow.version"
-    exit 1
-}
-
-workflow { //start of the workflow
-
-    //*************************************************
-    // STEP 1 Assemble using hybrid method
-    //*************************************************
-
-    if (params.modular=="full" | params.modular=="assemble" | params.modular=="assem-class" | params.modular=="assem-annot") { //only do the step one if called
-        if (params.assembler!='metaflye' && params.assembler!='metaspades') { //check if the assembler parameter is correct
-            exit 1, "--assembler: ${params.assembler}. Should be 'metaflye' or 'metaspades' (default: metaflye)"}
-
-        // stdout early usage (print header + default or modified params)
-
-        // DATA INPUT ILLUMINA
-        illumina_input_ch = Channel
-                .fromFilePairs( "${params.illumina}/*_R{1,2}.fastq{,.gz}", checkIfExists: true)
-                .view() 
-
-        // illumina_input_ch = Channel.fromFilePairs(reads_illumina).ifEmpty { error "Cannot find any Illumina reads in the directory: ${params.illumina} \n Delfault is ./illumina \n ${reads_illumina}" }.view()
-
-        // extra ill reads
-        if (params.extra_ill != false) {
-        extra_ill_ch=Channel.fromPath(params.extra_ill).splitCsv().map { row ->
-                    def path = file("${row[0]}")
-                    return path
-                }
-        }
-
-        // DATA INPUT ONT
-        ont_input_ch = Channel.fromPath("${params.ont}/*.fastq{,.gz}",checkIfExists: true).map {file -> tuple(file.simpleName, file) }.view()
-
-        // extra ont reads
-        if (params.extra_ont != false) {
-        extra_ont_ch=Channel.fromPath(params.extra_ont).splitCsv().map { row ->
-                    def path = file("${row[0]}")
-                    return path
-                }
-        }
-
-
-
-
-        // sourmash_db
-        if (params.sourmash_db) { database_sourmash = file(params.sourmash_db) } //use the path to the sourmash DB
-        else {
-            include './modules/sourmashgetdatabase'
-            sourmash_download_db() 
-            database_sourmash = sourmash_download_db.out
-        }   
-        // checkm_db
-        if (workflow.profile == 'conda') { // when using conda checkm needs to be set up first before any use
-            if (params.checkm_db) { // this one set in the env the path to checkm db uncompressed
-                include './modules/checkmsetupDB'
-                untar = true
-                checkm_setup_db(params.checkm_db, untar)
-                checkm_db_path = checkm_setup_db.out
-            }
-
-            else if (params.checkm_tar_db) { // untar the checkm db before setting up
-                include './modules/checkmsetupDB'
-                untar = false
-                checkm_setup_db(params.checkm_db, untar)
-                checkm_db_path = checkm_setup_db.out
-            }
-
-            else { // DLL the check db , untar then setup
-                include './modules/checkmsetupDB'
-                include './modules/checkmgetdatabases'
-                untar = false
-                checkm_setup_db(checkm_download_db(), untar)
-                checkm_db_path = checkm_setup_db.out
-            }
-        }
-        else { checkm_db_path = Channel.from("/checkm_database").collectFile() { item -> [ "path.txt", item ]  } } // Docker way to setup the db
-
-        //************
-        // QC OF READS
-        //************
-            // QC check ONT
-        if (params.skip_ont_qc == true) {}
-        else if (params.skip_ont_qc==false) {
-            include discard_short from './modules/ont_qc' params(short_qc : params.short_qc)
-            split_ont_ch = ont_input_ch.splitFastq(by:100000, file:true) //split the fastq to speed up the process
-            discard_short(split_ont_ch) // simply discard the reads under a threshold
-            if (params.filtlong==true) { // not necessary at all but can be run if wanted
-                include filtlong from './modules/ont_qc' params(short_qc : params.short_qc)
-                filtlong(discard_short.out)
-                merging_ch = filtlong.out.groupTuple() 
-            }
-            else {
-                merging_ch = discard_short.out.groupTuple() 
-            }
-            include merge from './modules/ont_qc' params(out_qc : params.out_qc, output : params.output)
-            merge(merging_ch)  // merge the splitted fastq
-            ont_input_ch = merge.out
-        }
-            // QC check Illumina
-        if (params.skip_ill_qc==true) {}
-        else if (params.skip_ill_qc==false) {
-            include fastp from './modules/fastp' params(out_qc : params.out_qc, output : params.output) // simple QC done by fastp
-            fastp(illumina_input_ch)
-            illumina_input_ch = fastp.out
-        }
-
-        //**********
-        // Assembly 
-        //**********
-            // Meta-SPADES
-
-        if (params.assembler=="metaspades") { // hybrid and metagenomic assembly by spades
-            include './modules/spades' params(assembly : params.assembly, output : params.output)
-            spades_ch= illumina_input_ch.join(ont_input_ch)
-            spades(spades_ch)
-            assembly_ch = spades.out
-        }
-
-            // Meta-FLYE
-
-        if (params.assembler=="metaflye") { // metagenomic assembly by flye + hybrid polishing (combo racon; medaka; pilon with short reads)
-            include sourmash_genome_size from './modules/sourmash'
-            include './modules/flye' params(assembly : params.assembly, output : params.output)
-            include minimap_polish from'./modules/minimap2'
-            include racon from './modules/polish'
-            include medaka from './modules/polish' params(model : params.model)
-            include pilon from './modules/polish' params(assembly : params.assembly, output : params.output)
-            // FLYE + Pilon 
-            flye(sourmash_genome_size(ont_input_ch,database_sourmash))
-            flye_to_map = flye.out.join(ont_input_ch)
-            minimap_polish(flye_to_map)
-            map_to_racon = ont_input_ch.join(flye.out).join(minimap_polish.out)
-            medaka(racon(map_to_racon))
-            medaka_to_pilon = medaka.out.join(illumina_input_ch)
-            pilon(medaka_to_pilon, params.polish_iteration)
-            assembly_ch = pilon.out
-        }
-
-        //*********
-        // Mapping
-        //*********
-
-            // ONT mapping
-
-        include minimap2 from './modules/minimap2' //mapping for the binning 
-        minimap2_ch = assembly_ch.join(ont_input_ch)
-        minimap2(minimap2_ch)
-        ont_bam_ch = minimap2.out
-
-        if (params.extra_ont != false) { //mapping of the "additionnal reads" to the assembly for use in the differential coverage binning
-            include extra_minimap2 from './modules/minimap2'
-            minimap_extra = assembly_ch.join(extra_ont_ch)
-            extra_minimap2(minimap_extra)
-            ont_extra_bam = extra_minimap2.out.collect()
-        }
-
-            // Illumina mapping
-        include bwa from './modules/bwa' //mapping for the binning
-        bwa_ch = assembly_ch.join(illumina_input_ch)
-        bwa(bwa_ch)
-        illumina_bam_ch = bwa.out
-
-        if (params.extra_ill != false) { //mapping of the "additionnal reads" to the assembly for use in the differential coverage binning
-            include extra_bwa from './modules/bwa'
-            bwa_extra = assembly_ch.join(extra_ill_ch)
-            extra_bwa(bwa_extra)
-            illumina_extra_bam = extra_bwa.out.collect()
-        }
-
-        //***************************************************
-        // Binning
-        //***************************************************
-
-            // metabat2 
-
-        if (params.skip_metabat2==true) {}
-        else {
-            if (params.extra_ont != false || params.extra_ill != false ) { // check if differential coverage binning possible
-                include metabat2_extra from './modules/metabat2' params(out_metabat : params.out_metabat, output : params.output)
-                metabat2_ch = assembly_ch.join(ont_bam_ch).join(illumina_bam_ch)
-                metabat2_extra(metabat2_ch, extra_bam)
-                metabat2_out = metabat2_extra.out
-            }
-            else {    
-                include metabat2 from './modules/metabat2' params(out_metabat : params.out_metabat, output : params.output)
-                metabat2_ch = assembly_ch.join(ont_bam_ch).join(illumina_bam_ch)
-                metabat2(metabat2_ch)
-                metabat2_out = metabat2.out
-            }
-        }
-
-            // Maxbin2 
-
-        if (params.skip_maxbin2==true) {}
-        else {
-            include './modules/maxbin2' params(out_maxbin : params.out_maxbin, output : params.output)
-            maxbin2_ch = assembly_ch.join(ont_input_ch).join(illumina_input_ch)
-            maxbin2(maxbin2_ch)
-            maxbin2_out = maxbin2.out
-        }
-
-            // Concoct OR CheckM Concoct
-
-        if (params.skip_concoct==true) {}
-        else {
-            if (params.extra_ont != false || params.extra_ill != false ) { // check if differential coverage binning possible
-                include concoct_extra from './modules/concoct' params(out_concoct : params.out_concoct, output : params.output)
-                concoct_ch = assembly_ch.join(ont_bam_ch).join(illumina_bam_ch)
-                concoct_extra(concoct_ch, extra_bam)
-                concoct_out = concoct_extra.out
-            }
-            else {
-                include concoct from './modules/concoct' params(out_concoct : params.out_concoct, output : params.output)
-                concoct_ch = assembly_ch.join(ont_bam_ch).join(illumina_bam_ch)
-                concoct(concoct_ch)
-                concoct_out = concoct.out
-            }
-        }
-
-        // Bin refine
-
-        if (params.skip_metabat2==true) {
-            if (  params.skip_maxbin2==true || params.skip_concoct==true) {} // no refine if 1 or less binning method used
-            else {
-                include refine2 from './modules/metawrap_refine_bin' params(out_metawrap : params.out_metawrap, output : params.output)
-                refine2_ch = maxbin2_out.join(concoct_out)
-                refine2(refine2_ch, checkm_db_path) // use 2 binning method to refine
-                final_bin_ch = refine2.out[0].transpose() // the transpose is used to "split" the channel in a channel with each bin file individually
-                // e.g without: ch:[ID,[bin1.fa,bin2.fa,bin3.fa]] with : ch:[[ID,bin1.fa],[ID,bin2.fa],[ID,bin3.fa]]
-                // this format is needed for further step
-            }
-        }
-
-        else if (params.skip_maxbin2==true) {
-            if (  params.skip_metabat2==true || params.skip_concoct==true) {} // no refine if 1 or less binning method used
-            else {
-                include refine2 from './modules/metawrap_refine_bin' params(out_metawrap : params.out_metawrap, output : params.output)
-                refine2_ch = metabat2_out.join(concoct_out)
-                refine2(refine2_ch, checkm_db_path) // use 2 binning method to refine
-                final_bin_ch = refine2.out[0].transpose() // the transpose is used to "split" the channel in a channel with each bin file individually
-                // e.g without: ch:[ID,[bin1.fa,bin2.fa,bin3.fa]] with : ch:[[ID,bin1.fa],[ID,bin2.fa],[ID,bin3.fa]]
-                // this format is needed for further step
-            }
-        }
-
-        else if (params.skip_concoct==true) {
-            if (  params.skip_metabat2==true || params.skip_maxbin2==true) {} // no refine if 1 or less binning method used
-            else {
-                include refine2 from './modules/metawrap_refine_bin' params(out_metawrap : params.out_metawrap, output : params.output)
-                refine2_ch = metabat2_out.join(maxbin2_out)
-                refine2(refine2_ch, checkm_db_path) // use 2 binning method to refine
-                final_bin_ch = refine2.out[0].transpose() // the transpose is used to "split" the channel in a channel with each bin file individually
-                // e.g without: ch:[ID,[bin1.fa,bin2.fa,bin3.fa]] with : ch:[[ID,bin1.fa],[ID,bin2.fa],[ID,bin3.fa]]
-                // this format is needed for further step
-            }
-        }
-
-        else {
-            include refine3 from './modules/metawrap_refine_bin' params(out_metawrap : params.out_metawrap, output : params.output)
-            refine3_ch = metabat2_out.join(maxbin2_out).join(concoct_out)
-            refine3(refine3_ch, checkm_db_path)
-            reassembly_ch = refine3.out[0]
-            metawrap_out_ch = refine3.out[0].transpose() // the transpose is used to "split" the channel in a channel with each bin file individually
-                // e.g without: ch:[ID,[bin1.fa,bin2.fa,bin3.fa]] with : ch:[[ID,bin1.fa],[ID,bin2.fa],[ID,bin3.fa]]
-                // this format is needed for further step
-        }
-
-        //**************
-        //Retrieve reads for each bin and assemble them
-        //**************
-        if (params.reassembly) {
-            // retrieve the ids of each bin contigs
-
-            include './modules/list_ids'
-            contig_list(reassembly_ch) //retrieve the list of contigs present for each bin
-            extract_reads_ch = contig_list.out.view()
-
-        
-            // bam align the reads to ALL OF THE CONTIGS 
-
-            include './modules/cat_all_bins'
-            include bwa_bin from './modules/bwa'  
-            include minimap2_bin from './modules/minimap2'
-            cat_all_bins(reassembly_ch) // assemble all bins' contigs in one file for the mapping
-            fasta_all_bin = cat_all_bins.out
-            bwa_all_bin = fasta_all_bin.join(illumina_input_ch)
-            ill_map_all_bin = bwa_bin(bwa_all_bin)    //map illumina reads
-            minimap2_all_bin = fasta_all_bin.join(ont_input_ch) 
-            ont_map_all_bin = minimap2_bin(minimap2_all_bin) //map ont reads
-
-            // retrieve the reads aligned to the contigs + run unicycler + polish with pilon for 2 round
-
-            include reads_retrieval from './modules/seqtk_retrieve_reads'params(out_bin_reads: params.out_bin_reads, output : params.output)
-            include unmapped_retrieve from './modules/seqtk_retrieve_reads'params(out_unmapped: params.out_unmapped, output : params.output)
-            include './modules/unicycler_reassemble_from_bin' params(output : params.output)
-            retrieve_unmapped_ch = ill_map_all_bin.join( ont_map_all_bin).join(illumina_input_ch).join(ont_input_ch)
-            unmapped_retrieve(retrieve_unmapped_ch) //retrieve the reads that didn't map to the contigs to output reads set that can be analysed again
-            retrieve_reads_ch = extract_reads_ch.transpose().combine(ill_map_all_bin, by:0).combine( ont_map_all_bin, by:0).combine(illumina_input_ch, by:0).combine(ont_input_ch, by:0)
-            reads_retrieval(retrieve_reads_ch).view() //retrieve the reads that mapped to the contigs to allow the reassembly
-            unicycler(reads_retrieval.out) // reassemble each bin with the reads mapped to their contigs
-
-            collected_final_bins_ch=unicycler.out[0].collect()
-            final_bins_ch=unicycler.out[0]
-        }
-        else {
-            final_bins_ch=metawrap_out_ch
-        }
-    } //end of step 1
-    //*************************************************
-    // STEP 2 classify taxa
-    //*************************************************
-    if (params.modular=="full" | params.modular=="classify" | params.modular=="assem-class" | params.modular=="class-annot") {
-
-        //**************
-        // File handling
-        //**************
-
-        //bins (list with id (run id not bin) coma path/to/file)
-        if (params.bin_classify) { 
-            classify_ch = Channel
-                .fromPath( params.bin_classify, checkIfExists: true )
-                .splitCsv()
-                .map { row -> ["${row[0]}", file("${row[2]}", checkIfExists: true)]  }
-                .view()
-                }
-
-        else {classify_ch=final_bins_ch}
-
-        if (params.modular=="classify" |params.modular=="class-annot") {
-            // sourmash_db
-            if (params.sourmash_db) { database_sourmash = file(params.sourmash_db) }
-            else {
-                include './modules/sourmashgetdatabase'
-                sourmash_download_db() 
-                database_sourmash = sourmash_download_db.out
-            }   
-            // checkm_db
-            if (workflow.profile == 'conda') {
-                if (params.checkm_db) {
-                    include './modules/checkmsetupDB'
-                    untar = true
-                    checkm_setup_db(params.checkm_db, untar)
-                    checkm_db_path = checkm_setup_db.out
-                }
-
-                else if (params.checkm_tar_db) {
-                    include './modules/checkmsetupDB'
-                    untar = false
-                    checkm_setup_db(params.checkm_db, untar)
-                    checkm_db_path = checkm_setup_db.out
-                }
-
-                else {
-                    include './modules/checkmsetupDB'
-                    include './modules/checkmgetdatabases'
-                    untar = false
-                    checkm_setup_db(checkm_download_db(), untar)
-                    checkm_db_path = checkm_setup_db.out
-                }
-            }
-            else { checkm_db_path = Channel.from("/checkm_database").collectFile() { item -> [ "path.txt", item ]  } }
-        }
-        //*************************
-        // Bins classify workflow
-        //*************************
-
-        //checkm of the final assemblies
-        include checkm from './modules/checkm'params(output : params.output)
-        checkm(classify_ch.groupTuple(by:0)) //checkm QC of the bins
-
-        //sourmash classification using gtdb database
-
-        include sourmash_bins from './modules/sourmash'params(output : params.output)
-        sourmash_bins(classify_ch,database_sourmash) // fast classification using sourmash with the gtdb (not the best classification but really fast and good for primarly result)
-
-        include sourmash_checkm_parser from './modules/checkm_sourmash_parser'params(output: params.output)
-        sourmash_checkm_parser(checkm.out[0],sourmash_bins.out.collect()) //parsing the result of sourmash and checkm in a single result file
-
-    } // end of step 2
-        //*************************************************
-        // STEP 3 annotation; kegg pathways + use or RNAseq
-        //*************************************************
-
-        if (params.modular=="full" | params.modular=="annotate" | params.modular=="assem-annot" | params.modular=="class-annot") {
-            //**************
-            // File handling
-            //**************
-
-            //RNAseq
-            if (params.rna) {rna_input_ch = Channel
-                    .fromFilePairs( "${params.rna}/*_R{1,2}.fastq{,.gz}", checkIfExists: true)
-                    .view()
-            }
-
-            //bins (list with id (run id not bin) coma path/to/file)
-            if (params.bin_annotate) {
-                bins_input_ch = Channel
-                    .fromPath( params.bin_annotate, checkIfExists: true )
-                    .splitCsv()
-                    .map { row -> ["${row[0]}", file("${row[2]}", checkIfExists: true)]  }
-                    .view() 
-                    }
-            else {bins_input_ch = final_bins_ch }
-
-        //************************
-        // Databases Dll and setup
-        //************************
-        if (params.eggnog_db) {eggnog_db=Channel
-                .fromPath( params.eggnog_db, checkIfExists: true )}
-        else {
-            include './modules/eggnog_get_databases'
-            eggnog_download_db()
-            eggnog_db = eggnog_download_db.out
-            } 
-        //*************************
-        // Bins annotation workflow
-        //*************************
-
-        include eggnog_bin from './modules/eggnog'params(output : params.output)
-        eggnog_bin_ch= bins_input_ch.combine(eggnog_db)
-        eggnog_bin(eggnog_bin_ch) //annotate the bins
-        bin_annotated_ch=eggnog_bin.out[0].groupTuple(by:0).view()
-
-        //************************
-        // RNA annotation workflow
-        //************************
-        if (params.rna) {
-        // QC
-            include fastp_rna from './modules/fastp'params(output : params.output)
-            fastp_rna(rna_input_ch) //qc illumina RNA-seq
-            rna_input_ch = fastp_rna.out
-
-        // De novo transcript
-            include de_novo_transcript_and_quant from './modules/trinity_and_salmon'params(output : params.output)
-            de_novo_transcript_and_quant(rna_input_ch) // de novo transcrip assembly and quantification with trinity and salmon
-            transcript_ch=de_novo_transcript_and_quant.out
-        // annotations of transcript
-            include eggnog_rna from './modules/eggnog'params(output : params.output)
-            eggnog_rna_ch= transcript_ch.combine(eggnog_db)
-            eggnog_rna(eggnog_rna_ch) //annotate the RNA-seq transcripts
-            rna_annot_ch=eggnog_rna.out[0].view()
-        }
-
-        //******************************************************
-        // Parsing bin annot and RNA out into nice graphical out
-        //******************************************************
-
-        if (params.rna)  {
-            include parser_bin_RNA from './modules/parser'params(output: params.output)
-            parser_bin_RNA(rna_annot_ch,bin_annotated_ch) // parse the annotations in html summary files
-        }
-        else {
-            include parser_bin from './modules/parser'params(output: params.output)
-            parser_bin(bin_annotated_ch) // parse the bins annotation in html summary files
-        }
-        // Share pathway to put and HTML file with
-    } // end of step 3
-
-} // end of workflow{}
-
-workflow.onComplete { 
-  log.info ( workflow.success ? "\nDone! Results are stored here --> $params.output \n" : "Oops .. something went wrong" )  }
-//***********
-// MAFIN DONE
+#!/usr/bin/env nextflow
+nextflow.preview.dsl=2
+
+start_var = """
+*********Start running MUFFIN*********
+MUFFIN is a hybrid assembly and differential binning workflow for metagenomics, transcriptomics and pathway analysis.
+
+If you use MUFFIN for your research pleace cite:
+
+https://www.biorxiv.org/content/10.1101/2020.02.08.939843v1 
+
+or
+
+Van Damme R., Hölzer M., Viehweger H., Müller B., Bongcam-Rudloff E., Brandt C., 2020
+"Metagenomics workflow for hybrid assembly, differential coverage binning, transcriptomics and pathway analysis (MUFFIN)",
+doi: https://doi.org/10.1101/2020.02.08.939843 
+**************************************
+"""
+println start_var
+
+if (params.help) { exit 0, helpMSG() }
+
+// Help Message
+def helpMSG() {
+    log.info """
+    *********hybrid assembly and differential binning workflow for metagenomics, transcriptomics and pathway analysis*********
+
+    MUFFIN is still under development please wait until the first non edge version realease before using it.
+    Please cite us using https://www.biorxiv.org/content/10.1101/2020.02.08.939843v1
+
+    Mafin is composed of 3 part the assembly of potential metagenome assembled genomes (MAGs); the classification of the MAGs; and the annotation of the MAGs.
+
+        Usage example:
+    nextflow run main.nf --ont nanopore/ --illumina illumina/ --assembler metaspades --rna rna/ -profile docker
+    or 
+    nextflow run main.nf --ont nanopore/ --illumina illumina/ --assembler metaflye -profile docker
+
+        Input:
+    --ont                       path to the directory containing the nanopore read file (fastq) (default: $params.ont)
+    --illumina                  path to the directory containing the illumina read file (fastq) (default: $params.illumina)
+    --rna                       path to the directory containing the RNA-seq read file (fastq) (default: none)
+    --bin_classify              path to the directory containing the bins files to classify (default: none)
+    --bin_annotate              path to the directory containing the bins files to annotate (default: none)
+    --assembler                 the assembler to use in the assembly step (default: $params.assembler)
+
+        Optional input:
+    --check_db                  path to the checkm database
+    --check_tar_db              path to the checkm database tar compressed
+    --sourmash_db               path to the LCA database for sourmash (default: GTDB LCA formated)
+    --eggnog_db                 path to the eggNOG database
+
+        Output:
+    --output                    path to the output directory (default: $params.output)
+
+        Outputed files:
+        You can see the output structure at https://osf.io/a6hru/
+    QC                          The reads file after qc
+    Assembly                    The assembly contigs file 
+    Bins                        The bins produced by CONCOCT, MetaBAT2, MaxBin2 and MetaWRAP (the refining of bins)
+    Mapped bin reads            The fastq files containing the reads mapped to each metawrap bin
+    Unmapped bin reads          The fastq files containing the unmmaped reads of illumina and nanopore
+    Reassembly                  The reassembly files of the bins (.fa and .gfa)
+    Checkm                      Various file outputed by CheckM (summary, taxonomy, plots and output dir)
+    Sourmash                    The classification done by sourmash
+    Classify summary            The summary of the classification and quality control of the bins (csv file)
+    RNA output                  The de novo assembled transcript and the quantification by Salmon
+    Annotation                  The annotations files from eggNOG (tsv format)
+    Parsed output               HTML files that summarize the annotations and show graphically the pathways
+
+
+    
+
+        Basic Parameter:
+    --cpus                      max cores for local use [default: $params.cpus]
+    --memory                    80% of available RAM in GB for --metamaps [default: $params.memory]
+
+
+        Workflow Options:
+    --skip_ill_qc               skip quality control of illumina files
+    --skip_ont_qc               skip quality control of nanopore file
+    --short_qc                  minimum size of the reads to be kept (default: $params.short_qc )
+    --filtlong                  use filtlong to improve the quality furthermore (default: false)
+    --model                     the model medaka will use (default: $params.model)
+    --polish_iteration          number of iteration of pilon in the polish step (default: $params.polish_iteration)
+    --extra_ill                 a list of additional ill sample file (with full path with a * instead of _R1,2.fastq) to use for the binning in Metabat2 and concoct
+    --extra_ont                 a list of additional ont sample file (with full path) to use for the binning in Metabat2 and concoct
+    --skip_metabat2             skip the binning using metabat2 (advanced)
+    --skip_maxbin2              skip the binning using maxbin2 (advanced)
+    --skip_concoct              skip the binning using concoct (advanced)
+
+        Nextflow options:
+    -profile                    change the profile of nextflow both the engine and executor more details on github README
+    -resume                     resume the workflow where it stopped
+    -with-report rep.html       cpu / ram usage (may cause errors)
+    -with-dag chart.html        generates a flowchart for the process tree
+    -with-timeline time.html    timeline (may cause errors)
+    """
+}
+
+if( !nextflow.version.matches('20.+') ) {
+    println "This workflow requires Nextflow version 19.07 or greater and under version 20 -- You are running version $nextflow.version"
+    exit 1
+}
+
+workflow { //start of the workflow
+    //*************************************************
+    // STEP 0 Loading modules and workflow profile error handling
+    //*************************************************
+
+    // Error handling
+    if ( workflow.profile == 'standard' ) { exit 1, "NO VALID EXECUTION PROFILE SELECTED, use e.g. [-profile local,docker]" }
+
+    if (
+    workflow.profile.contains('singularity') ||
+    workflow.profile.contains('docker') ||
+    workflow.profile.contains('conda')
+    ) { "engine selected" }
+    else { exit 1, "No engine selected:  -profile EXECUTER,ENGINE" }
+
+    if (
+    workflow.profile.contains('local') ||
+    workflow.profile.contains('sge') ||
+    workflow.profile.contains('slurm') ||
+    workflow.profile.contains('gcloud') ||
+    workflow.profile.contains('ebi') ||
+    workflow.profile.contains('lsf') ||
+    workflow.profile.contains('git_action')
+    ) { "executer selected" }
+    else { exit 1, "No executer selected:  -profile EXECUTER,ENGINE" }
+
+
+    //module for assemble
+    if (params.modular=="full" | params.modular=="assemble" | params.modular=="assem-class" | params.modular=="assem-annot") {
+        include sourmash_download_db from './modules/sourmashgetdatabase'
+        include checkm_setup_db from './modules/checkmsetupDB'
+        include checkm_download_db from './modules/checkmgetdatabases'
+        include discard_short from './modules/ont_qc' params(short_qc : params.short_qc)
+        include filtlong from './modules/ont_qc' params(short_qc : params.short_qc)
+        include merge from './modules/ont_qc' params(output : params.output)
+        include fastp from './modules/fastp' params(output : params.output) // simple QC done by fastp
+        include spades from './modules/spades' params(output : params.output)
+        include sourmash_genome_size from './modules/sourmash'
+        include flye from './modules/flye' params(output : params.output)
+        include minimap_polish from'./modules/minimap2'
+        include racon from './modules/polish'
+        include medaka from './modules/polish' params(model : params.model)
+        include pilon from './modules/polish' params(output : params.output)
+        include minimap2 from './modules/minimap2' //mapping for the binning 
+        include extra_minimap2 from './modules/minimap2'
+        include bwa from './modules/bwa' //mapping for the binning
+        include extra_bwa from './modules/bwa'
+        include metabat2_extra from './modules/metabat2' params(output : params.output)    
+        include metabat2 from './modules/metabat2' params(output : params.output)
+        include maxbin2 from './modules/maxbin2' params(output : params.output)
+        include concoct_extra from './modules/concoct' params(output : params.output)
+        include concoct from './modules/concoct' params(output : params.output)
+        include refine2 from './modules/metawrap_refine_bin' params(output : params.output)
+        include refine3 from './modules/metawrap_refine_bin' params(output : params.output)
+        include contig_list from './modules/list_ids'
+        include cat_all_bins from './modules/cat_all_bins'
+        include bwa_bin from './modules/bwa'  
+        include minimap2_bin from './modules/minimap2'
+        include reads_retrieval from './modules/seqtk_retrieve_reads' params(output : params.output)
+        include unmapped_retrieve from './modules/seqtk_retrieve_reads' params(output : params.output)
+        //include unicycler './modules/unicycler_reassemble_from_bin' params(output : params.output)
+    }
+    //module for classify
+    if (params.modular=="full" | params.modular=="classify" | params.modular=="assem-class" | params.modular=="class-annot") {
+        include checkm from './modules/checkm'params(output : params.output)
+        include sourmash_bins from './modules/sourmash'params(output : params.output)
+        include sourmash_checkm_parser from './modules/checkm_sourmash_parser'params(output: params.output)
+    }
+    if (params.modular=="classify" | params.modular=="class-annot") {
+        include sourmash_download_db from './modules/sourmashgetdatabase'
+        include checkm_setup_db from './modules/checkmsetupDB'
+        include checkm_download_db from './modules/checkmgetdatabases'
+    }
+    //module for annotate
+    if (params.modular=="full" | params.modular=="annotate" | params.modular=="assem-annot" | params.modular=="class-annot") {
+        include eggnog_download_db from './modules/eggnog_get_databases'
+        include eggnog_bin from './modules/eggnog'params(output : params.output)
+        include fastp_rna from './modules/fastp'params(output : params.output)
+        include de_novo_transcript_and_quant from './modules/trinity_and_salmon'params(output : params.output)
+        include eggnog_rna from './modules/eggnog'params(output : params.output)
+        include parser_bin_RNA from './modules/parser'params(output: params.output)
+        include parser_bin from './modules/parser'params(output: params.output)
+    }
+    include readme_output from './modules/readme_output'params(output: params.output)
+
+    //*************************************************
+    // STEP 1 Assemble using hybrid method
+    //*************************************************
+
+    if (params.modular=="full" | params.modular=="assemble" | params.modular=="assem-class" | params.modular=="assem-annot") { //only do the step one if called
+        if (params.assembler!='metaflye' && params.assembler!='metaspades') { //check if the assembler parameter is correct
+            exit 1, "--assembler: ${params.assembler}. Should be 'metaflye' or 'metaspades' (default: metaflye)"}
+
+        // stdout early usage (print header + default or modified params)
+
+        // DATA INPUT TEST
+        if (workflow.profile.contains('test')) {
+            include test from './modules/test_data_dll'
+            test()
+            illumina_input_ch = test.out[0]
+            ont_input_ch = test.out[1]
+            rna_input_ch = test.out[2]
+        }
+
+        else {
+            // DATA INPUT ILLUMINA
+            illumina_input_ch = Channel
+                    .fromFilePairs( "${params.illumina}/*_R{1,2}.fastq{,.gz}", checkIfExists: true)
+                    .view() 
+
+            // illumina_input_ch = Channel.fromFilePairs(reads_illumina).ifEmpty { error "Cannot find any Illumina reads in the directory: ${params.illumina} \n Delfault is ./illumina \n ${reads_illumina}" }.view()
+
+            // extra ill reads
+            if (params.extra_ill != false) {
+            extra_ill_ch=Channel.fromPath(params.extra_ill).splitCsv().map { row ->
+                        def path = file("${row[0]}")
+                        return path
+                    }
+            }
+
+            // DATA INPUT ONT
+            ont_input_ch = Channel.fromPath("${params.ont}/*.fastq{,.gz}",checkIfExists: true).map {file -> tuple(file.simpleName, file) }.view()
+
+            // extra ont reads
+            if (params.extra_ont != false) {
+            extra_ont_ch=Channel.fromPath(params.extra_ont).splitCsv().map { row ->
+                        def path = file("${row[0]}")
+                        return path
+                    }
+            }
+
+        }
+
+
+        // sourmash_db
+        if (params.sourmash_db) { database_sourmash = file(params.sourmash_db) } //use the path to the sourmash DB
+        else {
+            sourmash_download_db() 
+            database_sourmash = sourmash_download_db.out
+        }   
+        // checkm_db
+        if (workflow.profile.contains('conda') ) { // when using conda checkm needs to be set up first before any use
+            if (params.checkm_db) { // this one set in the env the path to checkm db uncompressed
+                untar = true
+                checkm_setup_db(params.checkm_db, untar)
+                checkm_db_path = checkm_setup_db.out
+            }
+
+            else if (params.checkm_tar_db) { // untar the checkm db before setting up
+                untar = false
+                checkm_setup_db(params.checkm_db, untar)
+                checkm_db_path = checkm_setup_db.out
+            }
+
+            else { // DLL the check db , untar then setup
+                untar = false
+                checkm_setup_db(checkm_download_db(), untar)
+                checkm_db_path = checkm_setup_db.out
+            }
+        }
+        else { checkm_db_path = Channel.from("/checkm_database").collectFile() { item -> [ "path.txt", item ]  } } // Docker way to setup the db
+
+        //************
+        // QC OF READS
+        //************
+            // QC check ONT
+        if (params.skip_ont_qc == true) {}
+        else if (params.skip_ont_qc==false) {
+            split_ont_ch = ont_input_ch.splitFastq(by:100000, file:true) //split the fastq to speed up the process
+            discard_short(split_ont_ch) // simply discard the reads under a threshold
+            if (params.filtlong==true) { // not necessary at all but can be run if wanted
+                filtlong(discard_short.out)
+                merging_ch = filtlong.out.groupTuple() 
+            }
+            else {
+                merging_ch = discard_short.out.groupTuple() 
+            }
+            merge(merging_ch)  // merge the splitted fastq
+            ont_input_ch = merge.out
+        }
+            // QC check Illumina
+        if (params.skip_ill_qc==true) {}
+        else if (params.skip_ill_qc==false) {
+            fastp(illumina_input_ch)
+            illumina_input_ch = fastp.out
+        }
+
+        //**********
+        // Assembly 
+        //**********
+            // Meta-SPADES
+
+        if (params.assembler=="metaspades") { // hybrid and metagenomic assembly by spades
+            spades_ch= illumina_input_ch.join(ont_input_ch)
+            spades(spades_ch)
+            assembly_ch = spades.out
+        }
+
+            // Meta-FLYE
+
+        if (params.assembler=="metaflye") { // metagenomic assembly by flye + hybrid polishing (combo racon; medaka; pilon with short reads)
+            // FLYE + Pilon 
+            flye(sourmash_genome_size(ont_input_ch,database_sourmash))
+            flye_to_map = flye.out.join(ont_input_ch)
+            minimap_polish(flye_to_map)
+            map_to_racon = ont_input_ch.join(flye.out).join(minimap_polish.out)
+            medaka(racon(map_to_racon))
+            medaka_to_pilon = medaka.out.join(illumina_input_ch)
+            pilon(medaka_to_pilon, params.polish_iteration)
+            assembly_ch = pilon.out
+        }
+
+        //*********
+        // Mapping
+        //*********
+
+            // ONT mapping
+        minimap2_ch = assembly_ch.join(ont_input_ch)
+        minimap2(minimap2_ch)
+        ont_bam_ch = minimap2.out
+
+        if (params.extra_ont != false) { //mapping of the "additionnal reads" to the assembly for use in the differential coverage binning
+            minimap_extra = assembly_ch.join(extra_ont_ch)
+            extra_minimap2(minimap_extra)
+            ont_extra_bam = extra_minimap2.out.collect()
+        }
+
+            // Illumina mapping
+        bwa_ch = assembly_ch.join(illumina_input_ch)
+        bwa(bwa_ch)
+        illumina_bam_ch = bwa.out
+
+        if (params.extra_ill != false) { //mapping of the "additionnal reads" to the assembly for use in the differential coverage binning
+            bwa_extra = assembly_ch.join(extra_ill_ch)
+            extra_bwa(bwa_extra)
+            illumina_extra_bam = extra_bwa.out.collect()
+        }
+
+        //***************************************************
+        // Binning
+        //***************************************************
+
+            // metabat2 
+
+        if (params.skip_metabat2==true) {}
+        else {
+            if (params.extra_ont != false || params.extra_ill != false ) { // check if differential coverage binning possible
+                metabat2_ch = assembly_ch.join(ont_bam_ch).join(illumina_bam_ch)
+                metabat2_extra(metabat2_ch, extra_bam)
+                metabat2_out = metabat2_extra.out
+            }
+            else {
+                metabat2_ch = assembly_ch.join(ont_bam_ch).join(illumina_bam_ch)
+                metabat2(metabat2_ch)
+                metabat2_out = metabat2.out
+            }
+        }
+
+            // Maxbin2 
+
+        if (params.skip_maxbin2==true) {}
+        else {
+            maxbin2_ch = assembly_ch.join(ont_input_ch).join(illumina_input_ch)
+            maxbin2(maxbin2_ch)
+            maxbin2_out = maxbin2.out
+        }
+
+            // Concoct OR CheckM Concoct
+
+        if (params.skip_concoct==true) {}
+        else {
+            if (params.extra_ont != false || params.extra_ill != false ) { // check if differential coverage binning possible
+                concoct_ch = assembly_ch.join(ont_bam_ch).join(illumina_bam_ch)
+                concoct_extra(concoct_ch, extra_bam)
+                concoct_out = concoct_extra.out
+            }
+            else {
+                concoct_ch = assembly_ch.join(ont_bam_ch).join(illumina_bam_ch)
+                concoct(concoct_ch)
+                concoct_out = concoct.out
+            }
+        }
+
+        // Bin refine
+
+        if (params.skip_metabat2==true) {
+            if (  params.skip_maxbin2==true || params.skip_concoct==true) {} // no refine if 1 or less binning method used
+            else {
+                refine2_ch = maxbin2_out.join(concoct_out)
+                refine2(refine2_ch, checkm_db_path) // use 2 binning method to refine
+                reassembly_ch = refine2.out[0]
+                metawrap_out_ch = refine2.out[0].transpose() // the transpose is used to "split" the channel in a channel with each bin file individually
+                // e.g without: ch:[ID,[bin1.fa,bin2.fa,bin3.fa]] with : ch:[[ID,bin1.fa],[ID,bin2.fa],[ID,bin3.fa]]
+                // this format is needed for further step
+            }
+        }
+
+        else if (params.skip_maxbin2==true) {
+            if (  params.skip_metabat2==true || params.skip_concoct==true) {} // no refine if 1 or less binning method used
+            else {
+                refine2_ch = metabat2_out.join(concoct_out)
+                refine2(refine2_ch, checkm_db_path) // use 2 binning method to refine
+                reassembly_ch = refine2.out[0]
+                metawrap_out_ch = refine2.out[0].transpose() // the transpose is used to "split" the channel in a channel with each bin file individually
+                // e.g without: ch:[ID,[bin1.fa,bin2.fa,bin3.fa]] with : ch:[[ID,bin1.fa],[ID,bin2.fa],[ID,bin3.fa]]
+                // this format is needed for further step
+            }
+        }
+
+        else if (params.skip_concoct==true) {
+            if (  params.skip_metabat2==true || params.skip_maxbin2==true) {} // no refine if 1 or less binning method used
+            else {
+                refine2_ch = metabat2_out.join(maxbin2_out)
+                refine2(refine2_ch, checkm_db_path) // use 2 binning method to refine
+                reassembly_ch = refine2.out[0]
+                metawrap_out_ch = refine2.out[0].transpose() // the transpose is used to "split" the channel in a channel with each bin file individually
+                // e.g without: ch:[ID,[bin1.fa,bin2.fa,bin3.fa]] with : ch:[[ID,bin1.fa],[ID,bin2.fa],[ID,bin3.fa]]
+                // this format is needed for further step
+            }
+        }
+
+        else {
+            refine3_ch = metabat2_out.join(maxbin2_out).join(concoct_out)
+            refine3(refine3_ch, checkm_db_path)
+            reassembly_ch = refine3.out[0]
+            metawrap_out_ch = refine3.out[0].transpose() // the transpose is used to "split" the channel in a channel with each bin file individually
+                // e.g without: ch:[ID,[bin1.fa,bin2.fa,bin3.fa]] with : ch:[[ID,bin1.fa],[ID,bin2.fa],[ID,bin3.fa]]
+                // this format is needed for further step
+        }
+
+        //**************
+        //Retrieve reads for each bin and assemble them
+        //**************
+        if (params.reassembly) {
+            // retrieve the ids of each bin contigs
+            contig_list(reassembly_ch) //retrieve the list of contigs present for each bin
+            extract_reads_ch = contig_list.out.view()
+
+        
+            // bam align the reads to ALL OF THE CONTIGS 
+            cat_all_bins(reassembly_ch) // assemble all bins' contigs in one file for the mapping
+            fasta_all_bin = cat_all_bins.out
+            bwa_all_bin = fasta_all_bin.join(illumina_input_ch)
+            ill_map_all_bin = bwa_bin(bwa_all_bin)    //map illumina reads
+            minimap2_all_bin = fasta_all_bin.join(ont_input_ch) 
+            ont_map_all_bin = minimap2_bin(minimap2_all_bin) //map ont reads
+
+            // retrieve the reads aligned to the contigs + run unicycler + polish with pilon for 2 round
+            retrieve_unmapped_ch = ill_map_all_bin.join( ont_map_all_bin).join(illumina_input_ch).join(ont_input_ch)
+            unmapped_retrieve(retrieve_unmapped_ch) //retrieve the reads that didn't map to the contigs to output reads set that can be analysed again
+            retrieve_reads_ch = extract_reads_ch.transpose().combine(ill_map_all_bin, by:0).combine( ont_map_all_bin, by:0).combine(illumina_input_ch, by:0).combine(ont_input_ch, by:0)
+            reads_retrieval(retrieve_reads_ch).view() //retrieve the reads that mapped to the contigs to allow the reassembly
+            unicycler(reads_retrieval.out) // reassemble each bin with the reads mapped to their contigs
+
+            collected_final_bins_ch=unicycler.out[0].collect()
+            final_bins_ch=unicycler.out[0]
+        }
+        else {
+            final_bins_ch=metawrap_out_ch
+        }
+    } //end of step 1
+    //*************************************************
+    // STEP 2 classify taxa
+    //*************************************************
+    if (params.modular=="full" | params.modular=="classify" | params.modular=="assem-class" | params.modular=="class-annot") {
+
+        //**************
+        // File handling
+        //**************
+
+        //bins (list with id (run id not bin) coma path/to/file)
+        if (params.bin_classify) { 
+            classify_ch = Channel
+                .fromPath( params.bin_classify, checkIfExists: true )
+                .splitCsv()
+                .map { row -> ["${row[0]}", file("${row[1]}", checkIfExists: true)]  }
+                .view()
+                }
+
+        else {classify_ch=final_bins_ch}
+        if (params.modular=="classify" | params.modular=="class-annot") {
+            // sourmash_db
+            if (params.sourmash_db) { database_sourmash = file(params.sourmash_db) }
+            else {
+                sourmash_download_db() 
+                database_sourmash = sourmash_download_db.out
+            }   
+            // checkm_db
+            if (workflow.profile == 'conda') {
+                if (params.checkm_db) {
+                    untar = true
+                    checkm_setup_db(params.checkm_db, untar)
+                    checkm_db_path = checkm_setup_db.out
+                }
+
+                else if (params.checkm_tar_db) {
+                    untar = false
+                    checkm_setup_db(params.checkm_db, untar)
+                    checkm_db_path = checkm_setup_db.out
+                }
+                
+                else {
+                    untar = false
+                    checkm_setup_db(checkm_download_db(), untar)
+                    checkm_db_path = checkm_setup_db.out
+                }
+            }
+            else { checkm_db_path = Channel.from("/checkm_database").collectFile() { item -> [ "path.txt", item ]  } }
+        }
+        //*************************
+        // Bins classify workflow
+        //*************************
+
+        //checkm of the final assemblies
+        checkm(classify_ch.groupTuple(by:0)) //checkm QC of the bins
+
+        //sourmash classification using gtdb database
+        sourmash_bins(classify_ch,database_sourmash) // fast classification using sourmash with the gtdb (not the best classification but really fast and good for primarly result)
+        sourmash_checkm_parser(checkm.out[0],sourmash_bins.out.collect()) //parsing the result of sourmash and checkm in a single result file
+
+    } // end of step 2
+        //*************************************************
+        // STEP 3 annotation; kegg pathways + use or RNAseq
+        //*************************************************
+
+        if (params.modular=="full" | params.modular=="annotate" | params.modular=="assem-annot" | params.modular=="class-annot") {
+            //**************
+            // File handling
+            //**************
+            if (workflow.profile.contains('test')) {
+                params.rna = true
+            }
+
+            else {
+            //RNAseq
+                if (params.rna) {rna_input_ch = Channel
+                        .fromFilePairs( "${params.rna}/*_R{1,2}.fastq{,.gz}", checkIfExists: true)
+                        .view()
+                }
+            }
+            //bins (list with id (run id not bin) coma path/to/file)
+            if (params.bin_annotate) {
+                bins_input_ch = Channel
+                    .fromPath( params.bin_annotate, checkIfExists: true )
+                    .splitCsv()
+                    .map { row -> ["${row[0]}", file("${row[1]}", checkIfExists: true)]  }
+                    .view() 
+                    }
+            else if (params.bin_classify) {
+                bins_input_ch = Channel
+                    .fromPath( params.bin_classify, checkIfExists: true )
+                    .splitCsv()
+                    .map { row -> ["${row[0]}", file("${row[2]}", checkIfExists: true)]  }
+                    .view() 
+                    }
+            else {bins_input_ch = final_bins_ch }
+
+        //************************
+        // Databases Dll and setup
+        //************************
+        if (params.eggnog_db) {eggnog_db=Channel
+                .fromPath( params.eggnog_db, checkIfExists: true )}
+        else {
+            eggnog_download_db()
+            eggnog_db = eggnog_download_db.out
+            } 
+        //*************************
+        // Bins annotation workflow
+        //*************************
+
+        eggnog_bin_ch= bins_input_ch.combine(eggnog_db)
+        eggnog_bin(eggnog_bin_ch) //annotate the bins
+        bin_annotated_ch=eggnog_bin.out[0].groupTuple(by:0).view()
+
+        //************************
+        // RNA annotation workflow
+        //************************
+        if (params.rna) {
+        // QC
+            fastp_rna(rna_input_ch) //qc illumina RNA-seq
+            rna_input_ch = fastp_rna.out
+
+        // De novo transcript
+            de_novo_transcript_and_quant(rna_input_ch) // de novo transcrip assembly and quantification with trinity and salmon
+            transcript_ch=de_novo_transcript_and_quant.out
+        // annotations of transcript
+            eggnog_rna_ch= transcript_ch.combine(eggnog_db)
+            eggnog_rna(eggnog_rna_ch) //annotate the RNA-seq transcripts
+            rna_annot_ch=eggnog_rna.out[0].view()
+        }
+
+        //******************************************************
+        // Parsing bin annot and RNA out into nice graphical out
+        //******************************************************
+
+        if (params.rna)  {
+            parser_bin_RNA(rna_annot_ch,bin_annotated_ch) // parse the annotations in html summary files
+        }
+        else {
+            parser_bin(bin_annotated_ch) // parse the bins annotation in html summary files
+        }
+        // Share pathway to put and HTML file with
+    } // end of step 3
+    
+    readme_output()
+
+} // end of workflow{}
+
+workflow.onComplete { 
+  log.info ( workflow.success ? "\nDone! Results are stored here --> $params.output \n The Readme file in $params.output describe the structure of the results directories. \n" : "Oops .. something went wrong" )  }
+//***********
+// MAFIN DONE
 //***********
\ No newline at end of file
diff --git a/modules/bwa.nf b/modules/bwa.nf
index 8cdeff4..a5678a0 100644
--- a/modules/bwa.nf
+++ b/modules/bwa.nf
@@ -1,53 +1,59 @@
-process bwa {
-    label 'bwa'
-    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "illumina.bam"  
-    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
-    input:
-    set val(name), file(assembly), file(illumina)
-    output:
-    set val(name) , file("illumina_sorted.bam")
-    script:
-    """
-    bwa index -p illumina -a bwtsw ${assembly}
-    bwa mem illumina ${illumina[0]} ${illumina[1]} -t ${task.cpus} > illumina.sam
-    samtools view -bS illumina.sam > illumina.bam
-    samtools sort -@ ${task.cpus} -o illumina_sorted.bam illumina.bam
-    rm illumina.*
-    """
-}
-
-process extra_bwa {
-    label 'bwa'
-    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "illumina.bam"  
-    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
-    input:
-    set val(name), file(assembly), file(illumina)
-    output:
-    set val(name) , file("*_sorted.bam")
-    script:
-    """
-    bwa index -p illumina -a bwtsw ${assembly}
-    bwa mem illumina ${illumina} -t ${task.cpus} > illumina.sam
-    samtools view -bS illumina.sam > illumina.bam
-    samtools sort -@ ${task.cpus} -o ${illumina[0]}_sorted.bam illumina.bam
-    rm illumina.*
-    """
-}
-
-process bwa_bin {
-    label 'bwa'
-    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "illumina.bam"  
-    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
-    input:
-    set val(name), file(assembly), file(illumina)
-    output:
-    set val(name) , file("illumina_sorted.bam")
-    script:
-    """
-    bwa index -p illumina -a bwtsw ${assembly}
-    bwa mem illumina ${illumina[0]} ${illumina[1]} -t ${task.cpus} > illumina.sam
-    samtools view -bS illumina.sam > illumina.bam
-    samtools sort -@ ${task.cpus} -o illumina_sorted.bam illumina.bam
-    rm illumina.*
-    """
+process bwa {
+    label 'bwa'
+    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "illumina.bam"  
+    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(assembly), path(illumina)
+    output:
+    tuple val(name) , path("illumina_sorted.bam")
+    script:
+    """
+    bwa index -p illumina -a bwtsw ${assembly}
+    bwa mem illumina ${illumina[0]} ${illumina[1]} -t ${task.cpus} > illumina.sam
+    samtools view -bS illumina.sam > illumina.bam
+    samtools sort -@ ${task.cpus} -o illumina_sorted.bam illumina.bam
+    rm illumina.*
+    """
+}
+
+process extra_bwa {
+    label 'bwa'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "illumina.bam"  
+    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
+    input:
+    tuple val(name), path(assembly), path(illumina)
+    output:
+    tuple val(name) , path("*_sorted.bam")
+    script:
+    """
+    bwa index -p illumina -a bwtsw ${assembly}
+    bwa mem illumina ${illumina} -t ${task.cpus} > illumina.sam
+    samtools view -bS illumina.sam > illumina.bam
+    samtools sort -@ ${task.cpus} -o ${illumina[0]}_sorted.bam illumina.bam
+    rm illumina.*
+    """
+}
+
+process bwa_bin {
+    label 'bwa'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "illumina.bam"  
+    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
+    input:
+    tuple val(name), path(assembly), path(illumina)
+    output:
+    tuple val(name) , path("illumina_sorted.bam")
+    script:
+    """
+    bwa index -p illumina -a bwtsw ${assembly}
+    bwa mem illumina ${illumina[0]} ${illumina[1]} -t ${task.cpus} > illumina.sam
+    samtools view -bS illumina.sam > illumina.bam
+    samtools sort -@ ${task.cpus} -o illumina_sorted.bam illumina.bam
+    rm illumina.*
+    """
 }
\ No newline at end of file
diff --git a/modules/cat_all_bins.nf b/modules/cat_all_bins.nf
index 6f5f662..66e5045 100644
--- a/modules/cat_all_bins.nf
+++ b/modules/cat_all_bins.nf
@@ -1,11 +1,11 @@
-process cat_all_bins {
-    label 'ubuntu'
-    input:
-    set val(name), file(bins)
-    output:
-    set val(name) , file("all_bins.fa") 
-    script:
-    """
-    cat ${bins}/bin.*.fa > all_bins.fa
-    """
+process cat_all_bins {
+    label 'ubuntu'
+    input:
+    tuple val(name), path(bins)
+    output:
+    tuple val(name) , path("all_bins.fa") 
+    script:
+    """
+    cat ${bins}/bin.*.fa > all_bins.fa
+    """
 }
\ No newline at end of file
diff --git a/modules/checkm.nf b/modules/checkm.nf
index 4e98c10..e540fd1 100644
--- a/modules/checkm.nf
+++ b/modules/checkm.nf
@@ -1,26 +1,28 @@
-process checkm {
-    maxForks 1
-    label 'checkm'
-    publishDir "${params.output}/${name}/checkm_bins/", mode: 'copy', pattern: "summary.txt"
-    publishDir "${params.output}/${name}/checkm_bins/", mode: 'copy', pattern: "taxonomy.txt"
-    publishDir "${params.output}/${name}/checkm_bins/", mode: 'copy', pattern: "*_checkm"
-    publishDir "${params.output}/${name}/checkm_bins/", mode: 'copy', pattern: "*_checkm_plot"
-    input:
-    set val(name), file(bins_assemblies)
-    output:
-    set val(name), file("summary.txt")
-    set file("${name}_checkm"), file("${name}_checkm_plot"), file("taxonomy.txt")
-    
-    script:
-    """
-    mkdir temporary
-    mkdir ${name}_bin
-    mv *.fa ${name}_bin/
-    checkm lineage_wf --tmpdir temporary --pplacer_threads 4 -t ${task.cpus} --reduced_tree -x fa ${name}_bin ${name}_checkm > summary.txt
-    checkm bin_qa_plot --image_type png -x fa ${name}_checkm ${name}_bin ${name}_checkm_plot
-    checkm tree_qa ${name}_checkm > taxonomy.txt
-     """
-}
-
-// checkm module is not use in the script at the moment but it is used in metawrap
+process checkm {
+    maxForks 1
+    label 'checkm'
+    publishDir "${params.output}/${name}/classify/checkm/", mode: 'copy', pattern: "summary.txt"
+    publishDir "${params.output}/${name}/classify/checkm/", mode: 'copy', pattern: "taxonomy.txt"
+    publishDir "${params.output}/${name}/classify/checkm/", mode: 'copy', pattern: "*_checkm"
+    publishDir "${params.output}/${name}/classify/checkm/", mode: 'copy', pattern: "*_checkm_plot"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(bins_assemblies)
+    output:
+    tuple val(name), path("summary.txt")
+    tuple path("${name}_checkm"), path("${name}_checkm_plot"), path("taxonomy.txt")
+    
+    script:
+    """
+    mkdir temporary
+    mkdir ${name}_bin
+    mv *.fa ${name}_bin/
+    checkm lineage_wf --tmpdir temporary --pplacer_threads 4 -t ${task.cpus} --reduced_tree -x fa ${name}_bin ${name}_checkm > summary.txt
+    checkm bin_qa_plot --image_type png -x fa ${name}_checkm ${name}_bin ${name}_checkm_plot
+    checkm tree_qa ${name}_checkm > taxonomy.txt
+     """
+}
+
+// checkm module is not use in the script at the moment but it is used in metawrap
 // this module can be added for an additional check by the user just call it in the main script and input a channel outputted from a binning step
\ No newline at end of file
diff --git a/modules/checkm_sourmash_parser.nf b/modules/checkm_sourmash_parser.nf
index 7e3296e..af2353e 100644
--- a/modules/checkm_sourmash_parser.nf
+++ b/modules/checkm_sourmash_parser.nf
@@ -1,15 +1,16 @@
-process sourmash_checkm_parser {
-    label 'python38'
-    publishDir "${params.output}/${name}/", mode: 'copy', pattern: "classify_step_summary.csv"
-    input:
-    set val(name), file(checkm)
-    file(sourmash)
-    output:
-    file("classify_step_summary.csv")
-    shell:
-    """
-    grep -v "] INFO: " !{checkm} | grep -v "\\-\\-\\-\\-\\-\\-\\-" | grep -v "Bin Id" | sed -e 's/^[ \\t]*//'|sed 's/[ \\t]*\$//' |sed -r 's/ +/,/g'|sed '/^\$/d' >checkm.csv
-    for file in !{sourmash}; do tail -n 1 \$file | sed -e 's/.fa//' >>sourmash.csv; done
-    checkm_sourmash_parser.py -c checkm.csv -s sourmash.csv
-    """
+process sourmash_checkm_parser {
+    //label 'python38'
+    label 'ubuntu'
+    publishDir "${params.output}/${name}/classify/", mode: 'copy', pattern: "classify_step_summary.csv"
+    input:
+    tuple val(name), path(checkm)
+    path(sourmash)
+    output:
+    path("classify_step_summary.csv")
+    shell:
+    """
+    grep -v "] INFO: " !{checkm} | grep -v "\\-\\-\\-\\-\\-\\-\\-" | grep -v "Bin Id" | sed -e 's/^[ \\t]*//'|sed 's/[ \\t]*\$//' |sed -r 's/ +/,/g'|sed '/^\$/d' >checkm.csv
+    for file in ${sourmash}; do tail -n 1 \$file | sed -e 's/.fa//' >>sourmash.csv; done
+    checkm_sourmash_parser.py -c checkm.csv -s sourmash.csv
+    """
 }
\ No newline at end of file
diff --git a/modules/checkmgetdatabases.nf b/modules/checkmgetdatabases.nf
index da7c0ff..0582984 100644
--- a/modules/checkmgetdatabases.nf
+++ b/modules/checkmgetdatabases.nf
@@ -1,12 +1,12 @@
-process checkm_download_db {
-        // if (workflow.profile == 'gcloud') { publishDir 'gs://databases-nextflow/databases/checkm', mode: 'copy', pattern: "genbank-k31.lca.json" }
-        // else { storeDir......}
-        storeDir 'nextflow-autodownload-databases/checkm' 
-        label 'ubuntu' 
-      output:
-        file("checkm_data_2015_01_16.tar.gz")
-      script:
-        """
-        wget https://data.ace.uq.edu.au/public/CheckM_databases/checkm_data_2015_01_16.tar.gz
-        """
+process checkm_download_db {
+        // if (workflow.profile == 'gcloud') { publishDir 'gs://databases-nextflow/databases/checkm', mode: 'copy', pattern: "genbank-k31.lca.json" }
+        if (workflow.profile == 'gcloud') {publishDir 'gs://nf-muffin20/databases-nextflow/checkm', mode: 'copy', pattern: "checkm_data_2015_01_16.tar.gz"}
+        else { storeDir 'nextflow-autodownload-databases/checkm' }
+        label 'ubuntu' 
+      output:
+        path("checkm_data_2015_01_16.tar.gz")
+      script:
+        """
+        wget https://data.ace.uq.edu.au/public/CheckM_databases/checkm_data_2015_01_16.tar.gz
+        """
     }
\ No newline at end of file
diff --git a/modules/checkmsetupDB.nf b/modules/checkmsetupDB.nf
index 124aa25..4e7b01f 100644
--- a/modules/checkmsetupDB.nf
+++ b/modules/checkmsetupDB.nf
@@ -1,25 +1,25 @@
-process checkm_setup_db {
-    label 'checkm'
-    input:
-    val(db)
-    val(untar)
-    output:
-    file("path_db.txt")
-    shell:
-        """
-        if [ !{untar} == true ] ;
-        then
-            checkm data setRoot !{db} ;
-            echo '!{db}' > path_db.txt;
-        fi
-
-        if [ !{untar} == false ] ;
-        then
-                    path_db=\$(dirname !{db});
-                    mkdir -p \$path_db/db/;
-                    tar -xvf !{db} -C \$path_db/db/;
-                    checkm data setRoot \$path_db/db;   
-                    echo \$path_db/db > path_db.txt;
-        fi
-        """
+process checkm_setup_db {
+    label 'checkm'
+    input:
+    val(db)
+    val(untar)
+    output:
+    path("path_db.txt")
+    shell:
+        """
+        if [ !{untar} == true ] ;
+        then
+            checkm data setRoot !{db} ;
+            echo '${db}' > path_db.txt;
+        fi
+
+        if [ !{untar} == false ] ;
+        then
+                    path_db=\$(dirname !{db});
+                    mkdir -p \$path_db/db/;
+                    tar -xvf ${db} -C \$path_db/db/;
+                    checkm data setRoot \$path_db/db;   
+                    echo \$path_db/db > path_db.txt;
+        fi
+        """
     }
\ No newline at end of file
diff --git a/modules/concoct.nf b/modules/concoct.nf
index 4218795..ec72003 100644
--- a/modules/concoct.nf
+++ b/modules/concoct.nf
@@ -1,47 +1,53 @@
-process concoct {
-    label 'concoct'
-    publishDir "${params.output}/${name}/concoct_bins/", mode: 'copy', pattern: "fasta_bins"
-    input:
-    set val(name), file(assembly), file(ont_bam), file(illumina_bam)
-    output:
-    set val(name), file("fasta_bins")
-    script:
-    """
-    mkdir concoct_out
-    cut_up_fasta.py ${assembly} -c 10000 -o 0 --merge_last -b contigs_10K.bed > contigs_10K.fa
-    samtools index -@ ${task.cpus} ${ont_bam}
-    samtools index -@ ${task.cpus} ${illumina_bam}
-    concoct_coverage_table.py contigs_10K.bed *.bam > coverage_table.tsv
-    concoct --composition_file contigs_10K.fa --coverage_file coverage_table.tsv -b concoct_out --thread ${task.cpus}
-    merge_cutup_clustering.py concoct_out/clustering_gt1000.csv > concoct_out/clustering_merged.csv
-    mkdir fasta_bins
-    extract_fasta_bins.py ${assembly} concoct_out/clustering_merged.csv --output_path fasta_bins
-
-    """
-
-}
-
-process concoct_extra {
-    label 'concoct'
-    publishDir "${params.output}/${name}/concoct_bins/", mode: 'copy', pattern: "fasta_bins"
-    input:
-    set val(name), file(assembly), file(ont_bam), file(illumina_bam)
-    file(extra_bam)
-    output:
-    set val(name), file("fasta_bins")
-    script:
-    """
-    mkdir concoct_out
-    cut_up_fasta.py ${assembly} -c 10000 -o 0 --merge_last -b contigs_10K.bed > contigs_10K.fa
-    samtools index -@ ${task.cpus} ${ont_bam}
-    samtools index -@ ${task.cpus} ${illumina_bam}
-    ls ${extra_bam} | xargs -n1 -P5 samtools index  -@ ${task.cpus}
-    concoct_coverage_table.py contigs_10K.bed *.bam > coverage_table.tsv
-    concoct --composition_file contigs_10K.fa --coverage_file coverage_table.tsv -b concoct_out --thread ${task.cpus}
-    merge_cutup_clustering.py concoct_out/clustering_gt1000.csv > concoct_out/clustering_merged.csv
-    mkdir fasta_bins
-    extract_fasta_bins.py ${assembly} concoct_out/clustering_merged.csv --output_path fasta_bins
-
-    """
-
+process concoct {
+    maxForks 1
+    label 'concoct'
+    publishDir "${params.output}/${name}/assemble/binning/concoct_bins/", mode: 'copy', pattern: "fasta_bins"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), file(assembly), file(ont_bam), file(illumina_bam)
+    output:
+    tuple val(name), file("fasta_bins")
+    script:
+    """
+    mkdir concoct_out
+    cut_up_fasta.py ${assembly} -c 10000 -o 0 --merge_last -b contigs_10K.bed > contigs_10K.fa
+    samtools index -@ ${task.cpus} ${ont_bam}
+    samtools index -@ ${task.cpus} ${illumina_bam}
+    concoct_coverage_table.py contigs_10K.bed *.bam > coverage_table.tsv
+    concoct --composition_file contigs_10K.fa --coverage_file coverage_table.tsv -b concoct_out --thread ${task.cpus}
+    merge_cutup_clustering.py concoct_out/clustering_gt1000.csv > concoct_out/clustering_merged.csv
+    mkdir fasta_bins
+    extract_fasta_bins.py ${assembly} concoct_out/clustering_merged.csv --output_path fasta_bins
+
+    """
+
+}
+
+process concoct_extra {
+    maxForks 1
+    label 'concoct'
+    publishDir "${params.output}/${name}/assemble/binning/concoct/", mode: 'copy', pattern: "fasta_bins"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(assembly), path(ont_bam), path(illumina_bam)
+    path(extra_bam)
+    output:
+    tuple val(name), path("fasta_bins")
+    script:
+    """
+    mkdir concoct_out
+    cut_up_fasta.py ${assembly} -c 10000 -o 0 --merge_last -b contigs_10K.bed > contigs_10K.fa
+    samtools index -@ ${task.cpus} ${ont_bam}
+    samtools index -@ ${task.cpus} ${illumina_bam}
+    ls ${extra_bam} | xargs -n1 -P5 samtools index  -@ ${task.cpus}
+    concoct_coverage_table.py contigs_10K.bed *.bam > coverage_table.tsv
+    concoct --composition_file contigs_10K.fa --coverage_file coverage_table.tsv -b concoct_out --thread ${task.cpus}
+    merge_cutup_clustering.py concoct_out/clustering_gt1000.csv > concoct_out/clustering_merged.csv
+    mkdir fasta_bins
+    extract_fasta_bins.py ${assembly} concoct_out/clustering_merged.csv --output_path fasta_bins
+
+    """
+
 }
\ No newline at end of file
diff --git a/modules/eggnog.nf b/modules/eggnog.nf
index e5cc702..b941685 100644
--- a/modules/eggnog.nf
+++ b/modules/eggnog.nf
@@ -1,32 +1,36 @@
-process eggnog_bin { 
-        label 'eggnog' 
-        publishDir "${params.output}/${name}/bin_annotated/", mode: 'copy', pattern: "*.tsv"
-      input:
-        set val(name), file(bin), file(db)
-      output:
-        set val(name), file("*.annotations.tsv")
-        file("*.seed_orthologs.tsv")
-      shell:
-        """
-        bin_id=\$(basename !{bin} | sed -r "s/\\.\\w+//2")
-        emapper.py --data_dir ${db} -d bact -o \$bin_id  -m diamond -i ${bin} --cpu ${task.cpus} --go_evidence non-electronic  --target_orthologs all --translate
-        tac \$bin_id.emapper.annotations | sed "1,3d" | tac |sed "1,3d" > \$bin_id.annotations.tsv
-        cp \$bin_id.emapper.seed_orthologs \$bin_id.seed_orthologs.tsv
-        """
-    }
-
-process eggnog_rna { 
-        label 'eggnog' 
-        publishDir "${params.output}/${name}/rna_annotated/", mode: 'copy', pattern: "*.tsv"
-      input:
-        set val(name), val(transcript), file(quant), file(db)
-      output:
-        set val(name), file("*.annotations.tsv"), file(quant)
-        file("*.seed_orthologs.tsv")
-      shell:
-        """
-        emapper.py --data_dir ${db} -d bact -o ${name}_transcript  -m diamond -i ${transcript} --cpu ${task.cpus} --go_evidence non-electronic  --target_orthologs all --translate
-        tac ${name}_transcript.emapper.annotations | sed "1,3d" | tac |sed "1,3d" > ${name}_transcript.annotations.tsv
-        cp ${name}_transcript.emapper.seed_orthologs ${name}_transcript.seed_orthologs.tsv
-        """
-    }
\ No newline at end of file
+process eggnog_bin { 
+  label 'eggnog' 
+  publishDir "${params.output}/${name}/annotate/bin_annotation/", mode: 'copy', pattern: "*.tsv"
+  errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+  maxRetries 3 
+  input:
+    tuple val(name), path(bin), path(db)
+  output:
+    tuple val(name), path("*.annotations.tsv")
+    path("*.seed_orthologs.tsv")
+  shell:
+    """
+    bin_id=\$(basename !{bin} | sed -r "s/\\.\\w+//2")
+    emapper.py --data_dir ${db} -d bact -o \$bin_id  -m diamond -i ${bin} --cpu ${task.cpus} --go_evidence non-electronic  --target_orthologs all --translate
+    tac \$bin_id.emapper.annotations | sed "1,3d" | tac |sed "1,3d" > \$bin_id.annotations.tsv
+    cp \$bin_id.emapper.seed_orthologs \$bin_id.seed_orthologs.tsv
+    """
+}
+
+process eggnog_rna { 
+  label 'eggnog' 
+  publishDir "${params.output}/${name}/annotate/rna_annotation/", mode: 'copy', pattern: "*.tsv"
+  errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+  maxRetries 3 
+  input:
+    tuple val(name), path(transcript), path(quant), path(db)
+  output:
+    tuple val(name), path("*.annotations.tsv"), path(quant)
+    path("*.seed_orthologs.tsv")
+  shell:
+    """
+    emapper.py --data_dir ${db} -d bact -o ${name}_transcript  -m diamond -i ${transcript} --cpu ${task.cpus} --go_evidence non-electronic  --target_orthologs all --translate
+    tac ${name}_transcript.emapper.annotations | sed "1,3d" | tac |sed "1,3d" > ${name}_transcript.annotations.tsv
+    cp ${name}_transcript.emapper.seed_orthologs ${name}_transcript.seed_orthologs.tsv
+    """
+}
\ No newline at end of file
diff --git a/modules/eggnog_get_databases.nf b/modules/eggnog_get_databases.nf
index 461ba80..2aea7aa 100644
--- a/modules/eggnog_get_databases.nf
+++ b/modules/eggnog_get_databases.nf
@@ -1,13 +1,14 @@
-process eggnog_download_db {
-        
-        if (workflow.profile == 'conda') { storeDir 'nextflow-autodownload-databases/eggnog' }
-        else { publishDir 'nextflow-autodownload-databases/eggnog', mode: 'copy', pattern: "eggnog-db" }
-        label 'eggnog' 
-      output:
-        file("eggnog-db")
-      script:
-        """
-        mkdir eggnog-db
-        download_eggnog_data.py --data_dir eggnog-db -y
-        """
+process eggnog_download_db {
+        
+        if (workflow.profile == 'conda') { storeDir 'nextflow-autodownload-databases/eggnog' }
+        else if (workflow.profile == 'gcloud') {publishDir 'gs://nf-muffin20/databases-nextflow/eggnog', mode: 'copy', pattern: "eggnog-db"}
+        else { publishDir 'nextflow-autodownload-databases/eggnog', mode: 'copy', pattern: "eggnog-db" }
+        label 'eggnog' 
+      output:
+        path("eggnog-db")
+      script:
+        """
+        mkdir eggnog-db
+        download_eggnog_data.py --data_dir eggnog-db -y
+        """
     }
\ No newline at end of file
diff --git a/modules/ext_check.nf b/modules/ext_check.nf
index dafd389..4f304f0 100644
--- a/modules/ext_check.nf
+++ b/modules/ext_check.nf
@@ -1,25 +1,25 @@
-process fasta_check { 
-        label 'ubuntu'
-      input:
-       set val(sample), val(bin_id), file(file)
-      output:
-       set val(name), val(bin_id), file("${bin_id}.fa")
-      shell:
-        """
-       case "${file}" in
-            *.gz)
-                zcat !{file} > ${bin_id}.fa
-                ;;
-            *.fasta)
-                
-                cp !{file} ${bin_id}.fa
-                ;;
-            *.fa)
-                cp !{file} ${bin_id}.fa
-                ;;
-            *)
-                echo "file format not supported...what the phage...(.fa .fasta .fna .gz is supported)"
-                exit 1
-        esac
-        """
+process fasta_check { 
+        label 'ubuntu'
+      input:
+       tuple val(sample), val(bin_id), path(file)
+      output:
+       tuple val(name), val(bin_id), path("${bin_id}.fa")
+      shell:
+        """
+       case "${file}" in
+            *.gz)
+                zcat !{file} > ${bin_id}.fa
+                ;;
+            *.fasta)
+                
+                cp !{file} ${bin_id}.fa
+                ;;
+            *.fa)
+                cp !{file} ${bin_id}.fa
+                ;;
+            *)
+                echo "file format not supported...what the phage...(.fa .fasta .fna .gz is supported)"
+                exit 1
+        esac
+        """
     }
\ No newline at end of file
diff --git a/modules/fastp.nf b/modules/fastp.nf
index 0fd0b58..ec00861 100644
--- a/modules/fastp.nf
+++ b/modules/fastp.nf
@@ -1,25 +1,29 @@
-process fastp {
-    label 'fastp'
-    publishDir "${params.output}/${name}/illumina_qc_out/", mode: 'copy', pattern: "*_R*_clean.fastq"
-    input:
-    set val(name), file(illumina)
-    output:
-    set val(name), file("*_R?_clean.fastq")
-    script:
-    """
-    fastp -i ${illumina[0]} -I ${illumina[1]} -o ${name}_R1_clean.fastq -O ${name}_R2_clean.fastq
-    """
-}
-
-process fastp_rna {
-    label 'fastp'
-    publishDir "${params.output}/${name}/rna_qc_out/", mode: 'copy', pattern: "*_R*_clean.fastq"
-    input:
-    set val(name), file(illumina)
-    output:
-    set val(name), file("*_R?_clean.fastq")
-    script:
-    """
-    fastp -i ${illumina[0]} -I ${illumina[1]} -o ${name}_R1_clean.fastq -O ${name}_R2_clean.fastq
-    """
-}
+process fastp {
+    label 'fastp'
+    publishDir "${params.output}/${name}/assemble/quality_control/illumina/", mode: 'copy', pattern: "*_R*_clean.fastq"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(illumina)
+    output:
+    tuple val(name), path("*_R?_clean.fastq")
+    script:
+    """
+    fastp -i ${illumina[0]} -I ${illumina[1]} -o ${name}_R1_clean.fastq -O ${name}_R2_clean.fastq
+    """
+}
+
+process fastp_rna {
+    label 'fastp'
+    publishDir "${params.output}/${name}/annotate/rna_quality_control/", mode: 'copy', pattern: "*_R*_clean.fastq"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(illumina)
+    output:
+    tuple val(name), path("*_R?_clean.fastq")
+    script:
+    """
+    fastp -i ${illumina[0]} -I ${illumina[1]} -o ${name}_R1_clean.fastq -O ${name}_R2_clean.fastq
+    """
+}
diff --git a/modules/flye.nf b/modules/flye.nf
index e726354..22f7474 100644
--- a/modules/flye.nf
+++ b/modules/flye.nf
@@ -1,15 +1,19 @@
-process flye {
-    label 'flye'
-    publishDir "${params.output}/${name}/flye_assembly/", mode: 'copy', pattern: "assembly.fasta"
-    input:
-    set val(name), file(ont), file(genome_size)
-    output:
-    set val(name), file("assembly.fasta")
-    shell:
-    """
-    size=\$(cat !{genome_size})
-    flye --nano-corr !{ont} -o flye_output -t !{task.cpus} --plasmids --meta --genome-size \$size
-    mv flye_output/assembly.fasta assembly.fasta
-    """
-
-}
\ No newline at end of file
+process flye {
+    label 'flye'
+    publishDir "${params.output}/${name}/assemble/assembly/flye_unpolished", mode: 'copy', pattern: "assembly.fasta"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(ont), path(genome_size)
+    output:
+    tuple val(name), path("assembly.fasta")
+    shell:
+    """
+    size=\$(cat !{genome_size})
+    flye --nano-raw ${ont} -o flye_output -t ${task.cpus} --plasmids --meta --genome-size \$size
+    mv flye_output/assembly.fasta assembly.fasta
+    """
+
+}
+
+//for flye updated over 2.7 use --nano-raw
\ No newline at end of file
diff --git a/modules/list_ids.nf b/modules/list_ids.nf
index 76c3912..44950dc 100644
--- a/modules/list_ids.nf
+++ b/modules/list_ids.nf
@@ -1,16 +1,16 @@
-process contig_list {
-    label 'ubuntu'
-    input:
-    set val(name), file(bins)
-    output:
-    set val(name), file("*.contigs.list")
-    shell:\
-    """
-    for bin in !{bins}/bin.*.fa
-        do
-        bin_name=\$(basename \$bin )
-        cat \$bin | grep -o -E "^>\\w+\\.\\w+" |sed 's/>//g'| tr -d "@" > \$bin_name.contigs.list ;
-        done ;
-    """
-}
-
+process contig_list {
+    label 'ubuntu'
+    input:
+    tuple val(name), path(bins)
+    output:
+    tuple val(name), path("*.contigs.list")
+    shell:\
+    """
+    for bin in ${bins}/bin.*.fa
+        do
+        bin_name=\$(basename \$bin )
+        cat \$bin | grep -o -E "^>\\w+\\.\\w+" |sed 's/>//g'| tr -d "@" > \$bin_name.contigs.list ;
+        done ;
+    """
+}
+
diff --git a/modules/maxbin2.nf b/modules/maxbin2.nf
index cc9a5a5..7567508 100644
--- a/modules/maxbin2.nf
+++ b/modules/maxbin2.nf
@@ -1,15 +1,18 @@
-process maxbin2 {
-    label 'maxbin2'
-    publishDir "${params.output}/${name}/maxbin2_bins/", mode: 'copy', pattern: "maxbin_bin" 
-    input:
-    set val(name), file(assembly), file(ont), file(illumina)
-    output:
-    set val(name), file("maxbin_bin/")
-    shell:
-    """
-    run_MaxBin.pl -contig !{assembly}  -reads !{illumina[0]} -reads2 !{illumina[1]} -reads3 !{ont}  -out maxbin2 -thread !{task.cpus}
-    mkdir maxbin_bin
-    mv maxbin2.*.fasta maxbin_bin/
-    """
-        
+process maxbin2 {
+    maxForks 1
+    label 'maxbin2'
+    publishDir "${params.output}/${name}/assemble/binning/maxbin2/", mode: 'copy', pattern: "maxbin_bin" 
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(assembly), path(ont), path(illumina)
+    output:
+    tuple val(name), path("maxbin_bin")
+    shell:
+    """
+    run_MaxBin.pl -contig ${assembly}  -reads ${illumina[0]} -reads2 ${illumina[1]} -reads3 ${ont}  -out maxbin2 -thread ${task.cpus}
+    mkdir maxbin_bin
+    mv maxbin2.*.fasta maxbin_bin/
+    """
+        
 }  // add -prob_threshold 0.5 -markerset 40 ??
\ No newline at end of file
diff --git a/modules/metabat2.nf b/modules/metabat2.nf
index 7da285c..d4e6438 100644
--- a/modules/metabat2.nf
+++ b/modules/metabat2.nf
@@ -1,27 +1,35 @@
-process metabat2 {
-    label 'metabat2'
-    publishDir "${params.output}/${name}/metabat2_bins/", mode: 'copy', pattern: "bins_dir"
-    input:
-    set val(name), file(assembly), file(ont_bam), file(illumina_bam)
-    output:
-    set val(name), file("bins_dir")
-    script:
-    """
-    metabat -i ${assembly} ${ont_bam} ${illumina_bam} -o bins_dir/metabat_bins -t ${task.cpus}
-    """
-}
-
-process metabat2_extra {
-    label 'metabat2'
-    publishDir "${params.output}/${name}/metabat2_bins/", mode: 'copy', pattern: "bins_dir" 
-    input:
-    set val(name), file(assembly), file(ont_bam), file(illumina_bam)
-    file(extra_bam)
-    output:
-    set val(name), file("bins_dir")
-    script:
-    """
-    metabat -i ${assembly} ${ont_bam} ${illumina_bam} ${extra_bam} -o bins_dir/metabat_bin -t ${task.cpus}
-    """
-
+process metabat2 {
+    maxForks 1
+    label 'metabat2'
+    publishDir "${params.output}/${name}/assemble/binning/metabat2/", mode: 'copy', pattern: "bins_dir"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(assembly), path(ont_bam), path(illumina_bam)
+    output:
+    tuple val(name), path("bins_dir")
+    script:
+    """
+    jgi_summarize_bam_contig_depths --outputDepth depth.txt *.bam
+    metabat2 -i ${assembly} -a depth.txt -o bins_dir/metabat_bins -t ${task.cpus}
+    """
+}
+
+process metabat2_extra {
+    maxForks 1
+    label 'metabat2'
+    publishDir "${params.output}/${name}/assemble/binning/metabat2/", mode: 'copy', pattern: "bins_dir" 
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(assembly), path(ont_bam), path(illumina_bam)
+    path(extra_bam)
+    output:
+    tuple val(name), path("bins_dir")
+    script:
+    """
+    jgi_summarize_bam_contig_depths --outputDepth depth.txt *.bam
+    metabat2 -i ${assembly} -a depth.txt -o bins_dir/metabat_bin -t ${task.cpus}
+    """
+
 }
\ No newline at end of file
diff --git a/modules/metawrap_refine_bin.nf b/modules/metawrap_refine_bin.nf
index abdc553..0d3cf8c 100644
--- a/modules/metawrap_refine_bin.nf
+++ b/modules/metawrap_refine_bin.nf
@@ -1,57 +1,73 @@
-process refine2 {
-    
-    if (workflow.profile == 'conda') {conda '/home/renaud/miniconda3/envs/metawrap-env'}
-    else {label 'metawrap'}
-    publishDir "${params.output}/${name}/metawrap_refined_bins/", mode: 'copy', pattern: "metawrap_bins" 
-    publishDir "${params.output}/${name}/metawrap_refined_bins/", mode: 'copy', pattern: "${name}_binning_stats.txt" 
-    input:
-    set val(name1), file(bins1), file(bins2)
-    file(path)
-    output:
-    set val(name1), file("metawrap_bins/*.fa")
-    file("${name}_binning_stats.txt")
-    shell:
-    """
-    mem=\$(echo !{task.memory} | sed 's/ GB//g')
-    path_db=\$(cat !{path})
-    echo \$path_db
-    echo -e "\$path_db" | checkm data setRoot
-    echo "checkm done"
-    metawrap bin_refinement -t !{task.cpus} -m \$mem -o refined_bins -A !{bins1} -B !{bins2} -o refined_bins
-    mkdir metawrap_bins/
-    mv refined_bins/metawrap_70_10_bins/*.fa metawrap_bins/
-    mv refined_bins/metawrap_70_10_bins.stats ${name}_binning_stats.txt
-    """
-
-}
-
-process refine3 {
-    if (workflow.profile == 'conda') {conda '/home/renaud/miniconda3/envs/metawrap-env'}
-    else {label 'metawrap'}
-    publishDir "${params.output}/${name}/metawrap_refined_bins/", mode: 'copy', pattern: "metawrap_bins" 
-    publishDir "${params.output}/${name}/metawrap_refined_bins/", mode: 'copy', pattern: "${name}_binning_stats.txt" 
-    input:
-        set val(name), file(bins1), file(bins2), file(bins3)
-        file(path)
-    output:
-    set val(name), file("metawrap_bins/*.fa")
-    file("${name}_binning_stats.txt")
-    shell:
-    """
-    mem=\$(echo !{task.memory} | sed 's/ GB//g')
-    path_db=\$(cat !{path})
-    echo \$path_db
-    echo -e "\$path_db" | checkm data setRoot
-    echo "checkm done"
-    metawrap bin_refinement -o refined_bins -A !{bins2} -B !{bins3} -C !{bins1} -t !{task.cpus} -m \$mem 
-    mkdir metawrap_bins/
-    mv refined_bins/metawrap_70_10_bins/*.fa metawrap_bins/
-    mv refined_bins/metawrap_70_10_bins.stats !{name}_binning_stats.txt
-    """
-
-}
-
-    // cp -r /home/renaud/mafin_modul/metawrap/metawrap_bins .
-    // cp /home/renaud/mafin_modul/metawrap/S_41_17_Cf_binning_stats.txt .
-
+process refine2 {
+    
+    if (workflow.profile.contains('conda')) {conda '/path/to/miniconda3/envs/metawrap-env'}
+    else {label 'metawrap'}
+    publishDir "${params.output}/${name}/assemble/binning/metawrap_refined_bins/", mode: 'copy', pattern: "metawrap_bins/*" 
+    publishDir "${params.output}/${name}/assemble/binning/metawrap_refined_bins/", mode: 'copy', pattern: "${name}_binning_stats.txt" 
+    input:
+    tuple val(name1), path(bins1), path(bins2)
+    path(path)
+    output:
+    tuple val(name1), path("metawrap_bins/*.fa")
+    path("${name}_binning_stats.txt")
+    shell:
+    """
+    mem=\$(echo ${task.memory} | sed 's/g//g')
+    path_db=\$(cat ${path})
+    echo \$path_db
+    echo -e "\$path_db" | checkm data setRoot
+    echo "checkm done"
+    metawrap bin_refinement -t ${task.cpus} -m \$mem -o refined_bins -A ${bins1} -B ${bins2} -o refined_bins
+    mkdir metawrap_bins/
+    mv refined_bins/metawrap_70_10_bins/*.fa metawrap_bins/
+    mv refined_bins/metawrap_70_10_bins.stats ${name}_binning_stats.txt
+    """
+
+}
+
+process refine3 {
+    if (workflow.profile.contains('conda')) {conda '/path/to/miniconda3/envs/metawrap-env'}
+    else {label 'metawrap'}
+    publishDir "${params.output}/${name}/assemble/binning/metawrap_refined_bins/", mode: 'copy', pattern: "metawrap_bins/*" 
+    publishDir "${params.output}/${name}/assemble/binning/metawrap_refined_bins/", mode: 'copy', pattern: "${name}_binning_stats.txt" 
+    errorStrategy { task.exitStatus in 1..1 ? 'retry' : 'finish'}
+    maxRetries 2
+    input:
+        tuple val(name), path(bins1), path(bins2), path(bins3)
+        path(path)
+    output:
+    tuple val(name), path("metawrap_bins/*.fa")
+    path("${name}_binning_stats.txt")
+    shell:
+    if (task.attempt == 1)
+    """
+    mem=\$(echo ${task.memory} | sed 's/g//g')
+    path_db=\$(cat ${path})
+    echo \$path_db
+    echo -e "\$path_db" | checkm data setRoot
+    echo "checkm done"
+    metawrap bin_refinement -o refined_bins -A ${bins2} -B ${bins3} -C ${bins1} -t ${task.cpus} -m \$mem 
+    mkdir metawrap_bins/
+    mv refined_bins/metawrap_70_10_bins/*.fa metawrap_bins/
+    mv refined_bins/metawrap_70_10_bins.stats ${name}_binning_stats.txt
+    """
+    else if (task.attempt == 2)
+    """
+    mem=\$(echo ${task.memory} | sed 's/g//g')
+    path_db=\$(cat ${path})
+    echo \$path_db
+    echo -e "\$path_db" | checkm data setRoot
+    echo "checkm done"
+    metawrap bin_refinement -o refined_bins -A ${bins2} -B ${bins1} -t ${task.cpus} -m \$mem 
+    mkdir metawrap_bins/
+    mv refined_bins/metawrap_70_10_bins/*.fa metawrap_bins/
+    mv refined_bins/metawrap_70_10_bins.stats ${name}_binning_stats.txt
+    """
+    else 
+    error "please pick the bins you want and submit them in the classification step then select the bins for the annotation step"
+}
+
+    // cp -r /home/renaud/mafin_modul/metawrap/metawrap_bins .
+    // cp /home/renaud/mafin_modul/metawrap/S_41_17_Cf_binning_stats.txt .
+
      
\ No newline at end of file
diff --git a/modules/minimap2.nf b/modules/minimap2.nf
index e1a4a1a..2fd3226 100644
--- a/modules/minimap2.nf
+++ b/modules/minimap2.nf
@@ -1,65 +1,73 @@
-process minimap2 {
-    label 'minimap2'
-    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "ont.bam"
-    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
-    input:
-    set val(name), file(assembly), file(ont)
-    output:
-    set val(name) , file("ont_sorted.bam")
-    script:
-    """
-    minimap2 -ax map-ont ${assembly} ${ont} > ont.sam
-    samtools view -bS ont.sam > ont.bam
-    samtools sort -@ ${task.cpus} -o ont_sorted.bam ont.bam
-    rm ont.*
-    """
-}
-
-process minimap_polish {
-    label 'minimap2'
-    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "ont.bam"
-    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
-    input:
-    set val(name), file(assembly), file(ont)
-    output:
-    set val(name) , file("ont.sam")
-    script:
-    """
-    minimap2 -ax map-ont ${assembly} ${ont} > ont.sam
-    """
-}
-
-process extra_minimap2 {
-    label 'minimap2'
-    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "ont.bam"
-    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
-    input:
-    set val(name), file(assembly), file(ont)
-    output:
-    set val(name) , file("*_sorted.bam")
-    script:
-    """
-    minimap2 -ax map-ont ${assembly} ${ont} > ont.sam
-    samtools view -bS ont.sam > ont.bam
-    samtools sort -@ ${task.cpus} -o ${ont}_sorted.bam ont.bam
-    rm ont.*
-    """
-}
-
-
-process minimap2_bin {
-    label 'minimap2'
-    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "ont.bam"
-    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
-    input:
-    set val(name), file(assembly), file(ont)
-    output:
-    set val(name) , file("ont_sorted.bam")
-    script:
-    """
-    minimap2 -ax map-ont ${assembly} ${ont} > ont.sam
-    samtools view -bS ont.sam > ont.bam
-    samtools sort -o ont_sorted.bam ont.bam
-    rm ont.*
-    """
+process minimap2 {
+    label 'minimap2'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "ont.bam"
+    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
+    input:
+    tuple val(name), path(assembly), path(ont)
+    output:
+    tuple val(name) , path("ont_sorted.bam")
+    script:
+    """
+    minimap2 -ax map-ont ${assembly} ${ont} > ont.sam
+    samtools view -bS ont.sam > ont.bam
+    samtools sort -@ ${task.cpus} -o ont_sorted.bam ont.bam
+    rm ont.*
+    """
+}
+
+process minimap_polish {
+    label 'minimap2'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "ont.bam"
+    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
+    input:
+    tuple val(name), path(assembly), path(ont)
+    output:
+    tuple val(name) , path("ont.paf")
+    script:
+    """
+    minimap2 -x map-ont ${assembly} ${ont} > ont.paf
+    """
+}
+
+process extra_minimap2 {
+    label 'minimap2'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "ont.bam"
+    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
+    input:
+    tuple val(name), path(assembly), path(ont)
+    output:
+    tuple val(name) , path("*_sorted.bam")
+    script:
+    """
+    minimap2 -ax map-ont ${assembly} ${ont} > ont.sam
+    samtools view -bS ont.sam > ont.bam
+    samtools sort -@ ${task.cpus} -o ${ont}_sorted.bam ont.bam
+    rm ont.*
+    """
+}
+
+
+process minimap2_bin {
+    label 'minimap2'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    //publishDir "${params.output}/${name}_bam/", mode: 'copy', pattern: "ont.bam"
+    //SINCE THIS module is use multiple times it migh not be advise to output the same name file mutiple times
+    input:
+    tuple val(name), path(assembly), path(ont)
+    output:
+    tuple val(name) , path("ont_sorted.bam")
+    script:
+    """
+    minimap2 -ax map-ont ${assembly} ${ont} > ont.sam
+    samtools view -bS ont.sam > ont.bam
+    samtools sort -o ont_sorted.bam ont.bam
+    rm ont.*
+    """
 }
\ No newline at end of file
diff --git a/modules/ont_qc.nf b/modules/ont_qc.nf
index 75839ce..afcdb42 100644
--- a/modules/ont_qc.nf
+++ b/modules/ont_qc.nf
@@ -1,41 +1,43 @@
-process discard_short {
-    label 'ubuntu'
-    input:
-    set val(name) , file(part)
-    output:
-    set val(name), file("filtered_${part}")
-    shell:
-    """
-        cat !{part} | paste - - - - | awk -F"\\t" 'length(\$2)  >= ${params.short_qc}' | sed 's/\\t/\\n/g' > "filtered_${part}"
-
-    """
-}
-
-
-
-process filtlong {
-    label 'filtlong'
-    input:
-    set val(name) , file(filtered)
-    output:
-    set val(name) , file("clean_${filtered}")
-    script:
-    """
-    filtlong --min_length ${params.short_qc} --keep_percent 90 --target_bases 500000000 ${filtered} > clean_${filtered}
-    """
-}
-
-
-process merge {
-    label 'ubuntu'
-    publishDir "${params.output}/${name}/nanopore_qc_out/", mode: 'copy', pattern: "*_all.fastq" 
-    input:
-    set val(name) , file(filtered)
-    output:
-    set val(name), file("${name}_all.fastq")
-    script:
-    """
-    cat *.fastq > ${name}_all.fastq
-    """
-
+process discard_short {
+    label 'ubuntu'
+    input:
+    tuple val(name) , path(part)
+    output:
+    tuple val(name), path("filtered_${part}")
+    shell:
+    """
+        cat !{part} | paste - - - - | awk -F"\\t" 'length(\$2)  >= ${params.short_qc}' | sed 's/\\t/\\n/g' > "filtered_${part}"
+
+    """
+}
+
+
+
+process filtlong {
+    label 'filtlong'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name) , path(filtered)
+    output:
+    tuple val(name) , path("clean_${filtered}")
+    script:
+    """
+    filtlong --min_length ${params.short_qc} --keep_percent 90 --target_bases 500000000 ${filtered} > clean_${filtered}
+    """
+}
+
+
+process merge {
+    label 'ubuntu'
+    publishDir "${params.output}/${name}/assemble/quality_control/nanopore/", mode: 'copy', pattern: "*_all.fastq" 
+    input:
+    tuple val(name) , path(filtered)
+    output:
+    tuple val(name), path("${name}_all.fastq")
+    script:
+    """
+    cat *.fastq > ${name}_all.fastq
+    """
+
 }
\ No newline at end of file
diff --git a/modules/parser.nf b/modules/parser.nf
index 388e9a3..6e54654 100644
--- a/modules/parser.nf
+++ b/modules/parser.nf
@@ -1,37 +1,41 @@
-process parser_bin_RNA {
-    label 'python38'
-    publishDir "${params.output}/${name}/", mode: 'copy', pattern: "parser_result/*"
-    input:
-        set val(name), file(rna_annot), file(quant)
-        set val(name), file(bins_annot)
-    output:
-        file("parser_result/*") 
-    script:
-        """
-        pankegg_bin_RNA.py -b ${bins_annot} -r ${rna_annot} -l ${quant} -o parser_result 
-        """
-    }
-process parser_bin {
-    label 'python38'
-    publishDir "${params.output}/${name}/", mode: 'copy', pattern: "parser_result/*"
-    input:
-        set val(name), file(bins_annot)
-    output:
-        file("parser_result/*") 
-    script:
-        """
-        pankegg_bin.py -b ${bins_annot} -o parser_result 
-        """
-    }
-
-    // """
-    //     #!/usr/bin/python
-
-    //     import PANKEGG
-    //     import PANKEGG.parser
-    //     from PANKEGG.parser import *
-    //     import sys
-    //     sys.argv = [sys.argv[0], '-b', '!{bin_annot}' , '-l' ,'!{quant}', '-o', 'result', '-r', '!{rna_annot}']
-    //     main()
-
+process parser_bin_RNA {
+    label 'ubuntu'
+    publishDir "${params.output}/${name}/annotate/", mode: 'copy', pattern: "parser_result/*"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+        tuple val(name), path(rna_annot), path(quant)
+        tuple val(name), path(bins_annot)
+    output:
+        path("parser_result/*") 
+    script:
+        """
+        pankegg_bin_RNA.py -b ${bins_annot} -r ${rna_annot} -l ${quant} -o parser_result 
+        """
+    }
+process parser_bin {
+    label 'ubuntu'
+    publishDir "${params.output}/${name}/annotate/", mode: 'copy', pattern: "parser_result/*"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+        tuple val(name), path(bins_annot)
+    output:
+        path("parser_result/*") 
+    script:
+        """
+        pankegg_bin.py -b ${bins_annot} -o parser_result 
+        """
+    }
+
+    // """
+    //     #!/usr/bin/python
+
+    //     import PANKEGG
+    //     import PANKEGG.parser
+    //     from PANKEGG.parser import *
+    //     import sys
+    //     sys.argv = [sys.argv[0], '-b', '!{bin_annot}' , '-l' ,'!{quant}', '-o', 'result', '-r', '!{rna_annot}']
+    //     main()
+
     //     """
\ No newline at end of file
diff --git a/modules/polish.nf b/modules/polish.nf
index 671fd63..6969fdf 100644
--- a/modules/polish.nf
+++ b/modules/polish.nf
@@ -1,94 +1,101 @@
-process racon {
-    label 'racon'
-    input:
-        set val(name), file(read), file(assembly), file(mapping) 
-    output:
-        set val(name), file(read), file("${name}_consensus.fasta") 
-    shell:
-        """
-        racon -t ${task.cpus} ${read} ${mapping} ${assembly} > ${name}_consensus.fasta
-        """
-    }
-
-process medaka {
-    label 'medaka'
-    input:
-        set val(name), file(read), file(consensus) 
-    output:
-        set val(name), file("${name}_polished.fasta") 
-    script:
-        """
-        medaka_consensus -i ${read} -d ${consensus} -o polished -t ${task.cpus} -m ${params.model}
-        mv polished/consensus.fasta ${name}_polished.fasta
-        """
-  }
-
-process pilon {
-    label 'pilon'
-    publishDir "${params.output}/${name}/flye_assembly/", mode: 'copy', pattern: "polished_assembly.fasta" 
-    input:
-        set val(name), file(assembly), file(ill_read)
-        val(iteration)
-    output:
-        set val(name) , file("polished_assembly.fasta")
-    shell:
-    """
-    mem=\$(echo !{task.memory} | sed 's/ GB//g')
-    assemb="!{assembly}"
-    for ite in {1..!{iteration}}
-    do
-        bwa index \$assemb
-        bwa mem \$assemb !{ill_read} | samtools view -bS - | samtools sort -@ !{task.cpus} - > \$ite.bam
-        samtools index -@ !{task.cpus} \$ite.bam
-        pilon -Xmx\$mem"g" --threads !{task.cpus} --genome \$assemb --bam \$ite.bam --output \$ite"_polished_assembly"
-        assemb=\$ite"_polished_assembly.fasta"
-    done
-    mv !{iteration}"_polished_assembly.fasta" polished_assembly.fasta
-    """
-
-}
-
-
-
-//*********************************
-// if polish with long on pilon add
-//*********************************
-// assemb2=\$bin_name"_illumina_polished.fasta"
-//         for ite in {1..!{iteration}}
-//         do
-//             bwa index \$assemb2
-//             bwa mem \$assemb2 !{ont_read} > assembly_ont_mapped.sam
-//             samtools view -bS assembly_ont_mapped.sam > assembly_ont_mapped.bam
-//             samtools sort -@ !{task.cpus} assembly_ont_mapped.bam > \$ite"_ont.bam"
-//             samtools index -@ !{task.cpus} \$ite"_ont.bam"
-//             pilon -Xmx24g --threads !{task.cpus} --genome \$assemb2 --bam \$ite"_ont.bam" --output \$ite"_polished_ont"
-//             assemb=\$ite"_polished_ont.fasta"
-//         done
-//         mv !{iteration}"_polished_ont.fasta" \$bin_name"_ont_polished.fasta"
-
-
-    // *************
-    // for threshold
-    // *************
-
-    // assemb="!{assembly}"
-    // if !{threshold}==""
-    // then
-    //     for ite in {1..!{iteration}}
-    //     do
-    //          pilon --genome \$assemb --bam !{ont_bam} --output \$ite"_polished_assembly"
-    //          assemb=\$ite"_polished_assembly.fasta"
-    //     done
-    //     mv \$iteration"_polished_assembly.fasta" polished_assembly.fasta
-    // fi
-
-    // if !{threshold}!=""
-    // then
-    //     thresh=\$(VALUE)
-    //     while \$thresh -lt !{threshold}
-    //          do
-    //          pilon --genome \$assemb --bam !{ont_bam} --output \$thresh"_polished_assembly"
-    //          assemb=\$thresh"_polished_assembly.fasta"
-    //          thresh=\$(GET NEW VAL FROM \$thresh"_polished_assembly")
-    //          done
+process racon {
+    label 'racon'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+        tuple val(name), path(read), path(assembly), path(mapping) 
+    output:
+        tuple val(name), path(read), path("${name}_consensus.fasta") 
+    shell:
+        """
+        racon -t ${task.cpus} ${read} ${mapping} ${assembly} > ${name}_consensus.fasta
+        """
+    }
+
+process medaka {
+    label 'medaka'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+        tuple val(name), path(read), path(consensus) 
+    output:
+        tuple val(name), path("${name}_polished.fasta") 
+    script:
+        """
+        medaka_consensus -i ${read} -d ${consensus} -o polished -t ${task.cpus} -m ${params.model}
+        mv polished/consensus.fasta ${name}_polished.fasta
+        """
+  }
+
+process pilon {
+    label 'pilon'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    publishDir "${params.output}/${name}/assemble/assembly/pilon_polished/", mode: 'copy', pattern: "polished_assembly.fasta" 
+    input:
+        tuple val(name), path(assembly), path(ill_read)
+        val(iteration)
+    output:
+        tuple val(name) , path("polished_assembly.fasta")
+    shell:
+    """
+    mem=\$(echo ${task.memory} | sed 's/ GB//g'| sed 's/g//g')
+    partial_mem=\$((\$mem*40/100))
+    assemb="${assembly}"
+    for ite in {1..${iteration}}
+    do
+        bwa index \$assemb
+        bwa mem \$assemb ${ill_read[0]} ${ill_read[1]} | samtools view -bS - | samtools sort -@ ${task.cpus} - > \$ite.bam
+        samtools index -@ ${task.cpus} \$ite.bam
+        pilon -Xmx\$partial_mem"g" --threads ${task.cpus} --genome \$assemb --bam \$ite.bam --output \$ite"_polished_assembly"
+        assemb=\$ite"_polished_assembly.fasta"
+    done
+    mv ${iteration}"_polished_assembly.fasta" polished_assembly.fasta
+    """
+
+}
+
+
+
+//*********************************
+// if polish with long on pilon add
+//*********************************
+// assemb2=\$bin_name"_illumina_polished.fasta"
+//         for ite in {1..!{iteration}}
+//         do
+//             bwa index \$assemb2
+//             bwa mem \$assemb2 !{ont_read} > assembly_ont_mapped.sam
+//             samtools view -bS assembly_ont_mapped.sam > assembly_ont_mapped.bam
+//             samtools sort -@ !{task.cpus} assembly_ont_mapped.bam > \$ite"_ont.bam"
+//             samtools index -@ !{task.cpus} \$ite"_ont.bam"
+//             pilon -Xmx24g --threads !{task.cpus} --genome \$assemb2 --bam \$ite"_ont.bam" --output \$ite"_polished_ont"
+//             assemb=\$ite"_polished_ont.fasta"
+//         done
+//         mv !{iteration}"_polished_ont.fasta" \$bin_name"_ont_polished.fasta"
+
+
+    // *************
+    // for threshold
+    // *************
+
+    // assemb="!{assembly}"
+    // if !{threshold}==""
+    // then
+    //     for ite in {1..!{iteration}}
+    //     do
+    //          pilon --genome \$assemb --bam !{ont_bam} --output \$ite"_polished_assembly"
+    //          assemb=\$ite"_polished_assembly.fasta"
+    //     done
+    //     mv \$iteration"_polished_assembly.fasta" polished_assembly.fasta
+    // fi
+
+    // if !{threshold}!=""
+    // then
+    //     thresh=\$(VALUE)
+    //     while \$thresh -lt !{threshold}
+    //          do
+    //          pilon --genome \$assemb --bam !{ont_bam} --output \$thresh"_polished_assembly"
+    //          assemb=\$thresh"_polished_assembly.fasta"
+    //          thresh=\$(GET NEW VAL FROM \$thresh"_polished_assembly")
+    //          done
     // fi
\ No newline at end of file
diff --git a/modules/readme_output.nf b/modules/readme_output.nf
new file mode 100644
index 0000000..d7a97cf
--- /dev/null
+++ b/modules/readme_output.nf
@@ -0,0 +1,12 @@
+process readme_output {
+    label 'ubuntu'
+    publishDir "${params.output}/", mode: 'copy', pattern: "README_output.txt"
+    input:
+    output:
+        path("README_output.txt") 
+    script:
+        """
+        wget https://osf.io/a6hru/download -O README_output.txt
+        """
+
+}
\ No newline at end of file
diff --git a/modules/seqtk_retrieve_reads.nf b/modules/seqtk_retrieve_reads.nf
index 1568bea..580a7ef 100644
--- a/modules/seqtk_retrieve_reads.nf
+++ b/modules/seqtk_retrieve_reads.nf
@@ -1,70 +1,74 @@
-process reads_retrieval {
-    label 'seqtk'
-    publishDir "${params.output}/${name}/reads_mapped_to_metawrap_bins/", mode: 'copy', pattern: "*.fastq"
-    input:
-    set val(name), file(contig_list), file(ill_bam), file(ont_bam), file(ill_reads), file(ont_reads)
-    output:
-    set val(name), val(file(file(file(contig_list).baseName).baseName).baseName), file("*_illumina_R{1,2}.fastq"), file("*_ont.fastq")
-    shell:
-    // first I extract the reads that NEED TO REDO IT WITH FRESH MIND (include BWA.nf)
-    """
-    bin=\$(basename -s .fa.contigs.list !{contig_list})
-    list=\$(cat !{contig_list} | tr "\\n" " " ) 
-
-    
-    ## illumina mapped reads retrieval
-    samtools index -@ !{task.cpus} !{ill_bam}
-    samtools view -bh !{ill_bam} \$list > illumina_contigs.bam  
-    samtools view -F4 illumina_contigs.bam > illumina_mapped_contigs.sam
-    cut -f1 illumina_mapped_contigs.sam | sort | uniq > \$bin"_illumina_mapped.list"
-    seqtk subseq !{ill_reads[0]} \$bin"_illumina_mapped.list" > \$bin"_illumina_R1.fastq"
-    seqtk subseq !{ill_reads[1]} \$bin"_illumina_mapped.list" > \$bin"_illumina_R2.fastq"
-
-    ## ONT mapped reads retrieval
-    samtools index -@ !{task.cpus} !{ont_bam}
-    samtools view -bh !{ont_bam} \$list > ont_contigs.bam  
-    samtools view -F4 ont_contigs.bam > ont_mapped_contigs.sam
-    cut -f1 ont_mapped_contigs.sam | sort | uniq > \$bin"_ont_mapped.list"
-    seqtk subseq !{ont_reads} \$bin"_ont_mapped.list" > \$bin"_ont.fastq"
-
-    rm illumina_contigs.bam
-    rm illumina_mapped_contigs.sam
-    rm ont_contigs.bam
-    rm ont_mapped_contigs.sam
-    rm *.bam.bai
-    """
-
-}
-
-// TODO PUT THE UNMAPPING AS A STAND ALONE STEP RETRIEVEING ALL UNMAPPED AS ONE FILE
-
-// notes that contigs.bam is the bam file of the reads aligned to the list of contigs used
-// notes that mapped.contigs.bam is the mapped reads to the contigs
-
-process unmapped_retrieve {
-    label 'seqtk'
-    publishDir "${params.output}/${name}/reads_unmapped_to_metawrap_bins/", mode: 'copy', pattern: "*unmapped_*.fastq"
-    input:
-    set val(name), file(ill_bam), file(ont_bam), file(ill_reads), file(ont_reads)
-    output:
-    file("unmapped_*.fastq") optionnal true
-    shell:
-    """
-    ## illumina unmapped reads retrieval
-    samtools view -f4 !{ill_bam} > illumina_unmapped_contigs.sam
-    cut -f1 illumina_unmapped_contigs.sam | sort | uniq > illumina_unmapped.list
-    seqtk subseq !{ill_reads[0]} illumina_unmapped.list > unmapped_ILL_R1.fastq
-    seqtk subseq !{ill_reads[1]} illumina_unmapped.list > unmapped_ILL_R2.fastq
-
-    ## ONT unmapped reads retrieval
-    samtools view -f4 !{ont_bam} > ont_unmapped_contigs.sam
-    cut -f1 ont_unmapped_contigs.sam | sort | uniq > ont_unmapped.list
-    seqtk subseq !{ont_reads} ont_unmapped.list > unmapped_ONT.fastq
-
-    rm illumina_unmapped_contigs.sam
-    rm illumina_unmapped.list
-    rm ont_unmapped_contigs.sam
-    rm ont_unmapped.list
-    """
-
+process reads_retrieval {
+    label 'seqtk'
+    publishDir "${params.output}/${name}/assembled/reassembly/mapped_reads/", mode: 'copy', pattern: "*.fastq"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(contig_list), path(ill_bam), path(ont_bam), path(ill_reads), path(ont_reads)
+    output:
+    tuple val(name), val(file(file(file(contig_list).baseName).baseName).baseName), path("*_illumina_R{1,2}.fastq"), path("*_ont.fastq")
+    shell:
+    // first I extract the reads that NEED TO REDO IT WITH FRESH MIND (include BWA.nf)
+    """
+    bin=\$(basename -s .fa.contigs.list ${contig_list})
+    list=\$(cat ${contig_list} | tr "\\n" " " ) 
+
+    
+    ## illumina mapped reads retrieval
+    samtools index -@ ${task.cpus} ${ill_bam}
+    samtools view -bh ${ill_bam} \$list > illumina_contigs.bam  
+    samtools view -F4 illumina_contigs.bam > illumina_mapped_contigs.sam
+    cut -f1 illumina_mapped_contigs.sam | sort | uniq > \$bin"_illumina_mapped.list"
+    seqtk subseq ${ill_reads[0]} \$bin"_illumina_mapped.list" > \$bin"_illumina_R1.fastq"
+    seqtk subseq ${ill_reads[1]} \$bin"_illumina_mapped.list" > \$bin"_illumina_R2.fastq"
+
+    ## ONT mapped reads retrieval
+    samtools index -@ ${task.cpus} ${ont_bam}
+    samtools view -bh ${ont_bam} \$list > ont_contigs.bam  
+    samtools view -F4 ont_contigs.bam > ont_mapped_contigs.sam
+    cut -f1 ont_mapped_contigs.sam | sort | uniq > \$bin"_ont_mapped.list"
+    seqtk subseq ${ont_reads} \$bin"_ont_mapped.list" > \$bin"_ont.fastq"
+
+    rm illumina_contigs.bam
+    rm illumina_mapped_contigs.sam
+    rm ont_contigs.bam
+    rm ont_mapped_contigs.sam
+    rm *.bam.bai
+    """
+
+}
+
+// TODO PUT THE UNMAPPING AS A STAND ALONE STEP RETRIEVEING ALL UNMAPPED AS ONE FILE
+
+// notes that contigs.bam is the bam file of the reads aligned to the list of contigs used
+// notes that mapped.contigs.bam is the mapped reads to the contigs
+
+process unmapped_retrieve {
+    label 'seqtk'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    publishDir "${params.output}/${name}/assembled/reassembly/unmapped_reads/", mode: 'copy', pattern: "*unmapped_*.fastq"
+    input:
+    tuple val(name), path(ill_bam), path(ont_bam), path(ill_reads), path(ont_reads)
+    output:
+    path("unmapped_*.fastq") optionnal true
+    shell:
+    """
+    ## illumina unmapped reads retrieval
+    samtools view -f4 ${ill_bam} > illumina_unmapped_contigs.sam
+    cut -f1 illumina_unmapped_contigs.sam | sort | uniq > illumina_unmapped.list
+    seqtk subseq ${ill_reads[0]} illumina_unmapped.list > unmapped_ILL_R1.fastq
+    seqtk subseq ${ill_reads[1]} illumina_unmapped.list > unmapped_ILL_R2.fastq
+
+    ## ONT unmapped reads retrieval
+    samtools view -f4 ${ont_bam} > ont_unmapped_contigs.sam
+    cut -f1 ont_unmapped_contigs.sam | sort | uniq > ont_unmapped.list
+    seqtk subseq ${ont_reads} ont_unmapped.list > unmapped_ONT.fastq
+
+    rm illumina_unmapped_contigs.sam
+    rm illumina_unmapped.list
+    rm ont_unmapped_contigs.sam
+    rm ont_unmapped.list
+    """
+
 }
\ No newline at end of file
diff --git a/modules/sourmash.nf b/modules/sourmash.nf
index 71167d4..9be40f5 100644
--- a/modules/sourmash.nf
+++ b/modules/sourmash.nf
@@ -1,38 +1,40 @@
-process sourmash_genome_size {
-    label 'sourmash' 
-    input:
-    set val(name), file(ont)
-    file(json)
-    output:
-    set val(name), file(ont), file('genome_size.txt')
-    shell:
-    """
-    echo "100M" >genome_size.txt
-    """
-/*
-    sourmash compute -p !{task.cpus} --scaled 10000 -k 31 !{ont} -o !{name}.sig 
-    sourmash lca gather  !{name}.sig !{json} --ignore-abundance -o metagenomic-composition.txt
-    sum_ont=\$(cat metagenomic-composition.txt | cut -d ',' -f 1 | paste -sd+ | bc)
-    total_m_ont=\$(bc -l <<< "scale=2 ; \$sum_ont /10^6")
-    if (( \$(echo "\$total_m_ont < 100" |bc -l) ));
-        then echo "100M" >genome_size.txt;
-        else echo \$total_m_ont"M" >genome_size.txt;
-    fi*/
-
-}
-
-process sourmash_bins {
-    label 'sourmash' 
-    publishDir "${params.output}/${name}/sourmash/", mode: 'copy', pattern: "*.txt"
-    input:
-    set val(name), file(bins)
-    file(json)
-    output:
-    file('*.txt')
-    shell:
-    """
-    bin_id=\$(basename !{bins} | sed -r "s/\\.\\w+//2")
-    sourmash compute -p !{task.cpus} --scaled 10000 -k 31 !{bins} -o !{bins}.sig
-    sourmash lca classify --query !{bins}.sig --db !{json} > \$bin_id.txt   
-    """
-}
+process sourmash_genome_size {
+    label 'sourmash' 
+    input:
+    tuple val(name), path(ont)
+    path(json)
+    output:
+    tuple val(name), path(ont), path('genome_size.txt')
+    shell:
+    """
+    echo "100M" >genome_size.txt
+    """
+/*
+    sourmash compute -p !{task.cpus} --scaled 10000 -k 31 !{ont} -o !{name}.sig 
+    sourmash lca gather  !{name}.sig !{json} --ignore-abundance -o metagenomic-composition.txt
+    sum_ont=\$(cat metagenomic-composition.txt | cut -d ',' -f 1 | paste -sd+ | bc)
+    total_m_ont=\$(bc -l <<< "scale=2 ; \$sum_ont /10^6")
+    if (( \$(echo "\$total_m_ont < 100" |bc -l) ));
+        then echo "100M" >genome_size.txt;
+        else echo \$total_m_ont"M" >genome_size.txt;
+    fi*/
+
+}
+
+process sourmash_bins {
+    label 'sourmash' 
+    publishDir "${params.output}/${name}/classify/sourmash/", mode: 'copy', pattern: "*.txt"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(bins)
+    path(json)
+    output:
+    path('*.txt')
+    shell:
+    """
+    bin_id=\$(basename ${bins} | sed -r "s/\\.\\w+//2")
+    sourmash compute -p ${task.cpus} --scaled 10000 -k 31 ${bins} -o ${bins}.sig
+    sourmash lca classify --query ${bins}.sig --db ${json} > \$bin_id.txt   
+    """
+}
diff --git a/modules/sourmashgetdatabase.nf b/modules/sourmashgetdatabase.nf
index ca2c694..4150189 100644
--- a/modules/sourmashgetdatabase.nf
+++ b/modules/sourmashgetdatabase.nf
@@ -1,13 +1,15 @@
-process sourmash_download_db {
-        if (workflow.profile == 'gcloud') { publishDir 'gs://databases-nextflow/databases/sourmash', mode: 'copy', pattern: "genbank-k31.lca.json.gz" }
-        else { storeDir 'nextflow-autodownload-databases/sourmash' }  
-        //this condition is here only for gcloud usage 
-        label 'ubuntu' 
-      output:
-        file("genbank-k31.lca.json.gz")
-      script:
-        """
-        #wget https://ndownloader.figshare.com/files/18809423?private_link=ed98a281ef089c033352 -O gtdb.lca.json
-        wget https://osf.io/4f8n3/download -O genbank-k31.lca.json.gz
-        """
-    }
\ No newline at end of file
+process sourmash_download_db {
+        if (workflow.profile.contains('gcloud')) { publishDir 'gs://databases-nextflow/databases/sourmash', mode: 'copy', pattern: "genbank-k31.lca.json.gz" }
+        else { storeDir 'nextflow-autodownload-databases/sourmash' }  
+        //this condition is here only for gcloud usage 
+        label 'ubuntu' 
+      output:
+        path("genbank-k31.lca.json.gz")
+      script:
+        """
+        #wget https://ndownloader.figshare.com/files/18809423?private_link=ed98a281ef089c033352 -O gtdb.lca.json
+        wget https://osf.io/4f8n3/download -O genbank-k31.lca.json.gz 
+        """
+    }
+
+    // The link use from osf.io require sourmash V3 (sourmash V3.3.0 is working))
\ No newline at end of file
diff --git a/modules/spades.nf b/modules/spades.nf
index 9479e80..6c92789 100644
--- a/modules/spades.nf
+++ b/modules/spades.nf
@@ -1,15 +1,20 @@
-process spades {
-    label 'spades'
-    publishDir "${params.output}/${name}/spades_assembly/", mode: 'copy', pattern: "assembly.fasta" 
-    input:
-    set val(name), file(illumina), file(ont)
-    output:
-    set val(name), file("assembly.fasta")
-    
-    script:
-    """
-    spades.py -1 ${illumina[0]} -2 ${illumina[1]}  --meta --nanopore ${ont} -o spades_output -t ${task.cpus}
-    mv spades_output/contigs.fasta  assembly.fasta
-    """
-
+process spades {
+    label 'spades'
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    publishDir "${params.output}/${name}/assemble/assembly/spades/", mode: 'copy', pattern: "assembly.fasta" 
+    input:
+    tuple val(name), path(illumina), path(ont)
+    output:
+    tuple val(name), path("assembly.fasta")
+    
+    script:
+    """
+    mem=\$(echo ${task.memory} |sed 's/ GB//g' | sed 's/g//g')
+    cpus=\$(echo ${task.cpus})
+    echo \$cpus \$mem
+    spades.py -1 ${illumina[0]} -2 ${illumina[1]}  --meta --nanopore ${ont} -o spades_output -t \$cpus -m \$mem
+    mv spades_output/contigs.fasta  assembly.fasta
+    """
+
 }
\ No newline at end of file
diff --git a/modules/split_ont.nf b/modules/split_ont.nf
index 7e50abb..176a734 100644
--- a/modules/split_ont.nf
+++ b/modules/split_ont.nf
@@ -1,18 +1,18 @@
-process split_ont {
-    label 'ubuntu'
-    input:
-        set val(name), file(ont)
-    output:
-    val(name), file("part0.fastq")    emit:   ont_1
-    val(name), file("part1.fastq")    emit:   ont_2
-    val(name), file("part2.fastq")    emit:   ont_3
-    val(name), file("part3.fastq")    emit:   ont_4
-    script:
-    """
-    total_line=$(wc -l !{ont} | cut -d ' ' -f 1)
-    line_per_file=$((\$total_line/4))
-    split -d -a 1 -l \$line_per_file !{ont} part
-    for part in part*; do mv \$part \$part".fastq"; done
-    """
-
+process split_ont {
+    label 'ubuntu'
+    input:
+        tuple val(name), path(ont)
+    output:
+    val(name), path("part0.fastq")    emit:   ont_1
+    val(name), path("part1.fastq")    emit:   ont_2
+    val(name), path("part2.fastq")    emit:   ont_3
+    val(name), path("part3.fastq")    emit:   ont_4
+    script:
+    """
+    total_line=$(wc -l ${ont} | cut -d ' ' -f 1)
+    line_per_file=$((\$total_line/4))
+    split -d -a 1 -l \$line_per_file ${ont} part
+    for part in part*; do mv \$part \$part".fastq"; done
+    """
+
 }
\ No newline at end of file
diff --git a/modules/test_data_dll.nf b/modules/test_data_dll.nf
new file mode 100644
index 0000000..9eaf3c7
--- /dev/null
+++ b/modules/test_data_dll.nf
@@ -0,0 +1,19 @@
+process test {
+    label 'ubuntu'
+    input:
+    output:
+    tuple val("subset"), path("ill/*")
+    tuple val("subset"), path("ont/subset.fastq")
+    tuple val("subset"), path("rna/*")
+    shell:
+    """
+    wget https://osf.io/9xmh4/download -O subset_data.tar.gz
+    tar -xzvf subset_data.tar.gz
+    mkdir ill
+    mv ./subset/subset_ill/* ill/
+    mkdir ont/
+    mv ./subset/subset_ont/* ont/
+    mkdir rna/
+    mv ./subset/subset_rna/* rna/
+    """
+}
\ No newline at end of file
diff --git a/modules/trinity_and_salmon.nf b/modules/trinity_and_salmon.nf
index 1317628..f452748 100644
--- a/modules/trinity_and_salmon.nf
+++ b/modules/trinity_and_salmon.nf
@@ -1,18 +1,21 @@
-process de_novo_transcript_and_quant {
-    label 'trinity'
-    publishDir "${params.output}/${name}/de_novo_transcript/", mode: 'copy', pattern: "*_transcript.fasta"
-    publishDir "${params.output}/${name}/quant_of_transcript/", mode: 'copy', pattern: "*_transcript_quant.sf"
-    input:
-    set val(name), file(rna)
-    output:
-    set val(name), file("*_transcript.fasta"), file("*_transcript_quant.sf")
-    shell:
-    """
-    mem=\$(echo "!{task.memory}" | sed 's/ GB/G/g')
-    echo \$mem
-    Trinity --seqType fq --max_memory 20G --CPU !{task.cpus} --left !{rna[0]} --right !{rna[1]}
-    cp trinity_out_dir/Trinity.fasta !{name}_transcript.fasta
-    align_and_estimate_abundance.pl --transcripts !{name}_transcript.fasta --est_method salmon --left !{rna[0]} --right !{rna[1]} --seqType fq --output_dir quant_salmon --thread_count !{task.cpus}  --prep_reference
-    cp quant_salmon/quant.sf !{name}_transcript_quant.sf
-    """
-}
+process de_novo_transcript_and_quant {
+    maxForks 1
+    label 'trinity'
+    publishDir "${params.output}/${name}/annotate/de_novo_transcript/", mode: 'copy', pattern: "*_transcript.fasta"
+    publishDir "${params.output}/${name}/annotate/quant_of_transcript/", mode: 'copy', pattern: "*_transcript_quant.sf"
+    errorStrategy { task.exitStatus in 14..14 ? 'retry' : 'finish'}
+    maxRetries 3 
+    input:
+    tuple val(name), path(rna)
+    output:
+    tuple val(name), path("*_transcript.fasta"), path("*_transcript_quant.sf")
+    shell:
+    """
+    mem=\$(echo "!{task.memory}" | sed 's/ GB/g/g' | sed 's/g/G/g')
+    echo \$mem
+    Trinity --seqType fq --max_memory \$mem --CPU !{task.cpus} --left !{rna[0]} --right !{rna[1]}
+    cp trinity_out_dir/Trinity.fasta !{name}_transcript.fasta
+    align_and_estimate_abundance.pl --transcripts !{name}_transcript.fasta --est_method salmon --left !{rna[0]} --right !{rna[1]} --seqType fq --output_dir quant_salmon --thread_count !{task.cpus}  --prep_reference
+    cp quant_salmon/quant.sf !{name}_transcript_quant.sf
+    """
+}
diff --git a/modules/unicycler_reassemble_from_bin.nf b/modules/unicycler_reassemble_from_bin.nf
index c460a47..12e31e8 100644
--- a/modules/unicycler_reassemble_from_bin.nf
+++ b/modules/unicycler_reassemble_from_bin.nf
@@ -1,41 +1,44 @@
-process unicycler {
-    maxForks 1
-    label 'unicycler'
-    publishDir "${params.output}/${name}/unicycler_assembly/", mode: 'copy', pattern: "*.fa"
-    publishDir "${params.output}/${name}/unicycler_assembly/", mode: 'copy', pattern: "*.gfa"
-    errorStrategy { task.exitStatus in 1..1 ? 'retry' : 'terminate'}
-    maxRetries 3
-    input:
-    set val(name), val(bin_name), file(illumina), file(ont)    
-    output:
-    set val(name), file("*.fa") optional true
-    file("*.gfa") optional true
-    shell:
-    if (task.attempt == 1)
-    """
-    mkdir spades_tmp
-    unicycler -1 ${illumina[0]} -2 ${illumina[1]} -l ${ont} -o output -t ${task.cpus} --keep 0 --no_pilon --spades_tmp_dir spades_tmp
-    mv output/assembly.fasta ${bin_name}".fa"
-    mv output/assembly.gfa ${bin_name}".gfa"
-    """
-    if (task.attempt == 2)
-    """
-    mkdir spades_tmp
-    unicycler -1 ${illumina[0]} -2 ${illumina[1]} -l ${ont} -o output -t ${task.cpus} --keep 0 --no_pilon --spades_tmp_dir spades_tmp --max_kmer_frac 0.85
-    mv output/assembly.fasta ${bin_name}".fa"
-    mv output/assembly.gfa ${bin_name}".gfa"
-    """
-    if (task.attempt == 3)
-    """
-    mkdir spades_tmp
-    unicycler -1 ${illumina[0]} -2 ${illumina[1]} -l ${ont} -o output -t ${task.cpus} --keep 0 --no_pilon --spades_tmp_dir spades_tmp --max_kmer_frac 0.70
-    mv output/assembly.fasta ${bin_name}".fa"
-    mv output/assembly.gfa ${bin_name}".gfa"
-    """
-    if (task.attempt == 4)
-    """
-    mkdir spades_tmp
-    unicycler -1 ${illumina[0]} -2 ${illumina[1]} -l ${ont} -o output -t ${task.cpus} --keep 0 --no_pilon --spades_tmp_dir spades_tmp --max_kmer_frac 0.50
-    mv output/assembly.fasta ${bin_name}".fa"
-    mv output/assembly.gfa ${bin_name}".gfa"
-    """
\ No newline at end of file
+process unicycler {
+    maxForks 1
+    label 'unicycler'
+    publishDir "${params.output}/${name}/assemble/reassembly/unicycler_bins/", mode: 'copy', pattern: "*.fa"
+    publishDir "${params.output}/${name}/assemble/reassembly/unicycler_bins/", mode: 'copy', pattern: "*.gfa"
+    errorStrategy { task.exitStatus in 1..1 ? 'retry' : 'finish'}
+    maxRetries 4
+    input:
+    tuple val(name), val(bin_name), path(illumina), path(ont)    
+    output:
+    tuple val(name), path("*.fa") optional true
+    path("*.gfa") optional true
+    shell:
+    if (task.attempt == 1)
+    """
+    mkdir spades_tmp
+    unicycler -1 ${illumina[0]} -2 ${illumina[1]} -l ${ont} -o output -t ${task.cpus} --keep 0 --no_pilon --spades_tmp_dir spades_tmp
+    mv output/assembly.fasta ${bin_name}".fa"
+    mv output/assembly.gfa ${bin_name}".gfa"
+    """
+    else if (task.attempt == 2)
+    """
+    mkdir spades_tmp
+    unicycler -1 ${illumina[0]} -2 ${illumina[1]} -l ${ont} -o output -t ${task.cpus} --keep 0 --no_pilon --spades_tmp_dir spades_tmp --max_kmer_frac 0.85
+    mv output/assembly.fasta ${bin_name}".fa"
+    mv output/assembly.gfa ${bin_name}".gfa"
+    """
+    else if (task.attempt == 3)
+    """
+    mkdir spades_tmp
+    unicycler -1 ${illumina[0]} -2 ${illumina[1]} -l ${ont} -o output -t ${task.cpus} --keep 0 --no_pilon --spades_tmp_dir spades_tmp --max_kmer_frac 0.70
+    mv output/assembly.fasta ${bin_name}".fa"
+    mv output/assembly.gfa ${bin_name}".gfa"
+    """
+    else if (task.attempt == 4)
+    """
+    mkdir spades_tmp
+    unicycler -1 ${illumina[0]} -2 ${illumina[1]} -l ${ont} -o output -t ${task.cpus} --keep 0 --no_pilon --spades_tmp_dir spades_tmp --max_kmer_frac 0.50
+    mv output/assembly.fasta ${bin_name}".fa"
+    mv output/assembly.gfa ${bin_name}".gfa"
+    """
+    else
+    error "Unicycler was unable to process your data please restart MUFFIN without Unicycler activated"
+}
\ No newline at end of file
diff --git a/nextflow.config b/nextflow.config
index 784536a..8ca6fb8 100644
--- a/nextflow.config
+++ b/nextflow.config
@@ -1,181 +1,113 @@
-manifest {
-  mainScript = 'main.nf'
-}
-
-// default parameters
-params {
-    cpus = "2"
-    memory = '16g'
-    help = false
-    profile = false
-
-    // Inputs (considered as dir)
-    ont = './nanopore'
-    illumina = './illumina'
-    bin_classify = false
-    rna = false
-    bin_annotate = false
-    bin_classify = false
-
-    // Databases
-    checkm_db = false
-    checkm_tar_db = false
-    sourmash_db = false
-    // dammit_db = false
-    // dammit_user_db = false
-    // busco_db = 'metazoa'
-    eggnog_db = false
-
-    // Options
-    modular = "full" // different option: full ; assemble ; classify ; annotate ; assem-class ; assem-annot ; class-annot
-    skip_ill_qc = false
-    skip_ont_qc = false
-    short_qc = "2000"
-    filtlong = false
-    model = "r941_min_high_g303"
-    polish_threshold = ""
-    polish_iteration = 2
-    extra_ill = false
-    extra_ont = false
-    //SRA_ill = false                  a list of additional ill sample from SRA accession number to use for the binning in Metabat2 and concoct (not implemented yet)
-    //SRA_ont = false                  a list of additional ont sample from SRA accession number to use for the binning in Metabat2 and concoct (not implemented yet)
-    skip_metabat2 = false
-    skip_maxbin2 = false
-    skip_concoct = false
-    reassembly = false
-    assembler = 'metaflye'
-
-    // Output
-    output = './results'
-
-}
-
-profiles {
-    standard {
-
-    }
-
-    conda {
-        process.executor = 'local'
-        docker.enabled=false
-        process.memory = params.memory
-        conda.createTimeout = '2h'
-        process {
-            withLabel : fastp { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::fastp'}
-            withLabel : filtlong { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::filtlong'}
-            withLabel : sourmash { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::sourmash=2.0.0a10 '}
-            withLabel : spades { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::spades'}
-            withLabel : flye { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::flye'}
-            withLabel : racon { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::racon '}
-            withLabel : medaka { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::medaka=0.11.2 '}
-            withLabel : pilon { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::pilon bioconda::bwa bioconda::samtools'}
-            withLabel : minimap2 { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::minimap2 bioconda::samtools'}
-            withLabel : bwa { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::bwa bioconda::samtools'}
-            withLabel : metabat2 { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::metabat2'}
-            withLabel : maxbin2 { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::maxbin2'}
-            withLabel : concoct { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::concoct'}
-            withLabel : checkm { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::checkm-genome'}
-            withLabel : metawrap { cpus = params.cpus ; memory = params.memory;
-            conda = 'ursky::metawrap-mg'}
-            withLabel : seqtk { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::seqtk bioconda::samtools '}
-            withLabel : unicycler { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::unicycler '}
-            withLabel : dammit { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::dammit=1.0 '}
-            withLabel : eggnog { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::diamond anaconda::biopython bioconda::eggnog-mapper=2.0.1 '}
-            withLabel : trinity { cpus = params.cpus ; memory = params.memory
-            conda = 'bioconda::trinity '}
-            withLabel : python38 { cpus = params.cpus ; memory = params.memory
-            conda = 'python=3.8 '}
-            // withLabel : { cpus = params.cpus ; memory = params.memory
-            // conda = 'bioconda:: '}
-        }
-    }
-
-    gcloud { 
-        docker.enabled = true 
-        process.executor = 'google-pipelines'
-        process.memory = params.memory
-        bucketDir = ''
-        google { project = ''; zone = '' }
-        //cloud { preemptible = true }
-        process{
-            // withLabel: busco { cpus = 8 ; memory = '30 GB' ; container = 'nanozoo/busco:3.0.2--0d4c614' } 
-            withLabel: bwa { cpus = 30 ; memory = '60 GB'; container = 'nanozoo/shovill:1.0.9--dc1de54' } 
-            withLabel: concoct { cpus = 10 ; memory = '30 GB' ; container = 'nanozoo/concoct:1.1.0--03a3888' }
-            withLabel: fastp { cpus = 12 ; memory = '16 GB' ; container = 'nanozoo/fastp:0.20.0--78a7c63' }
-            withLabel: filtlong { cpus = 4 ; memory = '12 GB' ; container = 'nanozoo/filtlong:v0.2.0--afa175e' }
-            withLabel: flye { cpus = 50 ; memory = '100 GB' ; container = 'nanozoo/flye:2.5--bae51d9' } 
-            withLabel: maxbin2 { cpus = 10 ; memory = '30 GB' ; container = 'nanozoo/maxbin2:2.2.7--b643a6b' }  
-            withLabel: medaka { cpus = 20 ; memory = '40 GB' ; container = 'nanozoo/medaka:0.10.0--1e71fdd' } 
-            withLabel: metabat2 { cpus = 10 ; memory = '30 GB' ; container = 'nanozoo/metabat2:2.13--0e2577e' }  
-            withLabel: metawrap { cpus = 20 ; memory = '40 GB' ; container = 'nanozoo/metawrap:1.2.2--de94241' } 
-            withLabel: minimap2 { cpus = 8 ; memory = '24 GB' ; container = 'nanozoo/minimap2:2.17--caba7af' }
-            withLabel: checkm { cpus = 8 ; memory = '14 GB' ; container = 'nanozoo/checkm:1.0.13--248242f' }
-            // withLabel: nanoplot { cpus = 8 ; memory = '14 GB' ; container = 'nanozoo/nanoplot:1.25.0--4e2882f' }
-            //withLabel: checkm(withLabel: metawrap)
-            withLabel: pilon { cpus = 30 ; memory = '60 GB'; container = 'nanozoo/shovill:1.0.9--dc1de54' } 
-            withLabel: racon { cpus = 10 ; memory = '30 GB' ; container = 'nanozoo/racon:1.4.7--239559c' } 
-            withLabel: seqtk { cpus = 4 ; memory = '20 GB' ; container = 'nanozoo/seqtk:1.3--dc0d16b' } 
-            withLabel: sourmash { cpus = 8 ; memory = '24 GB' ; container = 'nanozoo/sourmash:2.0.1--6970ddc'  }
-            withLabel: spades { cpus = 50 ; memory = '100 GB' ; container = 'nanozoo/spades:3.13.1--2c2a4c0'  }
-            withLabel: ubuntu { cpus = 4 ; memory = '20 GB' ; container = 'nanozoo/template:3.8--ccd0653' } 
-            withLabel: unicycler { cpus = 8 ; memory = '24 GB' ; container = 'nanozoo/unicycler:0.4.7-0--c0404e6' }
-            //withLabel: dammit { cpus = 16 ; memory = '48 GB' ; container = 'rvandamme/dammit:1' } //NOT USED ANYMORE
-            withLabel: eggnog { cpus = 16 ; memory = '48 GB' ; container = 'pgcbioinfo/eggnog-mapper:latest' }
-
-        }
-    }
-
-    docker { 
-        docker.enabled = true 
-        process.executor = 'local'
-        process.memory = params.memory
-        process{
-            // withLabel: busco { cpus = 8 ; memory = '30 GB' ; container = 'nanozoo/busco:3.0.2--0d4c614' } 
-            withLabel: bwa { cpus = params.cpus ; memory = params.memory; container = 'nanozoo/shovill:1.0.9--dc1de54' } 
-            withLabel: concoct { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/concoct:1.1.0--03a3888' }
-            withLabel: fastp { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/fastp:0.20.0--78a7c63' }
-            withLabel: filtlong { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/filtlong:v0.2.0--afa175e' }
-            withLabel: flye { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/flye:2.5--bae51d9' } 
-            withLabel: maxbin2 { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/maxbin2:2.2.7--b643a6b' }  
-            withLabel: medaka { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/medaka:0.10.0--1e71fdd' } 
-            withLabel: metabat2 { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/metabat2:2.13--0e2577e' }  
-            withLabel: metawrap { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/metawrap:1.2.2--de94241' } 
-            withLabel: minimap2 { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/minimap2:2.17--caba7af' }
-            withLabel: checkm { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/checkm:1.0.13--248242f' }
-            //withLabel: checkm(withLabel: metawrap)
-            // withLabel: nanoplot { cpus = 8 ; memory = '14 GB' ; container = 'nanozoo/nanoplot:1.25.0--4e2882f' }
-            withLabel: pilon { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/shovill:1.0.9--dc1de54' } 
-            withLabel: racon { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/racon:1.4.7--239559c' } 
-            withLabel: seqtk { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/seqtk:1.3--dc0d16b' } 
-            withLabel: sourmash { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/sourmash:2.3.0--4257650'  }
-            withLabel: python38 { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/template:3.8--ccd0653'  }
-
-
-            withLabel: spades { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/spades:3.13.1--2c2a4c0'  }
-            withLabel: ubuntu { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/template:3.8--ccd0653' } 
-            withLabel: unicycler { cpus = params.cpus ; memory = params.memory ; container = 'nanozoo/unicycler:0.4.7-0--c0404e6' }
-            //withLabel: dammit { cpus = params.cpus ; memory = params.memory ; container = 'rvandamme/dammit:1' } //NOT USED ANYMORE
-            withLabel: eggnog { cpus = params.cpus ; memory = params.memory ; container = 'pgcbioinfo/eggnog-mapper:latest' }
-        }
-
-    }
-}
-
+manifest {
+  mainScript = 'main.nf'
+}
+
+// default parameters
+params {
+    cpus = "2"
+    memory = '16g'
+    help = false
+    profile = false
+
+    // Inputs (considered as dir)
+    ont = './nanopore'
+    illumina = './illumina'
+    bin_classify = false
+    rna = false
+    bin_annotate = false
+
+    // Databases
+    checkm_db = false
+    checkm_tar_db = false
+    sourmash_db = false
+    // dammit_db = false
+    // dammit_user_db = false
+    // busco_db = 'metazoa'
+    eggnog_db = false
+
+    // Options
+    modular = "full" // different option: full ; assemble ; classify ; annotate ; assem-class ; assem-annot ; class-annot
+    skip_ill_qc = false
+    skip_ont_qc = false
+    short_qc = "2000"
+    filtlong = false
+    model = "r941_min_high_g303"
+    polish_threshold = ""
+    polish_iteration = 2
+    extra_ill = false
+    extra_ont = false
+    //SRA_ill = false                  a list of additional ill sample from SRA accession number to use for the binning in Metabat2 and concoct (not implemented yet)
+    //SRA_ont = false                  a list of additional ont sample from SRA accession number to use for the binning in Metabat2 and concoct (not implemented yet)
+    skip_metabat2 = false
+    skip_maxbin2 = false
+    skip_concoct = false
+    reassembly = false
+    assembler = 'metaspades'
+    modular = 'full'
+
+    // Output
+    output = './results'
+
+}
+
+profiles {
+    //executer
+    standard {
+
+    }
+
+    test {
+
+    }
+
+    local {
+        process.executor = 'local'    
+    }
+
+    gcloud {  // NEED TO CHANGE some DOCKER container to the version used in local_conda
+        //workDir = "/tmp/nextflow-docker_pipelines-$USER"
+        process.executor = 'google-lifesciences'
+        process.memory = params.memory
+        bucketDir = 'gs://bucket/work-dir'
+        google { project = 'project-name-111111'; zone = 'europe-north1-a' }
+        google.lifeSciences.copyImage = 'google/cloud-sdk:latest'
+        //google.lifeSciences.preemptible = true
+        //google.lifeSciences.bootDiskSize = "10GB"
+        google.lifeSciences.debug = true
+        includeConfig 'configs/preemptible.config'
+    }
+
+    slurm {
+        process.executor = 'slurm'
+    }
+
+    //engine
+    local_engine {
+        docker.enabled=false
+        process.memory = params.memory
+        includeConfig 'configs/local.config'
+    }
+
+    conda {
+        docker.enabled=false
+        process.memory = params.memory
+        conda.createTimeout = '2h'
+        includeConfig 'configs/conda.config'
+
+    }
+
+    docker {  // NEED TO CHANGE some DOCKER container to the version used in local_conda
+        docker.enabled = true 
+        process.memory = params.memory
+        includeConfig 'configs/container.config'
+    }
+
+    singularity { // NEED TO CHANGE some DOCKER container to the version used in local_conda
+        singularity.enabled = true
+        singularity.autoMounts = true
+        docker.enabled = fasle 
+        process.memory = params.memory
+        includeConfig 'configs/container.config'
+        }
+}
+