Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Do not merge!] Pseudo PR for first release #9

Closed
wants to merge 195 commits into from
Closed
Show file tree
Hide file tree
Changes from 3 commits
Commits
Show all changes
195 commits
Select commit Hold shift + click to select a range
e59fb9e
Intall the assemblyscan module with nf-core tools
charles-plessy Apr 11, 2024
5b04820
added module gfastats (like installation)
U13bs1125 Apr 12, 2024
28102f7
Sample sheet for one genome file in compressed FASTA format
charles-plessy Apr 12, 2024
875558f
installed a dotplot last
U13bs1125 Apr 14, 2024
c92a3d7
installed lastal
U13bs1125 Apr 14, 2024
4251a26
installed lastdb lastsplit
U13bs1125 Apr 14, 2024
eaff71c
installed last train mafswap
U13bs1125 Apr 14, 2024
d39dff7
Merge branch 'dev' of github.com:oist/plessy_pairwiseGenomeComparison…
charles-plessy Apr 15, 2024
cf6cac8
Added a 'target' parameter
U13bs1125 Apr 17, 2024
497ee63
Merge pull request #18 from oist/adtag
charles-plessy Apr 17, 2024
bf16f5d
Remove handling of paired-end data.
charles-plessy Apr 17, 2024
45d5d3a
Run assemblyscan instead of fastqc
charles-plessy Apr 17, 2024
5e271af
Merge pull request #19 from oist/runAssemblyScan
U13bs1125 Apr 17, 2024
012a2dc
Added 1st batch of new parameters
U13bs1125 Apr 18, 2024
e2301db
Merge pull request #20 from oist/1stnewparams
charles-plessy Apr 18, 2024
3b78b0f
Remove FastQC
charles-plessy Apr 18, 2024
d830867
Add an input channel for the target genome.
charles-plessy Apr 18, 2024
1be75d1
Run lastdb
charles-plessy Apr 18, 2024
7f7fa15
Merge remote-tracking branch 'origin/addLASTDB' into dev
U13bs1125 Apr 19, 2024
61804e2
Inclusion of targetName parameter
U13bs1125 Apr 19, 2024
6f2ee0b
Inclusion of a new module: Last_train
U13bs1125 Apr 19, 2024
ecb5bb4
Run lastal
charles-plessy Apr 19, 2024
c69c8ac
Added modules: split and dotplot
U13bs1125 Apr 23, 2024
95bc221
Added modules: split and dotplot
U13bs1125 Apr 23, 2024
abffa46
tuple val(meta), path(maf)
U13bs1125 Apr 23, 2024
96f32df
correction
U13bs1125 Apr 23, 2024
7911279
...
U13bs1125 Apr 25, 2024
5321778
REGULARIZATION
U13bs1125 Apr 26, 2024
d845557
Merge branch 'dev' into addedmodules
charles-plessy Apr 26, 2024
cd517dd
Merge pull request #24 from oist/addedmodules
charles-plessy Apr 26, 2024
d7859fe
Merge branch 'dev' of github.com:oist/plessy_pairwiseGenomeComparison…
U13bs1125 Apr 26, 2024
5499a51
Remove unwanted spaces to avoid pre-commit failures in GitHub actions.
charles-plessy Apr 26, 2024
91c9e5e
Make YASS the default seed.
charles-plessy Apr 26, 2024
717ed54
Rename the pipeline to pairgenomealign
charles-plessy Apr 26, 2024
2351f32
Merge branch 'TEMPLATE' into dev
charles-plessy Apr 26, 2024
6a5b792
Also rename files accordingly.
charles-plessy Apr 26, 2024
d7b5449
Merge branch 'TEMPLATE' into dev
charles-plessy Apr 26, 2024
57ac4d6
Refreshed logos with nf-core create-logo
charles-plessy Apr 26, 2024
3da9cf8
Put the refreshed logos in the correct directory.
charles-plessy Apr 26, 2024
cdc4661
Finish overlooked merge.
charles-plessy Apr 26, 2024
fc33d26
Put the files in correct folder.
charles-plessy Apr 26, 2024
6c672df
Added modules for m2m, o2o, o2m modules
U13bs1125 Apr 30, 2024
f1b70ff
REGULARIZATION
U13bs1125 Apr 30, 2024
aa6b978
Reduce test time so that it is easier to run on OISTs short queue
charles-plessy Apr 30, 2024
eb36120
ignore that are difficult to reproducibly generate
U13bs1125 May 2, 2024
b9c35f1
Changed the prefices
U13bs1125 May 2, 2024
8d8b79f
Merge branch 'dev' into m2mdotplotsmodules
charles-plessy May 2, 2024
f53d21c
Fix whitespace issues.
charles-plessy May 2, 2024
9703460
Merge pull request #26 from oist/m2mdotplotsmodules
charles-plessy May 2, 2024
0a0194a
Complete pipeline
U13bs1125 May 2, 2024
acbe435
regularization
U13bs1125 May 2, 2024
e14145c
Draft a tube map representation of the pipeline.
charles-plessy May 7, 2024
eb435e0
Cleared TODOs
U13bs1125 May 7, 2024
17b094f
cleared some TODOs
U13bs1125 May 7, 2024
cb100ed
added svg
U13bs1125 May 7, 2024
3910ef1
..
U13bs1125 May 7, 2024
ff50dd1
..
U13bs1125 May 7, 2024
53422ec
,,
U13bs1125 May 7, 2024
b94028b
mm
U13bs1125 May 7, 2024
04e7d37
m
U13bs1125 May 7, 2024
8006076
Editing the README file
U13bs1125 May 8, 2024
b044e01
Clear TODOs
U13bs1125 May 8, 2024
78befc7
Update of subworkflos utils_nfcore_pipeline
U13bs1125 May 8, 2024
c0b46db
Move the many-to-many and downstream alignments to a subworkflow
charles-plessy May 8, 2024
fd6f601
Use the new PAIRALIGN_M2M subworkflow
charles-plessy May 9, 2024
5c21c99
New m2o subworkflow
U13bs1125 May 9, 2024
5ed0636
Added alternataive statement on subworkflows
U13bs1125 May 9, 2024
c05e9de
Regularize groupinf of m2m param
U13bs1125 May 9, 2024
6016d13
mxc
U13bs1125 May 9, 2024
480d5e8
Merge branch 'TEMPLATE' into dev
charles-plessy May 10, 2024
2a4cbbc
added new params arguments; last_extr_args...
U13bs1125 May 13, 2024
9d05170
Merge branch 'subworkflow2' into dev
U13bs1125 May 13, 2024
8b496ff
Added extra arguments for lastal and split: mismap...
U13bs1125 May 14, 2024
7943e4b
Nf-core lint clearing
U13bs1125 May 15, 2024
ca8f726
Remove whitespace
charles-plessy May 21, 2024
dbe4eb8
Ran `pre-commit run --all-files` by hand.
charles-plessy May 21, 2024
36ecc17
pick file name that does not look like unassembled reads
charles-plessy May 21, 2024
6b1679b
Drop the last_split_options parameter
charles-plessy May 21, 2024
0510c6c
Updated Documentations
U13bs1125 May 22, 2024
d9b967d
Merge branch 'dev' of github.com:oist/plessy_pairwiseGenomeComparison…
U13bs1125 May 23, 2024
dd64e51
implement the datasets folder to tests
U13bs1125 May 23, 2024
c21e3e2
Implement dotplot_options
U13bs1125 May 23, 2024
34266b3
Implemented the issue: lastal params
U13bs1125 May 23, 2024
6cfbe17
cleared docs/Readme.md file
U13bs1125 May 23, 2024
a3fef68
Fix typo
charles-plessy May 24, 2024
aed4dff
removed the hanging "i"
U13bs1125 May 24, 2024
ef7d436
Merge branch 'dev' of github.com:oist/plessy_pairwiseGenomeComparison…
charles-plessy May 24, 2024
0e555c6
chema build done
U13bs1125 May 24, 2024
517bdd4
Merge branch 'dev' of github.com:oist/plessy_pairwiseGenomeComparison…
U13bs1125 May 24, 2024
08a3e57
Ran pre-commit run --all-files
charles-plessy May 24, 2024
21386e2
Remove Windowmasker and adjust borders
May 24, 2024
686a1db
Simplify information already given in docs/output.md
May 24, 2024
3f360d1
Transfer information to usage page
May 24, 2024
e102ddd
Credits
May 24, 2024
124eec1
Format reference.
May 24, 2024
8f595db
pre-commit run --all-files
charles-plessy May 24, 2024
4e5aa3f
Brush up schema
charles-plessy May 24, 2024
cdeaa73
Merge branch 'TEMPLATE' into dev
charles-plessy May 28, 2024
5090a0e
Correct input parameter type.
charles-plessy May 28, 2024
97dd3ec
Ensure last-train is ran with --revsym and the other lastal options
charles-plessy May 28, 2024
dbaf9b8
Add a zero to make nf-core lint happier
charles-plessy May 28, 2024
73bdd22
Show information directly in description.
charles-plessy May 28, 2024
5b9c9ac
Try with uppercase E like in funcscan pipeline
charles-plessy May 28, 2024
9ea4a53
Typo
charles-plessy May 28, 2024
7fbb90f
Prefix output file names with ${params.targetName}___
charles-plessy May 28, 2024
8ea3c9f
Allow for .fna and .fna.gz suffixed.
charles-plessy May 28, 2024
58efb12
Use the nf-core repository
charles-plessy May 28, 2024
1832f81
Whitespace changes by pre-commit
charles-plessy May 28, 2024
8493b3b
Numeric argument with non-scientific notation
charles-plessy May 28, 2024
fc353a4
Update nf-core modules
charles-plessy Jun 11, 2024
b354802
Adjust test config to the small size of the sequence
charles-plessy Jun 11, 2024
76f78c9
Set parameters of lastdb to -R01 -c -u${params.seed} -S2
charles-plessy Jun 11, 2024
7bae94e
Rename LAST_LASTAL LAST_LASTAL_M2M in the M2M workflow for consistency.
charles-plessy Jun 11, 2024
8498a70
installed and run seqtk successfully
U13bs1125 Jun 18, 2024
dfb7e54
Effected the changes of Seqtk workflow to run on each channel individ…
U13bs1125 Jun 18, 2024
bcb46f8
Again, prettier and lint
charles-plessy Jun 18, 2024
5b3fa86
Merge pull request #1 from charles-plessy/minus-s-option
U13bs1125 Jun 18, 2024
57fd146
Update pairgenomealign.nf
U13bs1125 Jun 19, 2024
62518a2
Merge pull request #2 from charles-plessy/addseqtk
charles-plessy Jun 19, 2024
9a4724f
Small-scale test suite with fungal genomes.
charles-plessy Jun 19, 2024
d3fde9e
Merge pull request #3 from charles-plessy/fusarium
U13bs1125 Jun 19, 2024
ae22d57
Fix file names in modules runnign on single genomes.
charles-plessy Jun 19, 2024
b5862bd
Merge pull request #4 from charles-plessy/target__query
U13bs1125 Jun 19, 2024
caba650
Merge branch 'dev' of github.com:charles-plessy/pairgenomealign into dev
U13bs1125 Jun 19, 2024
b11fe3a
Update last/dotplot
charles-plessy Jun 28, 2024
588955f
Pass the seqtk cutN output to last-dotplot
charles-plessy Jun 28, 2024
99eb125
Document the use of seqtk cutN to plot polyN regions.
charles-plessy Jun 28, 2024
0ec1923
Document assemblyscan and seqtk; remove postmask
charles-plessy Jun 28, 2024
6f146d6
Reorganise documentation of dot-plots and other outputs
charles-plessy Jun 28, 2024
fb6fc8d
Merge pull request #5 from charles-plessy/plotContigBoundaries2
U13bs1125 Jun 28, 2024
3a061df
Update modules
charles-plessy Jul 16, 2024
36825d7
Collect and report training and alignment statistics.
charles-plessy Jul 16, 2024
a2e223a
Collect more software version numbers.
charles-plessy Jul 16, 2024
fc92de6
Merge pull request #7 from charles-plessy/lastal-multiqc2
U13bs1125 Jul 16, 2024
9499a67
new branch to resolve multiqc version issue
U13bs1125 Jul 17, 2024
52bcfac
Combine two reports, plus minor changes.
charles-plessy Jul 18, 2024
c62835d
Cite LAST papers.
charles-plessy Jul 19, 2024
f43f129
Fix typo in output file name.
charles-plessy Jul 19, 2024
ae2bd85
Change comments and indentation
charles-plessy Jul 19, 2024
6ea049a
Full tests comparing the human genome to other primates.
charles-plessy Jul 19, 2024
e0cbc98
Name the target
charles-plessy Jul 19, 2024
db6492d
Release 1.0.0
charles-plessy Jul 19, 2024
eb13134
Remove the PSEUDO seed from the schema.
charles-plessy Jul 23, 2024
b58818a
Correct duplicated text
charles-plessy Jul 24, 2024
94545b9
Reduce redundancy with `doc/output.md` as suggested in PR #9.
charles-plessy Jul 24, 2024
18a2ba1
Also cite the paper describing the original implementation.
charles-plessy Jul 24, 2024
c9d2be8
Correct the list of accepted file suffixes
charles-plessy Jul 24, 2024
adda591
Remove mention of unported parameters.
charles-plessy Jul 24, 2024
27b0c0a
Improve wording of docs/usage.md
charles-plessy Jul 24, 2024
a58961b
Mention --input explicitely
charles-plessy Jul 24, 2024
2ed941b
Fix typo
charles-plessy Jul 24, 2024
157e675
Fix markdown formatting.
charles-plessy Jul 24, 2024
5463b6b
modified the pipeline logo, now png formatted
U13bs1125 Jul 24, 2024
6df876d
modified the pipeline logo, now png formatted
U13bs1125 Jul 24, 2024
8eefc01
added the new svg formatted pipeline logo/map
U13bs1125 Jul 24, 2024
cd92f37
Merge pull request #10 from oist/devop
charles-plessy Jul 24, 2024
3476a81
Merge branch 'dev' of github.com:nf-core/pairgenomealign into dev
charles-plessy Jul 24, 2024
2940837
Indent workflows/pairgenomealign.nf
charles-plessy Jul 24, 2024
2baf4b4
Remove un-needed example
charles-plessy Jul 24, 2024
054e436
Remove dangling filename.
charles-plessy Jul 24, 2024
f279c5c
Show the full sample sheet as an example.
charles-plessy Jul 24, 2024
11d1457
Multi-query example
charles-plessy Jul 24, 2024
c5ec40a
Slim nextflow.config
charles-plessy Jul 24, 2024
5d36b75
Move LAST output to `alignment/`
charles-plessy Jul 24, 2024
7fb4f49
Merge branch 'dev' of github.com:nf-core/pairgenomealign into dev
charles-plessy Jul 24, 2024
850605e
Put `seqtk cutN` output in `cutn/` and document it.
charles-plessy Jul 24, 2024
cabd248
Remove mention of FastQC
charles-plessy Jul 24, 2024
d3e2e86
[automated] Fix code linting
nf-core-bot Jul 24, 2024
0bb9ad7
Remove duplicated documentation.
charles-plessy Jul 24, 2024
d2406ac
Merge branch 'dev' of github.com:nf-core/pairgenomealign into dev
charles-plessy Jul 24, 2024
a93536f
Fix typo
charles-plessy Jul 24, 2024
c029ae4
Add a human–monkey alignment as example.
charles-plessy Jul 25, 2024
3e349a9
Update workflows/pairgenomealign.nf
charles-plessy Jul 25, 2024
340d9c6
Rename the custom module and document its output.
charles-plessy Jul 25, 2024
6949ad5
Merge branch 'dev' of github.com:nf-core/pairgenomealign into dev
charles-plessy Jul 25, 2024
ab93bb4
Revert "Update workflows/pairgenomealign.nf"
charles-plessy Jul 25, 2024
420a929
Polish parameter description.
charles-plessy Jul 25, 2024
0b417aa
Move tube map to docs/ hoping it solves display problem.
charles-plessy Jul 25, 2024
e9fb4bd
Add an example dot-plot
charles-plessy Jul 25, 2024
eca7b83
Remove FASTQC examples.
charles-plessy Jul 25, 2024
823bcdc
Add new multiqc examples
charles-plessy Jul 25, 2024
591ee73
Merge branch 'dev' of github.com:nf-core/pairgenomealign into dev
charles-plessy Jul 25, 2024
057a097
Display example MultiQC plots
charles-plessy Jul 25, 2024
123d9dc
prettier
charles-plessy Jul 25, 2024
44fff18
modified the logomap again as advised by nfcore team
U13bs1125 Jul 25, 2024
cc8234a
Merge pull request #13 from oist/devlogo
charles-plessy Jul 26, 2024
dd56788
Add a codename
charles-plessy Jul 26, 2024
d5df279
Fix filename
charles-plessy Jul 26, 2024
3737d98
Merge branch 'dev' of github.com:nf-core/pairgenomealign into dev
charles-plessy Jul 26, 2024
872991d
Use a Markdown link instead of HTML.
charles-plessy Jul 26, 2024
57224ea
pre-commit fixes
charles-plessy Jul 26, 2024
2d4f08c
Rename and document some table columns
charles-plessy Aug 7, 2024
830d557
Thank Martin and teammates
charles-plessy Aug 7, 2024
a298b19
Remove mention of lastdb -P because it does not impact the alignment …
charles-plessy Aug 8, 2024
c493be3
Update release date in CHANGELOG.md
charles-plessy Aug 26, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 5 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -72,7 +72,11 @@ For more details about the output files and reports, please refer to the

We thank the following people for their extensive assistance in the development of this pipeline:

- [Mahdi Mohammed](https://github.com/U13bs1125): ported the original pipeline to _nf-core_ template 2.14.x.
- [Mahdi Mohammed](https://github.com/U13bs1125) ported the original pipeline to _nf-core_ template 2.14.x.
- [Martin Frith](https://github.com/mcfrith/), the author of LAST, gave us extensive feedback and advices.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- [Martin Frith](https://github.com/mcfrith/), the author of LAST, gave us extensive feedback and advices.
- [Martin Frith](https://github.com/mcfrith/), the author of LAST, gave us extensive feedback and advice.

- [Michael Mansfield](https://github.com/mjmansfi) tested the pipeline and provided critical comments.
- [Aleksandra Bliznina](https://github.com/aleksandrabliznina) contributed to the creation of the initial `last/*` modules.
- [Jiashun Miao](https://github.com/miaojiashun) and [Huyen Pham](https://github.com/ngochuyenpham) tested the pipeline on vertebrate genomes.

## Contributions and Support

Expand Down
29 changes: 29 additions & 0 deletions assets/multiqc_config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -19,10 +19,39 @@ custom_data:
file_format: "tsv"
section_name: "Training parameter statistics"
plot_type: "table"
headers:
id:
title: "ID"
description: "target___query"
Comment on lines +23 to +25
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure why this is not working, but not such a big thing maybe...

substitution_percent_identity:
title: "Substitution Percent Identity"
"last -t":
title: "Temperature"
description: "Parameter for converting between scores and probability ratios. This affects the column ambiguity estimates. A score is converted to a probability ratio by this formula: exp(score / TEMPERATURE). The default value is 1/lambda, where lambda is the scale factor of the scoring matrix, which is calculated by the method of Yu and Altschul (YK Yu et al. 2003, PNAS 100(26):15688-93)."
"last -a":
title: "Gap existence"
description: "Gap existence cost (lastal -a)"
"last -b":
title: "Gap extension"
description: "Gap extension cost (lastal -b)"
"last -A":
title: "Insertion existence"
description: "Insertion existence cost (lastal -A)"
"last -B":
title: "Insertion extension"
description: "Insertion extension cost (lastal -B)"
last_o2o:
file_format: "tsv"
section_name: "Alignment statistics"
plot_type: "table"
headers:
id:
title: "ID"
description: "target__query"
TotalAlignmentLength:
title: "Total alignment length"
PercentSimilarity:
title: "Percent similarity"

sp:
last_o2o:
Expand Down
6 changes: 0 additions & 6 deletions docs/usage.md
Original file line number Diff line number Diff line change
Expand Up @@ -47,12 +47,6 @@ Please see the [parameter documentation](https://nf-co.re/pairgenomealign/parame

- The `last-train` commands runs with `--revsym` as the DNA strands play equivalent roles in the studied genomes.

- The `lastdb` command takes all CPU cores available (`lastdb -P0`). Note that
when using more than one core, the output of `lastdb` may vary, causing the
order of tied positions to change. To prevent this to happen, you can pass a
nextflow parameter file to the pipeline that reduces the number of CPUs
allocated to `LAST_LASTDB` to 1.

## Running the pipeline

The typical command for running the pipeline is as follows:
Expand Down
Loading