Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add sam/bam/cram output options to nucmer #5867

Merged
merged 30 commits into from
Mar 18, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
30 commits
Select commit Hold shift + click to select a range
fe9bc14
add sam output - means no delta.out so some filters and a new test.
fubar2 Mar 14, 2024
3878c4d
Much pfaffing about - mummer creates malformed sam headers!!
fubar2 Mar 15, 2024
0097f34
conditional added to tests - passing now
fubar2 Mar 15, 2024
7e8a1ef
bump version prefix - same mummer4 conda from 4 years ago :(
fubar2 Mar 15, 2024
ba002ad
add all test data and fix old warts in other modules...
fubar2 Mar 15, 2024
ef6345a
trying a mesa dependency for libGL
fubar2 Mar 15, 2024
9a7630c
add version to libgl thingy
fubar2 Mar 15, 2024
502f0f9
remove useless mesa test
fubar2 Mar 15, 2024
1c0373f
restore separate gnuplot macro
fubar2 Mar 15, 2024
11f984c
fix spelling
bgruening Mar 15, 2024
e80a86b
update test outs
fubar2 Mar 15, 2024
9a4a7fe
Merge branch 'nucmerfix' of github.com:fubar2/tools-iuc into nucmerfix
fubar2 Mar 15, 2024
b9fec78
updates
bgruening Mar 15, 2024
6298481
Merge branch 'nucmerfix' of github.com:fubar2/tools-iuc into nucmerfix
fubar2 Mar 15, 2024
efc3033
remove bogus samtools index steps - thanks Bjoern
fubar2 Mar 15, 2024
fb27873
add note on long and short outputs
fubar2 Mar 15, 2024
6d9b011
Update tools/mummer4/nucmer.xml
fubar2 Mar 15, 2024
90d0187
Update tools/mummer4/nucmer.xml
fubar2 Mar 15, 2024
43948e3
revert invalid xml
fubar2 Mar 15, 2024
08d6763
remove empty sam from test data and update tests again..
fubar2 Mar 15, 2024
e0da3ab
fix a silly strange form bug - misnamed hidden variable
fubar2 Mar 15, 2024
13c3d9f
removed all short-sam output options
fubar2 Mar 16, 2024
961152f
add missing sam.out test file :(
fubar2 Mar 16, 2024
8ac9199
Update tools/mummer4/nucmer.xml
fubar2 Mar 16, 2024
d589b9d
remove sam output option
fubar2 Mar 16, 2024
cdcaea1
Merge branch 'nucmerfix' of github.com:fubar2/tools-iuc into nucmerfix
fubar2 Mar 16, 2024
8c2b97d
Noticed that cat and tail each take a long time when there are many G…
fubar2 Mar 17, 2024
3366f6b
Incorporating suggestions for samtools wrangling.
fubar2 Mar 18, 2024
ae1faad
do not remove any headers - they've been repaired!
fubar2 Mar 18, 2024
4a39538
remove redundant samtools view
fubar2 Mar 18, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions tools/mummer4/dnadiff.xml
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
dnadiff
'reference.fa' 'query.fa'

]]>
]]>
</command>
<inputs>
<param name="reference_sequence" type="data" format="fasta" label="Reference Sequence" help="FastA or multi-FastA" />
Expand Down Expand Up @@ -48,7 +48,7 @@
<data name="snps" format="tabular" from_work_dir="out.snps" label="${tool.name} on ${on_string}: snps" >
<filter> report_only == 'no' </filter>
<actions>
<action name="column_names" type="metadata" default="[P1], [REF SUB], [QUERY SUB], [P2], [BUFF], [DIST], [LEN REF], [LEN QUERY], [REF FRAME], [QUERY FRAME], [REF TAG], [QUERY TAG]" />
<action name="column_names" type="metadata" default="[P1], [REF SUB], [QUERY SUB], [P2], [BUFF], [DIST], [LEN REF], [LEN QUERY], [REF FRAME], [QUERY FRAME], [REF TAG], [QUERY TAG]" />
</actions>
</data>
<data name="rdiff" format="tabular" from_work_dir="out.rdiff" label="${tool.name} on ${on_string}: rdiff" >
Expand All @@ -65,7 +65,7 @@
</data>
</outputs>
<tests>
<test>
<test expect_num_outputs="9">
<param name="reference_sequence" ftype="fasta" value="human_aqp3.fasta" />
<param name="query_sequence" ftype="fasta" value="mouse_aqp3.fasta"/>
<param name="report_only" value="no" />
Expand Down
19 changes: 10 additions & 9 deletions tools/mummer4/macros.xml
Original file line number Diff line number Diff line change
Expand Up @@ -2,23 +2,24 @@
<token name="@MUMMER_GNUPLOT_MANUAL@"><![CDATA[&& gnuplot < out.gp]]></token>
<xml name="bio_tools">
<xrefs>
<xref type="bio.tools">mumer4</xref>
<xref type="bio.tools">mummer4</xref>
</xrefs>
</xml>
<xml name="citation">
<citations>
<citation type="doi">10.1371/journal.pcbi.1005944</citation>
</citations>
</xml>
<xml name="gnuplot_requirement">
<requirement type="package" version="5.4.1">gnuplot</requirement>
</xml>
<token name="@TOOL_VERSION@">4.0.0rc1</token>
<token name="@VERSION_SUFFIX@">2</token>
<token name="@VERSION_SUFFIX@">3</token>
<token name="@PROFILE@">20.05</token>
<xml name="gnuplot_requirement">
<requirement type="package" version="5.4.8">gnuplot</requirement>
</xml>
<xml name="requirements">
<requirements>
<requirement type="package" version="@TOOL_VERSION@">mummer4</requirement>
<requirement type="package" version="1.19.2">samtools</requirement>
<yield />
</requirements>
</xml>
Expand All @@ -30,13 +31,13 @@
<option value="">Color</option>
<option value="-color">No color (-color)</option>
</param>
<param name="coverage" type="select" label="Coverage Plot" help="Generate a reference coverage plot (default for .tiling) or the defualt dotplot." >
<param name="coverage" type="select" label="Coverage Plot" help="Generate a reference coverage plot (default for .tiling) or the default dotplot." >
<option value="">Dotplot</option>
<option value="-c">Coverage Plot (-c)</option>
</param>
<param name="filter" type="boolean" argument="--filter" truevalue="--filter" falsevalue="" label="Filter"
<param type="boolean" argument="--filter" truevalue="--filter" falsevalue="" label="Filter"
help="Only display .delta alignments which represent the 'best' hit to any particular spot on either sequence, i.e. a one-to-one mapping of reference and query subsequences. (--filter)" />
<param name="fat" type="boolean" argument="--fat" truevalue="--fat" falsevalue="" label="Layout sequences using fattest alignment only" help="(--fat)" />
<param type="boolean" argument="--fat" truevalue="--fat" falsevalue="" label="Layout sequences using fattest alignment only" help="(--fat)" />
<conditional name="labels" >
<param name="IDs" type="select" label="Plot a particular reference or query sequence?" help="For alignments that used more than one reference/query." >
<option value="no">NO</option>
Expand All @@ -54,7 +55,7 @@
<option value="large">Large</option>
</param>
<param name="snp" type="boolean" argument="--SNP" truevalue="--SNP" falsevalue="" label="SNPs" help="Highlight SNP locations in each alignment. (--SNP)" />
<param name="title" type="text" argument="-title" value="Title" label="Plot Title" help="(-title)" />
<param type="text" argument="-title" value="Title" label="Plot Title" help="(-title)" />
<conditional name="range" >
<param name="custom" type="select" label="Choose custom X and Y axis ranges?" >
<option value="no">NO</option>
Expand Down
18 changes: 9 additions & 9 deletions tools/mummer4/mummer.xml
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@
</expand>
<command detect_errors="exit_code">
<![CDATA[
mummer
mummer
$anchoring
-l '$min'
$direction
Expand Down Expand Up @@ -87,9 +87,9 @@
<option value="0">No</option>
<option value="1">Yes</option>
</param>
<param name="skip" type="integer" argument="-skip" value="10" label="Sparsify"
<param type="integer" argument="-skip" value="10" label="Sparsify"
help="Sparsify the MEM-finding algorithm even more, performing jumps of skip*k [auto (l-10)/k]. (-skip)" />
<param name="kmer" type="integer" argument="-kmer" value="1" label="kmer Table"
<param type="integer" argument="-kmer" value="1" label="kmer Table"
help="Use kmer table containing sa-intervals (speeds up searching first k characters) in the index and during search. (-kmer)" />
</when>
<when value="defaults" />
Expand All @@ -112,7 +112,7 @@
</data>
</outputs>
<tests>
<test>
<test expect_num_outputs="2">
<param name="reference_sequence" ftype="fasta" value="human_aqp3.fasta" />
<param name="query_sequence" ftype="fasta" value="mouse_aqp3.fasta" />
<param name="options|advanced" value="defaults" />
Expand All @@ -128,7 +128,7 @@

mummer

-mumreference Compute maximal matches that are unique in the reference- sequence but not
-mumreference Compute maximal matches that are unique in the reference- sequence but not
necessarily in the query-sequence (default)

-maxmatch Compute all maximal matches regardless of their uniqueness
Expand All @@ -154,16 +154,16 @@

mummerplot

-b Highlight alignments with breakpoints further than breaklen nucleotides from the nearest
-b Highlight alignments with breakpoints further than breaklen nucleotides from the nearest
sequence end

-color Color plot lines with a percent similarity gradient or turn off all plot color (default
color by match dir) If the plot is very sparse, edit the .gp script to plot with
-color Color plot lines with a percent similarity gradient or turn off all plot color (default
color by match dir) If the plot is very sparse, edit the .gp script to plot with
'linespoints' instead of 'lines'

-c Generate a reference coverage plot (default for .tiling)

--filter Only display .delta alignments which represent the "best" hit to any particular spot on
--filter Only display .delta alignments which represent the "best" hit to any particular spot on
either sequence, i.e. a one-to-one mapping of reference and query subsequences

--fat Layout sequences using fattest alignment only
Expand Down
12 changes: 6 additions & 6 deletions tools/mummer4/mummerplot.xml
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@
<option value="yes">YES</option>
</param>
<when value="yes">
<param name="layout" type="boolean" argument="--layout" truevalue="--layout" falsevalue="" label="Layout" help="Layout a .delta multiplot in an intelligible fashion. (--layout)" />
<param type="boolean" argument="--layout" truevalue="--layout" falsevalue="" label="Layout" help="Layout a .delta multiplot in an intelligible fashion. (--layout)" />
</when>
<when value="no" />
</conditional>
Expand All @@ -75,7 +75,7 @@
<data name="output_png" format="png" from_work_dir="out.png" label="${tool.name} on ${on_string}: plot" />
</outputs>
<tests>
<test>
<test expect_num_outputs="5">
<param name="delta" value="nucmer.txt" ftype="tabular" />
<param name="reference_sequence" ftype="fasta" value="human_aqp3.fasta" />
<param name="query_sequence" ftype="fasta" value="mouse_aqp3.fasta" />
Expand All @@ -99,16 +99,16 @@ Mummerplot is a perl script that generates gnuplot scripts and data collections
**Options:**::


-b Highlight alignments with breakpoints further than breaklen nucleotides from the nearest
-b Highlight alignments with breakpoints further than breaklen nucleotides from the nearest
sequence end

-color Color plot lines with a percent similarity gradient or turn off all plot color (default
color by match dir) If the plot is very sparse, edit the .gp script to plot with
-color Color plot lines with a percent similarity gradient or turn off all plot color (default
color by match dir) If the plot is very sparse, edit the .gp script to plot with
'linespoints' instead of 'lines'

-c Generate a reference coverage plot (default for .tiling)

--filter Only display .delta alignments which represent the "best" hit to any particular spot on
--filter Only display .delta alignments which represent the "best" hit to any particular spot on
either sequence, i.e. a one-to-one mapping of reference and query subsequences

--fat Layout sequences using fattest alignment only
Expand Down
Loading