Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Pairtools sort #5910

Closed
wants to merge 32 commits into from
Closed

Add Pairtools sort #5910

wants to merge 32 commits into from

Conversation

diegomics
Copy link
Contributor

FOR CONTRIBUTOR:

  • I have read the CONTRIBUTING.md document and this tool is appropriate for the tools-iuc repo.
  • License permits unrestricted use (educational + commercial)
  • This PR adds a new tool or tool collection
  • This PR updates an existing tool or tool collection
  • This PR does something else (explain below)

diegomics and others added 24 commits March 19, 2024 18:44
Fixed errors, added more options and tests for the tool
Restructured the folder structure
tools/pairtools/sort.xml Outdated Show resolved Hide resolved
--nproc-out \${GALAXY_SLOTS:-4}
]]></command>
<inputs>
<param name="pairs_path" type="data" format="pairs" label="Input pairs file" help="Input .pairs/.pairsam file"/>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pairs is not a valid Galaxy format

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:(
in the parse xml we have this line and it worked...

<data name="output_parsed_pairs" format="pairs"

I can change it to text,tabular. Also pairsam?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I just double checked, .pairs and .pairsam both are tabular format data as per https://pairtools.readthedocs.io/en/latest/formats.html#pairs

If we all agree, we can stick to "format=tabular"? If not, we are also happy to include .pairs and .pairsam format to Galaxy.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, I'll update all to tabular

Copy link
Contributor

@SaimMomin12 SaimMomin12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @diegomics,

A small suggestion inline.

tools/pairtools/sort.xml Outdated Show resolved Hide resolved
diegomics and others added 2 commits March 27, 2024 11:06
Copy link
Member

@bgruening bgruening left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need to decide if we add the PAIR datatype to Galaxy or if we go for tabular for the time being.

Comment on lines 18 to 19
<data name="output_sorted_pairs" format="pairs,pairsam" label="${tool.name} on ${on_string}: .pairsam"/>
</data>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It can only be one format.

Suggested change
<data name="output_sorted_pairs" format="pairs,pairsam" label="${tool.name} on ${on_string}: .pairsam"/>
</data>
<data name="output_sorted_pairs" format="pairs,pairsam" label="${tool.name} on ${on_string}: .pairsam"/>

This seems to be the format, correct? https://github.com/4dn-dcic/pairix/blob/master/pairs_format_specification.md

We should create a Galaxy datatype for it. @SaimMomin12 maybe we can talk about that tomorrow?

tools/pairtools/sort.xml Outdated Show resolved Hide resolved
tools/pairtools/sort.xml Outdated Show resolved Hide resolved
diegomics and others added 2 commits March 27, 2024 15:09
Co-authored-by: Björn Grüning <[email protected]>
Co-authored-by: Björn Grüning <[email protected]>
@SaimMomin12
Copy link
Contributor

Resolved: #5917

@diegomics
Copy link
Contributor Author

Nice! Yes... both generating part of the files for the test dataset messed up the diff. I'm closing this PR now that #5917 is done

@diegomics diegomics closed this Mar 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants