-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error parsing attribute #1
Comments
Hope everything is ok! Gave it a look and the problem arised because of the way these .gtf/.gff test files are build. I had planned to integrate this "parent/child/feat" functionality in the future but given the circumstances the new release got ready earlier. Just to clarify some things, I build this mostly thinking in working with .gtf/.gff tha followed the Ensembl/GENCODE guidelines (so basically GFF3 and GTF2.5). Test files did not have "common" attribute lines, that is why those test runs failed. Now In order to pass the tests nextflow provides, use:
for the .gtf file, and
for the .gff3 file. This will ensure pass the testing modules. If you want to ensure the test to produce a solid .bed, I'd recommend to just cat few lines from any GENCODE or Ensembl GTF file and upload it as custom tests. I will publish the new version in the next minutes, so I think you could update some of the files there (hope the conda env gets updated asap, but this will depend on how fast my PR wil be attended). I am very grateful for your time building the Best, |
Hey! Thanks for the extemely detailed response! I've updated the version in the module, and added the args to the tests. The gtf one passed, but the gff didn't. 😞 Any other ideas? Also, is it okay if I add you as a maintainer of the module? |
Hi @edmundmiller! That was kind of strange, with those args no exit code 1 should appear. (with gff3 a blank file should be produced) Anyways, I gave it a deeper look and found some particular features with these test files that produced abnormal outputs (e.g. blank files). I did not consider 1-feature GFF3 lines (my approach was to regex for ";" in both formats, but in 1-feature lines this did not work) , like this for example:
After fixing these some errors with GTF files start appearing. Seems that this GTF test file has an additional blank space at the end of each line:
"_" at the end denotes that blank space. This new version takes care of all these little things, so nf test should pass this time. These are the commands to use:
this outputs:
and
this outputs:
No functionality have been compromised. GENCODE and Ensembl GTFs/GFFs work smoothly (now the tool takes ~0.7s less time due to additional enhancements).
Yes, of course. Thank you for your time doing this, it is really motivating! This new version is already published but we still need to wait for the bioconda team to update it. Let me know if there is anything else I can do! Best, |
Hey! Love the concept going on here and wanted to replace some perl scripts in some pipelines with the tool.
I went to make a nf-core module for it and just got errors though.
nf-core/modules#4951
Details
Reproduced it locally with
gxf2bed -i genome.gff3 -o genome.bed
as well.The text was updated successfully, but these errors were encountered: