-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AliTV on Cactus alignment #166
Comments
Hi! The capabilities of AliTV for importing pre-calculated alignments are somewhat limited but in general MAF import is possible: https://github.com/AliTVTeam/AliTV-perl-interface/blob/master/doc/alitv.md#section-alignment |
Hi, I was trying to visualize the alignment between two of my species starting from the HAL file. I extracted a MAF referenced at the chicken genome, with the barn swallow genome as a target. I faced the sequence name problem as in the issue you linked to me. I resolved it by rename both the fasta files and the maf with new sequence names (e.g. GgC1 for Gallus gallus chromosome 1) and AliTV is actually running without errors. Here's part of the output a video while running: Use of uninitialized value in subroutine entry at /gpfs/home/projects/Hirundo/bin/AliTV-perl-interface/bin/../lib/AliTV/Alignment.pm line 262, line 2155751. However, it has been running for almost an entire day.. is that normal? My plan was to make a trial like so and then provide to AliTV the MAFs for each couple of genomes that are near in the phylogenetic tree (and I will provide also that) or the entire MAF file.. what do you think will work better? Can AliTV take the entire MAFs for multiple species ? |
Hi, great progress! The Regarding pairwise MAF vs entire MAF I'm not sure. As far as I know AliTV normally uses pairwise MAF files. Maybe @greatfireball can answer whether it will also handle MAFs for multiple species? |
Hello, AliTV tooks more that 2 days and the job has been killed (there is a 2 day limit runtime).. do you know any ways to speed up the runtime? |
I'm not sure what the bottleneck is in this case. Could you try subsetting the alignment file to see whether it properly finishes at all? Can you share an example file so I can run it locally and profile what causes this extreme runtime? |
Hi, I'm sorry for the late reply but I had to pause the analysis for a while. Now I'm back on it! I subset the MAF alignment focusing on a single chromosome alignment between two species (I referenced the MAF on chromosome 6 of my reference species and targeted it on the query species). The program ran for more that 24h but it failed (see below). I passed to AliTV the entire fasta files for the two aligned genomes (and not only the chromosomes of interest). Moreover, in the yml file (AliTV_Hirundo_Gallus_SUPER_6.yml.txt), I passed to AliTV the query species first (Gallus gallus) and then the reference (Hirundo rustica). The command: The error: You are using version v1.0.6. Many thanks! |
I'm afraid this might be due to renaming of sequence IDs AliTV needs to do internally because it calls external programs that have restrictions on what they consider valid IDs. Therefore, when reading the maf file with original IDs it tries to match them against the renamed ones and can not find them. This seems to be the exact same problem as in AliTVTeam/AliTV-perl-interface#167 (comment) As this problem affects multiple users and there is no simple workaround from a user perspective (beside manually renaming IDs and later mapping back, which is a pain) we really need to fix this in |
I just pushed a branch https://github.com/AliTVTeam/AliTV-perl-interface/tree/improve-maf-import where you can call |
Sorry, I was busy for the last week. I will check your branch later today. If okay, we can merge it into our master branch as well, especially, if @SimonaSecomandi can verify, that the issue is solved with your new parameter (and its implementation). |
Hello,
I generated a CACTUS alignment between 9 bird species and I'm trying to find a way to visualize them.
Cactus is a reference-free software and outputs an HAL file that can be converted to MAF.
It is possible to visualize an alignment like that? The maf file can be referenced at the root or at any of the aligned species. By doing so, how can I visualize that alignment in AliTv ?
Many thanks
The text was updated successfully, but these errors were encountered: