You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Running toil-vg index with something like --vcf my.vcf.gz --fasta_regions will assume my.vcf.gz should be associated with each fasta region. This is harmless in itself (and often what we want when passing in a whole-genome vcf) but when --gbwt_prune is used, then there can be trouble.
Since the pruning has to be done in series to incrementally update the mapping_id, then toil-vg makes a follow-on chain, the length of which is the number of sequences in the fasta. In hs38d1, this can be very long, and I've run into a stack overflow in getRootJobs() in Toil's job.py.
The work around is to do something like --vcf "$(for i in $(seq 1 22; echo X; echo Y); do echo my.vcf.gz; done)" to apply the input VCF only to sequences that it covers. But it would be nicer if this were more transparent (perhaps scanning the VCF at first to see which chromosomes it covers using accordingly)
The text was updated successfully, but these errors were encountered:
Running
toil-vg index
with something like--vcf my.vcf.gz --fasta_regions
will assumemy.vcf.gz
should be associated with each fasta region. This is harmless in itself (and often what we want when passing in a whole-genome vcf) but when--gbwt_prune
is used, then there can be trouble.Since the pruning has to be done in series to incrementally update the mapping_id, then toil-vg makes a follow-on chain, the length of which is the number of sequences in the fasta. In hs38d1, this can be very long, and I've run into a stack overflow in
getRootJobs()
in Toil's job.py.The work around is to do something like
--vcf "$(for i in $(seq 1 22; echo X; echo Y); do echo my.vcf.gz; done)"
to apply the input VCF only to sequences that it covers. But it would be nicer if this were more transparent (perhaps scanning the VCF at first to see which chromosomes it covers using accordingly)The text was updated successfully, but these errors were encountered: