Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generateDecoyTranscriptome.sh gets 21 killed #6

Open
antonkulaga opened this issue Jul 5, 2019 · 2 comments
Open

generateDecoyTranscriptome.sh gets 21 killed #6

antonkulaga opened this issue Jul 5, 2019 · 2 comments

Comments

@antonkulaga
Copy link

antonkulaga commented Jul 5, 2019

I've made a docker container for SalmonTools https://quay.io/repository/comp-bio-aging/salmon-tools
However, I constantly get:

/opt/SalmonTools/scripts/generateDecoyTranscriptome.sh: line 105: 21 Killed $mashmap -r reference.masked.genome.fa -q $txpfile -t $threads --pi 80 -s 500

I run it on 32 cores machine with 64 GB RAM and I use Ensembl human genome.
I think something may be wrong in the bash script itself

/opt/SalmonTools/scripts/generateDecoyTranscriptome.sh: line 105:    21 Killed                  $mashmap -r reference.masked.genome.fa -q $txpfile -t $threads --pi 80 -s 500

***************
*** ABORTED ***
***************

An error occurred. Exiting...

the command is:

/opt/SalmonTools/scripts/generateDecoyTranscriptome.sh -a /cromwell-executions/decoy/9f2ca769-5a26-4149-a40c-ecc606e9b76c/call-generate/inputs/-848260311/Homo_sapiens.GRCh38.96.gtf -g /cromwell-executions/decoy/9f2ca769-5a26-4149-a40c-ecc606e9b76c/call-generate/inputs/-848260311/Homo_sapiens.GRCh38.dna.primary_assembly.fa -t /cromwell-executions/decoy/9f2ca769-5a26-4149-a40c-ecc606e9b76c/call-generate/inputs/-848260311/Homo_sapiens.GRCh38.cdna.all.fa -j 16 -o output

the stdout file is:

*** getDecoy ***
****************
-a <Annotation GTF file> = /cromwell-executions/decoy/9f2ca769-5a26-4149-a40c-ecc606e9b76c/call-generate/inputs/-848260311/Homo_sapiens.GRCh38.96.gtf
-g <Genome fasta> = /cromwell-executions/decoy/9f2ca769-5a26-4149-a40c-ecc606e9b76c/call-generate/inputs/-848260311/Homo_sapiens.GRCh38.dna.primary_assembly.fa
-t <Transcriptome fasta> = /cromwell-executions/decoy/9f2ca769-5a26-4149-a40c-ecc606e9b76c/call-generate/inputs/-848260311/Homo_sapiens.GRCh38.cdna.all.fa
-j <Concurrency level> = 16
-o <Output files Path> = output
[1/10] Extracting exonic features from the gtf
[2/10] Masking the genome fasta
[3/10] Aligning transcriptome to genome
>>>>>>>>>>>>>>>>>>
Reference = [reference.masked.genome.fa]
Query = [/cromwell-executions/decoy/9f2ca769-5a26-4149-a40c-ecc606e9b76c/call-generate/inputs/-848260311/Homo_sapiens.GRCh38.cdna.all.fa]
Kmer size = 16
Window size = 5
Segment length = 500 (read split allowed)
Alphabet = DNA
Percentage identity threshold = 80%
Mapping output file = mashmap.out
Filter mode = 1 (1 = map, 2 = one-to-one, 3 = none)
Execution threads  = 16
>>>>>>>>>>>>>>>>>>
INFO, skch::Sketch::build, minimizers picked from reference = 938129647
@k3yavi
Copy link
Member

k3yavi commented Jul 5, 2019

I think it's related to #5.
The problem is memory usage, I think . We've raised the issue on mashmap's repo here.

@antonkulaga
Copy link
Author

antonkulaga commented Jul 6, 2019

I have 64GB RAM, is it not enough? Also, why did you choose mashmap, it has not been updates for a year. Why not minimap2 which is fast, eats less memory and good for both short and long reads?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants