-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add clumpify-based dedup #970
base: master
Are you sure you want to change the base?
Commits on Jul 2, 2019
-
add bbmap.BBMapTool().dedup_clumpify()
add bbmap.BBMapTool().dedup_clumpify(), along with unit tests
Configuration menu - View commit details
-
Copy full SHA for a5d58b5 - Browse repository at this point
Copy the full SHA a5d58b5View commit details -
pass JVMmemory; add read_utils.rmdup_clumpify_bam; dedup_bam WDL task
pass JVMmemory to bbmap and clumpify; add rmdup_clumpify_bam to read_utils.py; change TestRmdupUnaligned unit tests for bbmap to use read_utils.py::rmdup_clumpify_bam; add dedup_bam WDL task to tasks_read_utils.wdl
Configuration menu - View commit details
-
Copy full SHA for 595764e - Browse repository at this point
Copy the full SHA 595764eView commit details -
switch from mvicuna to clumpify-based dedup in taxon_filter.py deplete
replace mvicuna-based read deduplication in taxon_filter.py::deplete() with clumpify-based deduplication that occurs farther upstream in advance of BWA-based depletion; add dedup_bam WDL workflow; in dedup_bam WDL task, create and emit FastQC report of only de-duplicated reads; update unit test input to include dup reads, and update expected output for the test_taxon_filter::TestDepleteHuman integration tests to reflect difference in output from clumpify vs previous mvicuna output
Configuration menu - View commit details
-
Copy full SHA for 09901d3 - Browse repository at this point
Copy the full SHA 09901d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for df208ea - Browse repository at this point
Copy the full SHA df208eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 98ac4fc - Browse repository at this point
Copy the full SHA 98ac4fcView commit details -
dump dx-toolkit version and update URL to reflect new source
DNAnexus seems to have replaced their wiki with a new documentation page ( https://documentation.dnanexus.com/downloads ) and the old download URLs along with it
Configuration menu - View commit details
-
Copy full SHA for 784877a - Browse repository at this point
Copy the full SHA 784877aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 232f9cd - Browse repository at this point
Copy the full SHA 232f9cdView commit details
Commits on Jul 3, 2019
-
Configuration menu - View commit details
-
Copy full SHA for c01bb5b - Browse repository at this point
Copy the full SHA c01bb5bView commit details -
Configuration menu - View commit details
-
Copy full SHA for e25ef52 - Browse repository at this point
Copy the full SHA e25ef52View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6ba96d4 - Browse repository at this point
Copy the full SHA 6ba96d4View commit details -
Configuration menu - View commit details
-
Copy full SHA for e8a4081 - Browse repository at this point
Copy the full SHA e8a4081View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8280063 - Browse repository at this point
Copy the full SHA 8280063View commit details -
Configuration menu - View commit details
-
Copy full SHA for b86b1c9 - Browse repository at this point
Copy the full SHA b86b1c9View commit details
Commits on Jul 4, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 8afe18f - Browse repository at this point
Copy the full SHA 8afe18fView commit details
Commits on Jul 10, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 7d2f45a - Browse repository at this point
Copy the full SHA 7d2f45aView commit details -
Configuration menu - View commit details
-
Copy full SHA for f48038e - Browse repository at this point
Copy the full SHA f48038eView commit details
Commits on Jul 11, 2019
-
Configuration menu - View commit details
-
Copy full SHA for c78f246 - Browse repository at this point
Copy the full SHA c78f246View commit details -
Configuration menu - View commit details
-
Copy full SHA for 72fb4cd - Browse repository at this point
Copy the full SHA 72fb4cdView commit details
Commits on Jul 13, 2019
-
Configuration menu - View commit details
-
Copy full SHA for d97f773 - Browse repository at this point
Copy the full SHA d97f773View commit details -
Configuration menu - View commit details
-
Copy full SHA for a685a8a - Browse repository at this point
Copy the full SHA a685a8aView commit details
Commits on Aug 1, 2019
-
specify containment=t for bbmap clumpify
Allow containments (where one sequence is shorter) when using bbmap clumpify to deduplicate
Configuration menu - View commit details
-
Copy full SHA for a2ce0f1 - Browse repository at this point
Copy the full SHA a2ce0f1View commit details
Commits on Aug 26, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 6f26717 - Browse repository at this point
Copy the full SHA 6f26717View commit details
Commits on Sep 11, 2019
-
Configuration menu - View commit details
-
Copy full SHA for b73950e - Browse repository at this point
Copy the full SHA b73950eView commit details -
Configuration menu - View commit details
-
Copy full SHA for a3010ea - Browse repository at this point
Copy the full SHA a3010eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8199b12 - Browse repository at this point
Copy the full SHA 8199b12View commit details
Commits on Sep 12, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 6bb3f6b - Browse repository at this point
Copy the full SHA 6bb3f6bView commit details
Commits on Sep 20, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 97eff11 - Browse repository at this point
Copy the full SHA 97eff11View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6f9b85 - Browse repository at this point
Copy the full SHA f6f9b85View commit details
Commits on Oct 3, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 1674c44 - Browse repository at this point
Copy the full SHA 1674c44View commit details
Commits on Nov 7, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 4ca4693 - Browse repository at this point
Copy the full SHA 4ca4693View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8f8aaae - Browse repository at this point
Copy the full SHA 8f8aaaeView commit details -
Merge branch 'ct-add-clumpify' of ssh://github.com/broadinstitute/vir…
…al-ngs into ct-add-clumpify
Configuration menu - View commit details
-
Copy full SHA for 218a12b - Browse repository at this point
Copy the full SHA 218a12bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fa5e01e - Browse repository at this point
Copy the full SHA fa5e01eView commit details
Commits on Nov 8, 2019
-
Configuration menu - View commit details
-
Copy full SHA for a0735c7 - Browse repository at this point
Copy the full SHA a0735c7View commit details
Commits on Nov 20, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 663deba - Browse repository at this point
Copy the full SHA 663debaView commit details -
demux_plus/demux_metag: merge linear parts of scatters, run spike-in …
…on raw rather than de-duped reads
Configuration menu - View commit details
-
Copy full SHA for 5a7ed3b - Browse repository at this point
Copy the full SHA 5a7ed3bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2be4a85 - Browse repository at this point
Copy the full SHA 2be4a85View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d691b2 - Browse repository at this point
Copy the full SHA 1d691b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ba7415 - Browse repository at this point
Copy the full SHA 1ba7415View commit details -
fix bug in conda command quiet calling
fix bug in conda command quiet calling ('-q -y' must be after 'conda <command>')
Configuration menu - View commit details
-
Copy full SHA for bb589a1 - Browse repository at this point
Copy the full SHA bb589a1View commit details -
maintain RG info in clumpify dedup; move processing to bbmap.py
for bbmap clumpify de-dup, merge like-library RGs and perform deduplication on each, then gather the IDs of kept reads, and filter the input sam based on the list of IDs to keep so as to maintain header and RG information. move most of the theprocessing to bbmap.py::dedup_clumpify so it has a more simple interface that accepts one bam and emits one bam. ToDo: parallelize across LBs
Configuration menu - View commit details
-
Copy full SHA for 472703b - Browse repository at this point
Copy the full SHA 472703bView commit details -
Configuration menu - View commit details
-
Copy full SHA for ca726d0 - Browse repository at this point
Copy the full SHA ca726d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f9f188 - Browse repository at this point
Copy the full SHA 3f9f188View commit details -
Configuration menu - View commit details
-
Copy full SHA for 995cf0d - Browse repository at this point
Copy the full SHA 995cf0dView commit details -
Configuration menu - View commit details
-
Copy full SHA for d54eff3 - Browse repository at this point
Copy the full SHA d54eff3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 21a6ac4 - Browse repository at this point
Copy the full SHA 21a6ac4View commit details
Commits on Nov 21, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 13f5172 - Browse repository at this point
Copy the full SHA 13f5172View commit details -
change to clumpify for pre-depletion dedup
change to clumpify for pre-depletion dedup; dedup lication can be likely be removed from depletion entirely in the future once all calls in the codebase have been updated to have one fewer arg
Configuration menu - View commit details
-
Copy full SHA for c1d18be - Browse repository at this point
Copy the full SHA c1d18beView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7c45da6 - Browse repository at this point
Copy the full SHA 7c45da6View commit details -
remove rmdup from depletion call
remove rmdup from depletion call, remove *.rmdup.bam from positional arguments for depletion CLI parser, remove *.rmdup.bam from inputs where depletion is called (test cases, WDL), remove *.rmdup.bam from expected depletion outputs. Chance Snakemake merge_one_per_sample rule to call rmdup_clumpify_bam rather than rmdup_mvicuna_bam
Configuration menu - View commit details
-
Copy full SHA for d91eca5 - Browse repository at this point
Copy the full SHA d91eca5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 12f73cb - Browse repository at this point
Copy the full SHA 12f73cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 16a2b50 - Browse repository at this point
Copy the full SHA 16a2b50View commit details
Commits on Dec 2, 2019
-
Configuration menu - View commit details
-
Copy full SHA for f1f9a40 - Browse repository at this point
Copy the full SHA f1f9a40View commit details
Commits on Dec 3, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 49bffcb - Browse repository at this point
Copy the full SHA 49bffcbView commit details
Commits on Mar 27, 2020
-
pass through single-end IDs for bbmap dedup
single-end reads do not have /1 /2 mate suffix, so pass through IDs missing the suffix
Configuration menu - View commit details
-
Copy full SHA for 362d0f3 - Browse repository at this point
Copy the full SHA 362d0f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for f816f6b - Browse repository at this point
Copy the full SHA f816f6bView commit details