Fast `yacht train`

This implements a fast version of yacht train, as of v1.2.3: https://github.com/KoslickiLab/YACHT/releases/tag/v1.2.3

Why is it important?

Its many times faster, and should consume a whole lot less memory. The main improvement in terms of speed comes from implementing a faster many-vs-many sketch comparator as an alternative of sourmash branchwater multisearch (as of Nov 2024, link: https://github.com/sourmash-bio/sourmash_plugin_branchwater), which is quadratic in nature. Our implementation here takes advantage of the sparsity in data (if any), avoids unnecessary computation, and can run in almost linear time (again, if there is enough sparsity to take advantage of).

Installation

After downloading, just do:

make

The bin directory should contain the executable.

Usage

yacht_train -h

Assumptions

All input sketches were computed using the same scale factor.
The input sketches have only a single sketch inside of them.

Arguments

Argument	Description
`file_list`	A file where each line is a path to a sketch
`working_directory`	Where similarity values are written
`output_filename`	Output file, this will contain a subset of the paths given as input in file_list
`threads`	Number of threads to use
`passes`	Number of passes to make. More passes make the program slower but uses less memory
`containment_threshold`	Containmnet threshold

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
src		src
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast `yacht train`

Why is it important?

Installation

Usage

Assumptions

Arguments

About

Releases

Packages

Contributors 2

Languages

KoslickiLab/fast_yacht_train

Folders and files

Latest commit

History

Repository files navigation

Fast yacht train

Why is it important?

Installation

Usage

Assumptions

Arguments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Fast `yacht train`

Packages