Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use GNU sort instead of "external-sort" package #17

Merged
merged 5 commits into from
Aug 31, 2023
Merged

Conversation

cmdcolin
Copy link
Collaborator

requires the user has GNU sort installed, but I think this should be common. If a user is running on windows, they can either install it to their path as a windows version or use WSL. The result should be much faster and more reliable

@cmdcolin cmdcolin force-pushed the use_gnu_sort branch 5 times, most recently from 504e0ac to 9fe2f58 Compare August 31, 2023 17:23
@cmdcolin
Copy link
Collaborator Author

the sort should be much faster now, and hopefully this should control aberrant memory usage too

#with gnu sort
bin/run text-index --out /media/cdiesh/Beezle/Downloads/data/large_names2   
4m16s.40 total

#without gnu sort
jbrowse text-index --out /media/cdiesh/Beezle/Downloads/data/large_names   
7m40s.51 total

@cmdcolin
Copy link
Collaborator Author

cmdcolin commented Aug 31, 2023

identical output also

/media/cdiesh/Beezle/Downloads/data
% shasum large_names/trix/hg19.ix
a09b30311c6ed6eb0cb69b6bd9078fd3b1c0f991  large_names/trix/hg19.ix

/media/cdiesh/Beezle/Downloads/data
% shasum large_names2/trix/hg19.ix
a09b30311c6ed6eb0cb69b6bd9078fd3b1c0f991  large_names2/trix/hg19.ix

@cmdcolin
Copy link
Collaborator Author

might go ahead with merge. we could make external-sort a fallback if no gnu sort exists, but i think this should be ok for most users

@cmdcolin cmdcolin merged commit 8530f5c into master Aug 31, 2023
1 check passed
@cmdcolin cmdcolin deleted the use_gnu_sort branch August 31, 2023 17:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant