-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Now able to specify filenames of downloads #84
Conversation
I can confirm that this works now with the modifications, as intended (downloading to the filename specified) |
Thanks @sbthandras for opening this PR! While I understand it would be great to be able to give the files e.g. simpler names, I wonder if this would introduce too much complexity for the function? Apart from simplicity, I'm a bit worried about the errors this could introduce:
I'm OK with adding this new argument, but I wonder if it would be simpler to just download the files using this function and then rename them separately? Not sure, what do you think? (The tests fail due to some other reason. This is some new stuff, I hope to fix this soon) |
I thi
I think these are all valid points, I myself was thinking a lot about the last one. Especially that a user might mistakenly set an unzipped extension for a compressed file. The reason I added it, is because I don't like the idea of the identity of the files being 'lost' when I'm downloading 'genomic.fna'. I assume that if I was downloading a more compelx file format then I could parse that and retrieve the information about what genome it is exactly, but now when I download a set of accessions the file names are not regular at all but come with different suffixes. I assume, based on the UID the function should be able to find an accession name? How about an option that forces the downloaded file to be ACCESSION.fna? |
OK, so if I download e.g. GCF_003007635.1 the filename will be GCF_003007635.1_ASM300763v1_genomic.fna.gz by default and instead of this you would want to have GCF_003007635.1.fna.gz, right? I think this is possible, but it may be simpler to add an argument that specifies whether the filenames should contain 1. only the assembly, 2. only the name or 3. both (default) and then assemble the final filename using the FTP filename (which contains both). Would this solve the issue? |
No description provided.