Add `get_bytes_range()` function #196

epwalsh · 2023-10-04T00:18:00Z

We use a function like this a lot in the LLM repo so I thought we should add it here.

cached_path/schemes/scheme_client.py

cached_path/schemes/http.py

2015aroras · 2023-10-05T17:53:20Z

cached_path/bytes_range.py

+
+    # If we're using the /a/b/foo.zip!c/d/file.txt syntax, handle it here.
+    exclamation_index = url_or_filename.find("!")
+    if extract_archive and exclamation_index >= 0:


if extract_archive is true and there is no !, we should probably return some sort of error/exception

It's possible it could be part of the filename though, no? I think this is how cached_path() behaves so I wanted to match the behavior.

What if an archive file has an ! in its name? It's an edge case but would cause things to break. Having a separate optional parameter for the relative path of the file within the archive would be a way to get around the problem

Having a separate optional parameter for the relative path of the file within the archive would be a way to get around the problem

I agree but I don't want to deviate from the API of cached_path(). At least for now.

tests/cached_path_test.py

epwalsh added 3 commits October 3, 2023 17:17

Add get_bytes_range() function

7634d2d

CHANGELOG.md

2893f50

Add tests, resort to default behavior for HTTP

4c9ee3c

epwalsh requested review from AkshitaB and 2015aroras October 4, 2023 23:35

2015aroras approved these changes Oct 5, 2023

View reviewed changes

epwalsh merged commit f435b48 into main Oct 6, 2023
12 checks passed

epwalsh deleted the bytes-range branch October 6, 2023 00:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `get_bytes_range()` function #196

Add `get_bytes_range()` function #196

epwalsh commented Oct 4, 2023 •

edited

Loading

2015aroras Oct 5, 2023

epwalsh Oct 5, 2023

2015aroras Oct 5, 2023

epwalsh Oct 6, 2023

Add get_bytes_range() function #196

Add get_bytes_range() function #196

Conversation

epwalsh commented Oct 4, 2023 • edited Loading

2015aroras Oct 5, 2023

Choose a reason for hiding this comment

epwalsh Oct 5, 2023

Choose a reason for hiding this comment

2015aroras Oct 5, 2023

Choose a reason for hiding this comment

epwalsh Oct 6, 2023

Choose a reason for hiding this comment

Add `get_bytes_range()` function #196

Add `get_bytes_range()` function #196

epwalsh commented Oct 4, 2023 •

edited

Loading