-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Options for reading list of netcdf files on OSN pod for rechunking #485
Comments
Best way would be to kerchunk the files! But you can also read them as xr.open_dataset(fs.open('s3://file.nc') |
@rsignell
If I try to open the list of files with something similar to the following:
I get the error
I can open individual files without any problem. I thought perhaps there was some fsspec command to handle the list but I've been unable to find it in the documentation. I'm not sure that kerchunk would be a very good option here because we have over 380,000 hourly netcdf files. |
@rsignell
|
Can you open one of these files? |
See note above - I can open individual files. They are netcdf4 files. |
Does the kerchunk allow for easy modification of the metadata? What's an example of doing this with parquet refs? |
@pnorton-usgs check out this example I made a few weeks ago: https://gist.github.com/rsignell/84f727f25d923aab5aa7c534cef14151 |
Sorry for the delayed and distracted responses. I'm currently in low bandwidth mode but will be back on Monday |
@rsignell , @rsignell-usgs I'm trying out the rechunk workflow for CONUS404 using the OSN pod for the source hourly WRF output files. What are my options for reading those files? I've been trying to figure out the fsspec way of doing this but so far have not found good documentation for how to get this to work.
The text was updated successfully, but these errors were encountered: