Experimenting with Dask integration #208

abkfenris · 2023-06-15T01:14:33Z

After the Dask discussion two weeks ago (see https://github.com/orgs/xpublish-community/discussions/4) I sat down and sketched out what an implementation could look like in Xpublish. It's really rough, and throughly un-tested.

This adds two local plugins and associated infrastructure for most hooks to be able to use Dask.

In most cases for different types of Dask infrastruture, a plugin that provides a get_dask_cluster() method should do the trick. The hook is set up to only return one result, and the built in plugin will be the last.

The Dask client plugin in theory should work with different types of clusters, but is similarly set up to be able to be overridden (dask-on-ray?). The client can be both sync and async, and once it gets accessed, it's cached on xpublish.Rest.

For hooks that have access to deps (which now includes dataset providers), deps.dask_sync_client and deps.dask_async_client now should give you the client.

The async client may need to be passed the current event loop. It appears the way to access the event loop varies by server, so that will probably take some research.

Adds two local plugins and associated infrastructure for most hooks to be able to use Dask. In most cases for different types of Dask infrastruture, a plugin that provides a `get_dask_cluster()` method should do the trick. The hook is set up to only return one result, and the built in plugin will be the last. The Dask client plugin in theory should work with different types of clusters, but is similarly set up to be able to be overridden (dask-on-ray?). The client can be both sync and async, and once it gets accessed, it's cached on `xpublish.Rest`. For hooks that have access to `deps` (which now includes dataset providers), `deps.dask_sync_client` and `deps.dask_async_client` now should give you the client. The async client may need to be passed the current event loop. It appears the way to access the event loop varies by server, so that will probably take some research. - fastapi/fastapi#7876 - encode/uvicorn#706 - https://stackoverflow.com/questions/66275747/how-to-use-event-loop-created-by-uvicorn

abkfenris · 2023-07-07T18:51:34Z

Some of the many tabs I had open while pondering Dask, Xarray, async, and FastAPI, and resources from others:

jonmjoyce mentioned this pull request Apr 4, 2024

[XPublish]: Performance and scaling improvements ioos/ioos-code-sprint#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimenting with Dask integration #208

Experimenting with Dask integration #208

abkfenris commented Jun 15, 2023

abkfenris commented Jul 7, 2023

Experimenting with Dask integration #208

Are you sure you want to change the base?

Experimenting with Dask integration #208

Conversation

abkfenris commented Jun 15, 2023

abkfenris commented Jul 7, 2023