diff --git a/content/pages/datasets.rst b/content/pages/datasets.rst index 4415553..24e3481 100644 --- a/content/pages/datasets.rst +++ b/content/pages/datasets.rst @@ -33,7 +33,9 @@ Some specification of the dataset: * The gridded main Eumetnet postprocessing benchmark dataset contains `ECMWF `_ ensemble and deterministic forecasts over a large portion of Europe, from 36 to 67° in latitude and from -6 to 17° of longitude, and covers the years 2017-2018. * It also contains the corresponding ERA5 reanalysis for the purpose of providing observations for the benchmark. -* For some dates, it contains also reforecasts that covers 20 years of past forecasts recomputed with the most recent model version. +* For some dates, it contains also `reforecasts `_ that covers + 20 years of past forecasts recomputed with the most recent model version. +* Forecasts and reforecasts lead time extent of 10 days. * All the forecasts and reforecasts provided are the noon ECMWF runs. * The ensemble forecasts and reforecasts also contain by default the control run (the 0-th member). * The gridded data resolution is 0.25° x 0.25° which corresponds roughly to 25 kilometers. @@ -54,7 +56,41 @@ EUPPBench dataset .. _eupp: -TODO +The EUPPBench dataset was designed out of `the base dataset `_ forecasts, reforecasts and gridded observations on a smaller domain and a shorter +leadtime extent of 5 days, and stored in the `Zarr format `_. Stations observation data from different meteorological centers were also added. +This dataset was thus intended to provide a user-friendly, analysis-ready dataset that can be easily manipulated and processed. +The dataset was published in `Earth System Science Data (ESSD) `_ (see `here `_ our +news publication about this). + +Like the base dataset, data from the EUPPBench dataset can be downloaded from our python `climetlab plugin `_, +but it is possible to access the Zarr archive with tools from other computing languages (e.g. R or Julia). See the ESSD publication above for more details about how to use the dataset with other languages. + +Some specification of the dataset: + +For the gridded data +~~~~~~~~~~~~~~~~~~~~ + +* The gridded EUPPBench postprocessing benchmark dataset contains `ECMWF `_ ensemble and deterministic forecasts over a small domain in Europe, from 45.75° to 53.5° in latitude, + and from 2.5° to 10.5° in longitude, and covers the years 2017-2018. +* It also contains the corresponding ERA5 reanalysis for the purpose of providing observations for the benchmark. +* For some dates, it contains also `reforecasts `_ that covers 20 years of past forecasts recomputed with the + most recent model version at the given date. +* Forecasts and reforecasts lead time extent of 5 days. +* All the forecasts and reforecasts provided are the noon ECMWF runs. +* The ensemble forecasts and reforecasts also contain by default the control run (the 0-th member). +* The gridded data resolution is 0.25° x 0.25° which corresponds roughly to 25 kilometers. +* Forecasts and reforecasts are 6-hourly, and include the analysis at 00Z. + +For the stations data +~~~~~~~~~~~~~~~~~~~~~ + +* The stations EUPPBench postprocessing benchmark dataset contains ECMWF ensemble and deterministic forecasts at the grid point closest to the station locations, and covers the years 2017-2018. +* Forecasts and reforecasts lead time extent of 5 days. +* It also contains the corresponding stations observations for some important variables. +* For some dates, it contains also reforecasts that covers 20 years of past forecasts recomputed with the most recent model version at the given date. +* All the forecasts and reforecasts provided are the noon ECMWF runs. +* The ensemble forecasts and reforecasts also contain by default the control run (the 0-th member). +* 5 countries are presently available: Belgium, Austria, France, Germany, The Netherlands. .. figure:: /images/EUPP_domain.png :align: center