Skip to content

Latest commit

 

History

History
272 lines (216 loc) · 18.1 KB

README.md

File metadata and controls

272 lines (216 loc) · 18.1 KB

Regions

Weather4cast: Multi-sensor weather forecasting competition & benchmark dataset

  • Study satellite multi-channel weather movies.
  • Predict weather products in various earth regions.
  • Apply transfer learning to new earth regions.

Contents

Introduction

The aim of our core competition is to predict the next 32 images (8h ahead in 15 minutes intervals) in our weather movies, which encode four different variables: (i) Temperature from the Cloud Top Temperature or the ground Skin Temperature if there are no clouds, (ii) Convective Rainfall Rate, (iii) Probability of Occurrence of Tropopause Folding, and (iv) Cloud Mask. Each image is an observation of these 4 channels in a 15 minutes period where pixels correspond to a spatial area of ~3km x 3km, and there are 11 regions of 256 x 256 pixels to provide test predictions. From these regions, 5 of them contain also training and validation data for learning purposes but the other 6 only inference is requested, to assess the Transfer Learning capabilities of models.

Regions

The submission format in each day of the test set is a multi-dimensional array (tensor) of shape (32, 4, 256, 256) and the objective function of all submitted tensors (one for each day in the test set and region) is the mean squared error of all pixel channel values to pixel channel values derived from true observations. We note that we normalize these pixel channel values to lie between 0 and 1 by dividing the pixel channel value by the maximum value of each variable (see below).

There are two competitions running in parallel that expect independent submission (participants can join one or both of them):

  • Core Competition: Train your models on these regions with the provided data and submit predictions on the test subset.
  • Transfer Learning Competition: Only the test subset is provided for these regions, test the generalization capacity of your models.

Get the data

You can download the data once registered in the competition.

Submission guide

Currently, the competition data provided comes in a zip file that has the following folder structure.

+-- RegionX -- ...
+-- RegionY 
        +-- training -- ... (~96 files per variable)
        +-- validation -- ... (~96 files per variable)
        +-- test -- (4 files each variable)
            +-- 2019047
            +-- ... 
            +-- 2019074
                +-- ASII -- ...
                +-- CMA -- ...
                +-- CRR -- ...
                +-- CTTH
                    + -- S_NWC_CTTH_MSG4_Europe-VISIR_20190216T170000Z.nc
                    + -- S_NWC_CTTH_MSG4_Europe-VISIR_20190216T171500Z.nc
                    + -- S_NWC_CTTH_MSG4_Europe-VISIR_20190216T173000Z.nc
                    + -- S_NWC_CTTH_MSG4_Europe-VISIR_20190216T174500Z.nc

Each region has three splits training/validation/test, and each split has a folder yyyyddd that corresponds to the day number in that year day_in_year, e.g. 2019365 would refer to the last day in 2019. Each day_in_year has 4 folders containing the weather variables. All 15-minute period images available for that day are contained inside. We note that there is a maximum of 96 files for training/validation (4 images/hour * 24 hour), and exactly 4 files in test (1 hour as input for the next 32 requested consecutive images).

Each of the files S_NWC_variableMSG4_Europe-VISIRyyyymmddThhmm00Z.nc is a netCDF encoding the respective requested target variable and other attributes that might help, for the same region of 256 x 256 pixels in the same 15-minutes interval.

For the submission, we expect a zip file back that, when unpacked, decomposes into the following folder structure:

+-- RegionX -- ...
+-- RegionY 
        +-- test -- (1 file per day, encoding 32 images of 4 channels each)
            + -- 2019047.h5
            + -- ... 
            + -- 2019074.h5

where now each h5 file yyyyddd.h5 contains a uint16 tensor of shape (32, 4, 256, 256) that contains the predictions of 32 successive images following the sequence of 4 images given in the corresponding input test folders for all regions of the competition data. Note that each variable should be in its own range since we will scale it as mentioned above.

To check if the shape of a predicted tensor is appropriate (32, 4, 256, 256), the following script should give us exactly that:

python3 utils/h5file.py -i path_to_RegionY/test/yyyyddd.h5

To generate the compressed folder, cd to the parent folder containing the regions folders (RegionX, ..., RegionY) and zip all regions together like in the following example:

user@comp:~/tmp_files/core-competition-predictions$ ls
R1/ R2/ R3/ R7/ R8/
user@comp:~/tmp_files/core-competition$ zip -r ../core-predictions.zip .
...
user@comp:~/tmp_files/core-competition$ ls ../
core-competition-predictions/   transfer-learning-competition-predictions/  core-predictions.zip

Please, delete any file that makes your submission not to match the requested structure. An example of finding and deleting non-expected files is shown below:

user@comp:~/tmp_files/core-competition-predictions$ find . -wholename *nb* -print
user@comp:~/tmp_files/core-competition-predictions$ find . -wholename *DS* -print
./.DS_Store
./R1/.DS_Store
./R3/.DS_Store
./R2/.DS_Store
user@comp:~/tmp_files/core-competition-predictions$ find . -wholename *DS* -delete
user@comp:~/tmp_files/core-competition-predictions$ find . -wholename *DS* -print

The submission file can be uploaded in the corresponding following submission link:

Data summary

The following table summarizes the data provided, detailing the requested target variables and all other variables provided. For a detailed explanation of each of the variables, see the link provided in the Introduction. Please, in order to keep the size of the provided prediction files small, just deliver them in uint16 format with each variable within its own range. When computing the score, we will divide each channel by its maximum value to have them in the interval [0, 1]:

Name Folder Variables Target Variable type Range Scale Factor Offset
Temperature at Ground/Cloud CTTH temperature
ctth_tempe
ctth_pres
ctth_alti
ctth_effectiv ctth_method
ctth_quality
ishai_skt
ishai_quality
temperature uint16 0-11000 10 0
Convective Rainfall Rate CRR crr
crr_intensity
crr_accum
crr_quality
crr_intensity uint16 0-500 0.1 0
Probability of Occurrence of Tropopause Folding ASII asii_turb_trop_prob
asiitf_quality
asii_turb_trop_prob uint8 0-100 1 0
Cloud Mask CMA cma_cloudsnow
cma
cma_dust
cma_volcanic
cma_smoke
cma_quality
cma uint8 0-1 1 0
Cloud Type CT ct
ct_cumuliform
ct_multilayer
ct_quality
None uint8 None None None

Cloud Type is provided since it has rich information that might help the models but no variable is required from this product. For the other products, we expect predictions for [temperature, crr_intensity, asii_turb_trop_prob, cma] in this order for the channel dimension in the submitted tensor.

Data obtained in collaboration with AEMet - Agencia Estatal de Meteorología/ NWC SAF.

Start here

We provide an introduction notebook in utils/1. Onboarding.ipynb where we cover all basic concepts for the competition from ~scratch:

  1. How to read, explore, and visualize netCDF4 files
  2. Load and transform context variables: altitudes and latitude/longitude
  3. Data split training/validation/test, and list of days with missing time-bins
  4. Data Loader Example
  5. Generate a valid submission for the Persistence model

Furthermore, you can find all explained methods in the notebook ready to be used in the files utils/data_utils.py and utils/context_variables.py, so you can import them out of the box.

The code assumes that if you download the regions for the core or transfer learning competition, they are located like follows:

+-- data
    +-- core-w4c
        +-- R1
        +-- R2
        +-- R3
        +-- R7
        +-- R8
    +-- transfer-learning-w4c
        +-- R4
        +-- R5
        +-- R6
        +-- R9
        +-- R10
        +-- R11
    +-- static
        +-- Navigation_of_S_NWC_CT_MSG4_Europe-VISIR_20201106T120000Z.nc
        +-- S_NWC_TOPO_MSG4_+000.0_Europe-VISIR.raw

Please, provide the path to the parent folder data as the argument data_path of the function get_params(...) in config.py.

Just in the same way, if you consider using the provided static context variables, provide the parent folder of the files data/static as the argument static_data_path of the function get_params(...).

Benchmarks

Generate a submission

We provide a notebook (2. Submission_UNet.ipynb) where we show how to create a submission using pre-trained UNet models, in particular, we will produce 3 sets of predictions:

  • A valid submission for the core-competition (R1, R2, R3, R7, R8) using pre-trained UNets per region i.e. individual models per region
  • A valid submission for the transfer-learning-competition (R4, R5, R6, R9, R10, R11) using a single UNet trained on region R1
  • Use the ensamble of models trained in regions R1-3 to generate a valid submission for the transfer-learning-competition (R4, R5, R6, R9, R10, R11) by averaging their predictions

The weights needed to generate such submission for the UNets can be downloaded once registered to the competition here. The notebook uses an architecture and a PyTorch Lightning class defined in weather4cast/benchmarks/, but it is not required to understand them when learning how to generate the submissions from a pre-trained model.

Train/evaluate a UNet

We provide a script (3-train-UNet-example.py) with all necessary code to train a UNet model from scratch or fine tune from any of the provided checkpoints for 2. Submission_UNet.ipynb. The same code with the flag -m val can be used to evaluate a model on the validation data split.

To use it, set the correct data paths in config.py and use the following syntax:

user@comp:~/projects/weather4cast-participants/utils$ python 3-train-UNet-example.py --h                                    
usage: 3-train-UNet-example.py [-h] [-g GPU_ID] [-r REGION] [-m MODE]
                               [-c CHECKPOINT]

optional arguments:
  -h, --help            show this help message and exit
  -g GPU_ID, --gpu_id GPU_ID
                        specify a gpu ID. 1 as default
  -r REGION, --region REGION
                        region_id to load data from. R1 as default
  -m MODE, --mode MODE  choose mode: train (default) / val
  -c CHECKPOINT, --checkpoint CHECKPOINT
                        init a model from a checkpoint path. '' as default
                        (random weights)

Few examples:

cd utils

    # a.1) train from scratch
python 3-train-UNet-example.py --gpu_id 1 --region R1

    # a.2) fine tune a model from a checkpoint
python 3-train-UNet-example.py --gpu_id 2 --region R1 -c 'epoch=03-val_loss_epoch=0.027697.ckpt'

    # b.1) evaluate an untrained model (with random weights)
python 3-train-UNet-example.py --gpu_id 3 --region R1 --mode val

    # b.2) evaluate a trained model from a checkpoint
python 3-train-UNet-example.py --gpu_id 4 --region R1 --mode val -c 'epoch=03-val_loss_epoch=0.027697.ckpt'

Every time we run the script PyTorch Lightning will create a new folder lightning_logs/version_i/, increasing version i automatically. This is where the model parameters and checkpoints will be saved together with the files explained below.

The class LeadTimeEval in benchmarks/validation_metrics.py is used by the UNet model to store the error per variable in the evaluation of the validation data split. After the evaluation all errors are shown by the standard output. Furthermore, a plot lead_times_mse_fig_R1.png with the evolution of the mean error across time (from 1 to 32 future predictions) is produced saving also the values to disk lead_times_mse_fig_R1.csv, in the respective lightning_logs/version_i/ folder. The latter can be used to compare different models across time.

Lead Times The image above shows the mean error per time bin (y-axis) and its standard deviation up to 8 hours (32 time bins ahead, x-axis). The further the prediction the worst the error. The title of the picture indicates that this model used latitude/longitude and elevations (l-e), and indicates the mean error per variable averaging all 32 lead times.

Code and abstract submission

At the end of the competition you must provide:

  1. A short scientific paper with a sufficiently detailed description of your approach (4-6 pages plus references)
  2. The code and models (with their learned weights) that you used for your predictions, with explanations to reproduce it.

We will notify participants of how to provide the paper. For the code, you will need to submit it to a public repository like GitHub, providing a link to download the model's learned weights. Ideally, your repository should at least contain:

  • a) A list of dependencies. In the case of using Python, we suggest using conda/pip to generate them: conda env export > environment.yml. Make sure that your code can be executed from a fresh environment using the provided list of requirements: conda env create -f environment.yml.
  • b) Code, models, and a folder with all model's weights.
  • c) An out-of-the-box script to use your best model to generate predictions. The script will read the inputs for the model from a given path and region, using its test folder (like the one used for the leaderboard), and save the outputs on a given path. The path to the folder containing the weights to be loaded by the models can also be an argument of the script. We provide an example in utils/4-inference.py with Python.

An example of using c) will be:

cd utils
R=R5
INPUT_PATH=../data
WEIGHTS=../weights
OUT_PATH=.
python 4-inference.py -d $INPUT_PATH -r $R -w $WEIGHTS -o $OUT_PATH -g 1

Since region R5 belongs to the transfer learning challenge, the code will generate predictions for each day in folder $INPUT_PATH/transfer-learning-w4c/$R/test/ and will save the .h5 predictions in folder $OUT_PATH/$R/test/, just like it is done for the leaderboard submissions. The script is responsible to use the appropriate models given the target region $R and the location of the weights $WEIGHTS.

The only difference with the test folders used for the leaderboard is that this script should compute the prediction for any date and sequence of 4 time intervals. This means that the splits utils/splits.csv and utils/test_split.json can't be used anymore but need to be generated for the days found in $INPUT_PATH/transfer-learning-w4c/$R/test/ (this allows using your models for any date and hour on that day as input). The code to generate these files is already provided in the example script utils/4-inference.py.

Cite

When referencing the data or the models provided, please cite these papers:

@INPROCEEDINGS{9672063,  
author={Herruzo, Pedro and Gruca, Aleksandra and Lliso, Llorenç and Calbet, Xavier and Rípodas, Pilar and Hochreiter, Sepp and Kopp, Michael and Kreil, David P.},  
booktitle={2021 IEEE International Conference on Big Data (Big Data)},   
title={High-resolution multi-channel weather forecasting – First insights on transfer learning from the Weather4cast Competitions 2021},   
year={2021},  
volume={},  
number={},  
pages={5750-5757},  
doi={10.1109/BigData52589.2021.9672063}
}

@inbook{10.1145/3459637.3482044,
author = {Gruca, Aleksandra and Herruzo, Pedro and R\'{\i}podas, Pilar and Kucik, Andrzej and Briese, Christian and Kopp, Michael K. and Hochreiter, Sepp and Ghamisi, Pedram and Kreil, David P.},
title = {CDCEO'21 - First Workshop on Complex Data Challenges in Earth Observation},
year = {2021},
isbn = {9781450384469},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3459637.3482044},
booktitle = {Proceedings of the 30th ACM International Conference on Information & Knowledge Management},
pages = {4878–4879},
numpages = {2}
}