Skip to content

Latest commit

 

History

History
18 lines (10 loc) · 2.1 KB

input_data_preparation.md

File metadata and controls

18 lines (10 loc) · 2.1 KB

Input Data Preparation

Choosing the variables

SeasFire Cube v3 provides a list of 59 climate variables on a regular grid that are related to wildfires in an xArray dataset stored in .zarr format. As described previously in the Data section, a subset of 9 climate variables was chosen. Most of them have time, latitude and longitude as their coordinates. The variable lsm is the only variable having latitude and longitude as its coordinates; this is because the Land-Sea Mask is a binary map of water bodies.

Preprocessing data

In order to have a dataset with time, latitude and longitude coordinates, the variable lsm must be adjusted. The variable lsm has latitude and longitude as coordinates, since it is a binary map of the globe in which pixels are equal to 1 in land areas and 0 in sea areas. To add the time coordinate to the variable lsm, the map was repeated for each timestamp in the dataset.

Scalers Creation

Once the dataset has been created, it is necessary to choose a scaler to scale data. Tipically, the data are scaled using Standard or MinMax scalers. Two different approaches can be used to achieve this, one based on maps and one based on arrays containing individual values of the reference statistics for the chosen scaler:

  • Standard Scaler: Mean and Standard Deviation maps or arrays;
  • MinMax Scaler: Maximum and Minimum maps or arrays.

The climatic variables considered not only change over time, but also geographically. Although it is more intuitive to proceed with the first approach and use maps, the second approach is used because the variations that the climatic variables can undergo in the time frame considered are not significant and, consequently, it is conceivable to use a single representative value for the Mean, Standard Deviation, Maximum and Minimum statistics.