forked from harphub/harpIO
-
Notifications
You must be signed in to change notification settings - Fork 0
/
README.Rmd
76 lines (61 loc) · 3.87 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
---
output: github_document
---
<!-- README.md is generated from README.Rmd. Please edit that file -->
```{r setup, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# harpIO
harpIO provides a set of read and write functions for the harp ecosystem.
The package is currently under development, but the aim is to provide functions to read grib1, grib2, FA, NetCDF and vfld files from weather forecast / climate models; station observations from vobs files, radar / satellite observations from HDF5 files and in house databases at selected meteorological institutes. Further functions will interpolate gridded forecast fields to weather station locations which will in turn be written to portable sqlite database files.
## Installation:
```r
#>install.packages("devtools")
devtools::install_github("andrew-MET/harpIO")
```
If you wish to read data from netcdf files and don't already have the system libraries installed, you will need to install them:
```bash
sudo apt-get install libnetcdf-dev netcdf-bin libudunits2-dev
```
Additionally, for reading from grib and FA files, the R packages Rgrib2, Rfa and meteogrid are needed - these are not publically available, but are available on demand. For interpolating data from NetCDF files, the miIO package is needed, which can be obtained from MET Norway.
## Examples
To run these examples, you should have access to vfld and vobs files - replace all paths with your own paths and `start_date`, `end_date` etc. to fit your data. Note also that to read vfld files from multiple forecast models (or experiments), the data are expected to be in their own directory for each forecast model underneath a common directory . i.e `/path/to/vfld/data/model1`, `/path/to/vfld/data/model2/` etc.
To start, load the harpIO package and set up the paths and dates etc. Here we assume the forecast models are called model1, model2 and model3. To begin with a test will be done for two forecasts - one for 00 UTC on 30 May 2017, and then for the same time the next day.
```{r}
library(harpIO)
vfld_path <- "/path/to/vfld/data"
first_forecast <- 2017053000
last_forecast <- 2017053100
forecast_models <- c("model1", "model2", "model3")
forecast_frequency <- "1d"
```
In this case, model1 has 5 members (0-4), and model2 and model3 have 8 members (0-7). We will ask for lead times 0 - 36 hours every 3 hours and use `read_eps_interpolate()` to read in the data setting `return_data = TRUE` so that we can inspect the data. We set `parameter = NULL` to read all parameters from the vfld files. Note that when more than one `eps_model` is asked for, the number of members must be specified for each model in a list, even if each model has the same number of members.
```{r echo=FALSE}
vfld_path = "/lustre/storeB/users/andrewts/HarpResults/vfld"
```
To run this example on ecgate you can set `vfld_path <- "/scratch/ms/no/fa1m/vfld"`
```{r message=FALSE, warning=FALSE}
forecast_members <- list(model1 = seq(0, 4), model2 = seq(0, 7), model3 = seq(0, 7))
forecast_lead_times <- seq(0, 36, 3)
forecast_data <- read_eps_interpolate(
start_date = first_forecast,
end_date = last_forecast,
eps_model = forecast_models,
parameter = NULL,
lead_time = forecast_lead_times,
members_in = forecast_members,
by = forecast_frequency,
file_path = vfld_path,
return_data = TRUE
)
```
The model data are interpolated to station locations that are supplied with harpIO, which can be seen in the variable `station_list`.
```{r df_print="tibble"}
head(forecast_data, n = 10)
```
To write out the data to sqlite files, set `sqlite_path = /path/to/output` where /path/to/output is the directory you want to which you want to write the sqlite files. It is also a good idea to set `return_data = FALSE` when reading a large amount of data, although this is the default behaviour.