-
Notifications
You must be signed in to change notification settings - Fork 5
/
1_intro.Rmd
47 lines (31 loc) · 1.17 KB
/
1_intro.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
---
title: "1 - Introduction to {disk.frame} Tutorial"
author: "ZJ"
date: "2020-07-23"
output: html_document
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```
## What is {disk.frame}
Let's look at the [useR! 2019 presentation](https://www.beautiful.ai/-LgwO_sf5qrBouC0H0sG/1)
Let us go through the common questions on https://diskframe.com
## What you need to get started - Setting up
### Installation
Simply run
```{r}
install.packages("disk.frame")
# install the dataset package we will use in today's tutorial
install.packages("nycflights13")
```
### Recommended boilerplate
Running `setup_disk.frame()` immediately after `library(disk.frame)` is highly recommended. It starts up multiple R sessions and `{disk.frame}` will use those R sessions to parallel the workloads.
```
# this willl set disk.frame with multiple workers
setup_disk.frame()
```
*Note* by default it still start as many session as there are cores. So only servers with many many cores, this may not be a good idea. Hence you may control the number of sessions by setting `workers=`.
```
# this willl set disk.frame with multiple workers
setup_disk.frame(workers = 4) # set up 4 session
```