-
-
Notifications
You must be signed in to change notification settings - Fork 40
Home
Welcome to the disk.frame wiki!
Many of disk.frame
's functions, such as map.disk.frame
and delayed
are just convenience functions to let you perform the same operation to each chunk. The convenience comes from the fact that it loads every chunk into a data.table/data.frame and does the saving to disk automatically into .fst files.
- Tests covering all user-facing functions (2019 03 03)
- Implement #50
- Submit to CRAN
I only work on Sunday morning on disk.frame to avoid this eating into my other (paid) work. So progress can be slow. If you would like to speed things up feel free to contact me for consulting services.
disk.frame_code({
libname(a, path1)
libname(b, path2)
a~disk.frame2 = delayed(b~disk.frame1, some_fn)
})
future
has backends for clusters. So with a little bit (a lot) of work we can scale out to computer. But we need to identify a simple way to set up these servers on AWS or on a local network. But this is not going to happen for a while.