-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alternative approach to reproducible research with drakepkg
#6
Comments
This looks great! I think your decision to include the |
Also, FYI: you may have noticed that the I look forward to playing with your forked version and will be happy to give you more detailed feedback afterwards! |
@tiernanmartin Thanks for getting back to me!
You're absolutely right. I saw your discussion with @wlandau about using the OSF instead of Github to store files. One option for projects with larger size data files and
I'm looking forward to seeing your changes/improvements and hearing your suggestions! Maybe we can come up with a common framework that accommodates the most common use cases. Potentially, in the long run an integration into |
Great discussion, you guys! It is exciting to see
library(drake)
load_main_example()
make(plan)
#> target raw_data
#> target data
#> target fit
#> target hist
#> target report
cache <- get_cache()
cache$get(file_store("report.Rmd"))
#> [1] "5a49b18f8d579dffda0983cbdecc44acc5099f3ee92c34b18f641fab86e6558e"
drake_cache_log()
#> # A tibble: 12 x 3
#> hash type name
#> <chr> <chr> <chr>
#> 1 a668e310782f864c import create_plot
#> 2 27115496d692f3d2 target data
#> 3 cfae01896d60312f target fit
#> 4 25cdbd93912e0269 import forcats::fct_inorder
#> 5 eb54142bf4029c58 target hist
#> 6 25efbda5da1aa408 target raw_data
#> 7 7cfd4cac5787a46e import "\"raw_data.xlsx\""
#> 8 c2ee4ecf9dd1c922 import readxl::read_excel
#> 9 d1813aad07a6a9ba target report
#> 10 01cc33b4bbba9d14 target "\"report.html\""
#> 11 b9bbfe573f3087b7 import "\"report.Rmd\""
#> 12 b7299c6d33b92763 import rmarkdown::render Created on 2018-12-02 by the reprex package (v0.2.1) |
Note: the hash from |
@tiernanmartin Thanks for developing
drakepkg
! I found it 2 weeks ago when I was researching ways to package adrake
workflow for a research project I plan to publish as a "research compendium" (according to the methods outlined by @benmarwick inrrtools
).I have since been working on an alternative to your approach which I have now uploaded to a fork of your repo here.
The main difference, I think, is that I also distribute the
.drake/
directory with the package, so that users can check the consistency of the workflow with all its inputs and outputs and can check out intermediate results/targets, without having to re-run the analysis on their own computer. The package includes simple wrapper functions that are intended to lower the barrier for the user to interact with the analysis (e.g., just runningreproduce_analysis()
is enough to copy and check the analysis). I wrote a vignette that hopefully explains the procedure well.I am interested in your thoughts on my approach. Thank you!
The text was updated successfully, but these errors were encountered: