We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
data.table has a great feature to join two large datasets efficiently by reference. I'm wondering if this could be available in dtplyr.
data.table
dtplyr
Below are very nice details about the difference from StackOverflow (by Jaap)
library(bench) bm <- mark(AA <- BB[AA, on = .(aa)], AA[BB, on = .(aa), cc := cc], iterations = 1)
> bm[,c(1,3,5)] # A tibble: 2 x 3 expression median mem_alloc <bch:expr> <bch:tm> <bch:byt> 1 AA <- BB[AA, on = .(aa)] 4.98s 4.1GB 2 AA[BB, on = .(aa), `:=`(cc, cc)] 560.88ms 384.6MB
The text was updated successfully, but these errors were encountered:
No branches or pull requests
data.table
has a great feature to join two large datasets efficiently by reference.I'm wondering if this could be available in
dtplyr
.Below are very nice details about the difference from StackOverflow (by Jaap)
The text was updated successfully, but these errors were encountered: