-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Oct 21, 2024: This week in DataFusion #13035
Comments
Congrats @goldmedal for becoming a committer! Thanks for all your hard work 🚀 |
Many thanks! |
@XiangpengHao and @tustvold are also working on Adaptive Predicate Pushdown in the parquet reader of apache/arrow-rs#5523 -- this has the potential to allow DataFusion to always push predicates into the parquet decoder itself, further improving parquet read performance As @Dandandan notes, this would let us finally do See the great writeup from @tustvold here with analysis: apache/arrow-rs#6454 (comment) |
@SamSynnada started a great discussion about better spreading the word about DataFusion. Thank you 🙏 -- it is going to be a great year |
Hi, I want to highlight datafusion-contrib/datafusion-orc#120 as a great example of community collaboration. Given DataFusion's unique position, I believe there are many opportunities for such collaboration. It's also a great chance to thank everyone who made it possible, especially those who have maintained the project for a year. |
@2010YOUY01 is rallying a team for improvements in externalized hash aggregation: |
Next week: #13167 |
Introduction
This ticket is a weekly summary of interesting things happening in DataFusion. Note this is not a complete list (it is what I remember / can find). Please feel free to leave comments on this ticket about things that I may have missed or you think should get wider attention by the community
Loosely inspired by https://this-week-in-rust.org/
Highlights from last week(s):
(I am sorry if I missed you -- please add a note to this ticket with anything you would like to add)
unnest
even more: Improve recursiveunnest
options API #128360.45.0
#12863BuiltInWindowFunction::{Lead, Lag}
to a user defined window function #12857Looking to get more involved? Try code review!
DataFusion has a long history of community members contributing in all aspects of the project. Reviewing PRs is an especially great way to get introduced to the project, help the community and grow your own knowledge -- researching and understanding the code enough to review PRs also often inspires additional ideas for improvements.
We have docs about reviews. TLDR is: look for test coverage, if the change is understandable and well documented, and if the code can be improved. When you think the PR looks good to merge, try
@
mentioning one of the committers.Help wanted
Please feel leave your own comments on the ticket if you are looking for help
regr*
aggregate functions to code #12871Andrew's Focus Areas:
We are preparing for the 43.0.0 release and I am personally pretty excited about (and thus actively help / put to the top of my review list):
WindowFunction
Interface (remove built in list ofBuiltInWindowFunction
s) #8709 (very close to finishing thanks @jcsherin @jatin510)Recent and Upcoming Releases
0.52.0
datafusion-sqlparser-rs#1423 (huge kudos to @iffyio for all the reviews)53.2.0
(~November~ October 2024) arrow-rs#6341 (to support turning on string view by default)Interesting discussions underway:
Community
Upcoming meetups:
Background:
Previous update: #12973
The text was updated successfully, but these errors were encountered: