Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Kernel] Add support for nested column reference expression #2078

Merged
merged 5 commits into from
Sep 20, 2023

Conversation

vkorukanti
Copy link
Collaborator

@vkorukanti vkorukanti commented Sep 19, 2023

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Description

Part of #2071 (Partition Pruning in Kernel). We need a way to reference the partitionValues nested column in scan file ColumnarBatch.

Currently, the Column expression can only be used to refer to a top-level column. There is no way to refer to a nested column. This PR updates the Column expression to be a multi-part identifier. This is similar to the Spark's NamedReference DSv2 expression.

Fixes #2040 (also contains different approaches to refer to a nested column).

How was this patch tested?

Added a UT

Copy link
Collaborator

@scottsand-db scottsand-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM with 2 comments

@vkorukanti vkorukanti merged commit 8e3943f into delta-io:master Sep 20, 2023
6 checks passed
@vkorukanti vkorukanti deleted the columnPathExpr branch October 2, 2023 05:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature Request][Kernel] Add support for referring to nested columns in Kernel Column expression
2 participants