-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimise hash joins #14644
Optimise hash joins #14644
Conversation
Review ChecklistHello reviewers! 👋 Please follow this checklist when reviewing this Pull Request. General
Tests
Documentation
New flags
If a workflow is added or modified:
Backward compatibility
|
a40c08f
to
ef546e7
Compare
ef546e7
to
15bcdba
Compare
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
…ough Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Manan Gupta <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
15bcdba
to
5ba9cd4
Compare
Signed-off-by: Andres Taylor <[email protected]>
5ba9cd4
to
9d37cc8
Compare
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
fcd773c
to
a41459a
Compare
"t2.shardKey", | ||
} | ||
|
||
// quickly construct a big number of left join aggregation queries that have to be executed using the hash join |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice
inOffset = op.AddColumn(ctx, false, false, aeWrap(expr)) | ||
} | ||
|
||
// we turn the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks like an incomplete comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
so, the idea was that we
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
also changing the reading of the json test files so that our plan tests fail if they don't follow the expected format Signed-off-by: Andres Taylor <[email protected]>
c345522
to
ca8dc6e
Compare
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Co-authored-by: Manan Gupta <[email protected]>
Description
We recently added support for more queries by using hash joins when a nested loop join is not possible to use.
At the moment these queries would work, but only by moving the data to the vtgate proxy and there doing the necessary operations on the data. This makes a lot of queries intractable - they work only on toy datasets and would time out on real data sizes.
This PR adds logic to push most of the work down to mysql.
Related Issue(s)
PR that introduced hash joins: #14448
Checklist