Merge integration to Main #1676

yu-shipit · 2023-09-27T20:42:10Z

Pull Request checklist

The commit(s) message(s) follows the contribution guidelines ?
Tests for the changes have been added (for bug fixes / features) ?
Docs have been added / updated (for bug fixes / features) ?

Current behavior : (link exiting issues here : https://help.github.com/articles/basic-writing-and-formatting-syntax/#referencing-issues-and-pull-requests)

New behavior :

BREAKING CHANGES

If this PR contains a breaking change, please describe the impact and migration
path for existing applications.
If not please remove this section.

Breaking changes may include:

Any schema changes to any Cassandra tables
The serialized format for Dataset and Column (see .toString methods)
Over the wire formats for Akka messages / case classes
Changes to the HTTP public API
Changes to query parsing / PromQL parsing

Other information:

) This reverts commit fa731f4. Co-authored-by: Amol Nayak <[email protected]>

…db#1610) Today scaling filoDB horizontally involves re-calculation of memory settings in a manual way involving tribal knowledge. This PR aims to automate it and make it simple math. It is backward compatible and behind feature flag. * Min-num-nodes moved from dataset into server config. * New server config will drive automatic memory calculation * Each dataset requires another configuration that determines what fraction of resources each dataset gets.

* filodb(core) add debugging info for empty histogram. Some queries occasionally hit exceptions because of empty histogram. However, the same exception could not be reproduced later. The hunch is that the bug is caused by a race condition. So, adding additional debugging log to print out the chunk id chunk info and the memory dump. --------- Co-authored-by: Yu Zhang <[email protected]> Co-authored-by: alextheimer <[email protected]> Co-authored-by: sandeep6189 <[email protected]>

Co-authored-by: Yu Zhang <[email protected]>

…1613)" (filodb#1623) This reverts commit 90303aa.

Co-authored-by: Kier Petrov <[email protected]>

…ata scans (filodb#1628) Double.isNan involves conversion to boxed java Double Local heap profiling showed that this is a significant allocation. Conversion to the static java.lang.Double.isNan removes these.

…filodb#1630) Co-authored-by: Yu Zhang <[email protected]> (cherry picked from commit 5b05779)

skip busting the problematic entry. Co-authored-by: Yu Zhang <[email protected]>

Some queries occasionally hit exceptions because of empty histogram. However, the same exception could not be reproduced later. The hunch is that the bug is caused by a race condition. So, adding additional debugging log to print out the chunk id chunk info and the memory dump. Co-authored-by: Yu Zhang <[email protected]>

cherry-pick query-planner fix

There are two configs for num-nodes used by clustering-v2 and automatic memory alloc code. Consolidating them.

Default alloc configs should sum to 100.

…n without the label (filodb#1639)

filodb#1629) * fix(core) fix the unless operator for aggregators. For regex shard key we need to aggregate across all nodes. InProcessPlanDispatcher is needed. --------- Co-authored-by: Yu Zhang <[email protected]>

feat(query): Cardinality V2 API Query Plan changes

convert unary expressions through binary expressions. Co-authored-by: Yu Zhang <[email protected]>

) There was a bug in calculating size of SRV. Earlier, for efficiency purposes, we were calculating the size of the containers associated with the SRV. But actually, the container can home multiple SRVs. So the calculated size for several SRVs at a time can end up wrong with addition of cumulative counts. The fix for now is to calculate the size by going through the records. It introduces a small inefficiency here, but submitting this PR for now since other ways to calculate this were more invasive and risk regression. We can have an optimization of this if really needed later. I have also reduced the number of calls to this method from two to one. The unit tests didn't catch this since earlier since they played with one SRV only. I have now added a unit test that adds multiple SRVs. It failed with earlier code.

… enum and prefix regex filters (filodb#1641) Production profiling is showing that Lucene Regex automata is creating a hotspot in method and allocation profiling. This PR optimizes two kinds of queries * Regex with enumerated values are converted to TermInSetQuery * Regex with prefix is converted to PrefixQuery It also wraps Lucene queries in ConstantScoreQuery to prevent any scoring that may be happening. Observed 2.2x performance improvement in JMH benchmark for specific enum regex query 1.5x performance improvement in JMH benchmark for specific prefix regex query

…ix (filodb#1645) * fix(core) make the error message more frendly to users. (filodb#1593) Co-authored-by: Yu Zhang <[email protected]> (cherry picked from commit 5b05779) * fix nullpointer happened in cardinality busting job. (filodb#1631) skip busting the problematic entry. Co-authored-by: Yu Zhang <[email protected]> (cherry picked from commit 6ac0255)

…ilodb#1649) * filodb(core) add debugging info for empty histogram. Some queries occasionally hit exceptions because of empty histogram. However, the same exception could not be reproduced later. The hunch is that the bug is caused by a race condition. So, adding additional debugging log to print out the chunk id chunk info and the memory dump. --------- Co-authored-by: Yu Zhang <[email protected]> (cherry picked from commit 90303aa)

feat(query): Cardinality V2 API Query Plan changes

feat(query): Cardinality V2 API Query Plan changes (filodb#1637)

…c calls (filodb#1651) * Adding UserDatasets for remote calls * Updating UT

fix(query): Cardinality multi partition queries

Adding CPU Nanos Time consumed for Lucene index lookups in query stats, especially since we are seeing this in frequently as a hotspot in CPU profile information

…anner for TenantIngestionMetering changes

…anner for TenantIngestionMetering changes (filodb#1659)

…lodb#1661)

…ilodb#1664)

…s for cardinality calculation time (filodb#1666) * Adding config support for DS Card flushCount and perf logs for cardinality calculation time

filodb#1629) (filodb#1668) * fix(core) fix the unless operator for aggregators. For regex shard key we need to aggregate across all nodes. InProcessPlanDispatcher is needed. --------- Co-authored-by: Yu Zhang <[email protected]> (cherry picked from commit f5018ae)

Merge branch 'develop' into integration

Bump filodb version to 0.9.23.

alextheimer and others added 30 commits July 11, 2023 15:38

Revert "maint(core): upgrade lucene to 9.7.0 (filodb#1617)" (filodb#1622

7679af3

) This reverts commit fa731f4. Co-authored-by: Amol Nayak <[email protected]>

fix(core) make the error message more frendly to users. (filodb#1593)

5b05779

Co-authored-by: Yu Zhang <[email protected]>

Revert "filodb(core) add debugging info for empty histogram. (filodb#…

a93666d

…1613)" (filodb#1623) This reverts commit 90303aa.

Adding logging statement when warning is produced. (filodb#1625)

a37bf5f

Co-authored-by: Kier Petrov <[email protected]>

perf(query): Remove boxed Double allocations from NaN checks during d…

8ecf630

…ata scans (filodb#1628) Double.isNan involves conversion to boxed java Double Local heap profiling showed that this is a significant allocation. Conversion to the static java.lang.Double.isNan removes these.

fix(core) make the error message more frendly to users. (filodb#1593) (…

f14a13c

…filodb#1630) Co-authored-by: Yu Zhang <[email protected]> (cherry picked from commit 5b05779)

fix nullpointer happened in cardinality busting job. (filodb#1631)

6ac0255

skip busting the problematic entry. Co-authored-by: Yu Zhang <[email protected]>

fix(query): prevent list.head on empty list (filodb#1632)

eebd5f4

maint(kafka): update consumer client id (filodb#1633)

6c1693a

fix(query): prevent list.head on empty list (filodb#1632)

5dadfb9

Merge pull request filodb#1634 from alextheimer/cherry-pick

8929fb2

cherry-pick query-planner fix

fix(core): Consolidate num-nodes duplicate config (filodb#1635)

59cae2a

There are two configs for num-nodes used by clustering-v2 and automatic memory alloc code. Consolidating them.

Fix memory alloc config (filodb#1638)

ea1644b

Default alloc configs should sum to 100.

fix(query) Regex equals .* must ignore the label and match series eve…

955814e

…n without the label (filodb#1639)

feat(query): Cardinality V2 API Query Plan changes (filodb#1637)

84a185f

feat(query): Cardinality V2 API Query Plan changes

fix(query) Fix regression with regex match (filodb#1640)

c7e26a9

fix(query) support unary operators(+/-) (filodb#1642)

dd59325

convert unary expressions through binary expressions. Co-authored-by: Yu Zhang <[email protected]>

feat(query): Cardinality V2 API Query Plan changes (filodb#1637)

523999c

feat(query): Cardinality V2 API Query Plan changes

Merge pull request filodb#1650 from amolnayak311/integration

9ed8685

feat(query): Cardinality V2 API Query Plan changes (filodb#1637)

fix(query): Adding user datasets for Cardinality V2 RemoteMetadataExe…

594ffce

…c calls (filodb#1651) * Adding UserDatasets for remote calls * Updating UT

Fix MultiPartition Card Queries (filodb#1652)

89bd678

fix(query): Adding user datasets for Cardinality V2 RemoteMetadataExe…

bdcfde7

…c calls (filodb#1651) * Adding UserDatasets for remote calls * Updating UT

sandeep6189 and others added 17 commits August 18, 2023 09:53

Fix MultiPartition Card Queries (filodb#1652)

17fb077

Merge pull request filodb#1654 from sandeep6189/integration

225617c

fix(query): Cardinality multi partition queries

feat(core): Add Query CPU Time for Index Lookups (filodb#1655)

89095e2

Adding CPU Nanos Time consumed for Lucene index lookups in query stats, especially since we are seeing this in frequently as a hotspot in CPU profile information

fix(metering): Overriding the cluster name .passed to SingleClusterPl…

1e56ef9

…anner for TenantIngestionMetering changes

fix(metering): Overriding the cluster name .passed to SingleClusterPl…

c77a95a

…anner for TenantIngestionMetering changes (filodb#1659)

misc(core): add downsample support for aggregated data (filodb#1661)

710c3d2

misc(core): add downsample support for aggregated data (filodb#1661)

df3922d

cherry-pic misc(core): add downsample support for aggregated data (fi…

14115cf

…lodb#1661)

maint(core): upgrade to Lucene 9.7.0 (filodb#1662)

6cb5433

bug(query): Streaming query execution allocated too much mem via RB (f…

7adc382

…ilodb#1664)

perf(card): Adding config support for DS card flushCount and perf log…

bf8ead0

…s for cardinality calculation time (filodb#1666) * Adding config support for DS Card flushCount and perf logs for cardinality calculation time

Merge branch 'develop' into integration

f7d60ac

Merge pull request filodb#1673 from yu-shipit/integration

ba0023f

Merge branch 'develop' into integration

Bump filodb version to 0.9.23.

78a33f6

Merge pull request filodb#1674 from yu-shipit/integration

b5a2e40

Bump filodb version to 0.9.23.

Merge branch 'integration'

4e97f53

yu-shipit changed the title ~~merge integration to main~~ Merge integration to Main Sep 27, 2023

sherali42 approved these changes Sep 27, 2023

View reviewed changes

alextheimer approved these changes Sep 27, 2023

View reviewed changes

sandeep6189 approved these changes Sep 27, 2023

View reviewed changes

yu-shipit merged commit b8f7dca into filodb:main Sep 27, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge integration to Main #1676

Merge integration to Main #1676

yu-shipit commented Sep 27, 2023

Merge integration to Main #1676

Merge integration to Main #1676

Conversation

yu-shipit commented Sep 27, 2023