Add compaction stats for filtered files #13136

jowlyzhang · 2024-11-12T23:48:12Z

As titled. This PR adds some compaction job stats, internal stats and some logging for filtered files.

Example logging:
[default] compacted to: files[0 0 0 0 2 0 0] max score 0.25, estimated pending compaction bytes 0, MB/sec: 0.3 rd, 0.2 wr, level 6, files in(1, 0) filtered(0, 2) out(1 +0 blob) MB in(0.0, 0.0 +0.0 blob) filtered(0.0, 0.0) out(0.0 +0.0 blob), read-write-amplify(2.0) write-amplify(1.0) OK, records in: 1, records dropped: 1 output_compression: Snappy

Test plan:
Added unit tests

facebook-github-bot · 2024-11-13T00:37:20Z

@jowlyzhang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-11-13T17:50:36Z

@jowlyzhang has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2024-11-13T19:08:51Z

@jowlyzhang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cbi42

LGTM

cbi42 · 2024-11-14T17:15:48Z

util/compaction_job_stats_impl.cc

@@ -30,6 +32,7 @@ void CompactionJobStats::Reset() {
  total_blob_bytes_read = 0;
  total_output_bytes = 0;
  total_output_bytes_blob = 0;
+  total_skipped_input_bytes = 0;


nit: total_filtered_input_bytes to be consistent with the other new stats

Yeah, thanks for the suggestion! I thought a bit about this and feel the action of filtering is applied to the file, and it ends up getting all the bytes in the file skipped, as opposed to the bytes are filtered. So I made this distinction in the name. The documentation of the field should make it clear where these fields come from.

jowlyzhang · 2024-11-14T17:52:51Z

@cbi42 Thanks for the review!

facebook-github-bot · 2024-11-14T18:14:23Z

@jowlyzhang merged this pull request in ef119c9.

Add compaction stats for filtered files.

e1dc708

facebook-github-bot added the CLA Signed label Nov 12, 2024

update comment

f05defe

jowlyzhang requested review from cbi42 and pdillinger November 13, 2024 19:35

cbi42 approved these changes Nov 14, 2024

View reviewed changes

facebook-github-bot closed this in ef119c9 Nov 14, 2024

facebook-github-bot added the Merged label Nov 14, 2024

jowlyzhang deleted the stats_for_filtered_files branch November 19, 2024 15:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add compaction stats for filtered files #13136

Add compaction stats for filtered files #13136

jowlyzhang commented Nov 12, 2024

facebook-github-bot commented Nov 13, 2024

facebook-github-bot commented Nov 13, 2024

facebook-github-bot commented Nov 13, 2024

cbi42 left a comment

cbi42 Nov 14, 2024

jowlyzhang Nov 14, 2024 •

edited

Loading

jowlyzhang commented Nov 14, 2024

facebook-github-bot commented Nov 14, 2024

Add compaction stats for filtered files #13136

Add compaction stats for filtered files #13136

Conversation

jowlyzhang commented Nov 12, 2024

facebook-github-bot commented Nov 13, 2024

facebook-github-bot commented Nov 13, 2024

facebook-github-bot commented Nov 13, 2024

cbi42 left a comment

Choose a reason for hiding this comment

cbi42 Nov 14, 2024

Choose a reason for hiding this comment

jowlyzhang Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

jowlyzhang commented Nov 14, 2024

facebook-github-bot commented Nov 14, 2024

jowlyzhang Nov 14, 2024 •

edited

Loading