pageserver: revert flush backpressure (#8550) #10135

erikgrinaker · 2024-12-13T09:29:44Z

Problem

In #8550, we made the flush loop wait for uploads after every layer. This was to avoid unbounded buildup of uploads, and to reduce compaction debt. However, the approach has several problems:

It prevents upload parallelism.
It prevents flush and upload pipelining.
It slows down ingestion even when there is no need to backpressure.
It does not directly backpressure WAL ingestion (only via disk_consistent_lsn), and will build up in-memory layers.
It does not directly backpressure based on compaction debt and read amplification.

An alternative solution to these problems is proposed in #8390.

In the meanwhile, we revert the change to reduce the impact on ingest throughput. This does reintroduce some risk of unbounded upload/compaction buildup. Until #8390, this can be addressed in other ways:

Use max_replication_apply_lag (aka remote_consistent_lsn), which will more directly limit upload debt.
Shard the tenant, which will spread the flush/upload work across more Pageservers and move the bottleneck to Safekeeper.

Touches #10095.

Summary of changes

Remove waiting on the upload queue in the flush loop.

github-actions · 2024-12-13T10:23:58Z

7110 tests run: 6813 passed, 0 failed, 297 skipped (full report)

Flaky tests (7)

Postgres 17

test_timeline_archival_chaos: release-x86-64, release-x86-64, release-arm64, release-arm64

Postgres 16

test_physical_replication_config_mismatch_too_many_known_xids: release-arm64

Postgres 15

test_pgdata_import_smoke[None-1024-RelBlockSize.MULTIPLE_RELATION_SEGMENTS]: release-arm64
test_lr_with_slow_safekeeper: release-x86-64

Code coverage* (full report)

functions: 31.4% (8399 of 26729 functions)
lines: 48.0% (66605 of 138626 lines)

* collected from Rust tests only

_{The comment gets automatically updated with the latest test results
9f3f060 at 2024-12-13T14:32:24.415Z :recycle:}

arpad-m

Fine by me but I want to see if Christian is okay with this (I think he is the one who originally suggested it)

arpad-m

There is a few tests that fail, they need fixing.

erikgrinaker · 2024-12-13T11:48:32Z

There is a few tests that fail, they need fixing.

I know, working my way through them.

erikgrinaker · 2024-12-13T15:25:59Z

In #10144, I add parallel layer uploads by only scheduling index uploads (which act as upload queue barriers) every 8 layers.

If we want to derisk this, a halfway solution might be to keep the flush backpressure for now, but only trigger it every time we perform an index upload (every 8 layers) -- this would provide some parallelism and pipelining without building up an unbounded upload queue.

problame

Approving under the condition that this time we actually implement compact debt based backpressure.

Is this in Q1 planning?

I like your plan in #10095

erikgrinaker · 2024-12-15T09:45:07Z

Merging this to get some benchmark runtime.

Is this in Q1 planning?

I took this to be part of "Write path throttling", which is right on the edge of Q1 inclusion. I agree that it's worth bumping this, certainly more so than S3 throttling. cc @jcsp

I like your plan in #10095

Sharding complicates #10095 and can lead to deadlock. I have an updated proposal in #8390.

pageserver: revert flush backpressure (#8550)

79bd2d3

erikgrinaker requested a review from arpad-m December 13, 2024 09:29

erikgrinaker requested a review from a team as a code owner December 13, 2024 09:29

Fix test_pageserver_metrics_removed_after_detach

a55313b

Remote test_paused_upload_stalls_checkpoint

41f2af2

arpad-m requested a review from problame December 13, 2024 11:46

arpad-m approved these changes Dec 13, 2024

View reviewed changes

arpad-m requested changes Dec 13, 2024

View reviewed changes

erikgrinaker added 2 commits December 13, 2024 13:05

Revert test_cannot_branch_from_non_uploaded_branch

98c12b0

Revert test_cannot_create_endpoint_on_non_uploaded_timeline

9f3f060

arpad-m approved these changes Dec 13, 2024

View reviewed changes

erikgrinaker mentioned this pull request Dec 13, 2024

pageserver: upload flushed layers in parallel #10144

Draft

problame approved these changes Dec 13, 2024

View reviewed changes

erikgrinaker added this pull request to the merge queue Dec 15, 2024

Merged via the queue into main with commit f3ecd5d Dec 15, 2024
83 checks passed

erikgrinaker deleted the erik/revert-flush-wait branch December 15, 2024 09:47

erikgrinaker mentioned this pull request Dec 15, 2024

pageserver: add L0 compute backpressure (replace flush backpressure) #10095

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pageserver: revert flush backpressure (#8550) #10135

pageserver: revert flush backpressure (#8550) #10135

erikgrinaker commented Dec 13, 2024 •

edited

Loading

github-actions bot commented Dec 13, 2024 •

edited

Loading

Postgres 17

Postgres 16

Postgres 15

arpad-m left a comment

arpad-m left a comment

erikgrinaker commented Dec 13, 2024

erikgrinaker commented Dec 13, 2024

problame left a comment

erikgrinaker commented Dec 15, 2024

pageserver: revert flush backpressure (#8550) #10135

pageserver: revert flush backpressure (#8550) #10135

Conversation

erikgrinaker commented Dec 13, 2024 • edited Loading

Problem

Summary of changes

github-actions bot commented Dec 13, 2024 • edited Loading

7110 tests run: 6813 passed, 0 failed, 297 skipped (full report)

Postgres 17

Postgres 16

Postgres 15

Code coverage* (full report)

arpad-m left a comment

Choose a reason for hiding this comment

arpad-m left a comment

Choose a reason for hiding this comment

erikgrinaker commented Dec 13, 2024

erikgrinaker commented Dec 13, 2024

problame left a comment

Choose a reason for hiding this comment

erikgrinaker commented Dec 15, 2024

erikgrinaker commented Dec 13, 2024 •

edited

Loading

github-actions bot commented Dec 13, 2024 •

edited

Loading