Add InstalledScheduler for blockstore_processor #33875

ryoqun · 2023-10-26T08:00:50Z

Problem

There's no actual usage of the recently introduced BankWithScheduler (#33704) in the monorepo.

Summary of Changes

Start to use it incrementally; first in blockstore_processor with minimal definition of a newly-introduced trait called InstalledScheduler.

The trait is currently really bare-bone; more methods will be added in the upcoming prs, especially to properly shutdown the installed scheduler and its lifecycle management by the installed scheduler pool.

also note that the plumbing in this pr isn't complete as well: there should be no way to hit the added code path for the unified scheduler in the released binaries.

extracted from: #33070

codecov · 2023-10-26T09:02:34Z

Codecov Report

Merging #33875 (5d856f9) into master (70107e2) will increase coverage by 0.0%.
Report is 9 commits behind head on master.
The diff coverage is 98.6%.

@@           Coverage Diff           @@
##           master   #33875   +/-   ##
=======================================
  Coverage    81.8%    81.9%           
=======================================
  Files         809      809           
  Lines      217824   217882   +58     
=======================================
+ Hits       178387   178463   +76     
+ Misses      39437    39419   -18

apfitzge · 2023-10-27T01:57:05Z

ledger/src/blockstore_processor.rs

+    log_messages_bytes_limit: Option<usize>,
+    prioritization_fee_cache: &PrioritizationFeeCache,
+) -> Result<()> {
+    if !bank.has_installed_scheduler() {


nit:
I think the ! makes this slightly less straight-forward to me at least. Would be more clear if has_scheduler { schedule } else { rebatch_and_execute }
wdyt?

done: 9491e33

apfitzge · 2023-10-27T02:13:48Z

ledger/src/blockstore_processor.rs

@@ -294,6 +294,55 @@ fn execute_batches_internal(
    })
 }

+fn process_batches(


so this fn is a drop-in replacement for the previous execute_batches.

In process_entries we still call this drop-in at the same times we called execute_batches, i.e. tick-boundaries, entry conflicts, end of loop.
Scheduling doesn't block on execution, and also cannot fail w/ an error - how does replay stage become aware of invalid block state without this fn returning it?

with that in mind, i'm curious why the switch was introduced here instead of one layer up. Is there a benefit in collecting the until conflict/tick before scheduling?

oh, thanks for well-deserved good questions! hope this in-source comments should answer to both of your questions: 2897280

I think so? I think the main reason is we want to share most of the same code since there are checks on non-self-conflicting entries. And since scheduling doesn't block on execution, there shouldn't be too much overhead from just collecting the multiple entries instead of scheduling them as they come in?

yeah. that's correct.

fyi, i a bit improved the in-source comment: 6e63d87

ryoqun · 2023-10-27T04:29:19Z

ledger/src/blockstore_processor.rs

+// This fn diverts the code-path into two variants. Both must provide exactly the same set of
+// validations. For this reason, this fn is deliberately inserted into the code path to be called


Both must provide exactly the same set of validations

I'm aware that apply_cost_tracker_during_replay needs to be handled for unified scheduler ;)

ryoqun · 2023-10-27T04:30:51Z

ledger/src/blockstore_processor.rs

+        );
+        // scheduling always succeeds here without being blocked on actual transaction executions.
+        // The transaction execution errors will be collected via the blocking fn called
+        // BankWithScheduler::wait_for_completed_scheduler(), if any.


fyi, wait_for_completed_scheduler() doesn't exist in the master branch. yet written as if it exists to avoid needless future rewrite. that fn will be added just by next pr..

apfitzge · 2023-10-27T07:44:57Z

runtime/src/installed_scheduler_pool.rs

+    // 'a is needed; anonymous_lifetime_in_impl_trait isn't stabilized yet...
+    pub fn schedule_transaction_executions<'a>(
+        &self,
+        transactions: &[SanitizedTransaction],
+        transaction_indexes: impl Iterator<Item = &'a usize>,
+    ) {


sorry I missed this on my previous pass. Could you explain/justify this interface? There's the comment on 'a which drew my attention.

It feels a bit odd to have transactions be a slice, and transaction_indexes be an iterator.
AFAICT, the source of both of these is TransactionBatchWithIndexes which should be able to get us a slice for both these fields. Which makes me wonder why we need/want transaction_indexes to be an iterator.

I can see iterators are more generic than the slice; if that's the case, why not something like:

pub fn schedule_transaction_executions( &self, transactions_with_indexes: impl IntoIterator<Item = (...)> ) { ... }

you're right. i just simplified and i like this: 5d856f9

apfitzge

LGTM - appreciate all the work!

Add InstalledScheduler for blockstore_processor

70e1654

ryoqun requested a review from apfitzge October 26, 2023 08:13

apfitzge reviewed Oct 27, 2023

View reviewed changes

ryoqun added 2 commits October 27, 2023 12:38

Reverse if clauses

9491e33

Add more comments for process_batches()

2897280

ryoqun requested a review from apfitzge October 27, 2023 04:25

ryoqun commented Oct 27, 2023

View reviewed changes

apfitzge reviewed Oct 27, 2023

View reviewed changes

ryoqun added 2 commits October 27, 2023 16:53

Elaborate comment

6e63d87

Simplify schedule_transaction_executions type

5d856f9

ryoqun requested a review from apfitzge October 27, 2023 08:10

apfitzge approved these changes Oct 27, 2023

View reviewed changes

ryoqun merged commit 950ca5e into solana-labs:master Oct 27, 2023
28 checks passed

ryoqun mentioned this pull request Oct 29, 2023

Define InstalledScheduler::wait_for_termination() #33922

Merged

willhickey mentioned this pull request Mar 28, 2024

v1.18 commits - please ignore anza-xyz/agave#475

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add InstalledScheduler for blockstore_processor #33875

Add InstalledScheduler for blockstore_processor #33875

ryoqun commented Oct 26, 2023 •

edited

Loading

codecov bot commented Oct 26, 2023 •

edited

Loading

apfitzge Oct 27, 2023

ryoqun Oct 27, 2023

apfitzge Oct 27, 2023

apfitzge Oct 27, 2023

ryoqun Oct 27, 2023

apfitzge Oct 27, 2023

ryoqun Oct 27, 2023

ryoqun Oct 27, 2023

ryoqun Oct 27, 2023 •

edited

Loading

ryoqun Oct 27, 2023 •

edited

Loading

apfitzge Oct 27, 2023

ryoqun Oct 27, 2023

apfitzge left a comment

		// This fn diverts the code-path into two variants. Both must provide exactly the same set of
		// validations. For this reason, this fn is deliberately inserted into the code path to be called

Add InstalledScheduler for blockstore_processor #33875

Add InstalledScheduler for blockstore_processor #33875

Conversation

ryoqun commented Oct 26, 2023 • edited Loading

Problem

Summary of Changes

codecov bot commented Oct 26, 2023 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryoqun Oct 27, 2023 • edited Loading

Choose a reason for hiding this comment

ryoqun Oct 27, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apfitzge left a comment

Choose a reason for hiding this comment

ryoqun commented Oct 26, 2023 •

edited

Loading

codecov bot commented Oct 26, 2023 •

edited

Loading

ryoqun Oct 27, 2023 •

edited

Loading

ryoqun Oct 27, 2023 •

edited

Loading