[merkledb benchmark] implement simple write profile benchmark #3372

tsachiherman · 2024-09-08T16:42:05Z

Why this should be merged

How this works

How this was tested

rkuris · 2024-09-09T17:25:15Z

x/merkledb/benchmarks/benchmark.go

+)
+
+func getMerkleDBConfig(promRegistry prometheus.Registerer) merkledb.Config {
+	const defaultHistoryLength = 300


Let's use the same number as firewood of 120: https://github.com/ava-labs/firewood/blob/main/firewood/src/manager.rs#L21

rkuris · 2024-09-09T17:28:42Z

x/merkledb/benchmarks/benchmark.go

+		Hasher:                      merkledb.DefaultHasher,
+		RootGenConcurrency:          0,
+		HistoryLength:               defaultHistoryLength,
+		ValueNodeCacheSize:          units.MiB,


These seem really small. If I am reading this correctly, there is about 2Mb of total cache?

I attempted to tweak this value, but it had no performance impact.

rkuris · 2024-09-09T17:34:48Z

x/merkledb/benchmarks/benchmark.go

+	}
+
+	fmt.Printf("Initializing database.")
+	ticksCh := make(chan interface{})


nit: do you really need this ticker? Might be easier to report every 100k rows or something.

reporting every 100k rows doesn't work nicely because of the batch writing ( which blocks for a long time ).

rkuris · 2024-09-09T17:50:06Z

x/merkledb/benchmarks/benchmark.go

+
+const (
+	defaultDatabaseEntries    = 2000000
+	databaseCreationBatchSize = 1000000


Batch size is supposed to be 10k. This is 1M.

rkuris · 2024-09-09T17:52:33Z

x/merkledb/benchmarks/benchmark.go

+				return err
+			}
+		}
+		deleteDuration = time.Since(startDeleteTime)


You can avoid all this math and just report the raw number of deletes. Grafana can convert this to a rate for you.

I've added both. I believe that my calculation would be more accurate, but let's have both for the time being.

StephenButtolph · 2024-09-10T15:30:12Z

x/merkledb/benchmarks/benchmark.go

+	deleteRate = prometheus.NewGauge(prometheus.GaugeOpts{
+		Namespace: "merkledb_bench",
+		Name:      "entry_delete_rate",
+		Help:      "The rate at which elements are deleted",
+	})
+	updateRate = prometheus.NewGauge(prometheus.GaugeOpts{
+		Namespace: "merkledb_bench",
+		Name:      "entry_update_rate",
+		Help:      "The rate at which elements are updated",
+	})
+	insertRate = prometheus.NewGauge(prometheus.GaugeOpts{
+		Namespace: "merkledb_bench",
+		Name:      "entry_insert_rate",
+		Help:      "The rate at which elements are inserted",
+	})
+	batchWriteRate = prometheus.NewGauge(prometheus.GaugeOpts{
+		Namespace: "merkledb_bench",
+		Name:      "batch_write_rate",
+		Help:      "The rate at which the batch was written",
+	})


We should not be calculating the rates in the benchmark. The prometheus server should do this based on the counts.

Suggested change

deleteRate = prometheus.NewGauge(prometheus.GaugeOpts{

Namespace: "merkledb_bench",

Name: "entry_delete_rate",

Help: "The rate at which elements are deleted",

})

updateRate = prometheus.NewGauge(prometheus.GaugeOpts{

Namespace: "merkledb_bench",

Name: "entry_update_rate",

Help: "The rate at which elements are updated",

})

insertRate = prometheus.NewGauge(prometheus.GaugeOpts{

Namespace: "merkledb_bench",

Name: "entry_insert_rate",

Help: "The rate at which elements are inserted",

})

batchWriteRate = prometheus.NewGauge(prometheus.GaugeOpts{

Namespace: "merkledb_bench",

Name: "batch_write_rate",

Help: "The rate at which the batch was written",

})

I believe that it won't generate accurate results, since we're mixing batch writing and put in the same sequence.
I've included both the counter and the rate metrics so that we can get both numbers in the grafana.

StephenButtolph · 2024-09-10T15:31:31Z

x/merkledb/benchmarks/benchmark.go

+	err = mdb.Close()
+	if err != nil {
+		fmt.Fprintf(os.Stderr, "unable to close levelDB database : %v\n", err)
+		return err
+	}
+	err = levelDB.Close()
+	if err != nil {
+		fmt.Fprintf(os.Stderr, "unable to close merkleDB database : %v\n", err)
+		return err
+	}


The logs seem inverted here

StephenButtolph · 2024-09-10T15:33:42Z

x/merkledb/benchmarks/benchmark.go

+		ValueNodeCacheSize:          units.MiB,
+		IntermediateNodeCacheSize:   1024 * units.MiB,


How much memory are we using? Feels like we could probably increase these

I've attempted to tweak these, but haven't seen any concrete gains. Adjusting the leveldb config was helpful, though.

rkuris · 2024-09-11T19:53:04Z

x/merkledb/benchmarks/benchmark.go

+		startUpdateTime := time.Now()
+		for keyToUpdateIdx := low + ((*databaseEntries) / 2); keyToUpdateIdx < low+((*databaseEntries)/2)+databaseRunningUpdateSize; keyToUpdateIdx++ {
+			updateEntryKey := calculateIndexEncoding(keyToUpdateIdx)
+			updateEntryValue := calculateIndexEncoding(keyToUpdateIdx - ((*databaseEntries) / 2))


This is incorrect, should be:

Suggested change

updateEntryValue := calculateIndexEncoding(keyToUpdateIdx - ((*databaseEntries) / 2))

updateEntryValue := calculateIndexEncoding(low)

hmm.. I think that it's ok to use the low as you suggested, although using the above would yield different and unique values ( i.e. [low..low+5k] ).

rkuris · 2024-09-11T19:53:42Z

x/merkledb/benchmarks/benchmark.go

+		levelDB.Close()
+	}()
+
+	low := uint64(0)


low never changes, and should be increased by 2.5k each pass

good catch; fixed.

github-actions · 2024-11-24T00:00:36Z

This PR has become stale because it has been open for 30 days with no activity. Adding the lifecycle/frozen label will cause this PR to ignore lifecycle events.

update

da7f632

tsachiherman self-assigned this Sep 8, 2024

tsachiherman requested a review from StephenButtolph as a code owner September 8, 2024 16:42

tsachiherman added 9 commits September 8, 2024 13:14

update

67b83d9

update

60a1bd7

registration order.

e9f6f20

add few more metrics.

2ce08b8

update

a8b673f

fix metric

e888f9e

update

a34e949

few more updates.

8984569

reuse batch

c844c70

rkuris requested changes Sep 9, 2024

View reviewed changes

tsachiherman added 3 commits September 9, 2024 16:28

update

30f05e7

bugfix

e2ed004

avoid double registration

24da31d

StephenButtolph reviewed Sep 10, 2024

View reviewed changes

tsachiherman added 4 commits September 10, 2024 13:23

add leveldb configuration; optimize writes.

171c3b5

Merge branch 'master' into tsachi/bench_merkledb

3d9fd61

update

c16e891

update

bfe6c7c

rkuris requested changes Sep 11, 2024

View reviewed changes

tsachiherman added 8 commits September 12, 2024 09:53

update per CR.

4a0ae2b

Merge branch 'master' into tsachi/bench_merkledb

a4d935f

add a benchmark to compare with eth's merkle trie.

2014fdc

udpate

af5c4ab

update caches

4f807c8

fix bench

291f2da

fix typo

058ec55

update dashboard.

0c80953

tsachiherman and others added 19 commits September 19, 2024 22:39

update

a6385b0

update

6a1817e

refactor and add few unit test for database creation.

716fbbd

add unit test.

dd25de9

update revisions

ae64496

adding historical lookup for geth.

f3191bc

Fix geth merkledb unit test (#3414)

a0f31a4

Code cleanups (#3416)

4602deb

fix lint

72b9d13

update

b94e8f3

update

2485f98

update

b2aa009

update

ceada16

update setup

9ff9dbe

updatr

ddd0c88

use less memory for cache so that it would fit for 16gb machines.

1b87ea8

fix linter issues

78d164b

further reduce cache size

e3a0002

update

d9327ae

github-actions bot added the lifecycle/stale label Nov 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[merkledb benchmark] implement simple write profile benchmark #3372

[merkledb benchmark] implement simple write profile benchmark #3372

tsachiherman commented Sep 8, 2024

rkuris Sep 9, 2024

tsachiherman Sep 10, 2024

rkuris Sep 9, 2024

tsachiherman Sep 10, 2024

rkuris Sep 9, 2024

tsachiherman Sep 10, 2024

rkuris Sep 9, 2024

tsachiherman Sep 10, 2024

rkuris Sep 9, 2024

tsachiherman Sep 10, 2024

StephenButtolph Sep 10, 2024

tsachiherman Sep 10, 2024

StephenButtolph Sep 10, 2024

tsachiherman Sep 10, 2024

StephenButtolph Sep 10, 2024

tsachiherman Sep 10, 2024

rkuris Sep 11, 2024

tsachiherman Sep 12, 2024

rkuris Sep 11, 2024

tsachiherman Sep 12, 2024

github-actions bot commented Nov 24, 2024

		ValueNodeCacheSize: units.MiB,
		IntermediateNodeCacheSize: 1024 * units.MiB,

	updateEntryValue := calculateIndexEncoding(keyToUpdateIdx - ((*databaseEntries) / 2))
	updateEntryValue := calculateIndexEncoding(low)

[merkledb benchmark] implement simple write profile benchmark #3372

Are you sure you want to change the base?

[merkledb benchmark] implement simple write profile benchmark #3372

Conversation

tsachiherman commented Sep 8, 2024

Why this should be merged

How this works

How this was tested

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Nov 24, 2024