fix: remove mutex in prefixdb #239

faddat · 2022-04-25T16:55:15Z

closes:

Adds additional testing to all databases
Adds .gitpod.yml so that devs can click a single button and ensure that changes work, in the exact docker environment that we use in ci

codecov · 2022-04-25T16:56:36Z

Codecov Report

Merging #239 (f39f7bb) into master (d1b9b74) will decrease coverage by 0.50%.
The diff coverage is 92.30%.

@@            Coverage Diff             @@
##           master     #239      +/-   ##
==========================================
- Coverage   68.54%   68.03%   -0.51%     
==========================================
  Files          27       27              
  Lines        2130     2093      -37     
==========================================
- Hits         1460     1424      -36     
+ Misses        595      594       -1     
  Partials       75       75

Impacted Files	Coverage Δ
badger_db.go	`87.77% <0.00%> (ø)`
boltdb_batch.go	`82.69% <ø> (ø)`
boltdb_iterator.go	`92.10% <ø> (ø)`
cleveldb.go	`70.99% <ø> (ø)`
cleveldb_batch.go	`82.35% <ø> (ø)`
prefixdb.go	`63.55% <ø> (-6.79%)`	⬇️
prefixdb_iterator.go	`81.01% <ø> (-0.24%)`	⬇️
rocksdb.go	`72.26% <ø> (ø)`
rocksdb_batch.go	`84.31% <ø> (ø)`
boltdb.go	`56.55% <100.00%> (-0.71%)`	⬇️
... and 4 more

Impacted Files	Coverage Δ
badger_db.go	`87.77% <0.00%> (ø)`
boltdb_batch.go	`82.69% <ø> (ø)`
boltdb_iterator.go	`92.10% <ø> (ø)`
cleveldb.go	`70.99% <ø> (ø)`
cleveldb_batch.go	`82.35% <ø> (ø)`
prefixdb.go	`63.55% <ø> (-6.79%)`	⬇️
prefixdb_iterator.go	`81.01% <ø> (-0.24%)`	⬇️
rocksdb.go	`72.26% <ø> (ø)`
rocksdb_batch.go	`84.31% <ø> (ø)`
boltdb.go	`56.55% <100.00%> (-0.71%)`	⬇️
... and 4 more

creachadair

In light of the argument in #156, this seems like it should be safe.

I am a bit worried that we don't seem to have any tests that exercise the documented concurrency-safety requirement, though -- and there is at least one case where the implementations differ about what operations are allowed to overlap (cf. https://github.com/tendermint/tm-db/blob/master/util_test.go#L29).

Running the race detector can't really help, when there is no concurrency for it to check. This PR doesn't create that problem, but it does make it more relevant.

prefixdb.go

faddat · 2022-04-25T18:28:54Z

@creachadair

I may need to nuke-reinstall my test machine, but I did a bit of a survey on databases recently.

That led me to the terra-money/tmdb:performance branch.

The speed is super impressive but I've had an explosion (concurrent map read and write error).

Can you review my method for testing this stuff?

I am thinking like:

make a gitlab instance
automate a replace of tm-db
run syncs of gaia and osmosis in parallel, check the impact on sync time
when we've got synced nodes, do an automated test of queries with vegeta

My chief concern is sync times. They are too long and this slows down all other work in and on cosmos.

But if we don't pay careful attention to this stuff, we can miss things-- for example there's a state breaking change somewhere in the 42 series. (you'll note I didn't mention where-- as I don't know :P )

But how do I know it's there?

I'm sure it's there because I applied tendermint 34.19 and sdk v42.11 to gaia and could not do a genesis sync.

faddat · 2022-04-25T18:47:48Z

cosmos/gaia#1415

faddat · 2022-04-25T19:32:10Z

PS:

https://github.com/cosmos/gaia/runs/6163960772?check_suite_focus=true#step:8:31

codecov generally broken

creachadair · 2022-04-26T06:25:26Z

Can you review my method for testing this stuff?

I am thinking like:

make a gitlab instance

automate a replace of tm-db

run syncs of gaia and osmosis in parallel, check the impact on sync time

when we've got synced nodes, do an automated test of queries with vegeta

I'm not convinced we need anything that elaborate. Even just a regular Go test that creates a database instance and does some amount of parallel insertions, lookups, and deletions would give the race detector something to push against. I don't think we need an elaborate multi-process test harness to verify the basic concurrency properties the library is intended to guarantee.

Of course, integration tests are nice too, but as far as tm-db itself, integration is a concern for the clients of the library.

Very roughly, I was thinking along these lines:

// Run generates concurrent reads and writes to db so the race detector can
// verify concurrent operations are properly synchronized.
// The contents of db are garbage after Run returns.
func Run(t *testing.T, db tmdb.DB) {
	t.Helper()

	const numWorkers = 10
	const numKeys = 32

	var wg sync.WaitGroup
	for i := 0; i < numWorkers; i++ {
		wg.Add(1)
		i := i
		go func() {
			defer wg.Done()

			// Insert a bunch of keys with random data.
			for k := 1; k <= numKeys; k++ {
				key := taskKey(i, k) // say, "task-<i>-key-<k>"
				value := someRandomValue()
				if err := db.Set(key, value); err != nil {
					t.Errorf("Task %d: db.Set(%q=%q) failed: %v", 
						i, string(key), string(value), err)
				}
			}

			// Iterate over the database to make sure our keys are there.
			it, err := db.Iterator(nil, nil)
			if err != nil {
				t.Errorf("Iterator[%d]: %v", i, err)
				return
			}
			found := make(map[string][]byte)
			mine := []byte(fmt.Sprintf("task-%d-", i))
			for it.Valid() {
				it.Next()
				if key := it.Key(); bytes.HasPrefix(key, mine) {
					found[string(key)] = it.Value()
				}
			}
			if err := it.Error(); err != nil {
				t.Errorf("Iterator[%d] reported error: %v" i, err)
			}
			if err := it.Close(); err != nil {
				t.Errorf("Close iterator[%d]: %v", i, err)
			}
			if len(mine) != numKeys {
				t.Errorf("Task %d: found %d keys, wanted %d", i, len(mine), numKeys)
			}

			// Delete all the keys we inserted.
			for key := range mine {
				if err := db.Delete([]byte(key)); err != nil {
					t.Errorf("Delete %q: %v", key, err)
				}
			}
		}()
	}
	wg.Wait()
}

N.B. this is not tested, but should be the right general flavour.

faddat · 2022-04-27T14:07:34Z

@creachadair thank you! it might even help me with the PebbleDB implementation

prefixdb_test.go

faddat · 2022-05-19T04:45:06Z

I've added .gitpod.yml to this

@creachadair if you wouldn't mind trying the gitpod browser extension, I think you'd instantly understand why it is a nice-to-have.

https://www.gitpod.io/docs/browser-extension

Just saved me a whole bunch of time. I was able to work on the linter issues in the same environment as CI, and didn't need to think about my development environment.

…tr.isInvalid were considered to be ineffective assignments by the linter.

faddat · 2022-05-19T05:08:33Z

That was dramatically easier than making a dev env, and now it's all set :D!

faddat · 2022-05-20T08:18:09Z

I think this one is set now.

creachadair

This looks pretty close, there are just a few details to sort out.

cleveldb_test.go

prefixdb_test.go

.gitpod.yml

prefixdb_test.go

Co-authored-by: M. J. Fromberger <[email protected]>

faddat · 2022-06-04T09:14:33Z

@creachadair I hope to have the tests moved today, wish me luck...

faddat · 2022-07-15T14:27:30Z

is this wanted?

tac0turtle · 2022-07-15T17:21:43Z

id vote for yes

tac0turtle · 2022-07-29T07:48:41Z

generally the scope should be fixed on a single item in prs. Expanding scope makes it take longer to get things merged.

merging

fix: remove mutex in prefixdb

5c9bfe8

faddat requested review from alexanderbez, cmwaters, creachadair, tac0turtle, tychoish and williambanfield as code owners April 25, 2022 16:55

faddat added 3 commits April 25, 2022 16:57

fmt

a0ff309

fmt

636377b

Delete .gitpod.yml

0155808

creachadair reviewed Apr 25, 2022

View reviewed changes

prefixdb.go Outdated Show resolved Hide resolved

faddat added 3 commits April 25, 2022 18:22

remove mutex entirely instead of commenting out

973a74a

fmt

34ca0e7

remove .gitpod.yml

d81b67b

update changelog

4c9b066

tac0turtle approved these changes Apr 25, 2022

View reviewed changes

faddat mentioned this pull request Apr 25, 2022

support rocksdb v7 osmosis-labs/osmosis#1285

Closed

4 tasks

Merge branch 'master' into remove-mutex

0566540

faddat and others added 3 commits May 6, 2022 12:43

prefixdb test

f531a3f

all test passed in prefixdb_test.go

cc11706

Merge branch 'master' into remove-mutex

417a583

creachadair reviewed May 6, 2022

View reviewed changes

prefixdb_test.go Show resolved Hide resolved

simple taskKey in prefixdb_test.go

3214560

devli13 mentioned this pull request May 6, 2022

bug: Invalid Nonce / Transactions Dropped from Mempool evmos/evmos#562

Closed

4 tasks

faddat added 2 commits May 19, 2022 05:00

fix linting

d57bb1f

itr.isInvalid would be unconditionally set anyhow, which is why the i…

8e4e401

…tr.isInvalid were considered to be ineffective assignments by the linter.

Merge branch 'master' into remove-mutex

602ef89

faddat requested a review from tac0turtle May 20, 2022 08:17

creachadair previously requested changes May 20, 2022

View reviewed changes

cleveldb_test.go Outdated Show resolved Hide resolved

prefixdb_test.go Outdated Show resolved Hide resolved

prefixdb_test.go Outdated Show resolved Hide resolved

.gitpod.yml Show resolved Hide resolved

prefixdb_test.go Show resolved Hide resolved

faddat and others added 4 commits May 23, 2022 12:56

Update cleveldb_test.go

d4bb034

Co-authored-by: M. J. Fromberger <[email protected]>

Update prefixdb_test.go

2640477

Co-authored-by: M. J. Fromberger <[email protected]>

Update prefixdb_test.go

d75f0c0

Co-authored-by: M. J. Fromberger <[email protected]>

Merge branch 'master' into remove-mutex

4355c2a

faddat added 8 commits June 4, 2022 09:22

Firing this into CI, let's see.

f330226

add filepath lib to tests

230b7cc

fix filepath in goleveldb

cdc2374

gofumpt formatting

e74373d

Merge branch 'master' into remove-mutex

1ab7644

Merge branch 'master' into remove-mutex

ea504e8

Merge branch 'master' into remove-mutex

7d4b2c7

Merge branch 'master' into remove-mutex

f39f7bb

This was referenced Jul 27, 2022

PebbleDB #281

Closed

Pebble #284

Closed

tac0turtle approved these changes Jul 29, 2022

View reviewed changes

tac0turtle merged commit 00fb04a into tendermint:master Jul 29, 2022

ghost mentioned this pull request Aug 1, 2022

Revert "fix: remove mutex in prefixdb" #286

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: remove mutex in prefixdb #239

fix: remove mutex in prefixdb #239

faddat commented Apr 25, 2022 •

edited

Loading

codecov bot commented Apr 25, 2022 •

edited

Loading

creachadair left a comment

faddat commented Apr 25, 2022 •

edited

Loading

faddat commented Apr 25, 2022

faddat commented Apr 25, 2022

creachadair commented Apr 26, 2022 •

edited

Loading

faddat commented Apr 27, 2022

faddat commented May 19, 2022 •

edited

Loading

faddat commented May 19, 2022

faddat commented May 20, 2022

creachadair left a comment

faddat commented Jun 4, 2022

faddat commented Jul 15, 2022

tac0turtle commented Jul 15, 2022

tac0turtle commented Jul 29, 2022

fix: remove mutex in prefixdb #239

fix: remove mutex in prefixdb #239

Conversation

faddat commented Apr 25, 2022 • edited Loading

codecov bot commented Apr 25, 2022 • edited Loading

Codecov Report

creachadair left a comment

Choose a reason for hiding this comment

faddat commented Apr 25, 2022 • edited Loading

faddat commented Apr 25, 2022

faddat commented Apr 25, 2022

creachadair commented Apr 26, 2022 • edited Loading

faddat commented Apr 27, 2022

faddat commented May 19, 2022 • edited Loading

faddat commented May 19, 2022

faddat commented May 20, 2022

creachadair left a comment

Choose a reason for hiding this comment

faddat commented Jun 4, 2022

faddat commented Jul 15, 2022

tac0turtle commented Jul 15, 2022

tac0turtle commented Jul 29, 2022

faddat commented Apr 25, 2022 •

edited

Loading

codecov bot commented Apr 25, 2022 •

edited

Loading

faddat commented Apr 25, 2022 •

edited

Loading

creachadair commented Apr 26, 2022 •

edited

Loading

faddat commented May 19, 2022 •

edited

Loading