Jepsen test for TiKV.
# generate clojure client from protobuf
make gen-proto-clojure-client
# build rust-client-server which will be referenced by `./rpc-server`
make build-rust-client-server
-
Install LXC using
scripts/install-lxc.h
. -
Set up tester using
scripts/setup-tester.sh
. -
Set up virtual network using
scripts/setup-network.sh
. -
Generate RSA key pair.
ssh-keygen -t rsa
-
Set up nodes via LXC using
scripts/setup-nodes.sh
,scripts/prepare-nodes.sh
. -
Build rust-client-server
make build-rust-client-server
- Run the test on nodes:
xvfb-run lein run test --ssh-private-key ~/.ssh/id_rsa --version v4.0.0 --workload register --concurrency 10 --leave-db-running --time-limit 30
When we run a Jepsen test, we need several test nodes to deploy our databases, and a control node which runs the tests written in Clojure. Under our circustance, we setup 5 CentOS 7 nodes via LXC and make sure it can be logged in via ssh from control node, which is the host.
The whole process of running a Jepsen test consist of four phases. First, we try to tear down the database if it's already running, and setup a brand-new database on test nodes. And then, we run specific workloads on it and record everything we observe from the client-side as a history. Third, we analyze the history with a specific model and a checker to see whether it shows any anomolies. Finally, we tear down the databases (This pharse will be skipped if --leave-db-running
option is present).
Each workload usually has more than one concurrent unit. In Jepsen, it is called a process. Each process executes operations in a blocking way. Each process create a client connected to a specific node for operations.
Normally, the client is implemented natively in Clojure. But in our case, we implement it in a "client server" way. While setting up each test node, we also start a "client server" connected to the node. The "client server" exposes a RPC service for the TiKV client implemented in Clojure.
┌───────────────────────────────────────────┐ ┌─────────────┐
│ │ │ │
│ ┌────────┐ │ │ ┌──────┐ │
│ ┌─────────────────┐ ┌───►│ client ├──┼───┼──►│ node │ │
│ │ Jepsen │ │ │ server │ │ │ └──────┘ │
│ │ │ │ └────────┘ │ │ │
│ │ ┌─────────┐ │ │ │ │ │
│ │ │ Process ├───┼───┘ ┌────────┐ │ │ ┌──────┐ │
│ │ └─────────┘ │ │ client ├──┼───┼──►│ node │ │
│ │ │ │ server │ │ │ └──────┘ │
│ │ ┌─────────┐ │ └────────┘ │ │ │
│ │ │ Process ├───┼───┐ │ │ │
│ │ └─────────┘ │ │ ┌────────┐ │ │ ┌──────┐ │
│ │ . │ └───►│ client ├──┼───┼──►│ node │ │
│ │ . │ │ server │ │ │ └──────┘ │
│ │ . │ └────────┘ │ │ │
│ │ . │ │ │ │
│ │ . │ ┌────────┐ │ │ ┌──────┐ │
│ │ . │ │ client ├──┼───┼──►│ node │ │
│ │ ┌─────────┐ │ │ server │ │ │ └──────┘ │
│ │ │ Process ├───┼───┐ └────────┘ │ │ │
│ │ └─────────┘ │ │ │ │ │
│ │ │ │ ┌────────┐ │ │ ┌──────┐ │
│ │ │ └───►│ client ├──┼───┼──►│ node │ │
│ └─────────────────┘ │ server │ │ │ └──────┘ │
│ └────────┘ │ │ │
│ │ │ │
│ Host │ │ LXC │
└───────────────────────────────────────────┘ └─────────────┘
- register concurrent atomic updates to multiple shared registers
- set concurrent unique appends to a single key
- list-append Checks for dependency cycles in append/read transactions
Copyright © 2021 Ziyi Yan
This program and the accompanying materials are made available under the terms of the Eclipse Public License 2.0 which is available at http://www.eclipse.org/legal/epl-2.0.
This Source Code may also be made available under the following Secondary Licenses when the conditions for such availability set forth in the Eclipse Public License, v. 2.0 are satisfied: GNU General Public License as published by the Free Software Foundation, either version 2 of the License, or (at your option) any later version, with the GNU Classpath Exception which is available at https://www.gnu.org/software/classpath/license.html.