chainrep - Chain Replication in Go

chainrep is a Go implementation of chain replication with a coordinator control plane, multi-replica storage nodes, gRPC transport, and both non-HA and HA coordinator modes.

Highlights:

gRPC transport between clients, coordinator, and storage nodes
epoch-gated coordinator HA failover
durable local Badger-backed storage backend
dynamic storage-node auto-registration and ready gating
durable non-HA coordinator dispatch retry and restart recovery
flapping-node eviction through coordinator liveness policy
conditional writes with per-object metadata
optional TLS/mTLS for gRPC transports
read-only HTTP admin/health endpoints plus Prometheus metrics

Documentation:

Ops Surfaces

Observability is documented in OBSERVABILITY.md.

At a high level, each coordinator or storage process can optionally expose a separate read-only HTTP admin listener with:

/livez
/readyz
/metrics
/admin/v1/state

The admin listener is unauthenticated in v1 and is intended for loopback or a trusted network only.

Performance Notes

The repo includes an end-to-end localhost gRPC benchmark in transport/grpcx/grpc_benchmark_test.go. It uses:

a coordinator gRPC server
a client router
storage-node gRPC servers
gRPC replication between nodes
Badger-backed storage on local temp directories

These numbers are end-to-end client latencies on localhost through a setup: router -> coordinator snapshot -> storage-node gRPC server(s) -> storage, with replication over gRPC where applicable.

Benchmark command:

go test ./transport/grpcx -run '^$' -bench BenchmarkClientLatencyGRPC_Localhost -benchmem -benchtime=3s -count=5 -cpu=1

Average results from 5 localhost benchmark runs on an Apple M3 Max, using the command above:

single_replica_get: 0.044 ms/op, 10,656 B/op, 186 allocs/op
single_replica_put: 0.100 ms/op, 13,976 B/op, 291 allocs/op
three_replica_get: 0.045 ms/op, 10,657 B/op, 186 allocs/op
three_replica_put: 0.325 ms/op, 58,215 B/op, 1,118 allocs/op

These are localhost benchmark numbers, not SLOs or cross-machine production guarantees. They include gRPC and durable local storage costs, but not network latency, TLS, or multi-host deployment effects. The read path likely reflects cache-warm access through Badger and the OS page cache rather than cold disk-read latency.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
adminhttp		adminhttp
bin		bin
client		client
cmd/chainrep-quickstart		cmd/chainrep-quickstart
consistenthash		consistenthash
coordinator		coordinator
coordserver		coordserver
examples/quickstart		examples/quickstart
gologger		gologger
ops		ops
proto/chainrep/v1		proto/chainrep/v1
quickstart		quickstart
storage		storage
transport/grpcx		transport/grpcx
.gitignore		.gitignore
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
HA_STORE.md		HA_STORE.md
OBSERVABILITY.md		OBSERVABILITY.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
SECURITY.md		SECURITY.md
TODO.md		TODO.md
buf.gen.yaml		buf.gen.yaml
buf.yaml		buf.yaml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

chainrep - Chain Replication in Go

Ops Surfaces

Performance Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

chainrep - Chain Replication in Go

Ops Surfaces

Performance Notes

About

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages