Write on memory TSDB database (gocon tokyo autumn 2018)

Write tsdb database
like playing lego
@linxgnu
@huydx
gocon.autumn 2018

Observability team
Building, maintaining large scale
- Metrics system
- Log system
- Alert system
- Distributed tracing system

Today talk is about our story
of writing our own database
for metrics system

A series of ﬂoat value follow time axis
 
(t1, x1), (t2, x2)… (tn, xn) (where t is time
stampt and x is value at the moment)
Tn
Xn

What we need
- A storage to store time series data which is
- Extremely fast to write (hundreds of millions
data point / minute)
- Very fast to read (few thousands query per
sec)
- Efﬁcient space usage (memory/disk)

What we can trade off
- Data consistency (data could be duplicate)
- Immutable data (once it written, could not be
change)

We need a storage to store
data looks like
message Sample {
double value = 1;
sint64 timestamp = 2;
}
// Serie is a collection of sample data with same serie_id
message Serie {
uint64 id = 1;
repeated Sample samples = 2;
repeated Label labels = 3;
}
// Series collection of series
message Series {
repeated Serie series = 1;
uint32 total_samples = 2;
}

With interfaces like
 
func (c *Storage) Save(series *proto.Series) (failed
[]*proto.Series, err error)
func (c *Storage) Load(serieIDs []uint64,
fromTimestampMillis, toTimestampMillis int64)

We tried many options but
- None of available solutions fit (performance
problem (clickhouse), or overprice (influxdb))
- Or some fit, but poorly maintained (facebook
beringei) or (netflix/atlas)
- Or looks potential, but poorly documented and
very unstable (uber/m3)

We decided to
- Build our own on memory TSDB
- For fast read/write
- But trade off for low retention (1 day instead
of months or years)
- But …
- We’re not database expert

Write on memory TSDB database (gocon tokyo autumn 2018)

Solutions
- Reuse as much as possible what people
already did good
- TANSTAAFL — “there ain’t no such thing as
a free lunch”

TSDB anatomy
// Series collection of series
message Series {
repeated Serie series = 1;
uint32 total_samples = 2;
} 
- How to store Series efﬁcient
- Especially space (because we’re using RAM)

Prometheus/tsdb package
- Which provides us
- Implementation to store series as chunk
- And compress it super efﬁcient with loss-less
delta-of-delta encoding algorithm
(bstream.go) (original idea is from beringei)

We need better
compression
- Save single byte for each data point == save dozens
Gig of RAM
- Further compress (freeze) old data (not frequent read)
- Lossless compression
- Brotli
- Zstd

valyala/gozstd package
- Datadog/zstd has some memory allocate
problem
- Could do stream compression with reader/writer
interface

We need data replication
- We could not just lose data when restart,
replication will solve the problem
- Replication in distributed environment is hard

What we need for data
replication?
- Leader election

We know where to ﬁnd
distributed system best
quality package
- github.com/hashicorp

hashicorp/raft package
- Golang implementation of the Raft consensus
protocol (https://raft.github.io/)
- Provide us
- Leader election
- Log replication
- EVERY communication between nodes are
stored as replicated log (like event sourcing)
- You need to provide your own replicated log
implementation

bsm/raft-badger package
- Implementation of replicated log based on badger
kv database (https://github.com/dgraph-io/badger)
- Badger is fast with SSD

Topology management
- We need to store some cluster information like
- Seed node
- Shard information
- …
- Candidate:
- Etcd / Centraldogma

LINE/centraldogma
- Conﬁguration store
- Store data as arbitrary text (json/yaml..)
- Interesting feature
- Watch change
- Version controlled

LINE/centraldogma-go
package
- Centraldogma go client
- Full feature (json parse, watch change,…)

Thanks to tons of
awesome golang OSS
- Our storage now is serving in avg 1m samples
written per second without any problem
- And could store few billions samples in single
machine

Building your own
database is hard, but not
impossible
- We feel that it’s like playing lego with building
blocks area awesome golang OSS package
- May be that’s the reason why many awesome
databases are written in golang
- https://github.com/gostor/awesome-go-storage

We still has tons of things
to share, so stay tune!

Write on memory TSDB database (gocon tokyo autumn 2018)

More Related Content

What's hot (20)

Similar to Write on memory TSDB database (gocon tokyo autumn 2018) (20)

More from Huy Do (16)

Recently uploaded (20)

Write on memory TSDB database (gocon tokyo autumn 2018)