tidesdb-kafka

TidesDB + Kafka Streams

A drop-in replacement for RocksDB state stores in Kafka Streams applications, powered by TidesDB.

Features

Swap RocksDB for TidesDB with a one-line change
Every TidesDB database and column family option exposed through TidesDBStoreConfig
Default TTL per store or per-key TTL via putWithTtl()
Optional B+tree SSTables for faster point lookups
Change data capture callbacks on every transaction commit
Online backup and near-instant hard-link checkpoints
Column family stats, database stats, and block cache stats
Synchronous compaction and explicit durability control
Change write buffer, bloom FPR, sync mode without restart
Pooled transaction with reset() to minimize allocation on the hot path

Installation

./install.sh

This will:

Build and install TidesDB native library
Build and install TidesDB Java bindings
Install the Kafka Streams plugin

Quick Start

import com.tidesdb.kafka.store.TidesDBStoreSupplier;

// Default config
KTable<String, Long> counts = input
    .groupByKey()
    .count(Materialized.as(new TidesDBStoreSupplier("my-counts")));

// Custom config
TidesDBStoreConfig config = TidesDBStoreConfig.builder()
    .compressionAlgorithm(CompressionAlgorithm.ZSTD_COMPRESSION)
    .syncMode(SyncMode.SYNC_NONE)
    .enableBloomFilter(true)
    .blockCacheSize(128 * 1024 * 1024)
    .defaultTtlSeconds(3600)
    .build();

KTable<String, Long> counts = input
    .groupByKey()
    .count(Materialized.as(new TidesDBStoreSupplier("my-counts", config)));

See kafka.md for the full reference documentation.

Running Tests and Benchmarks

./run.sh [options]

Options:
  -t, --tests            Run unit tests
  -b, --benchmarks       Run benchmarks
  -c, --charts           Generate charts from benchmark data
  -a, --all              Run everything
  -d, --data-dir <path>  Set data directory for benchmark databases
  -h, --help             Show this help message

Examples

# Run unit tests
./run.sh -t

# Run benchmarks with default temp directory
./run.sh -b

# Run benchmarks on a fast SSD
./run.sh -b -d /mnt/nvme/bench

# Run benchmarks and generate charts
./run.sh -b -c

# Run everything
./run.sh -a

Dynamic Benchmark Configuration

All benchmark parameters are configurable via -D system properties:

mvn test -Dtest=StateStoreBenchmark \
    -DargLine="-Djava.library.path=/usr/local/lib \
               -Dbenchmark.sizes=1000,10000,100000 \
               -Dbenchmark.value.size=256 \
               -Dbenchmark.mixed.ratio=80 \
               -Dbenchmark.threads=1,4,8 \
               -Dbenchmark.percentiles=true"

Key parameters:

Property	Default	Description
`benchmark.sizes`	1000,5000,10000,50000,100000	Operation counts for standard benchmarks
`benchmark.large.sizes`	100000,...,25000000	Sizes for large dataset benchmarks
`benchmark.threads`	1,2,4,8,16	Thread counts for concurrent benchmarks
`benchmark.value.size`	64	Value size in bytes
`benchmark.mixed.ratio`	50	Read percentage for mixed workload (0-100)
`benchmark.warmup`	3	Warmup iterations
`benchmark.iterations`	5	Measurement iterations
`benchmark.percentiles`	true	Track p50/p90/p95/p99/p99.9/max latencies
`benchmark.seed`	42	Random seed for reproducibility

Fair Comparison

Both engines are configured with equivalent settings:

Compression: LZ4 on both
Bloom filters: ~1% FPR on both (TidesDB 0.01 FPR, RocksDB 10 bits/key)
Block cache: 64 MB on both
Write buffer: 64 MB on both
Background threads: 4 on both (2 flush + 2 compaction)
Sync: disabled on both (TidesDB SYNC_NONE, RocksDB sync=false)

License

Apache Kafka is a trademark of the Apache Software Foundation.

This project/product is not affiliated with, endorsed by, or sponsored by the Apache Software Foundation.

Licensed under Mozilla Public License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
pom.xml		pom.xml
run.sh		run.sh
utils.sh		utils.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tidesdb-kafka

Features

Installation

Quick Start

Running Tests and Benchmarks

Examples

Dynamic Benchmark Configuration

Fair Comparison

License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tidesdb-kafka

Features

Installation

Quick Start

Running Tests and Benchmarks

Examples

Dynamic Benchmark Configuration

Fair Comparison

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages