Slice - LSM Key-Value Store

A high-performance, concurrent LSM (Log-Structured Merge) tree-based key-value store implemented in Rust, designed for high write throughput and concurrent access.

Features

High Write Concurrency: Optimized for concurrent write operations using async/await and efficient locking strategies
LSM Tree Architecture: Multi-level storage with automatic compaction
LRU Cache: Thread-safe LRU cache for SSTable entries to dramatically reduce I/O
Bloom Filters: Fast negative lookups to eliminate unnecessary disk reads
Write-Ahead Logging (WAL): Ensures durability and crash recovery
Concurrent Memtables: Active and immutable memtables for non-blocking writes
Background Compaction: Automatic SSTable merging and cleanup
Checksums: Data integrity verification for all persistent storage
Async/Await Support: Fully asynchronous API using Tokio

Architecture

┌─────────────┐     ┌─────────────┐
│   Writes    │     │    Reads    │
└─────┬───────┘     └─────┬───────┘
      │                   │
      ▼                   ▼
┌─────────────┐    ┌─────────────┐
│     WAL     │◄───┤ Active      │───► Check first
│   (Durability)   │ MemTable    │
└─────────────┘    └─────┬───────┘
                         │ (when full)
                         ▼
                   ┌─────────────┐
                   │ Immutable   │───► Then check
                   │ MemTables   │
                   └─────┬───────┘
                         │ (background flush)
                         ▼
                   ┌─────────────┐     ┌─────────────┐
                   │   Level 0   │◄────┤ LRU Cache   │
                   │  SSTables   │     │ (Hot Data)  │
                   │   + Bloom   │     └─────────────┘
                   │   Filters   │           ▲
                   └─────┬───────┘           │
                         │ (compaction)      │
                         ▼                   │
                   ┌─────────────┐           │
                   │   Level 1+  │───────────┘
                   │  SSTables   │ 1. Bloom filter check
                   │   + Bloom   │ 2. Cache check  
                   │   Filters   │ 3. Disk read (if needed)
                   └─────────────┘

Usage

Basic Operations

use slice::LSMTree;

#[tokio::main]
async fn main() -> slice::Result<()> {
    // Initialize the LSM tree with default cache (1000 entries)
    let lsm = LSMTree::new("./data").await?;
    
    // Or initialize with custom cache size
    let lsm = LSMTree::new_with_cache_size("./data", 5000).await?;
    
    // Put key-value pairs
    lsm.put(b"user:1", b"Alice").await?;
    lsm.put(b"user:2", b"Bob").await?;
    
    // Get values
    if let Some(value) = lsm.get(b"user:1").await? {
        println!("user:1 = {}", String::from_utf8_lossy(&value));
    }
    
    // Delete keys
    lsm.delete(b"user:2").await?;
    
    // Manual flush and compaction
    lsm.flush().await?;
    lsm.compact().await?;
    
    // Check cache statistics
    let stats = lsm.stats();
    stats.print(); // Shows cache hit ratio and size
    
    // Clear cache if needed
    lsm.clear_cache();
    
    Ok(())
}

Multi-threaded Compaction

Configure multi-threaded compaction for better performance:

use slice::{LSMTree, compaction::CompactionConfig};

#[tokio::main]
async fn main() -> slice::Result<()> {
    // Configure multi-threaded compaction
    let compaction_config = CompactionConfig {
        max_concurrent_compactions: 4,  // Allow up to 4 concurrent compactions
        merge_parallelism: 8,           // Use 8 parallel merge tasks
        max_sstables_per_job: 5,        // Limit job size for better parallelism
        enable_parallel_merge: true,    // Enable parallel merging
    };
    
    // Create LSM tree with custom compaction configuration
    let lsm = LSMTree::new_with_config(
        "./data",
        1000,  // cache size
        compaction_config
    ).await?;
    
    // Check compaction statistics
    let stats = lsm.stats();
    println!("Compaction stats:");
    println!("  Total compactions: {}", stats.compaction_stats.total_compactions);
    println!("  Active compactions: {}", stats.compaction_stats.concurrent_compactions);
    println!("  Parallel merges: {}", stats.compaction_stats.parallel_merges);
    println!("  Bytes compacted: {} MB", stats.compaction_stats.bytes_compacted / 1024 / 1024);
    
    Ok(())
}

Concurrent Writes

use slice::LSMTree;
use std::sync::Arc;

#[tokio::main]
async fn main() -> slice::Result<()> {
    let lsm = Arc::new(LSMTree::new("./data").await?);
    
    // Spawn multiple concurrent writers
    let mut handles = Vec::new();
    
    for writer_id in 0..10 {
        let lsm_clone = lsm.clone();
        let handle = tokio::spawn(async move {
            for i in 0..1000 {
                let key = format!("writer{}:key{}", writer_id, i);
                let value = format!("value_{}_{}", writer_id, i);
                lsm_clone.put(key.as_bytes(), value.as_bytes()).await.unwrap();
            }
        });
        handles.push(handle);
    }
    
    // Wait for all writers to complete
    for handle in handles {
        handle.await.unwrap();
    }
    
    println!("All concurrent writes completed!");
    Ok(())
}

Performance

The LSM tree is optimized for:

High Write Throughput: Writes go to memory first (memtable) then are batched to disk
Fast Reads: LRU cache provides ~20x speedup for frequently accessed data
Ultra-Fast Negative Lookups: Bloom filters eliminate disk I/O for non-existent keys (~2µs per lookup)
Write Amplification: Minimized through efficient compaction strategies
Concurrent Access: Multiple readers and writers can operate simultaneously
Memory Efficiency: Configurable memtable sizes and cache capacity

Benchmarks

Standard Industry Benchmarks

We've implemented comprehensive benchmarks that mirror industry standards:

# Run standardized YCSB and db_bench workloads
cargo bench --bench standard_benchmarks

# Run specific benchmark groups
cargo bench --bench standard_benchmarks -- "YCSB"     # Yahoo! Cloud Serving Benchmark
cargo bench --bench standard_benchmarks -- "DBBench"  # RocksDB/LevelDB standard benchmarks
cargo bench --bench standard_benchmarks -- "Industry" # Industry comparison workloads

Performance Results vs Industry Standards:

Read Performance: 2.8-3.9M ops/sec (competitive with RocksDB)
Write Performance: 228-329K ops/sec (good LSM characteristics)
Mixed Workloads: 405K-1.8M ops/sec (scales with read proportion)
YCSB Compliance: Full implementation of standard workloads A, B, C

See BENCHMARK_RESULTS.md for detailed analysis.

Custom Performance Benchmarks

Run the included benchmarks to test performance on your system:

cargo bench

Example benchmark categories:

Sequential writes
Concurrent writes (1, 2, 4, 8, 16 writers)
Read performance
Mixed read/write workloads
Cache performance (cached vs uncached reads)
LRU eviction behavior
Bloom filter effectiveness (negative lookup performance)

Configuration

Basic Configuration

Key configuration parameters:

Cache Size: Default 1000 entries (configurable via new_with_cache_size)
Memtable Size: Default 64MB (configurable)
Max Levels: Default 7 levels
Compaction Frequency: Every 10 seconds
Flush Frequency: Every 1 second

Multi-threaded Compaction Configuration

Configure compaction parallelism for optimal performance:

use slice::compaction::CompactionConfig;

let config = CompactionConfig {
    max_concurrent_compactions: 4,  // Number of parallel compaction workers (default: 2)
    merge_parallelism: 8,           // Parallel merge tasks within a compaction (default: 4)
    max_sstables_per_job: 10,       // Max SSTables per compaction job (default: 10)
    enable_parallel_merge: true,    // Enable parallel merging (default: true)
};

Configuration Guidelines:

max_concurrent_compactions: Set to number of CPU cores for I/O bound workloads
merge_parallelism: Higher values improve large compaction performance but use more memory
max_sstables_per_job: Smaller values improve parallelism, larger values reduce overhead
enable_parallel_merge: Disable for very small datasets or memory-constrained systems

File Structure

data/
├── wal_000000.log         # Write-ahead log files (sequential)
├── wal_000001.log         # Multiple WAL files for safety
├── sstable_000001.db      # SSTable data files
├── sstable_000001.idx     # SSTable index files
├── sstable_000001.bloom   # Bloom filter files
├── sstable_000002.db
├── sstable_000002.idx
├── sstable_000002.bloom
└── ...

Development

Building

cargo build --release

Testing

# Run all tests
cargo test

# Run unit and integration tests
cargo test --lib

# Run standardized correctness tests
cargo test --test correctness_tests

Comprehensive Test Coverage

The test suite includes critical durability and crash recovery tests:

Basic Operations: Put, get, delete, and updates
WAL Recovery (Unflushed Memtable): Simulates crash before background flush completes
WAL Recovery with Deletions: Ensures tombstones are properly recovered
Concurrent Writes During Recovery: Tests system stability during recovery
Crash Safety: Verifies no data loss in various failure scenarios

Standardized Correctness Tests

Industry-standard correctness tests verify system reliability:

Linearizability Testing: Jepsen-style history checking for concurrent operations
ACID Compliance: Atomicity, Consistency, Isolation, and Durability verification
Consistency Models: Sequential consistency, Read-Your-Writes, Monotonic reads
Fault Injection: Concurrent operations with random delays and crash recovery
Crash Recovery: Data durability and system resilience testing

See CORRECTNESS_TESTS.md for detailed documentation.

Running the Demo

cargo run

Running Examples

# Run the multi-threaded compaction example
cargo run --example multi_threaded_compaction

Running Benchmarks

cargo bench

License

Licensed under either of

Apache License, Version 2.0, (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Development

AI-Assisted Development

This project was entirely "vibe coded" using AI assistance! 🤖✨

Development Stack:

IDE: Cursor - AI-powered code editor
AI Assistant: Claude Sonnet 4 (Anthropic)
Approach: Conversational development with AI pair programming

What "Vibe Coded" Means: This LSM tree implementation was built through natural language conversations with AI, starting from high-level concepts and iteratively refining the implementation. The AI assistant helped with:

Architecture design and data structure choices
Implementation of complex algorithms (LSM compaction, Bloom filters, LRU cache)
Multi-threaded compaction system design
Comprehensive testing strategies (unit tests, correctness tests, benchmarks)
Performance optimization and code quality improvements
Documentation and examples

Why This Approach Works:

Rapid Prototyping: From concept to working LSM tree in hours, not days
Best Practices: AI knowledge of Rust idioms and database internals
Comprehensive Testing: AI suggested industry-standard test methodologies
Documentation: AI helped create thorough documentation and examples
Code Quality: Automated suggestions for clippy fixes and optimizations

The result is a production-quality LSM tree implementation with features that would typically take much longer to develop manually, including standardized benchmarks, correctness tests, and multi-threaded compaction.

This demonstrates the power of AI-assisted development for complex systems programming projects.

Technical Details

Components

MemTable: In-memory sorted structure (BTreeMap) with concurrent access
WAL: Write-ahead log for durability with checksums
SSTable: Persistent sorted storage with bloom filters for fast negative lookups
Bloom Filter: Probabilistic data structure to eliminate unnecessary disk reads
LRU Cache: In-memory cache for frequently accessed SSTable entries
Storage Manager: Manages multiple SSTable levels with automatic compaction
Compaction Manager: Background compaction and cleanup

Concurrency Model

Readers: Lock-free reads from memtables, shared locks for SSTables
Writers: Minimal locking with async coordination
Background Tasks: Separate tasks for flushing and compaction
WAL: Synchronized writes for durability

Durability Guarantees

All writes are logged to WAL before acknowledgment
WAL entries have checksums for corruption detection
Multiple WAL files: New WAL created when memtable becomes immutable
Safe WAL cleanup: WAL files only removed AFTER successful disk flush and sync
Crash safety: No data loss even if crash occurs during memtable flushing
Automatic recovery from multiple WAL files on startup
SSTable files include checksums for integrity

Dependencies

tokio: Async runtime
parking_lot: High-performance synchronization primitives
serde + bincode: Serialization
crc32fast: Fast checksums
crossbeam: Concurrent data structures
criterion: Benchmarking

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
benches		benches
examples		examples
src		src
tests		tests
.gitignore		.gitignore
BENCHMARK_RESULTS.md		BENCHMARK_RESULTS.md
CORRECTNESS_TESTS.md		CORRECTNESS_TESTS.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md

License

Licenses found

jamesbibby/slice

Folders and files

Latest commit

History

Repository files navigation

Slice - LSM Key-Value Store

Features

Architecture

Usage

Basic Operations

Multi-threaded Compaction

Concurrent Writes

Performance

Benchmarks

Standard Industry Benchmarks

Custom Performance Benchmarks

Configuration

Basic Configuration

Multi-threaded Compaction Configuration

File Structure

Development

Building

Testing

Comprehensive Test Coverage

Standardized Correctness Tests

Running the Demo

Running Examples

Running Benchmarks

License

Contribution

Development

AI-Assisted Development

Technical Details

Components

Concurrency Model

Durability Guarantees

Dependencies

License

Contributing

Future Improvements

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages