LLM-FUSE

A research experiment testing whether language models can perform robust, stateful reasoning by acting as a filesystem.

Overview

This project trains an LLM to function as a complete filesystem implementation. A mountable FUSE filesystem serves as the testbed. The core approach is "full state rewrite": on each operation (e.g., mkdir /foo), the model receives the entire filesystem state and must output the complete new state.

This tests a fundamental question: can LLMs maintain consistent state across many sequential operations without drift or hallucination?

How It Works

Training data generation: A reference FUSE filesystem logs real operations and their resulting states
Supervised fine-tuning: The model learns to predict state transitions from (state, operation) pairs
Inference: The trained model backs a real FUSE mount, handling all filesystem operations

State is represented as XML:

<filesystem>
  <directory path="/" name="/" mode="755" owner="root" group="root" mtime="...">
    <file path="hello.txt" name="hello.txt" mode="644" size="12">
      <body>hello world</body>
    </file>
  </directory>
</filesystem>

Repository Structure

llmfuse/       # FUSE filesystem backed by LLM inference
llmencode/     # Arithmetic coding compression using LLM probabilities
train/         # Data generation and fine-tuning scripts
eval/          # Evaluation pipelines
infra/         # Modal deployment for GPU inference
common/        # Shared model utilities
scripts/       # Helper scripts

Usage

Generate Training Data

docker-compose run --rm datagen python3 -m train.generate_data \
  --num_examples 10000 \
  --output_dir /app/data/train

Train on Modal

modal run train/sft_modal.py::train_qwen \
  --model-name "qwen3-4b" \
  --training-data "path/to/data.jsonl" \
  --num-epochs 8

Evaluate

modal run eval/modal_eval.py::eval_on_dataset \
  --model-path "your-model-checkpoint" \
  --dataset-path "path/to/eval.jsonl"

Run the Filesystem

Deploy the inference service:

modal run infra/modal_llmfuse.py::deploy --model-path your-model-checkpoint

Mount the filesystem (requires Linux/Docker for FUSE):

export LLMFUSE_REMOTE_ENDPOINT="https://your-modal-endpoint/generate"
bash scripts/run_llmfuse.sh

LLMEncode

The llmencode package demonstrates prediction-compression equivalence using arithmetic coding guided by model probabilities. This explores the theoretical connection between language modeling and data compression.

from llmencode import LLMEncode
encoder = LLMEncode(model_name="qwen3-4b")
result = encoder.test_roundtrip("Hello world", verbose=True)

Requirements

Docker (for FUSE operations and consistent environments)
Modal account (for GPU training and inference)
Python 3.11+ with dependencies in requirements.txt

Related Work

This project draws inspiration from work on LLM reasoning, state tracking, and the theoretical connections between prediction and compression.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM-FUSE

Overview

How It Works

Repository Structure

Usage

Generate Training Data

Train on Modal

Evaluate

Run the Filesystem

LLMEncode

Requirements

Related Work

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
common		common
eval		eval
infra		infra
llmencode		llmencode
llmfuse		llmfuse
scripts		scripts
train		train
.gitignore		.gitignore
Dockerfile.datagen		Dockerfile.datagen
Dockerfile.llmfuse		Dockerfile.llmfuse
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

rohangpta/llmfuse

Folders and files

Latest commit

History

Repository files navigation

LLM-FUSE

Overview

How It Works

Repository Structure

Usage

Generate Training Data

Train on Modal

Evaluate

Run the Filesystem

LLMEncode

Requirements

Related Work

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages