Correlation Dimension of Autoregressive LLMs

A fractal-geometric approach to quantifying the epistemological complexity of text as perceived by language models.

This repository contains the implementation for computing correlation dimension, a measure that bridges local and global properties of text generation in autoregressive language models. Unlike perplexity, correlation dimension captures long-range structural complexity and self-similarity patterns, revealing insights into model behavior, hallucination tendencies, and various forms of text degeneration.

Quick Links

📚 Publications

NeurIPS 2025
- 📄 Full Paper (arXiv)
- 🎯 Conference Page
Physical Review Research (2024)
- 📄 arXiv | APS Journal

🔗 Resources

🏠 Our Research Group @ Waseda University
📊 Conference Poster

Features

Efficient computation using next-token log-probability vectors
Robust to model quantization (down to 4-bit precision or more)
Applicable across autoregressive architectures (Transformer, Mamba, etc.)
Real-time inference integration

Example: Correlation Integral Curves

The following figure shows correlation integral curves for various pre-trained language models on the "Newton's Philosophy" article from the Stanford Encyclopedia of Philosophy, compared to i.i.d. Gaussian noise:

Models shown: GPT2-1.5B, Mamba-2.8B, Pythia-12B, Falcon3-10B, OpenLlama-13B, Yi1.5-34B

Code release coming soon. 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
asset		asset
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Correlation Dimension of Autoregressive LLMs

Quick Links

📚 Publications

🔗 Resources

Features

Example: Correlation Integral Curves

About

Uh oh!

Releases

Packages

kduxin/corrdim

Folders and files

Latest commit

History

Repository files navigation

Correlation Dimension of Autoregressive LLMs

Quick Links

📚 Publications

🔗 Resources

Features

Example: Correlation Integral Curves

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages