A Phase-Transition Model for Synthetic Epistemic Drift
Author: Bentley DeVilling Affiliation: Course Correct Labs Date: 2025
We propose a theoretical framework and toy-model validation for synthetic epistemic drift—the degradation of truth signals in information ecosystems recursively populated by large language models (LLMs). Analytical derivation predicts a phase transition in epistemic integrity at a critical synthetic share
| ⟨k⟩ | Empirical |
Theoretical |
Deviation |
|---|---|---|---|
| 8 | 0.130 | 0.143 | 9% |
| 10 | 0.110 | 0.111 | 1% |
| 12 | 0.090 | 0.091 | 1% |
Key Result: Configuration model simulations with N=100,000 nodes validate the percolation threshold
Click the badge above or visit: Echo Chamber Zero on Colab
Runtime: ~5 minutes
git clone https://github.com/Course-Correct-Labs/echo-chamber-zero.git
cd echo-chamber-zero
pip install -r requirements.txt
python simulate_percolation.pyRuntime: ~30 minutes
jupyter notebook Echo_Chamber_Zero_Simulation.ipynbThe model predicts a phase transition at:
where:
-
$p$ = probability that a node is synthetic -
$\langle k \rangle$ = mean degree of the network -
$p_c$ = critical threshold for giant synthetic component emergence
Fraction of nodes in the largest connected synthetic-only component:
Measures the extent of synthetic "echo chamber" formation.
Shannon entropy over the distribution of component sizes:
where
echo-chamber-zero/
├── README.md # This file
├── LICENSE # CC-BY 4.0 (text/figures/data)
├── LICENSE-CODE # MIT License (source code)
├── requirements.txt # Python dependencies
├── simulate_percolation.py # Main simulation script
├── Echo_Chamber_Zero_Simulation.ipynb # Full Jupyter notebook
├── Echo_Chamber_Zero_Colab.ipynb # Colab-optimized version
├── data/
│ └── simulation_results.csv # Complete dataset (153 points)
└── figures/
├── sri_vs_p.png # SRI phase transition plot
├── re_vs_p.png # RE fragmentation plot
└── sri_re_vs_p_combined.png # Combined visualization
- Python 3.8 or higher
- pip package manager
- Clone the repository:
git clone https://github.com/Course-Correct-Labs/echo-chamber-zero.git
cd echo-chamber-zero- Install dependencies:
pip install -r requirements.txtnumpy- Numerical computationspandas- Data manipulationnetworkx- Graph construction and analysismatplotlib- Visualizationtqdm- Progress barsjupyter- Interactive notebook environment
Run the complete simulation pipeline:
python simulate_percolation.pyThis will:
- Generate configuration model graphs (N=100k nodes)
- Sweep synthetic probability p ∈ [0.0, 0.5] for ⟨k⟩ ∈ {8, 10, 12}
- Compute SRI and RE metrics for each configuration
- Save results to
data/simulation_results.csv - Generate publication-quality plots in
figures/ - Print threshold analysis to console
Expected runtime: 10-20 minutes (depending on hardware)
Launch the Jupyter notebook for step-by-step execution and visualization:
jupyter notebook Echo_Chamber_Zero_Simulation.ipynbThe notebook includes:
- Detailed methodology documentation
- Inline visualizations
- Parameter sensitivity analysis
- Threshold comparison tables
| Parameter | Value | Description |
|---|---|---|
| N | 100,000 | Number of nodes |
| ⟨k⟩ | 8, 10, 12 | Mean degree values |
| p | 0.0 → 0.5 (step 0.01) | Synthetic probability range |
| Graph type | Configuration model | Random graph with specified degree distribution |
| Random seed | 42 | For reproducibility |
- Phase transition confirmed: SRI exhibits sharp transitions at predicted thresholds
-
Theory validated: Empirical
$p_c$ matches$1/(\langle k \rangle - 1)$ within ~5-10% - Network fragmentation: RE peaks near threshold, indicating maximum fragmentation
- Finite-size effects: Small deviations attributable to finite N and Poisson variance
| ⟨k⟩ | Theoretical |
Empirical |
|---|---|---|
| 8 | 0.1429 | ~0.14-0.15 |
| 10 | 0.1111 | ~0.11-0.12 |
| 12 | 0.0909 | ~0.09-0.10 |
All plots show:
- Solid lines: Empirical SRI/RE measurements
-
Dashed lines: Theoretical
$p_c$ predictions - Color coding: Different mean degree values
See figures/ directory for high-resolution outputs (300 DPI).
All results are fully reproducible:
- Fixed random seed (42)
- Deterministic graph generation
- Versioned dependencies in
requirements.txt - Complete parameter documentation
To regenerate all results:
# Clean previous outputs
rm -rf data/ figures/
# Run simulation
python simulate_percolation.py
# Or run notebook
jupyter nbconvert --execute --to notebook --inplace Echo_Chamber_Zero_Simulation.ipynbIf you use this simulation in your research, please cite:
@misc{devillinng2025echochamber,
title={Echo Chamber Zero: A Phase-Transition Model for Synthetic Epistemic Drift},
author={DeVilling, Bentley},
year={2025},
howpublished={\url{https://github.com/Course-Correct-Labs/echo-chamber-zero}},
note={arXiv preprint (forthcoming)}
}Paper: DeVilling, B. (2025). Echo Chamber Zero: A Phase-Transition Model for Synthetic Epistemic Drift. Course Correct Labs / arXiv preprint TBD.
© Course Correct Labs 2025
This project uses dual licensing:
- Text, figures, and data: Creative Commons Attribution 4.0 International (CC BY 4.0)
- Source code: MIT License (see LICENSE-CODE)
You are free to share and adapt this material with attribution.
© 2025 Course Correct Labs
Course Correct Labs Email: [contact information] Website: [website URL]
Configuration model graphs are generated using:
- Poisson-distributed degree sequences with mean ⟨k⟩
- NetworkX
configuration_model()function - Self-loops and parallel edges removed
- Degree sum adjusted to ensure even parity
For each simulation trial:
- Generate graph G(N, ⟨k⟩)
- Assign each node as synthetic independently with probability p
- Compute metrics on resulting network
SRI Algorithm:
- Extract subgraph of synthetic nodes only
- Find all connected components
- Identify largest component size
- Normalize by total network size
RE Algorithm:
- Find all connected components in full graph
- Compute size fraction for each component
- Calculate Shannon entropy over distribution
Empirical thresholds estimated via:
-
Maximum derivative:
$p$ where$\frac{d(\text{SRI})}{dp}$ is maximized -
Crossing threshold:
$p$ where SRI first exceeds 0.05
Both methods yield consistent estimates within 1-2% of theoretical predictions.
Last updated: 2025 Version: 1.0.0