| 2025.05 |
Fast-dLLM |
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding |
 |
 |
 |
| 2025.05 |
EB-Sampler |
Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking |
 |
— |
 |
| 2025.05 |
DINGO |
DINGO: Constrained Inference for Diffusion LLMs |
 |
— |
 |
| 2025.06 |
Dilated-Scheduling |
Plan for Speed: Dilated Scheduling for Masked Diffusion Language Models |
 |
— |
 |
| 2025.06 |
SlowFast-Sampling |
Accelerating diffusion large language models with slowfast: The three golden principles |
 |
 |
 |
| 2025.06 |
WINO |
Wide-In, Narrow-Out: Revokable Decoding for Efficient and Effective DLLMs |
 |
 |
 |
| 2025.06 |
APD |
Accelerating Diffusion LLMs via Adaptive Parallel Decoding |
 |
 |
 |
| 2025.08 |
Prophet |
Diffusion language models know the answer before decoding |
 |
 |
 |
| 2025.08 |
RWS |
Reward-Weighted Sampling: Enhancing Non-Autoregressive Characteristics in Masked Diffusion LLMs |
— |
— |
 |
| 2025.10 |
LocalLeap |
Accelerated Diffusion LLM Inference via Local Determinism Propagation |
 |
 |
 |
| 2025.10 |
FreeDave |
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models |
 |
 |
 |
| 2025.10 |
Saber |
Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model |
 |
 |
 |
| 2025.12 |
SchED |
Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules |
 |
 |
 |
| 2025.12 |
CadLLM |
Improving the Throughput of Diffusion-based Large Language Models via a Training-Free Confidence-Aware Calibration |
 |
— |
 |
| 2025.12 |
LoPA |
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding |
 |
 |
 |
| 2025.09 |
dParallel |
dparallel: Learnable parallel decoding for dllms |
 |
 |
 |
| 2025.09 |
LSD |
Learnable sampler distillation for discrete diffusion models |
 |
 |
 |
| 2025.09 |
ADJUST |
Enabling Approximate Joint Sampling in Diffusion LMs |
 |
— |
 |
| 2025.09 |
Learn2PD |
Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding |
 |
 |
 |
| 2025.12 |
Learning-Unmasking-Policies |
Learning Unmasking Policies for Diffusion Language Models |
 |
— |
 |
| 2025.09 |
Spiffy |
Spiffy: Multiplying Diffusion LLM Acceleration via Lossless Speculative Decoding |
 |
— |
 |
| 2025.10 |
SSD |
Self Speculative Decoding for Diffusion Large Language Models |
 |
— |
 |
| 2025.10 |
DiffuSpec |
DiffuSpec: Unlocking Diffusion Language Models for Speculative Decoding |
 |
— |
 |
| 2025.12 |
DEER |
DEER: Draft with Diffusion, Verify with Autoregressive Models |
 |
 |
 |
| 2026.01 |
DFlash |
DFlash: Block Diffusion for Flash Speculative Decoding |
— |
 |
 |
| 2026.01 |
DART |
DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference |
 |
 |
 |
| 2026.03 |
ES-dLLM |
ES-dLLM: Efficient Inference for Diffusion Large Language Models by Early-Skipping |
 |
 |
 |
| 2026.03 |
EntropyCache |
EntropyCache: Decoded Token Entropy Guided KV Caching for Diffusion Language Models |
 |
 |
 |