Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Offloading] Support Disk Offloading documentation Improvements or additions to documentation
#2373 opened Feb 17, 2026 by kylesayrs Draft
[model_free_ptq] Earlier Shape Validation codex documentation Improvements or additions to documentation ready When a PR is ready for review
#2372 opened Feb 16, 2026 by kylesayrs Loading…
input_id not required for Step3-VL-10B ready When a PR is ready for review
#2370 opened Feb 16, 2026 by gDINESH13 Loading…
[GPTQ] Move modifier to top-level for consistent folder structure documentation Improvements or additions to documentation ready When a PR is ready for review
#2368 opened Feb 16, 2026 by dik654 Loading…
[Sequential Pipeline] only cache unique offloaded values ready When a PR is ready for review
#2366 opened Feb 13, 2026 by kylesayrs Loading…
4 tasks done
add qwen3 vl autoround example documentation Improvements or additions to documentation ready When a PR is ready for review
#2357 opened Feb 12, 2026 by xin3he Loading…
feat: early group-size divisibility check with layer FQNs enhancement New feature or request ready When a PR is ready for review
#2353 opened Feb 11, 2026 by GOavi101 Loading…
DataLoader options, single-pass weight calibration, optional sequential prefetch documentation Improvements or additions to documentation ready When a PR is ready for review
#2349 opened Feb 11, 2026 by GOavi101 Loading…
Add model_free_ptq example for glm 4.6 block fp8 documentation Improvements or additions to documentation
#2343 opened Feb 10, 2026 by mgoin Loading…
[Bugfix] Guard against MLA ready When a PR is ready for review
#2337 opened Feb 6, 2026 by kylesayrs Loading…
Improve how we identify and run e2e smoke tests
#2336 opened Feb 6, 2026 by dhuangnm Loading…
[MoE] MiniMax-M2/M2.1 calibration follow-up documentation Improvements or additions to documentation ready When a PR is ready for review
#2335 opened Feb 6, 2026 by LudovicoYIN Loading…
[GPTQ][ddp] PoC for GPTQ with DDP enhancement New feature or request gptq For any PR / issue related to GPTQ support quality-failed
#2333 opened Feb 6, 2026 by HDCharles Loading…
[AutoRound] Add DP Support
#2331 opened Feb 5, 2026 by yiliu30 Loading…
Add GSM8K evaluation script and AWQ+FP8 results documentation Improvements or additions to documentation
#2330 opened Feb 4, 2026 by rtj1 Loading…
Benchmark torch.compile optimization for quantization ready When a PR is ready for review
#2320 opened Jan 31, 2026 by colldata79 Loading…
Update vLLM GPU Utilization
#2319 opened Jan 30, 2026 by dsikka Draft
Add AFMOE mappings for awq and smoothquant ready When a PR is ready for review
#2316 opened Jan 30, 2026 by bartowski1182 Loading…
move smoothquant to transforms documentation Improvements or additions to documentation ready When a PR is ready for review
#2314 opened Jan 30, 2026 by Etelis Loading…
Support FP8 Block Quantization for Non-Divisible Shapes
#2290 opened Jan 26, 2026 by Etelis Loading…
3 of 4 tasks
Refactor Matching Logic to Use compressed-tensors Utilities needs-rebase ready When a PR is ready for review
#2284 opened Jan 24, 2026 by Etelis Loading…
[Docs][Examples] Add MoE Guide and remove finetune examples documentation Improvements or additions to documentation needs-rebase ready When a PR is ready for review
#2281 opened Jan 23, 2026 by dsikka Loading…
ProTip! Adding no:label will show everything without a label.