-
Notifications
You must be signed in to change notification settings - Fork 396
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Improvements or additions to documentation
ready
When a PR is ready for review
model_free_ptq] Earlier Shape Validation
codex
documentation
#2372
opened Feb 16, 2026 by
kylesayrs
Loading…
input_id not required for Step3-VL-10B
ready
When a PR is ready for review
#2370
opened Feb 16, 2026 by
gDINESH13
Loading…
[GPTQ] Move modifier to top-level for consistent folder structure
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2368
opened Feb 16, 2026 by
dik654
Loading…
[Sequential Pipeline] only cache unique offloaded values
ready
When a PR is ready for review
#2366
opened Feb 13, 2026 by
kylesayrs
Loading…
4 tasks done
add qwen3 vl autoround example
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2357
opened Feb 12, 2026 by
xin3he
Loading…
feat: early group-size divisibility check with layer FQNs
enhancement
New feature or request
ready
When a PR is ready for review
#2353
opened Feb 11, 2026 by
GOavi101
Loading…
DataLoader options, single-pass weight calibration, optional sequential prefetch
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2349
opened Feb 11, 2026 by
GOavi101
Loading…
Add model_free_ptq example for glm 4.6 block fp8
documentation
Improvements or additions to documentation
#2343
opened Feb 10, 2026 by
mgoin
Loading…
[Bugfix] Guard against MLA
ready
When a PR is ready for review
#2337
opened Feb 6, 2026 by
kylesayrs
Loading…
[MoE] MiniMax-M2/M2.1 calibration follow-up
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2335
opened Feb 6, 2026 by
LudovicoYIN
Loading…
[GPTQ][ddp] PoC for GPTQ with DDP
enhancement
New feature or request
gptq
For any PR / issue related to GPTQ support
quality-failed
#2333
opened Feb 6, 2026 by
HDCharles
Loading…
Add GSM8K evaluation script and AWQ+FP8 results
documentation
Improvements or additions to documentation
#2330
opened Feb 4, 2026 by
rtj1
Loading…
[AWQ] Add option to consider smooth layer quantization in scale search
needs-rebase
#2323
opened Jan 31, 2026 by
Ramshankar07
Loading…
Benchmark torch.compile optimization for quantization
ready
When a PR is ready for review
#2320
opened Jan 31, 2026 by
colldata79
Loading…
Add AFMOE mappings for awq and smoothquant
ready
When a PR is ready for review
#2316
opened Jan 30, 2026 by
bartowski1182
Loading…
move smoothquant to transforms
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2314
opened Jan 30, 2026 by
Etelis
Loading…
Support FP8 Block Quantization for Non-Divisible Shapes
#2290
opened Jan 26, 2026 by
Etelis
Loading…
3 of 4 tasks
Refactor Matching Logic to Use compressed-tensors Utilities
needs-rebase
ready
When a PR is ready for review
#2284
opened Jan 24, 2026 by
Etelis
Loading…
[Docs][Examples] Add MoE Guide and remove finetune examples
documentation
Improvements or additions to documentation
needs-rebase
ready
When a PR is ready for review
#2281
opened Jan 23, 2026 by
dsikka
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.