Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix for e2e tests
#927 opened Nov 19, 2024 by horheynm Loading…
[1/2] Expand e2e testing to prepare for lm-eval ready When a PR is ready for review
#922 opened Nov 17, 2024 by dsikka Loading…
Update kv_cache example ready When a PR is ready for review
#921 opened Nov 17, 2024 by dsikka Loading…
Actually make the run_compressed test useful ready When a PR is ready for review
#920 opened Nov 17, 2024 by dsikka Loading…
[Bugfix] Support model offloading SparseGPTQ
#918 opened Nov 16, 2024 by kylesayrs Loading…
Implement HooksMixin
#917 opened Nov 14, 2024 by kylesayrs Loading…
Kylesayrs/gptq partition
#914 opened Nov 13, 2024 by kylesayrs Draft
fix consecutive oneshot
#898 opened Nov 5, 2024 by horheynm Loading…
Allow Shortcutting Min-max Observer
#887 opened Nov 1, 2024 by kylesayrs Loading…
[Bugfix] Correct metrics calculations
#878 opened Oct 30, 2024 by kylesayrs Loading…
FSDP utils cleanup
#854 opened Oct 19, 2024 by kylesayrs Loading…
[Bugfix] DisableKVCache Context
#834 opened Oct 9, 2024 by kylesayrs Loading…
Awq re implementation
#824 opened Oct 7, 2024 by rahul-tuli Draft
Enable Sparse compression
#822 opened Oct 7, 2024 by rahul-tuli Loading…
1 of 3 tasks
e2e tests
#742 opened Oct 1, 2024 by horheynm Loading…
Add: Weight clipping to AWQModifier
#184 opened Sep 18, 2024 by rahul-tuli Loading…
1 task
ProTip! Adding no:label will show everything without a label.