-
Notifications
You must be signed in to change notification settings - Fork 58
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Load Compressed Tensors Models with run_compressed=False
#923
opened Nov 18, 2024 by
horheynm
Loading…
[1/2] Expand e2e testing to prepare for lm-eval
ready
When a PR is ready for review
#922
opened Nov 17, 2024 by
dsikka
Loading…
Update kv_cache example
ready
When a PR is ready for review
#921
opened Nov 17, 2024 by
dsikka
Loading…
Actually make the When a PR is ready for review
run_compressed
test useful
ready
#920
opened Nov 17, 2024 by
dsikka
Loading…
Support pack_quantized format for nonuniform mixed-precision
#913
opened Nov 13, 2024 by
mgoin
Loading…
[SparseAutoModelForCausalLM Deprecation] Update examples
#880
opened Oct 31, 2024 by
horheynm
Loading…
[DO NOT MERGE] Increase Sparsity Threshold to 50% for sparse compression
#876
opened Oct 30, 2024 by
rahul-tuli
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.