Pinned Loading
Repositories
Showing 10 of 26 repositories
- llm-compressor Public
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
vllm-project/llm-compressor’s past year of commit activity - compressed-tensors Public
A safetensors extension to efficiently store sparse quantized tensors on disk
vllm-project/compressed-tensors’s past year of commit activity - speculators Public
A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM
vllm-project/speculators’s past year of commit activity - vllm-project.github.io Public
vllm-project/vllm-project.github.io’s past year of commit activity