Welcome to llm-d: a Kubernetes-native high-performance distributed LLM inference framework

llm-d is a well-lit path for serving large language models at scale with the fastest time-to-value and competitive performance per dollar. Built on vLLM, Kubernetes, and Inference Gateway, llm-d provides modular solutions for distributed inference with features like KV-cache aware routing and disaggregated serving.

Key Resources

📖 Documentation: llm-d.ai
🏗️ Architecture: llm-d architecture docs
📖 Project Details: PROJECT.md
📦 Releases: GitHub Releases

🤝 How to Contribute

Join the Community

💬 Slack: Join our development discussions at llm-d.slack.com
📧 Google Group: Subscribe to llm-d-contributors for architecture docs and meeting invites
🗓️ Weekly Standup: Wednesdays at 1230 ET - Public Calendar

Contributing Code

Read Guidelines: Review our Code of Conduct and contribution process
Sign Commits: All commits require DCO sign-off (git commit -s)

Ways to Contribute

🐛 Bug fixes and small features - Submit PRs directly to component repos
🚀 New features with APIs - Require project proposals
📚 Documentation - Help improve guides and examples
🧪 Testing & Benchmarking - Contribute to our test coverage
💡 Experimental features - Start in llm-d-incubation org

License: Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
llm-d-benchmark @ 75f345e		llm-d-benchmark @ 75f345e
llm-d-inference-scheduler @ d21f977		llm-d-inference-scheduler @ d21f977
llm-d-inference-sim @ 5369e28		llm-d-inference-sim @ 5369e28
llm-d-kv-cache-manager @ 42967e7		llm-d-kv-cache-manager @ 42967e7
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Welcome to llm-d: a Kubernetes-native high-performance distributed LLM inference framework

Key Resources

🤝 How to Contribute

Join the Community

Contributing Code

Ways to Contribute

About

Uh oh!

Releases

Packages

whaili/llm-d

Folders and files

Latest commit

History

Repository files navigation

Welcome to llm-d: a Kubernetes-native high-performance distributed LLM inference framework

Key Resources

🤝 How to Contribute

Join the Community

Contributing Code

Ways to Contribute

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages