aphrodite-engine
https://github.com/pygmalionai/aphrodite-engine
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
not yet supported1 Subscribers
Add a CodeTriage badge to aphrodite-engine
Help out
- Issues
- [Installation]: Install v0.6.6/v0.6.7 on amd gpu gfx906 failed, v0.6.5 success but cannot run gptq
- [Bug]: VPTQ quantitative model Inference error
- [Feature]: Automatic max-model-len or max-num-seqs
- [Bug]: FP8 KV Cache FLASHINFER AssertionError
- Add Olmo2
- Add Repetition Range ('rep_range')
- [Bug]: ModuleNotFoundError: No module named 'ray'
- [Bug]: Generation sometimes slows to a crawl for all requests when there is a DRY sampler request
- [Usage]: Distributed Inference Without Docker.
- [Bug]: unable to load 14B Qwen2.5 GGUF with newest version (0.6.2.post1)
- Docs
- not yet supported