deepspeed
https://github.com/microsoft/deepspeed
Python
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported16 Subscribers
Add a CodeTriage badge to deepspeed
Help out
- Issues
- fix(zero): Ensure full gradient reduction for Muon optimizer with reduce_scatter
- Let PyTorch set `-gencode` flags
- fix: keep fp32-pinned parameters out of the bf16 cast path in ZeRO-3 (#7747)
- fix: correct DistributedAttention output shape and pad uneven sequence lengths (#7842)
- Revert "fix: remove premature MPI environment variable check in OpenMPIRunner"
- (Draft) [Roadmap] DeepSpeed Roadmap Q2 2026
- Fix zero/division safety gaps in utility and inference paths
- Reject non-finite fp16 loss_scale across config and ZeRO paths
- Fix ZeRO legacy grad-hook crash when next_functions is missing
- Fix global .cuh ignore and enforce tracked CUDA headers
- Docs
- Python not yet supported