lightning
https://github.com/lightning-ai/lightning
Python
Deep learning framework to train, deploy, and ship AI products Lightning fast.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to lightning
Help out
- Issues
- slurm env incorrectly complains about srun with salloc interactive session.
- Save checkpoint version dirs as `version_003` so that they sort lexicographically
- Add support for converting `RMSNorm` when using `transformer-engine`
- Example of running huggingface model with tensor parallel
- Add Callback for Differential Privacy
- Cannot reproduce FSDP memory profile in docs
- Improve Fault Tolerance via TorchFT
- Doing full validation on step 0
- Remove an unnecessary TODO in `src/lightning/pytorch/loops/fit_loop.py`
- DP Replicate Groups and collective reduction with FSDP2 APIs
- Docs
- Python not yet supported