lightning
https://github.com/lightning-ai/lightning
Python
Deep learning framework to train, deploy, and ship AI products Lightning fast.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to lightning
Help out
- Issues
- Add `on_validation_model_train/eval` to Callback API as well
- Slurm multi-node work fine but multi-gpu doesn't
- Multi-gpu training with slurm times out
- Call `configure_module` before `freeze_before_training`
- Why only one GPU is getting used in the kaggle kernel
- When interrupting a run with Ctrl+C, sometimes the WandbLogger does not upload a checkpoint artifact
- Log default metrics
- How to deal with uneven inputs in DDP with sharded data without hanging
- FSDP full state dict mangles fsspec path
- put the monitor metric into default filename for ModelCheckpoint
- Docs
- Python not yet supported