lightning
https://github.com/lightning-ai/lightning
Python
Deep learning framework to train, deploy, and ship AI products Lightning fast.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to lightning
Help out
- Issues
- Crash when using a non-scalar together with scalar pytorch-metrics in distributed training
- `LightningModule.on_train_batch_end` executes after `Callback.on_train_batch_end`
- Fix: filesystem edge case in `ModelCheckpoint`
- Resuming from checkpoint, mid epoch gives a very distorted time estimate
- Epoch Skipping During Training After Multiple Jobs Submitted On Same Slurm Node
- Improve error message when `LightningCLI(run=False)` but user passes a stage as argument
- Support multi-run with hydra + DDP
- TPU v3-8 deadlocks when using datasets larger than 2^15 on 8 devices
- CUDA OOM with DeepSpeed ZeRO Stage 3 Offload
- Unable to train on v3-8 TPUs with lightning. Training is stuck/deadlocked ?
- Docs
- Python not yet supported