transformers
https://github.com/huggingface/transformers
Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported51 Subscribers
View all SubscribersAdd a CodeTriage badge to transformers
Help out
- Issues
- Add native masked MSE loss for Sapiens2ForPoseEstimation
- [Offloading] Support full disk offloading
- [`peft`] Support key_mapping with PEFT models
- LabelSmoother ignores num_items_in_batch, inflating gradients by ~gradient_accumulation_steps with label smoothing
- Several models hardcode `torch.float64` in their forward pass, crashing on MPS (Apple Silicon)
- Fix gradient inflation when combining label smoothing with gradient accumulation
- Fix beam cache reordering for non-default cache keys
- Fix hardcoded torch.float64 crashing on MPS (#46723)
- LoRA fine-tuning of an FP8 checkpoint is blocked: `get_peft_model` does not clear `validate_quantization_for_training` (QuantizationMethod.FP8)
- Allow metal-flash-sdpa for OpenAIPrivacyFilter on MPS
- Docs
- Python not yet supported