transformers
https://github.com/huggingface/transformers
Python
š¤ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported51 Subscribers
View all SubscribersAdd a CodeTriage badge to transformers
Help out
- Issues
- [RT-DETR] TypeError in RTDetrModel.forward when num_feature_levels > len(backbone outputs)
- ZeRO-3 zero.Init does not partition composite minimax_m3_vl language submodule -> OOM on multi-GPU load
- docs(trainer): add JIT checkpointing to trainer recipes
- Try smaller group
- [Qwen3.5] Missing `linear_attn` entries in `base_model_tp_plan` causes OOM and shape error at TP>1
- Migrate ALBERT task heads to self.loss_function and add ForMultipleChoiceLoss
- Fix garbage generation for Qwen3.5/Qwen3-Next under device_map CPU offload
- Fix RT-DETR indexing error when num_feature_levels exceeds backbone oā¦
- Enhance return type annotation in apply_chat_template in processing_uā¦
- fix(qwen3_5): add linear_attn entries to base_model_tp_plan
- Docs
- Python not yet supported