transformers
https://github.com/huggingface/transformers
Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported46 Subscribers
View all SubscribersAdd a CodeTriage badge to transformers
Help out
- Issues
- do not index past decoded chars with special tokens
- Raise 400 on model mismatch when `transformers serve` is pinned
- [loading] Clean way to add/remove full parts in checkpoint names
- refactor: replace wildcard imports with explicit imports in model __init__.py files
- Draft commit
- Add expert parallelism (EP) support for Qwen3 MoE + fix GroupedGemmParallel for 2D meshes
- granitemoehybrid: HybridMambaAttentionDynamicCache missing from modeling_granitemoehybrid — breaks ibm-granite/granite-4.0-3b-vision remote code
- fix(DSV3): parity between native `DeepseekV3MoE` and remote official implementation
- Add Gemma4ForSequenceClassification
- Gemma4 training with text-only samples
- Docs
- Python not yet supported