transformers
https://github.com/huggingface/transformers
Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported51 Subscribers
View all SubscribersAdd a CodeTriage badge to transformers
Help out
- Issues
- [CTRL] Support attn_implementation="sdpa" dispatch
- Widen tols for float16/bfloat16
- Model quantized via sinq broken after save_pretrained and from_pretrained
- [`Kernels`] Sync to latest version and add new kernels (SwiGLU, CE)
- Add TDT loss kernel
- DeepSeek-V4 shared expert not gated.
- [docs] tp for continuous batching
- Support Granite Speech NAR (NLE)
- fix(llama4): align MoE interface for EP/TP compatibility
- Mamba2Mixer: use_cache with seq_len > 1 silently produces incorrect results (both CPU and GPU paths)
- Docs
- Python not yet supported