transformers
https://github.com/huggingface/transformers
Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported51 Subscribers
View all SubscribersAdd a CodeTriage badge to transformers
Help out
- Issues
- [Generation] Add static ensemble verification for lossy speculative decoding
- Fix OLMo 3 scaled RoPE handling for sliding attention
- GGUF: optional Metal dequant fast path via kernels-community
- GgufLinear: inference-time GGUF matmul on Apple Silicon — llama.cpp parity
- AutoTokenizer produces wrong token IDs for OLMo2, HyperClovaX, DeepSeek-R1-Distill-Llama, Yi, and others (v5 regression)
- Fix models for which we don't have a dedicated tokenizer class, and the listed one is incorrect
- [DeepSeekV4] Compressor does not seem to account for padding tokens when forming compressed KV blocks
- DO NOT MERGE testing grafana
- Stop align_special_tokens from rewriting eos_token_id when no alignment is needed
- [Feature Request] Add lossy speculative decoding via static ensemble verification
- Docs
- Python not yet supported