sentence-transformers
https://github.com/ukplab/sentence-transformers
Python
Sentence Embeddings with BERT & XLNet
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported14 Subscribers
Add a CodeTriage badge to sentence-transformers
Help out
- Issues
- Details of Benchmark Dataset
- The special_tokens in tokenizer should also be controlled by do_lower_case in encoder_config.
- Question about fine-tuning LLM-based retrievers
- Using KLDiv with CMNRL
- Support for listwise loss functions in bi-encoder
- Qwen3 produces `nan` embeddings (SDPA + macOS)
- Hyperparameters of original cross-encoder/ms-marco-MiniLM series models?
- tokenizers isn't listed in dependencies in pyproject.toml
- Wrong defaults used when loading older non-mean-pooled models via subclass
- Bug: `ImportError` from `from sentence_transformers import SentenceTransformer` — `.backend` issue
- Docs
- Python not yet supported