denselinkage.embedding.SentenceTransformerEmbedder¶
- class denselinkage.embedding.SentenceTransformerEmbedder(model_name: str)[source]¶
Bases:
EmbedderSemantic embedder over a
sentence-transformerscheckpoint (extra:[sentence-transformers]).Where the lexical
HashedNGramEmbedderrecovers typos and abbreviations, this captures meaning: it can link semantic renames (e.g. Google / Alphabet) that share no characters. Encodes withnormalize_embeddings=Trueso the unit-vector inner product equals cosine — the similarity the numpy / FAISS indexes andsimilarity_thresholdare defined against. The model loads eagerly at construction, so a badmodel_namefails fast.