Skip to content
ai-supply.store
DiscoverCategoriesLeaderboardsCommunityAgent APIFAQ
PublishSign in
catalog / Language & NLP / BGE-large-en-v1.5
⠿EmbeddingLanguage & NLPFree

BGE-large-en-v1.5

MIT-licensed SOTA English embedding model from BAAI — top MTEB leaderboard performer, commercial-friendly.

@ai-supply
Installs230k
Rating★ 4.8
Reviews77
Install (free) to download the source.↗ Source repository

BGE-large-en-v1.5

BGE-large-en-v1.5 (Beijing Academy of AI General Embedding) is a state-of-the-art English text embedding model released by BAAI under the MIT license. It consistently ranks at the top of the MTEB (Massive Text Embedding Benchmark) leaderboard for retrieval, reranking, and semantic similarity tasks.

Key features

  • 1024-dimensional embeddings — high-fidelity semantic representation
  • Top MTEB scores across retrieval, classification, and clustering tasks
  • Dual-encoder architecture optimized for retrieval
  • MIT license — fully commercial-friendly
  • Works out-of-the-box with sentence-transformers, LangChain, and LlamaIndex
  • Prefix instructions (Represent this sentence for retrieval:) boost retrieval performance

Quick start

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("BAAI/bge-large-en-v1.5")

queries = ["Represent this sentence for retrieval: What is RAG?"]
docs = ["Retrieval-Augmented Generation grounds LLMs with external knowledge."]

q_emb = model.encode(queries, normalize_embeddings=True)
d_emb = model.encode(docs, normalize_embeddings=True)
scores = q_emb @ d_emb.T
print(scores)  # cosine similarity

Install via ai-supply

npx ai-supply add bge-large-en-v1-5

Curated mirror of the open-source BGE-large-en-v1.5 (MIT). Get it from the source.

More from @ai-supply

View profile →
◆Skill
OpenCV Python
The world's most popular computer vision library with Python bindings — image processing, video, and ML pipelines.
↓ 500k★ 4.9
◐Model
timm (PyTorch Image Models)
The largest collection of pretrained image models for PyTorch — ViT, ConvNeXt, EfficientNet, Swin, and 900+ more.
↓ 490k★ 4.9
⌬Workflow
Apache Airflow
Apache-2.0 workflow orchestration platform — define, schedule, and monitor data and AI pipelines as Python DAGs.
↓ 395k★ 4.7
◐Model
Segment Anything Model (SAM)
Meta AI's promptable image segmentation model that can segment any object from a single click or bounding box.
↓ 320k★ 4.9
ai-supply.store

The marketplace for AI capabilities. Skills, MCPs, plugins, agents, datasets — discoverable by humans, consumable by machines.

api · v3.1status · all green
Marketplace
  • Discover
  • Categories
  • Leaderboards
  • Benchmarks
Community
  • Community
  • FAQ
For agents
  • Quickstart (60s)
  • Authorize an agent
  • Agent API
  • OpenAPI spec
For builders
  • Publish
  • Dashboard
  • Revenue share
Account
  • Sign in
  • Settings
Legal
  • Terms
  • Publisher Agreement
  • Acceptable Use
  • Privacy