Skip to content
ai-supply.store
탐색카테고리리더보드커뮤니티Agent APIFAQ
로그인무료 가입
catalog / Language & NLP / BEIR
▣DatasetLanguage & NLPFree

BEIR

Heterogeneous zero-shot information-retrieval benchmark bundling 15+ diverse IR datasets behind one evaluation API.

@ai-supply
설치 수829
↗ 소스 저장소

BEIR

BEIR (Benchmarking IR) is the standard for measuring how well a retriever generalizes zero-shot across domains it was never tuned on. Instead of overfitting to a single collection, it aggregates 15+ heterogeneous datasets — spanning fact-checking, question answering, bio-medical, scientific, financial, duplicate-detection, and news retrieval — into a common format with unified corpus/queries/qrels loaders and evaluation.

Key features

  • 15+ ready-to-use retrieval datasets in one consistent schema
  • Standardized nDCG@k, MAP, Recall, and Precision evaluation out of the box
  • Compare BM25, dense bi-encoders, ColBERT, rerankers, and hybrid systems apples-to-apples
  • Focus on zero-shot generalization, exposing where dense models quietly underperform lexical baselines
  • Widely cited reference used to report embedding and retriever quality

Use it to sanity-check a new embedding model or reranker before shipping it into a RAG stack, so you know it holds up beyond your own domain.

Curated mirror of the open-source BEIR (Apache-2.0). Get it from the source.

More from @ai-supply

View profile →
◇MCP server
GitHub MCP Server
Official GitHub MCP server — give your AI agent full read/write access to repos, issues, PRs, and actions.
↓ 771k
⠿Embedding
Sentence Transformers
State-of-the-art sentence and text embeddings — compute semantic similarity, clustering, and dense retrieval.
↓ 751k
◆Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
↓ 641k
◇MCP server
MCP TypeScript SDK
Official TypeScript/JavaScript SDK for building MCP servers and clients — the Node.js foundation for the Model Context Protocol.
↓ 629k
ai-supply.store

무료로 제공하는 보안 검증 AI 역량 — skill, MCP, plugin, agent, 데이터셋을 비롯한 모든 항목에 보안 점수를 매기고 최신성을 추적하며, 사람과 agent 모두를 위해 만들었습니다.

api · v3.1status · all green
문의하기
support@ai-supply.storesecurity@ai-supply.store
카탈로그
  • 탐색
  • 카테고리
  • 리더보드
  • 벤치마크
  • 보안
커뮤니티
  • 커뮤니티
  • FAQ
에이전트용
  • 빠른 시작 (60s)
  • 에이전트 승인
  • Agent API
  • OpenAPI 사양
빌더용
  • 게시
  • 대시보드
계정
  • 계정 만들기
  • 로그인
  • 설정
법적 정보
  • 이용약관
  • 게시자 계약
  • 이용 정책
  • 개인정보 처리방침