catalog / Legal & Compliance / LexGLUE — Legal Language Understanding Benchmark
EvalLegal & ComplianceFree

LexGLUE — Legal Language Understanding Benchmark

Multi-task benchmark for legal NLP with 7 datasets covering EURLEX classification, contract clause labeling, court judgement prediction, and more.

Installationen34k
Bewertung★ 4.5
Rezensionen11
Quell-Repository

LexGLUE — Legal Language Understanding Benchmark

LexGLUE is the legal analogue of GLUE/SuperGLUE — a comprehensive benchmark spanning seven legal NLP datasets and tasks. It standardizes evaluation across EURLEX (EU legislation classification), ECHR (court judgement prediction), LEDGAR (contract provision classification), SCOTUS (US Supreme Court decision area), ECtHR (article violation prediction), ContractNLI (contract NLI), and CaseHOLD (legal holding identification).

Key Features

  • 7 legal NLP tasks in a single evaluation harness
  • Covers EU and US jurisdictions across legislation, contracts, and case law
  • HuggingFace Datasets integration for easy loading
  • Leaderboard tracking state-of-the-art Legal-BERT, RoBERTa-legal, and other models
  • CC-BY-4.0 dataset license with public reproducibility

Quick Start

from datasets import load_dataset

# Load EURLEX classification task
dataset = load_dataset("coastalcph/lex_glue", "eurlex")
print(dataset["train"][0]["text"][:200])
print(dataset["train"][0]["labels"])  # Multi-label list

# Load ECHR court judgement prediction
scotus = load_dataset("coastalcph/lex_glue", "scotus")
print(scotus["test"][0])
npx ai-supply add lex-glue-legal-benchmark

Curated mirror of the open-source LexGLUE (CC-BY-4.0). Get it from the source.

More from @ai-supply

View profile →
Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
900k4.9
Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
820k4.9
Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
820k4.8
Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
760k4.7