Skip to content
ai-supply.store
DécouvrirCatégoriesClassementsCommunautéAgent APIFAQ
PublierSe connecter
catalog / Legal & Compliance / CUAD — Contract Understanding Atticus Dataset
▣DatasetLegal & ComplianceFree

CUAD — Contract Understanding Atticus Dataset

Expert-labeled dataset of 13,000+ annotations across 510 commercial contracts covering 41 legal clause types for contract review AI.

@ai-supply
Installations89k
Note★ 4.7
Avis30
↗ Dépôt source

CUAD — Contract Understanding Atticus Dataset

CUAD (Contract Understanding Atticus Dataset) is a large-scale dataset created by The Atticus Project with dozens of legal experts. It contains 13,000+ annotations across 510 real commercial contracts, labeling 41 distinct clause types including parties, payment terms, termination clauses, IP ownership, and liability caps. It is the benchmark dataset for training and evaluating contract review AI systems.

Key Features

  • 510 commercial contracts from EDGAR (SEC filings)
  • 41 clause categories annotated by legal professionals
  • Question-answering format compatible with extractive QA models
  • Benchmark leaderboard for contract understanding research
  • Free for commercial and academic use under CC-BY-4.0

Quick Start

from datasets import load_dataset

dataset = load_dataset("theatticusproject/cuad")
train = dataset["train"]
print(f"Train examples: {len(train)}")
print(train[0]["title"])  # Contract name
print(train[0]["question"])  # Clause type question
print(train[0]["answers"])  # Extracted clause text
npx ai-supply add cuad-contract-understanding-dataset

Curated mirror of the open-source CUAD (CC-BY-4.0). Get it from the source.

More from @ai-supply

View profile →
◐Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
↓ 900k★ 4.9
⇄Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
↓ 820k★ 4.9
◉Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
↓ 820k★ 4.8
◆Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
↓ 760k★ 4.7
ai-supply.store

La marketplace des capacités IA. Compétences, MCPs, plugins, agents, datasets — découvrables par les humains, exploitables par les machines.

api · v3.1status · all green
Contact
support@ai-supply.storesecurity@ai-supply.store
Marketplace
  • Découvrir
  • Catégories
  • Classements
  • Benchmarks
Communauté
  • Communauté
  • FAQ
Pour les agents
  • Démarrage rapide (60s)
  • Autoriser un agent
  • Agent API
  • Spécification OpenAPI
Pour les développeurs
  • Publier
  • Tableau de bord
  • Partage des revenus
Compte
  • Se connecter
  • Paramètres
Mentions légales
  • Conditions
  • Accord éditeur
  • Utilisation acceptable
  • Confidentialité