Skip to content
ai-supply.store
DiscoverCategoriesLeaderboardsCommunityAgent APIFAQ
PublishSign in
catalog / Legal & Compliance / CUAD — Contract Understanding Atticus Dataset
▣DatasetLegal & ComplianceFree

CUAD — Contract Understanding Atticus Dataset

Expert-labeled dataset of 13,000+ annotations across 510 commercial contracts covering 41 legal clause types for contract review AI.

@ai-supply
Installs89k
Rating★ 4.7
Reviews30
Install (free) to download the source.↗ Source repository

CUAD — Contract Understanding Atticus Dataset

CUAD (Contract Understanding Atticus Dataset) is a large-scale dataset created by The Atticus Project with dozens of legal experts. It contains 13,000+ annotations across 510 real commercial contracts, labeling 41 distinct clause types including parties, payment terms, termination clauses, IP ownership, and liability caps. It is the benchmark dataset for training and evaluating contract review AI systems.

Key Features

  • 510 commercial contracts from EDGAR (SEC filings)
  • 41 clause categories annotated by legal professionals
  • Question-answering format compatible with extractive QA models
  • Benchmark leaderboard for contract understanding research
  • Free for commercial and academic use under CC-BY-4.0

Quick Start

from datasets import load_dataset

dataset = load_dataset("theatticusproject/cuad")
train = dataset["train"]
print(f"Train examples: {len(train)}")
print(train[0]["title"])  # Contract name
print(train[0]["question"])  # Clause type question
print(train[0]["answers"])  # Extracted clause text
npx ai-supply add cuad-contract-understanding-dataset

Curated mirror of the open-source CUAD (CC-BY-4.0). Get it from the source.

More from @ai-supply

View profile →
◆Skill
OpenCV Python
The world's most popular computer vision library with Python bindings — image processing, video, and ML pipelines.
↓ 500k★ 4.9
◐Model
timm (PyTorch Image Models)
The largest collection of pretrained image models for PyTorch — ViT, ConvNeXt, EfficientNet, Swin, and 900+ more.
↓ 490k★ 4.9
⌬Workflow
Apache Airflow
Apache-2.0 workflow orchestration platform — define, schedule, and monitor data and AI pipelines as Python DAGs.
↓ 395k★ 4.7
◐Model
Segment Anything Model (SAM)
Meta AI's promptable image segmentation model that can segment any object from a single click or bounding box.
↓ 320k★ 4.9
ai-supply.store

The marketplace for AI capabilities. Skills, MCPs, plugins, agents, datasets — discoverable by humans, consumable by machines.

api · v3.1status · all green
Marketplace
  • Discover
  • Categories
  • Leaderboards
  • Benchmarks
Community
  • Community
  • FAQ
For agents
  • Quickstart (60s)
  • Authorize an agent
  • Agent API
  • OpenAPI spec
For builders
  • Publish
  • Dashboard
  • Revenue share
Account
  • Sign in
  • Settings
Legal
  • Terms
  • Publisher Agreement
  • Acceptable Use
  • Privacy