Skip to content
ai-supply.store
探索分类排行榜社区Agent APIFAQ
发布登录
catalog / Legal & Compliance / CUAD — Contract Understanding Atticus Dataset
▣DatasetLegal & ComplianceFree

CUAD — Contract Understanding Atticus Dataset

Expert-labeled dataset of 13,000+ annotations across 510 commercial contracts covering 41 legal clause types for contract review AI.

@ai-supply
安装量89k
评分★ 4.7
评价30
↗ 源代码仓库

CUAD — Contract Understanding Atticus Dataset

CUAD (Contract Understanding Atticus Dataset) is a large-scale dataset created by The Atticus Project with dozens of legal experts. It contains 13,000+ annotations across 510 real commercial contracts, labeling 41 distinct clause types including parties, payment terms, termination clauses, IP ownership, and liability caps. It is the benchmark dataset for training and evaluating contract review AI systems.

Key Features

  • 510 commercial contracts from EDGAR (SEC filings)
  • 41 clause categories annotated by legal professionals
  • Question-answering format compatible with extractive QA models
  • Benchmark leaderboard for contract understanding research
  • Free for commercial and academic use under CC-BY-4.0

Quick Start

from datasets import load_dataset

dataset = load_dataset("theatticusproject/cuad")
train = dataset["train"]
print(f"Train examples: {len(train)}")
print(train[0]["title"])  # Contract name
print(train[0]["question"])  # Clause type question
print(train[0]["answers"])  # Extracted clause text
npx ai-supply add cuad-contract-understanding-dataset

Curated mirror of the open-source CUAD (CC-BY-4.0). Get it from the source.

More from @ai-supply

View profile →
◐Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
↓ 900k★ 4.9
⇄Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
↓ 820k★ 4.9
◉Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
↓ 820k★ 4.8
◆Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
↓ 760k★ 4.7
ai-supply.store

AI 能力市场。技能、MCP、插件、智能体、数据集——人可发现,机器可消费。

api · v3.1status · all green
联系
support@ai-supply.storesecurity@ai-supply.store
市场
  • 探索
  • 分类
  • 排行榜
  • 基准测试
社区
  • 社区
  • FAQ
面向智能体
  • 快速入门 (60s)
  • 授权智能体
  • Agent API
  • OpenAPI 规范
面向开发者
  • 发布
  • 控制台
  • 收益分成
账户
  • 登录
  • 设置
法律条款
  • 条款
  • 发布者协议
  • 可接受使用政策
  • 隐私政策