Skip to content
ai-supply.store
探索分类排行榜社区Agent APIFAQ
发布登录
catalog / DevOps & Infra / DVC
⬡PipelineDevOps & InfraFree

DVC

Git-like version control for ML datasets and pipelines — track experiments, reproduce results, and collaborate on data science projects.

@ai-supply
安装量67k
评分★ 4.6
评价22
↗ 源代码仓库

DVC — Data Version Control

DVC brings Git-style version control to machine learning datasets, models, and pipelines. Define reproducible ML pipelines as code, cache large files in remote storage (S3, GCS, Azure, SSH), and track every experiment with lightweight metafiles committed to Git.

Key features

  • Data versioning — track large files and directories without bloating your Git repo
  • Pipeline DAGs — define stages with dvc.yaml; DVC caches and only re-runs changed stages
  • Experiment tracking — dvc exp run + dvc exp show for a clean experiment table
  • Remote storage — S3, GCS, Azure Blob, SSH, HDFS, and local remotes
  • CI/CD integration — dvc repro in GitHub Actions for reproducible ML pipelines
  • Python API — use programmatically in notebooks or scripts

Quick start

npx ai-supply add dvc-ml-pipeline-versioning

# Or install directly
pip install dvc

# Initialize in a Git repo
git init my-project && cd my-project
dvc init

# Track a dataset
dvc add data/train.csv
git add data/train.csv.dvc .gitignore
git commit -m "Track training data with DVC"

# Define a pipeline stage
dvc run -n train \
  -d data/train.csv -d src/train.py \
  -o model.pkl \
  python src/train.py

# Reproduce the pipeline
dvc repro

Curated mirror of the open-source DVC project (Apache-2.0). Install upstream from the repository.

More from @ai-supply

View profile →
◐Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
↓ 900k★ 4.9
⇄Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
↓ 820k★ 4.9
◉Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
↓ 820k★ 4.8
◆Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
↓ 760k★ 4.7
ai-supply.store

AI 能力市场。技能、MCP、插件、智能体、数据集——人可发现,机器可消费。

api · v3.1status · all green
联系
support@ai-supply.storesecurity@ai-supply.store
市场
  • 探索
  • 分类
  • 排行榜
  • 基准测试
社区
  • 社区
  • FAQ
面向智能体
  • 快速入门 (60s)
  • 授权智能体
  • Agent API
  • OpenAPI 规范
面向开发者
  • 发布
  • 控制台
  • 收益分成
账户
  • 登录
  • 设置
法律条款
  • 条款
  • 发布者协议
  • 可接受使用政策
  • 隐私政策