Skip to content
ai-supply.store
DiscoverCategoriesLeaderboardsCommunityAgent APIFAQ
PublishSign in
catalog / DevOps & Infra / DVC
⬡PipelineDevOps & InfraFree

DVC

Git-like version control for ML datasets and pipelines — track experiments, reproduce results, and collaborate on data science projects.

@ai-supply
Installs67k
Rating★ 4.6
Reviews22
Install (free) to download the source.↗ Source repository

DVC — Data Version Control

DVC brings Git-style version control to machine learning datasets, models, and pipelines. Define reproducible ML pipelines as code, cache large files in remote storage (S3, GCS, Azure, SSH), and track every experiment with lightweight metafiles committed to Git.

Key features

  • Data versioning — track large files and directories without bloating your Git repo
  • Pipeline DAGs — define stages with dvc.yaml; DVC caches and only re-runs changed stages
  • Experiment tracking — dvc exp run + dvc exp show for a clean experiment table
  • Remote storage — S3, GCS, Azure Blob, SSH, HDFS, and local remotes
  • CI/CD integration — dvc repro in GitHub Actions for reproducible ML pipelines
  • Python API — use programmatically in notebooks or scripts

Quick start

npx ai-supply add dvc-ml-pipeline-versioning

# Or install directly
pip install dvc

# Initialize in a Git repo
git init my-project && cd my-project
dvc init

# Track a dataset
dvc add data/train.csv
git add data/train.csv.dvc .gitignore
git commit -m "Track training data with DVC"

# Define a pipeline stage
dvc run -n train \
  -d data/train.csv -d src/train.py \
  -o model.pkl \
  python src/train.py

# Reproduce the pipeline
dvc repro

Curated mirror of the open-source DVC project (Apache-2.0). Install upstream from the repository.

More from @ai-supply

View profile →
◆Skill
OpenCV Python
The world's most popular computer vision library with Python bindings — image processing, video, and ML pipelines.
↓ 500k★ 4.9
◐Model
timm (PyTorch Image Models)
The largest collection of pretrained image models for PyTorch — ViT, ConvNeXt, EfficientNet, Swin, and 900+ more.
↓ 490k★ 4.9
⌬Workflow
Apache Airflow
Apache-2.0 workflow orchestration platform — define, schedule, and monitor data and AI pipelines as Python DAGs.
↓ 395k★ 4.7
◐Model
Segment Anything Model (SAM)
Meta AI's promptable image segmentation model that can segment any object from a single click or bounding box.
↓ 320k★ 4.9
ai-supply.store

The marketplace for AI capabilities. Skills, MCPs, plugins, agents, datasets — discoverable by humans, consumable by machines.

api · v3.1status · all green
Marketplace
  • Discover
  • Categories
  • Leaderboards
  • Benchmarks
Community
  • Community
  • FAQ
For agents
  • Quickstart (60s)
  • Authorize an agent
  • Agent API
  • OpenAPI spec
For builders
  • Publish
  • Dashboard
  • Revenue share
Account
  • Sign in
  • Settings
Legal
  • Terms
  • Publisher Agreement
  • Acceptable Use
  • Privacy