Skip to content
ai-supply.store
DiscoverCategoriesLeaderboardsCommunityAgent APIFAQ
PublishSign in
catalog / Vision & Image / MediaPipe — Cross-Platform ML for Live & Streaming Media
⬡PipelineVision & ImageFree

MediaPipe — Cross-Platform ML for Live & Streaming Media

Google's on-device ML framework for face, hand, pose, and object detection across mobile, desktop, web, and edge.

@ai-supply
Installs350k
Rating★ 4.7
Reviews117
↗ Source repository

MediaPipe

MediaPipe is Google's cross-platform, customisable ML framework for live and streaming media pipelines. It ships optimised, pre-built solutions for the most common computer vision tasks and runs efficiently on-device without a server round-trip.

Key Features

  • Pre-built solutions: face detection, face mesh, hand tracking, pose estimation, holistic, object detection, image segmentation, text classification
  • Targets: Android, iOS, desktop (Linux/macOS/Windows), web (WebAssembly), Edge TPU
  • Python, Java, Swift, Objective-C, JavaScript, and C++ APIs
  • LiteRT (TFLite) runtime for low-latency inference
  • MediaPipe Tasks: new unified API for model-agnostic inference
  • Model Maker: fine-tune built-in solutions on custom data with a few lines of code

Quick Start

import mediapipe as mp
import cv2

mp_hands = mp.solutions.hands
hands = mp_hands.Hands(static_image_mode=False, max_num_hands=2)

cap = cv2.VideoCapture(0)
while cap.isOpened():
    ret, frame = cap.read()
    result = hands.process(cv2.cvtColor(frame, cv2.COLOR_BGR2RGB))
    if result.multi_hand_landmarks:
        print(f"{len(result.multi_hand_landmarks)} hand(s) detected")

Install via ai-supply

npx ai-supply add mediapipe-cross-platform-ml-solutions

Curated mirror of the open-source MediaPipe (Apache-2.0). Get it from the source.

More from @ai-supply

View profile →
◐Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
↓ 900k★ 4.9
⇄Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
↓ 820k★ 4.9
⠿Embedding
Sentence Transformers
State-of-the-art sentence and text embeddings — compute semantic similarity, clustering, and dense retrieval.
↓ 750k★ 4.9
⬡Pipeline
Diffusers
Hugging Face's state-of-the-art library for diffusion-based image, video, and audio generation models.
↓ 750k★ 4.9
ai-supply.store

The marketplace for AI capabilities. Skills, MCPs, plugins, agents, datasets — discoverable by humans, consumable by machines.

api · v3.1status · all green
Marketplace
  • Discover
  • Categories
  • Leaderboards
  • Benchmarks
Community
  • Community
  • FAQ
For agents
  • Quickstart (60s)
  • Authorize an agent
  • Agent API
  • OpenAPI spec
For builders
  • Publish
  • Dashboard
  • Revenue share
Account
  • Sign in
  • Settings
Legal
  • Terms
  • Publisher Agreement
  • Acceptable Use
  • Privacy