Skip to content
ai-supply.store
DécouvrirCatégoriesClassementsCommunautéAgent APIFAQ
PublierSe connecter
catalog / DevOps & Infra / BentoML
⇄ConnectorDevOps & InfraFree

BentoML

Build, ship, and scale AI services — unified framework from local development to production Kubernetes.

@ai-supply
Installations230k
Note★ 4.7
Avis77
↗ Dépôt source

BentoML

BentoML is an open-source unified model serving framework that lets you build AI services from any ML framework and deploy them on any infrastructure. It handles the full lifecycle from packaging models into reproducible Bentos to autoscaling Kubernetes deployments with adaptive batching.

Key Features

  • Framework agnostic: PyTorch, TensorFlow, Keras, XGBoost, scikit-learn, LLMs, diffusion models
  • Adaptive micro-batching: automatically batch requests for optimal GPU throughput
  • Runners API: modular service composition with independent scaling
  • Bento packaging: reproducible bundles with model, code, dependencies, Dockerfile
  • BentoCloud integration: one-command deployment to managed inference infrastructure
  • Built-in OpenTelemetry, Prometheus metrics, and gRPC support

Quick Start

import bentoml

@bentoml.service
class SentimentAnalyzer:
    model = bentoml.models.get("sentiment:latest")

    @bentoml.api
    def classify(self, text: str) -> str:
        return self.model.predict([text])[0]
# Serve locally
bentoml serve sentiment_service:SentimentAnalyzer

# Build + containerize
bentoml build && bentoml containerize sentiment:latest

Install via ai-supply

npx ai-supply add bentoml-model-serving-framework

Curated mirror of the open-source BentoML (Apache-2.0). Get it from the source.

More from @ai-supply

View profile →
◐Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
↓ 900k★ 4.9
⇄Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
↓ 820k★ 4.9
◉Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
↓ 820k★ 4.8
◆Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
↓ 760k★ 4.7
ai-supply.store

La marketplace des capacités IA. Compétences, MCPs, plugins, agents, datasets — découvrables par les humains, exploitables par les machines.

api · v3.1status · all green
Contact
support@ai-supply.storesecurity@ai-supply.store
Marketplace
  • Découvrir
  • Catégories
  • Classements
  • Benchmarks
Communauté
  • Communauté
  • FAQ
Pour les agents
  • Démarrage rapide (60s)
  • Autoriser un agent
  • Agent API
  • Spécification OpenAPI
Pour les développeurs
  • Publier
  • Tableau de bord
  • Partage des revenus
Compte
  • Se connecter
  • Paramètres
Mentions légales
  • Conditions
  • Accord éditeur
  • Utilisation acceptable
  • Confidentialité