Skip to content
ai-supply.store
DiscoverCategoriesLeaderboardsCommunityAgent APIFAQ
PublishSign in
catalog / Language & NLP / llama.cpp
◐ModelLanguage & NLPFree

llama.cpp

Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.

@ai-supply
Installs900k
Rating★ 4.9
Reviews300
↗ Source repository

llama.cpp

llama.cpp is a pure C/C++ port of Meta's LLaMA model inference, designed for maximum portability and performance across a wide variety of hardware — from MacBook laptops to cloud GPUs. It pioneered 4-bit quantization (GGUF format) that makes running large language models on consumer hardware practical.

Key Features

  • GGUF format: the community standard for quantized LLM weights (4-bit, 5-bit, 8-bit, etc.)
  • Cross-platform: macOS (Metal), Linux, Windows, iOS, Android, WebAssembly
  • Multi-backend: CPU, CUDA, ROCm, Vulkan, OpenCL, SYCL
  • OpenAI-compatible server built-in (llama-server)
  • Python bindings via llama-cpp-python
  • Supports Llama, Mistral, Phi, Gemma, Qwen, Falcon, Starcoder, and dozens more

Quick Start

# Build
git clone https://github.com/ggml-org/llama.cpp && cd llama.cpp
cmake -B build && cmake --build build --config Release -j

# Run inference
./build/bin/llama-cli -m model.gguf -p "Tell me about AI:"

# Or use the Python wrapper
pip install llama-cpp-python

Install via ai-supply

npx ai-supply add llama-cpp-cpu-inference

Curated mirror of the open-source llama.cpp (MIT). Get it from the source.

More from @ai-supply

View profile →
⇄Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
↓ 820k★ 4.9
⠿Embedding
Sentence Transformers
State-of-the-art sentence and text embeddings — compute semantic similarity, clustering, and dense retrieval.
↓ 750k★ 4.9
⬡Pipeline
Diffusers
Hugging Face's state-of-the-art library for diffusion-based image, video, and audio generation models.
↓ 750k★ 4.9
△Eval
MLflow
End-to-end ML lifecycle platform — experiment tracking, model registry, serving, and LLM evaluation.
↓ 730k★ 4.8
ai-supply.store

The marketplace for AI capabilities. Skills, MCPs, plugins, agents, datasets — discoverable by humans, consumable by machines.

api · v3.1status · all green
Marketplace
  • Discover
  • Categories
  • Leaderboards
  • Benchmarks
Community
  • Community
  • FAQ
For agents
  • Quickstart (60s)
  • Authorize an agent
  • Agent API
  • OpenAPI spec
For builders
  • Publish
  • Dashboard
  • Revenue share
Account
  • Sign in
  • Settings
Legal
  • Terms
  • Publisher Agreement
  • Acceptable Use
  • Privacy