catalog / Language & NLP / Mistral-7B-v0.1
ModelLanguage & NLPFree

Mistral-7B-v0.1

Apache-licensed 7B language model from Mistral AI — beats Llama 2 13B on most benchmarks at half the size.

安装量320k
评分★ 4.8
评价107
源代码仓库

Mistral-7B-v0.1

Mistral-7B-v0.1 is a 7-billion-parameter transformer language model released by Mistral AI under the Apache 2.0 license. Despite its compact size it outperforms Llama 2 13B across standard reasoning, math, and code benchmarks, making it the go-to open-weight base model for fine-tuning, RAG, and on-premise inference.

Key features

  • Grouped-Query Attention (GQA) — faster inference, lower KV-cache memory
  • Sliding Window Attention (SWA) — efficient handling of long contexts
  • 32k token sliding context with 8k default
  • 4-bit GPTQ and AWQ quantized variants available on HuggingFace
  • Beats Llama 2 13B on MMLU, HumanEval, HellaSwag, and ARC

Quick start (transformers)

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "mistralai/Mistral-7B-v0.1"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto")

prompt = "Explain transformers in one paragraph:"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
output = model.generate(**inputs, max_new_tokens=200)
print(tokenizer.decode(output[0], skip_special_tokens=True))

Install via ai-supply

npx ai-supply add mistral-7b-v01

Curated mirror of the open-source Mistral-7B-v0.1 (Apache-2.0). Get it from the source.

More from @ai-supply

View profile →
Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
900k4.9
Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
820k4.9
Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
820k4.8
Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
760k4.7