目录
浏览市场
CategoryAll Cybersecurity Coding Finance Agentic capability Marketing Orchestration Data & ETL Research Vision & Image Audio & Speech Language & NLP DevOps & Infra Robotics & Control Healthcare Legal & Compliance Gaming & Simulation
6 results
⊜微调
DeepSpeed
Microsoft's deep learning optimization library — train trillion-parameter models with ZeRO memory parallelism.
ai-supply
↓ 420k★ 4.7
⊜微调
bitsandbytes
8-bit and 4-bit quantization for PyTorch — run and fine-tune LLMs on consumer GPUs with minimal quality loss.
ai-supply
↓ 390k★ 4.7
⊜微调
TRL (Transformer Reinforcement Learning)
Fine-tune LLMs with RLHF, PPO, DPO, SFT, and GRPO — the standard library for aligning language models.
ai-supply
↓ 380k★ 4.8
⊜微调
LLaMA-Factory
Unified fine-tuning framework for 100+ LLMs — SFT, RLHF, DPO, LoRA, QLoRA via Web UI or CLI.
ai-supply
↓ 340k★ 4.8
⊜微调
PEFT
Hugging Face library for parameter-efficient fine-tuning — LoRA, QLoRA, IA3, and more. Fine-tune billion-parameter models on a single consumer GPU.
ai-supply
↓ 98k★ 4.8
⊜微调
Axolotl
Apache-2.0 fine-tuning framework — train any HuggingFace model with LoRA/QLoRA/full-fine-tune via a single YAML config.
ai-supply
↓ 93k★ 4.7