Skip to content
ai-supply.store
ОбзорКатегорииРейтингиСообществоAgent APIFAQ
ОпубликоватьВойти
catalog / Audio & Speech / SpeechBrain
⬡PipelineAudio & SpeechFree

SpeechBrain

All-in-one conversational AI toolkit for ASR, speaker recognition, speech enhancement, and language identification.

@ai-supply
Установки67k
Рейтинг★ 4.7
Отзывы22
↗ Исходный репозиторий

SpeechBrain

SpeechBrain is an open-source, all-in-one conversational AI platform developed at Mila and Université de Montréal. A single, modular codebase covers automatic speech recognition, speaker recognition and diarisation, speech enhancement and separation, language identification, and spoken language understanding.

Key Features

  • 200+ pretrained models on HuggingFace Hub across all speech tasks
  • Modular Brain class: compose any pipeline from reusable blocks
  • State-of-the-art ASR with Transformer, Conformer, and hybrid CTC/attention
  • Speaker verification and identification (ECAPA-TDNN, x-vectors)
  • Speech enhancement: MetricGAN+, SEGAN, and ConvTasNet separation

Quick Start

pip install speechbrain
import speechbrain as sb
from speechbrain.inference.ASR import EncoderDecoderASR

asr_model = EncoderDecoderASR.from_hparams(
    source="speechbrain/asr-conformer-transformerlm-librispeech",
    savedir="pretrained_models/asr-transformer-transformerlm-librispeech",
)
result = asr_model.transcribe_file("audio.wav")
print(result)
npx ai-supply add speechbrain-audio-toolkit

Curated mirror of the open-source SpeechBrain (Apache-2.0). Get it from the source.

More from @ai-supply

View profile →
◐Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
↓ 900k★ 4.9
⇄Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
↓ 820k★ 4.9
◉Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
↓ 820k★ 4.8
◆Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
↓ 760k★ 4.7
ai-supply.store

Маркетплейс возможностей ИИ. Навыки, MCP-серверы, плагины, агенты, датасеты — доступны людям, пригодны для потребления машинами.

api · v3.1status · all green
Контакты
support@ai-supply.storesecurity@ai-supply.store
Маркетплейс
  • Обзор
  • Категории
  • Рейтинги
  • Бенчмарки
Сообщество
  • Сообщество
  • FAQ
Для агентов
  • Быстрый старт (60s)
  • Авторизовать агента
  • Agent API
  • Спецификация OpenAPI
Для разработчиков
  • Опубликовать
  • Панель управления
  • Распределение дохода
Аккаунт
  • Войти
  • Настройки
Правовые документы
  • Условия использования
  • Соглашение издателя
  • Правила допустимого использования
  • Конфиденциальность