Skip to content
ai-supply.store
탐색카테고리리더보드커뮤니티Agent APIFAQ
로그인무료 가입
catalog / Language & NLP / Detoxify
⛨GuardrailLanguage & NLPFree

Detoxify

Pretrained PyTorch models that score text for toxicity, threats, insults, and identity attacks — a drop-in content-moderation guardrail.

@ai-supply
설치 수9.0k
↗ 소스 저장소

Detoxify — toxic comment & content moderation classifier

Detoxify provides trained PyTorch models to detect toxic, obscene, threatening, insulting, and identity-attacking language — the models that placed in Jigsaw's three Toxic Comment Challenges. It works as a drop-in content-moderation guardrail for user- or model-generated text.

Key features

  • Three pretrained checkpoints: original, unbiased (bias-mitigated), and multilingual (7 languages)
  • Per-label probability scores for toxicity, severe toxicity, obscene, threat, insult, and identity attack
  • One-line inference API built on Hugging Face Transformers and PyTorch Lightning
  • The unbiased model is trained to reduce unintended identity bias for fairer moderation
  • Lightweight enough to run inline as an input/output filter for chat and comment pipelines

Detoxify gives you a fast, self-hostable toxicity gate you can place ahead of or behind an LLM, returning granular scores instead of a single opaque yes/no verdict.

Curated mirror of the open-source Detoxify (Apache-2.0). Get it from the source.

More from @ai-supply

View profile →
◇MCP server
GitHub MCP Server
Official GitHub MCP server — give your AI agent full read/write access to repos, issues, PRs, and actions.
↓ 771k
⠿Embedding
Sentence Transformers
State-of-the-art sentence and text embeddings — compute semantic similarity, clustering, and dense retrieval.
↓ 751k
◆Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
↓ 641k
◇MCP server
MCP TypeScript SDK
Official TypeScript/JavaScript SDK for building MCP servers and clients — the Node.js foundation for the Model Context Protocol.
↓ 629k
ai-supply.store

무료로 제공하는 보안 검증 AI 역량 — skill, MCP, plugin, agent, 데이터셋을 비롯한 모든 항목에 보안 점수를 매기고 최신성을 추적하며, 사람과 agent 모두를 위해 만들었습니다.

api · v3.1status · all green
문의하기
support@ai-supply.storesecurity@ai-supply.store
카탈로그
  • 탐색
  • 카테고리
  • 리더보드
  • 벤치마크
  • 보안
커뮤니티
  • 커뮤니티
  • FAQ
에이전트용
  • 빠른 시작 (60s)
  • 에이전트 승인
  • Agent API
  • OpenAPI 사양
빌더용
  • 게시
  • 대시보드
계정
  • 계정 만들기
  • 로그인
  • 설정
법적 정보
  • 이용약관
  • 게시자 계약
  • 이용 정책
  • 개인정보 처리방침