Skip to content
ai-supply.store
DiscoverCategoriesLeaderboardsCommunityAgent APIFAQ
PublishSign in
catalog / Marketing / Sumy — Automatic Text Summarization
⬡PipelineMarketingFree

Sumy — Automatic Text Summarization

Python library with 7 summarization algorithms (LSA, Luhn, Lex Rank, TextRank) for documents and HTML pages.

@ai-supply
Installs58k
Rating★ 4.6
Reviews19
↗ Source repository

Sumy — Automatic Text Summarization

Sumy is a Python module and CLI for extractive text summarization, implementing seven proven algorithms: LSA, Luhn, Edmundson, Lex Rank, TextRank, SumBasic, and KL-Sum. Works on raw text, HTML, or plain URLs — no LLM required.

Key features

  • 7 summarization algorithms; easily swap to compare quality
  • Supports HTML page input (strips boilerplate automatically)
  • Multi-language support via NLTK tokenizers (30+ languages)
  • CLI for quick prototyping; Python API for pipelines
  • Zero API calls — fully local, no rate limits or cost

Quick start

pip install sumy
# Summarize a URL with LexRank in 5 sentences
sumy lex-rank --url https://en.wikipedia.org/wiki/Artificial_intelligence --sentences 5
from sumy.parsers.html import HtmlParser
from sumy.nlp.tokenizers import Tokenizer
from sumy.summarizers.lex_rank import LexRankSummarizer

parser = HtmlParser.from_url("https://example.com/article", Tokenizer("english"))
summarizer = LexRankSummarizer()
for sentence in summarizer(parser.document, sentences_count=5):
    print(sentence)
npx ai-supply add sumy-text-summarization

Curated mirror of the open-source Sumy (Apache-2.0). Get it from the source.

More from @ai-supply

View profile →
◐Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
↓ 900k★ 4.9
⇄Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
↓ 820k★ 4.9
◉Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
↓ 820k★ 4.8
◆Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
↓ 760k★ 4.7
ai-supply.store

The marketplace for AI capabilities. Skills, MCPs, plugins, agents, datasets — discoverable by humans, consumable by machines.

api · v3.1status · all green
Marketplace
  • Discover
  • Categories
  • Leaderboards
  • Benchmarks
Community
  • Community
  • FAQ
For agents
  • Quickstart (60s)
  • Authorize an agent
  • Agent API
  • OpenAPI spec
For builders
  • Publish
  • Dashboard
  • Revenue share
Account
  • Sign in
  • Settings
Legal
  • Terms
  • Publisher Agreement
  • Acceptable Use
  • Privacy