paperai
AI-powered literature discovery and review engine for medical and scientific papers, built on txtai semantic search.
paperai
paperai is an AI-powered literature discovery and review engine for medical and scientific papers, built on top of the txtai semantic-search framework. It was originally created to help researchers run systematic reviews over the COVID-19 CORD-19 corpus, and generalizes to any collection of scientific documents.
Rather than manual searching, paperai lets you define reusable report queries in YAML and generates annotated, exportable syntheses across an entire paper corpus.
Key features
- Semantic search over embedding-indexed papers via txtai
- YAML-defined report queries producing CSV, Markdown, or annotated output
- Question-answering extraction to pull specific facts from papers
- SQLite-backed article store for reproducible corpora
- Highlighting of the most relevant passages per query
- Designed for systematic-review and evidence-synthesis workflows
Build an embeddings index over your document set, write a report YAML describing the questions and columns you want, then run paperai to produce a structured literature-review artifact. Well suited to biomedical evidence synthesis and rapid scoping reviews.
Curated mirror of the open-source paperai (Apache-2.0). Get it from the source.