arxiv-sanity-lite
Self-hostable web app to tag arXiv papers and get recommendations of similar papers using SVMs over tf-idf abstract features.
arxiv-sanity-lite
arxiv-sanity-lite is Andrej Karpathy's lightweight rewrite of the original arxiv-sanity project: a self-hostable web application that helps researchers keep up with the flood of new arXiv preprints. You tag papers you find interesting, and the app recommends similar papers by training a support-vector machine over tf-idf features derived from paper abstracts.
It is deliberately minimal and easy to deploy, making it a great template for a personalized research-survey and paper-tracking tool.
Key features
- Tag-based personal library of arXiv papers
- SVM-over-tf-idf recommendations of similar papers per tag
- Keyword search across fetched abstracts
- Optional email digests of new relevant papers
- Lightweight Flask + SQLite backend, easy to self-host
- Pulls fresh listings directly from the arXiv API
Fetch recent papers via the included script, compute the tf-idf features, and launch the Flask server to browse, tag, and receive recommendations. A practical starting point for anyone building a domain-specific literature-monitoring dashboard.
Curated mirror of the open-source arxiv-sanity-lite (MIT). Get it from the source.