ArXiv AutoSumm

Automated research paper summarization from ArXiv with LLM-powered rating, multi-format delivery, and comprehensive configuration management.

English | 中文

Quick Start: GitHub Actions

The fastest way to get started is with GitHub Actions, which automates the entire pipeline. This method uses repository secrets to dynamically configure the application without needing to commit any configuration files.

Prerequisites:

A GitHub account
API keys for your chosen LLM providers
An SMTP email account for receiving summaries

Steps

Fork the Repository

Click the "Fork" button at the top-right of this page to create a copy of this repository in your own GitHub account.

Configure Repository Secrets

Navigate to your forked repository's Settings > Secrets and variables > Actions. Add the following secrets to configure the pipeline. Only SUMMARIZER_API_KEY, RATER_API_KEY, SMTP_SERVER, SENDER_EMAIL, RECIPIENT_EMAIL, and SMTP_PASSWORD are strictly required.

Secret	Required	Description
`SUMMARIZER_API_KEY`	✅	API key for the summarization LLM provider (modelscope by default).
`RATER_API_KEY`	✅	API key for the rating LLM provider (modelscope by default).
`SMTP_SERVER`	✅	Your email provider's SMTP server (e.g.,`smtp.163.com`).
`SENDER_EMAIL`	✅	The email address for sending summaries.
`RECIPIENT_EMAIL`	✅	The email address for receiving summaries.
`SMTP_PASSWORD`	✅	The password or app password for your sender email.
`ARXIV_CATEGORIES`	❌	Comma-separated ArXiv categories (e.g.,`cs.AI,cs.CV`, default to `cs.AI,cs.CV,cs.RO`).
`MAX_PAPERS`	❌	The maximum number of papers to summarize (default to 5).
`SUMMARIZER_PROVIDER`	❌	The LLM provider for summarization (e.g.,`openai`, default to `modelscope`).
`RATER_PROVIDER`	❌	The LLM provider for rating (e.g.,`anthropic`, default to `modelscope`).

Enable and Run the Workflow
- Go to the Actions tab in your repository.
- If prompted, enable the workflows.
- Select the ArXiv AutoSumm Daily workflow and click Run workflow.

That's it! The workflow will now run on its schedule, delivering summaries to your inbox. For more advanced setups, including local installation and detailed configuration, see our full documentation.

Features

Automated Paper Processing: Fetches, rates, summarizes, and delivers papers daily.
Multiple Output Formats: Supports PDF, HTML, Markdown, and AZW3 (Kindle).
Advanced Caching: Avoids re-processing papers with an SQLite-based cache.
Flexible Rating: Choose between LLM, embedding, or hybrid rating strategies.
VLM Parsing: Optional Vision Language Model support for enhanced PDF analysis.

Pipeline Overview

The pipeline processes papers in the following sequence:

Fetch: Downloads metadata from ArXiv.
Rate: Selects the most relevant/interesting papers.
Parse: Extracts content from the PDFs.
Summarize: Generates summaries with a powerful LLM.
Render: Creates outputs in your desired formats.
Deliver: Sends the summaries to you via email.

Documentation

Installation Guide: Detailed setup instructions for GitHub Actions and local environments.
Configuration Guide: Comprehensive reference for all configuration options.
Troubleshooting & Q&A: Solutions for common issues and frequently asked questions.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
.github		.github
autosumm		autosumm
docs		docs
prompts		prompts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
config.advanced.yaml		config.advanced.yaml
config.basic.yaml		config.basic.yaml
config.yaml		config.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ArXiv AutoSumm

Quick Start: GitHub Actions

Steps

Features

Pipeline Overview

Documentation

License

About

Uh oh!

Releases 102

Contributors 2

Uh oh!

Languages

License

Mtrya/arxiv-autosumm

Folders and files

Latest commit

History

Repository files navigation

ArXiv AutoSumm

Quick Start: GitHub Actions

Steps

Features

Pipeline Overview

Documentation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 102

Contributors 2

Uh oh!

Languages