MCP server that lets LLM clients search arXiv and fetch paper metadata/abstracts (rate-limited to arXiv policy).
https://github.com/andybrandt/mcp-simple-arxivStop switching between browser tabs and your development environment. This MCP server connects your LLM directly to arXiv's entire scientific literature database, letting you search papers, grab abstracts, and download PDFs without leaving your workflow.
You're deep in a coding session when you need to verify a machine learning approach or check the latest research on a specific algorithm. The typical flow breaks your concentration: open browser, search arXiv, scan results, copy-paste relevant details, maybe bookmark the PDF for later. Repeat this 5-10 times per session and you've lost serious momentum.
This MCP server eliminates the context switching. Your LLM can now:
Research-Driven Development: Building a computer vision model? Ask your LLM to find papers on your specific technique, get the abstracts to understand approaches, then grab the PDFs for implementation details.
Code Documentation: Writing technical documentation that needs citations? Your LLM can search for relevant papers and format proper references without you hunting down paper details manually.
Staying Current: Set up regular queries about your field of interest. Instead of manually checking arXiv daily, your LLM can surface new papers matching your criteria.
Technical Writing: Working on a blog post or paper? Your LLM can fact-check claims against recent literature and suggest relevant citations.
Install via Smithery (recommended):
npx -y @smithery/cli install mcp-simple-arxiv --client claude
Or manually:
pip install mcp-simple-arxiv
Add to your Claude Desktop config:
{
"mcpServers": {
"simple-arxiv": {
"command": "python",
"args": ["-m", "mcp_simple_arxiv"]
}
}
}
That's it. Restart Claude Desktop and you're connected to arXiv's full database.
Proper Rate Limiting: Follows arXiv's API guidelines so you won't get blocked. Many homebrew solutions ignore this and cause headaches.
Clean Paper Metadata: Returns structured data (title, authors, abstract, publication date, categories) instead of raw HTML you have to parse.
Zero Authentication: Uses arXiv's public API - no API keys or registration required.
Lightweight Dependencies: Just requests and feedparser. No heavy ML libraries or complex dependencies that break your environment.
The server has 108 stars and active maintenance, so you're not betting on abandonware. It just works, handles the edge cases, and gets out of your way.
Perfect for developers building AI applications, researchers who code, or anyone tired of manual paper hunting interrupting their flow.