mcp-simple-arxiv: Turn Your LLM Into a Research Assistant

Stop switching between browser tabs and your development environment. This MCP server connects your LLM directly to arXiv's entire scientific literature database, letting you search papers, grab abstracts, and download PDFs without leaving your workflow.

The Problem with Current Research Workflows

You're deep in a coding session when you need to verify a machine learning approach or check the latest research on a specific algorithm. The typical flow breaks your concentration: open browser, search arXiv, scan results, copy-paste relevant details, maybe bookmark the PDF for later. Repeat this 5-10 times per session and you've lost serious momentum.

Direct arXiv Integration That Actually Works

This MCP server eliminates the context switching. Your LLM can now:

Search arXiv directly - "Find recent papers on transformer architectures" returns actual results with IDs, authors, and publication dates
Fetch complete paper details - Get abstracts, author lists, and direct PDF links without opening a browser
Respect arXiv's API limits - Built-in rate limiting (1 request per 3 seconds) keeps you compliant with arXiv's usage policies

Real-World Use Cases

Research-Driven Development: Building a computer vision model? Ask your LLM to find papers on your specific technique, get the abstracts to understand approaches, then grab the PDFs for implementation details.

Code Documentation: Writing technical documentation that needs citations? Your LLM can search for relevant papers and format proper references without you hunting down paper details manually.

Staying Current: Set up regular queries about your field of interest. Instead of manually checking arXiv daily, your LLM can surface new papers matching your criteria.

Technical Writing: Working on a blog post or paper? Your LLM can fact-check claims against recent literature and suggest relevant citations.

Setup in Under 2 Minutes

Install via Smithery (recommended):

npx -y @smithery/cli install mcp-simple-arxiv --client claude

Or manually:

pip install mcp-simple-arxiv

Add to your Claude Desktop config:

{
  "mcpServers": {
    "simple-arxiv": {
      "command": "python",
      "args": ["-m", "mcp_simple_arxiv"]
    }
  }
}

That's it. Restart Claude Desktop and you're connected to arXiv's full database.

Why This Implementation Wins

Proper Rate Limiting: Follows arXiv's API guidelines so you won't get blocked. Many homebrew solutions ignore this and cause headaches.

Clean Paper Metadata: Returns structured data (title, authors, abstract, publication date, categories) instead of raw HTML you have to parse.

Zero Authentication: Uses arXiv's public API - no API keys or registration required.

Lightweight Dependencies: Just requests and feedparser. No heavy ML libraries or complex dependencies that break your environment.

The server has 108 stars and active maintenance, so you're not betting on abandonware. It just works, handles the edge cases, and gets out of your way.

Perfect for developers building AI applications, researchers who code, or anyone tired of manual paper hunting interrupting their flow.