home / skills / langchain-ai / deepagents / arxiv-search

arxiv-search skill

/libs/deepagents-cli/examples/skills/arxiv-search

This skill lets you search arXiv for physics, math, CS and biology papers, returning concise titles and summaries to guide research.

npx playbooks add skill langchain-ai/deepagents --skill arxiv-search

Review the files below or copy the command above to add this skill to your agents.

Files (2)
SKILL.md
3.4 KB
---
name: arxiv-search
description: Search arXiv preprint repository for papers in physics, mathematics, computer science, quantitative biology, and related fields
---

# arXiv Search Skill

This skill provides access to arXiv, a free distribution service and open-access archive for scholarly articles in physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering, systems science, and economics.

## When to Use This Skill

Use this skill when you need to:
- Find preprints and recent research papers before journal publication
- Search for papers in computational biology, bioinformatics, or systems biology
- Access mathematical or statistical methods papers relevant to biology
- Find machine learning papers applied to biological problems
- Get the latest research that may not yet be in PubMed

## How to Use

The skill provides a Python script that searches arXiv and returns formatted results.

### Basic Usage

**Note:** Always use the absolute path from your skills directory (shown in the system prompt above).

If running deepagents from a virtual environment:
```bash
.venv/bin/python [YOUR_SKILLS_DIR]/arxiv-search/arxiv_search.py "your search query" [--max-papers N]
```

Or for system Python:
```bash
python3 [YOUR_SKILLS_DIR]/arxiv-search/arxiv_search.py "your search query" [--max-papers N]
```

Replace `[YOUR_SKILLS_DIR]` with the absolute skills directory path from your system prompt (e.g., `~/.deepagents/agent/skills` or the full absolute path).

**Arguments:**
- `query` (required): The search query string (e.g., "neural networks protein structure", "single cell RNA-seq")
- `--max-papers` (optional): Maximum number of papers to retrieve (default: 10)

### Examples

Search for machine learning papers:
```bash
.venv/bin/python ~/.deepagents/agent/skills/arxiv-search/arxiv_search.py "deep learning drug discovery" --max-papers 5
```

Search for computational biology papers:
```bash
.venv/bin/python ~/.deepagents/agent/skills/arxiv-search/arxiv_search.py "protein folding prediction"
```

Search for bioinformatics methods:
```bash
.venv/bin/python ~/.deepagents/agent/skills/arxiv-search/arxiv_search.py "genome assembly algorithms"
```

## Output Format

The script returns formatted results with:
- **Title**: Paper title
- **Summary**: Abstract/summary text

Each paper is separated by blank lines for readability.

## Features

- **Relevance sorting**: Results ordered by relevance to query
- **Fast retrieval**: Direct API access with no authentication required
- **Simple interface**: Clean, easy-to-parse output
- **No API key required**: Free access to arXiv database

## Dependencies

This skill requires the `arxiv` Python package. The script will detect if it's missing and show an error.

**If you see "Error: arxiv package not installed":**

If running deepagents from a virtual environment (recommended), use the venv's Python:
```bash
.venv/bin/python -m pip install arxiv
```

Or for system-wide install:
```bash
python3 -m pip install arxiv
```

The package is not included in deepagents by default since it's skill-specific. Install it on-demand when first using this skill.

## Notes

- arXiv is particularly strong for:
  - Computer science (cs.LG, cs.AI, cs.CV)
  - Quantitative biology (q-bio)
  - Statistics (stat.ML)
  - Physics and mathematics
- Papers are preprints and may not be peer-reviewed
- Results include both recent uploads and older papers
- Best for computational/theoretical work in biology

Overview

This skill searches the arXiv preprint repository to find relevant papers across physics, mathematics, computer science, quantitative biology, and related fields. It returns clean, human-readable results (title and abstract) sorted by relevance and is designed for fast, unauthenticated access. The tool is implemented as a small Python script that can be run from a Deep Agents environment or any Python environment with the arxiv package installed.

How this skill works

The script issues queries to the arXiv API via the arxiv Python package, retrieves matching records, and formats each result with title and abstract. Results are ordered by relevance and limited by an optional --max-papers argument. The script detects if the arxiv dependency is missing and provides an actionable error message with installation commands.

When to use it

  • Locate recent preprints or early versions of papers not yet in journals
  • Search for machine learning, computational biology, or bioinformatics research
  • Gather methodological papers in statistics, math, or theoretical physics
  • Quickly assemble abstracts for literature review or screening
  • Find cross-disciplinary work that appears on arXiv before PubMed or journals

Best practices

  • Use focused queries with keywords and phrases (e.g., "protein structure prediction") for better relevance
  • Specify --max-papers when you only need a small set of top results
  • Run inside the same virtual environment where arxiv is installed to avoid dependency errors
  • Treat results as preprints — verify findings in peer-reviewed sources before relying on them
  • Combine with citation or full-text retrieval tools when you need PDFs or references

Example use cases

  • Quickly find recent ML models applied to drug discovery for a project proposal
  • Scan abstracts for single-cell RNA-seq methods to identify candidate tools
  • Collect theoretical papers on optimization or statistics for a literature review
  • Identify early drafts of influential papers before journal publication
  • Extract concise abstracts for triage when assembling reading lists

FAQ

Do I need an API key to use this skill?

No. arXiv access via the arxiv Python package does not require an API key.

What if the script reports the arxiv package is not installed?

Install it in your environment with `python3 -m pip install arxiv` or use your virtualenv's pip (e.g., `.venv/bin/python -m pip install arxiv`).