home / skills / microck / ordinary-claude-skills / arxiv-search

arxiv-search skill

/skills_all/arxiv-search

This skill lets you search arXiv for papers across physics, mathematics, CS and biology, returning concise titles and abstracts.

This is most likely a fork of the arxiv-search skill from langchain-ai
npx playbooks add skill microck/ordinary-claude-skills --skill arxiv-search

Review the files below or copy the command above to add this skill to your agents.

Files (2)
SKILL.md
3.4 KB
---
name: arxiv-search
description: Search arXiv preprint repository for papers in physics, mathematics, computer science, quantitative biology, and related fields
---

# arXiv Search Skill

This skill provides access to arXiv, a free distribution service and open-access archive for scholarly articles in physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering, systems science, and economics.

## When to Use This Skill

Use this skill when you need to:
- Find preprints and recent research papers before journal publication
- Search for papers in computational biology, bioinformatics, or systems biology
- Access mathematical or statistical methods papers relevant to biology
- Find machine learning papers applied to biological problems
- Get the latest research that may not yet be in PubMed

## How to Use

The skill provides a Python script that searches arXiv and returns formatted results.

### Basic Usage

**Note:** Always use the absolute path from your skills directory (shown in the system prompt above).

If running deepagents from a virtual environment:
```bash
.venv/bin/python [YOUR_SKILLS_DIR]/arxiv-search/arxiv_search.py "your search query" [--max-papers N]
```

Or for system Python:
```bash
python3 [YOUR_SKILLS_DIR]/arxiv-search/arxiv_search.py "your search query" [--max-papers N]
```

Replace `[YOUR_SKILLS_DIR]` with the absolute skills directory path from your system prompt (e.g., `~/.deepagents/agent/skills` or the full absolute path).

**Arguments:**
- `query` (required): The search query string (e.g., "neural networks protein structure", "single cell RNA-seq")
- `--max-papers` (optional): Maximum number of papers to retrieve (default: 10)

### Examples

Search for machine learning papers:
```bash
.venv/bin/python ~/.deepagents/agent/skills/arxiv-search/arxiv_search.py "deep learning drug discovery" --max-papers 5
```

Search for computational biology papers:
```bash
.venv/bin/python ~/.deepagents/agent/skills/arxiv-search/arxiv_search.py "protein folding prediction"
```

Search for bioinformatics methods:
```bash
.venv/bin/python ~/.deepagents/agent/skills/arxiv-search/arxiv_search.py "genome assembly algorithms"
```

## Output Format

The script returns formatted results with:
- **Title**: Paper title
- **Summary**: Abstract/summary text

Each paper is separated by blank lines for readability.

## Features

- **Relevance sorting**: Results ordered by relevance to query
- **Fast retrieval**: Direct API access with no authentication required
- **Simple interface**: Clean, easy-to-parse output
- **No API key required**: Free access to arXiv database

## Dependencies

This skill requires the `arxiv` Python package. The script will detect if it's missing and show an error.

**If you see "Error: arxiv package not installed":**

If running deepagents from a virtual environment (recommended), use the venv's Python:
```bash
.venv/bin/python -m pip install arxiv
```

Or for system-wide install:
```bash
python3 -m pip install arxiv
```

The package is not included in deepagents by default since it's skill-specific. Install it on-demand when first using this skill.

## Notes

- arXiv is particularly strong for:
  - Computer science (cs.LG, cs.AI, cs.CV)
  - Quantitative biology (q-bio)
  - Statistics (stat.ML)
  - Physics and mathematics
- Papers are preprints and may not be peer-reviewed
- Results include both recent uploads and older papers
- Best for computational/theoretical work in biology

Overview

This skill provides a lightweight command-line interface to search the arXiv preprint repository for papers in physics, mathematics, computer science, quantitative biology, and related fields. It returns relevance-sorted results with titles and abstracts and requires no API key. The tool is designed for quick retrieval of preprints and recent research prior to journal publication.

How this skill works

The script queries the arXiv API using the provided search string and returns a formatted list of papers ordered by relevance. Each result includes the title and abstract; pagination or a --max-papers flag controls how many items are returned. The script detects the required Python package and will prompt to install it if missing.

When to use it

  • Find recent preprints before journal publication or indexing
  • Locate methods and theory papers in CS, physics, mathematics, or quantitative biology
  • Survey literature on niche or emerging topics not yet in journals
  • Quickly gather abstracts for reading lists, citations, or inspiration
  • Check for prior work before starting an implementation or experiment

Best practices

  • Use focused queries with keywords, author names, or arXiv categories (e.g., cs.LG, q-bio) to improve relevance
  • Limit results with --max-papers when scanning many queries to save time
  • Combine topic keywords with method names (e.g., "graph neural network protein") to find applied ML work
  • Run in a Python virtual environment and install the arxiv package there to avoid system conflicts
  • Verify paper status and check for peer-reviewed versions before citing

Example use cases

  • Search for the latest machine learning preprints on protein structure prediction
  • Collect abstracts on single-cell RNA-seq analysis methods for a literature review
  • Find statistical or mathematical methods relevant to a modeling project
  • Scan for computational biology preprints that apply deep learning to genomics
  • Quickly gather candidate papers for a reading group or grant background section

FAQ

Do I need an API key or account to use this skill?

No. The arXiv API accessed by the script requires no API key or authentication.

What dependency is required and how do I install it?

The script requires the Python package named arxiv. Install it with python3 -m pip install arxiv or inside your virtual environment with .venv/bin/python -m pip install arxiv.

Are results peer-reviewed?

No. arXiv hosts preprints; many are later peer-reviewed but you should check the published version before relying on final results.