home / skills / openclaw / skills / literature-review

literature-review skill

/skills/weird-aftertaste/literature-review

This skill helps you craft literature reviews by searching multiple databases, extracting DOIs, deduplicating results, and drafting structured sections with

npx playbooks add skill openclaw/skills --skill literature-review

Review the files below or copy the command above to add this skill to your agents.

Files (3)
SKILL.md
3.8 KB
---
name: literature-review
version: 1.2.0
description: Assistance with writing literature reviews by searching for academic sources via Semantic Scholar, OpenAlex, Crossref and PubMed APIs. Use when the user needs to find papers on a topic, get details for specific DOIs, or draft sections of a literature review with proper citations.
---

# Literature Review

Help write academic literature reviews using a multi-engine search integration (S2, OA, CR, PM).

## Capabilities

- **Multi-Source Search**: Find relevant academic papers using Semantic Scholar (S2), OpenAlex (OA), Crossref (CR), and PubMed (PM).
- **Full Abstracts**: All sources now return complete abstracts (PubMed uses `efetch` for full XML records).
- **DOI Extraction**: DOIs are extracted from all sources for cross-referencing and deduplication.
- **Automatic Deduplication**: When searching multiple sources (`--source all` or `--source both`), results are automatically deduplicated by DOI.
- **Polite Access**: Automatic email identification for OpenAlex/Crossref "Polite Pool" (via `USER_EMAIL` env var).
- **Abstract Reconstruction**: Reconstructs abstracts from OpenAlex inverted index format.
- **Synthesis**: Group papers by theme and draft review sections based on metadata.

## Environment Variables

| Variable | Purpose | Default |
|----------|---------|---------|
| `USER_EMAIL` | Email for polite API access | `[email protected]` |
| `CLAWDBOT_EMAIL` | Fallback if USER_EMAIL not set | — |
| `SEMANTIC_SCHOLAR_API_KEY` | Optional S2 API key for higher rate limits | — |
| `OPENALEX_API_KEY` | Optional OpenAlex API key | — |

## Workflows

### 1. Broad Search (All Bases)
Get a comprehensive overview from all major academic databases. Results are automatically deduplicated by DOI.
```bash
python3 scripts/lit_search.py search "impact of glycyrrhiza on bifidobacterium" --limit 5 --source all
```

### 2. Targeted Search
- **OpenAlex** (`oa`): Fast and comprehensive, good abstracts.
- **Semantic Scholar** (`s2`): High-quality citation data and TL;DRs.
- **Crossref** (`cr`): Precise DOI-based metadata (no abstracts).
- **PubMed** (`pm`): Gold standard for biomedical research, full abstracts and PMIDs.

```bash
python3 scripts/lit_search.py search "prebiotic effects of liquorice" --source pm
```

### 3. Comparing Sources
Search both S2 and OA simultaneously to ensure nothing is missed. Deduplicated by default.
```bash
python3 scripts/lit_search.py search "Bifidobacterium infantis growth" --source both
```

### 4. Getting Full Details (S2)
Retrieve detailed metadata including TL;DR summaries.
```bash
python3 scripts/lit_search.py details "DOI:10.1016/j.foodchem.2023.136000"
```

### 5. Writing the Review
1.  **Extract**: Pull key findings from the abstracts found.
2.  **Organize**: Group findings into a logical structure (e.g., chronological or thematic).
3.  **Draft**: Use the "Think step-by-step" approach to synthesize multiple sources into a coherent narrative.

## Output Format

Each result includes:
- `id`: Source-specific identifier (PMID for PubMed, OpenAlex ID, S2 paper ID, DOI for Crossref)
- `doi`: DOI when available (used for deduplication)
- `title`: Paper title
- `year`: Publication year
- `authors`: List of author names
- `abstract`: Full abstract text (when available)
- `venue`: Journal or conference name
- `citationCount`: Citation count (S2, OA)
- `source`: Which database the result came from

## Tips for Success

- **Citations**: Always cross-reference the DOI or PMID for accuracy in bibliography.
- **Filtering**: Focus on papers with higher `citationCount` or recent years for a more modern review.
- **PubMed for Medicine**: Use `--source pm` for the most reliable biomedical literature.
- **Deduplication**: Multi-source searches automatically remove duplicates; use single sources if you need raw counts.

Overview

This skill helps researchers and students locate, synthesize, and draft literature review sections by searching multiple academic sources. It queries Semantic Scholar, OpenAlex, Crossref, and PubMed, returns full abstracts and metadata, and deduplicates results by DOI. The skill can extract DOIs, rebuild OpenAlex abstract text, group papers by theme, and produce draft review paragraphs with citations.

How this skill works

The skill runs parallel searches across selected APIs (S2, OA, CR, PM), collects metadata and full abstracts where available, and normalizes results into a common record format (id, doi, title, year, authors, abstract, venue, citationCount, source). When multiple sources are used it deduplicates by DOI, reconstructs inverted-index abstracts from OpenAlex, and applies simple thematic grouping to support synthesis. It can also fetch detailed records for a specific DOI or PMID and generate draft review text organized by theme or chronology.

When to use it

  • Preparing a literature review for a thesis, grant, or manuscript.
  • When you need a comprehensive search across multiple academic databases.
  • To retrieve full abstracts and metadata for DOI-based cross-referencing.
  • When you want automated deduplication across sources to avoid double-counting.
  • To draft thematic or chronological review sections from aggregated findings.

Best practices

  • Set USER_EMAIL (or CLAWDBOT_EMAIL) so OpenAlex/Crossref requests use polite contact details.
  • Use --source pm for biomedical topics to prioritize PubMed abstracts and PMIDs.
  • Prefer multi-source searches for coverage, then filter by DOI and citationCount for quality.
  • Manually verify critical citations and DOIs before finalizing bibliographies.
  • Group papers by clear themes (methods, findings, population) before drafting synthesis.

Example use cases

  • Run a broad search across all engines to map a field and export deduplicated records for screening.
  • Fetch full metadata and TL;DR from Semantic Scholar for a key DOI to enrich a methods comparison paragraph.
  • Use PubMed-only searches when compiling clinical evidence and extracting PMIDs for systematic reviews.
  • Compare results from Semantic Scholar and OpenAlex to ensure high-coverage literature discovery.
  • Generate a draft thematic section synthesizing 10–20 papers with inline DOI citations.

FAQ

Can this skill find full-text PDFs?

No. It returns metadata and full abstracts where available. For full texts, use the DOI to locate publisher repositories or institutional access.

How does deduplication work across sources?

Deduplication is performed by DOI when present. Records without DOIs are compared by title and author heuristics, but manual review is recommended for ambiguous cases.