home / skills / nealcaren / social-data-analysis / lit-synthesis

This skill transforms a lit-synthesis corpus into a structured theoretical map, reading notes, debate maps, and field synthesis for writing-ready insights.

npx playbooks add skill nealcaren/social-data-analysis --skill lit-synthesis

Review the files below or copy the command above to add this skill to your agents.

Files (8)
SKILL.md
9.0 KB
---
name: lit-synthesis
description: Deep reading and synthesis of literature corpus. Theoretical mapping, thematic clustering, and debate identification using Zotero MCP for full-text access.
---

# Literature Synthesis

You help sociologists move from a corpus of papers to a deep understanding of a field. This is the analytical bridge between finding papers (lit-search) and writing about them (lit-writeup).

## The Lit Trilogy

This skill is the middle step in a three-skill workflow:

| Skill | Role | Key Output |
|-------|------|------------|
| **lit-search** | Find papers via OpenAlex | `database.json`, download checklist |
| **lit-synthesis** | Analyze & organize via Zotero | `field-synthesis.md`, `theoretical-map.md`, `debate-map.md` |
| **lit-writeup** | Draft prose | Publication-ready Theory section |

**Input**: Papers in Zotero (imported from lit-search or user's existing library)
**Output**: Organized understanding of the field ready for writing

## When to Use This Skill

Use this skill when users:
- Have a corpus of papers (from lit-search or their own collection)
- Need to understand the theoretical landscape before writing
- Want to identify debates, tensions, and competing positions
- Need to organize papers thematically or by theoretical tradition
- Want deep reading notes, not just metadata extraction

## Core Principles

1. **Read deeply, not widely**: Better to understand 15 papers thoroughly than 50 superficially.

2. **Theoretical traditions matter**: Papers exist within intellectual lineages. Map who cites whom and why.

3. **Debates are gold**: Competing positions create space for contributions. Find the tensions.

4. **Organization serves writing**: The clusters and maps you create should directly feed lit-writeup's architecture phase.

5. **Full text when possible**: Abstracts tell you *what*; full text tells you *how* and *why*.

## Zotero MCP Integration

This skill uses **Zotero MCP** for accessing your library:

### Setup

Install the Zotero MCP server:
```bash
uv tool install "git+https://github.com/54yyyu/zotero-mcp.git"
zotero-mcp setup
```

See `mcp/zotero-setup.md` for detailed configuration.

### Key Capabilities

| Tool | Purpose |
|------|---------|
| `zotero_search_items` | Find papers by keyword, author, tag |
| `zotero_semantic_search` | Conceptual similarity search |
| `zotero_get_item_metadata` | Retrieve full metadata + BibTeX |
| `zotero_get_annotations` | Extract PDF highlights and notes |
| `zotero_search_notes` | Search your reading notes |

### Workflow Integration

1. **From lit-search**: Import the BibTeX export into Zotero
2. **Acquire PDFs**: Use Zotero's "Find Available PDF" or manual download
3. **Read and annotate**: Highlight key passages, add notes
4. **lit-synthesis reads**: Access annotations via MCP for analysis

## Workflow Phases

### Phase 0: Corpus Audit
**Goal**: Assess what's in the corpus and identify gaps.

**Process**:
- Review the database from lit-search (or user's Zotero collection)
- Count papers by year, journal, author, theoretical tradition
- Identify potential gaps in coverage
- Prioritize which papers need deep reading vs. skimming

**Output**: `corpus-audit.md` with statistics and reading priorities.

> **Pause**: User confirms corpus coverage and reading priorities.

---

### Phase 1: Deep Reading
**Goal**: Close read priority papers and extract analytical insights.

**Process**:
- For each priority paper, read full text via Zotero MCP
- Extract: argument structure, theoretical framework, key concepts, methodological approach
- Note: how theory is deployed, what evidence supports claims, limitations acknowledged
- Create structured reading notes

**Output**: `reading-notes/` directory with per-paper notes.

> **Pause**: User reviews reading notes for key papers.

---

### Phase 2: Theoretical Mapping
**Goal**: Identify intellectual traditions and lineages.

**Process**:
- Identify which theoretical frameworks appear across papers
- Map citation relationships (who cites whom)
- Note foundational texts and their descendants
- Identify "camps" or schools of thought
- Document key concepts and how they're used

**Output**: `theoretical-map.md` with traditions, key theorists, and concept definitions.

> **Pause**: User reviews theoretical landscape.

---

### Phase 3: Thematic Clustering
**Goal**: Organize papers by what they study and how.

**Process**:
- Group papers by empirical focus (population, setting, phenomenon)
- Group papers by theoretical approach
- Group papers by methodological strategy
- Identify papers that bridge multiple clusters
- Note within-cluster consensus and variation

**Output**: `thematic-clusters.md` with organized paper groupings.

> **Pause**: User reviews clustering logic.

---

### Phase 4: Debate Mapping
**Goal**: Identify tensions, disagreements, and competing positions.

**Process**:
- Find explicit disagreements (papers that critique each other)
- Find implicit tensions (contradictory findings or incompatible assumptions)
- Identify unresolved questions the field is grappling with
- Note where evidence is mixed or contested
- Document the "state of the debate" for each tension

**Output**: `debate-map.md` with positions, evidence, and unresolved questions.

> **Pause**: User reviews debates and selects focus areas.

---

### Phase 5: Field Synthesis
**Goal**: Create comprehensive understanding ready for writing.

**Process**:
- Synthesize across phases into coherent field understanding
- Identify the most productive gaps for contribution
- Recommend which lit-writeup cluster (Gap-Filler, Theory-Extender, etc.) fits
- Create the handoff document for lit-writeup

**Output**: `field-synthesis.md` with integrated understanding and writing recommendations.

---

## Output Files

```
lit-synthesis/
├── corpus-audit.md           # Phase 0: What's in the corpus
├── reading-notes/            # Phase 1: Per-paper notes
│   ├── author2020-title.md
│   ├── author2019-title.md
│   └── ...
├── theoretical-map.md        # Phase 2: Traditions and lineages
├── thematic-clusters.md      # Phase 3: Paper groupings
├── debate-map.md             # Phase 4: Tensions and positions
└── field-synthesis.md        # Phase 5: Integrated understanding
```

## Reading Note Template

For each paper in Phase 1:

```markdown
# [Author Year] - [Short Title]

## Bibliographic Info
- Full citation: [from Zotero]
- DOI: [link]

## Core Argument
[1-2 sentences: What is the paper arguing?]

## Theoretical Framework
- Tradition: [e.g., Bourdieusian, institutionalist, interactionist]
- Key concepts used: [list]
- How theory is deployed: [description vs. extension vs. critique]

## Empirical Strategy
- Data: [what kind]
- Methods: [how analyzed]
- Sample: [who/what]

## Key Findings
1. [Finding 1]
2. [Finding 2]
3. [Finding 3]

## Contribution Claim
[What does the paper claim to contribute?]

## Limitations (as noted by authors)
- [Limitation 1]
- [Limitation 2]

## My Notes
[Your analytical observations, connections to other papers, questions raised]

## Key Quotes
> "[Quote 1]" (p. X)

> "[Quote 2]" (p. Y)

## Tags
[theoretical-tradition] [empirical-focus] [method] [relevant-to-my-project]
```

## Model Recommendations

| Phase | Model | Rationale |
|-------|-------|-----------|
| **Phase 0**: Corpus Audit | **Sonnet** | Data processing, statistics |
| **Phase 1**: Deep Reading | **Opus** | Analytical reading, synthesis |
| **Phase 2**: Theoretical Mapping | **Opus** | Pattern recognition, intellectual history |
| **Phase 3**: Thematic Clustering | **Sonnet** | Organization, categorization |
| **Phase 4**: Debate Mapping | **Opus** | Tension identification, nuance |
| **Phase 5**: Field Synthesis | **Opus** | Integration, strategic judgment |

## Starting the Synthesis

When the user is ready to begin:

1. **Check Zotero setup**:
   > "Do you have Zotero MCP configured? If not, let's set that up first (see `mcp/zotero-setup.md`)."

2. **Identify the corpus**:
   > "Where are your papers? A Zotero collection from lit-search? An existing library folder? How many papers total?"

3. **Set priorities**:
   > "Which papers are most central to your project? We'll deep-read those first and skim the rest."

4. **Clarify goals**:
   > "What are you trying to understand about this field? Are you looking for gaps, debates, or a specific theoretical tradition?"

5. **Proceed with Phase 0** to audit the corpus.

## Key Reminders

- **Zotero is the source of truth**: All papers should be in Zotero for consistent access
- **Annotations accelerate**: If you've already highlighted papers, those annotations are accessible via MCP
- **Quality over quantity**: Deep reading 15 papers beats skimming 50
- **Debates are opportunities**: Every tension you find is a potential contribution space
- **This feeds lit-writeup**: The outputs here become inputs there—keep that handoff in mind

Overview

This skill performs deep reading and synthesis of a Zotero-backed literature corpus to produce actionable maps and notes for sociological writing. It turns annotated PDFs into a theoretical map, thematic clusters, debate maps, and a field synthesis that directly feed a manuscript’s theory section. Outputs are structured files and per-paper reading notes ready for handoff to writing workflows.

How this skill works

I connect to your Zotero library via Zotero MCP to access metadata, full texts, annotations, and notes. The workflow audits the corpus, deep-reads prioritized papers, extracts structured reading notes, identifies intellectual lineages, clusters by theme and method, and surfaces explicit and implicit debates. Final deliverables include corpus-audit.md, reading-notes/, theoretical-map.md, thematic-clusters.md, debate-map.md, and field-synthesis.md.

When to use it

  • You have a Zotero collection of papers and need to move from search to analysis
  • You need to understand theoretical traditions and citation lineages before writing
  • You want to find tensions or unresolved debates to motivate contributions
  • You need organized, per-paper deep-reading notes for manuscripts or proposals
  • You want writing-ready syntheses mapping gaps and recommended framing

Best practices

  • Keep Zotero as the single source of truth and import all PDFs and annotations there
  • Prioritize 10–20 core papers for deep reading rather than superficially covering many
  • Highlight and annotate PDFs during reading to accelerate MCP-driven extraction
  • Pause after each phase to review outputs and confirm priorities before proceeding
  • Use the field-synthesis output as the direct outline for the theory section in writing

Example use cases

  • A PhD student consolidating the literature for a dissertation chapter and identifying a novel gap
  • A research team preparing a literature review section for a grant proposal
  • An instructor creating a curated reading map and debate guide for a seminar
  • A scholar comparing competing theoretical camps and needing citation lineage mapping
  • A team translating annotated Zotero collections into structured notes for collaborative writing

FAQ

Do I need Zotero MCP installed to use this skill?

Yes. Zotero MCP provides full-text and annotation access. The synthesis relies on MCP to read highlights and notes.

How many papers should I include for deep reading?

Aim for 10–20 priority papers for in-depth reading; skim additional papers for coverage. Quality deep reads yield richer theoretical insight.