home / skills / qwenlm / qwen-code-examples / youtube-transcript-extractor
This skill extracts timestamped YouTube transcripts for translation, summarization, and content creation, saving transcripts locally and showing timestamps.
npx playbooks add skill qwenlm/qwen-code-examples --skill youtube-transcript-extractorReview the files below or copy the command above to add this skill to your agents.
---
name: YouTube Transcript Extractor
description: Extracts timestamped transcripts from YouTube videos for translation, summarization, and content creation.
---
# YouTube Transcript Extractor
## What You Get
- **Terminal Output**: Prints the transcript with `[HH:MM:SS]` timestamps.
- **Local File**: Writes `youtube_transcript_{video_id}.txt` in the **current working directory**.
- Includes the source video URL and the full transcript content.
## Requirements
- Python 3.9+ (Recommended)
- Network environment that can access YouTube transcript APIs.
- Python Dependencies:
- `youtube-transcript-api`
## Quick Start
### Step 1: (Optional) Create a Virtual Environment
```bash
python3 -m venv .venv
source .venv/bin/activate
python -m pip install -U pip
```
### Step 2: Install Dependencies
```bash
python -m pip install youtube-transcript-api
```
### Step 3: Extract Transcript
Run from the repository root (recommended for the clearest path):
```bash
python skills/youtube-transcript-extractor/scripts/get_youtube_transcript.py "https://www.youtube.com/watch?v=IDSAMqip6ms"
```
- The transcript will be saved to `youtube_transcript_{video_id}.txt` in the current working directory.
## Step 4: Convert Transcript to Reader-Friendly Markdown
Output `youtube_transcript_{video_id}.md`. Note: Ensure the content is not altered or truncated.
This skill extracts timestamped transcripts from YouTube videos and saves them as local files for translation, summarization, and content creation. It produces both terminal output with [HH:MM:SS] timestamps and a saved transcript file named youtube_transcript_{video_id}.txt in the current working directory. The saved file includes the source video URL and the full transcript content.
The extractor calls YouTube transcript services to retrieve available captions for a given video URL or ID. It formats each caption line with a [HH:MM:SS] timestamp, prints the result to the terminal, and writes a complete transcript file in the working directory. Optionally, the output can be converted into reader-friendly Markdown without altering the transcript content.
What dependencies and environment do I need?
You need Python 3.9+ (recommended) and the youtube-transcript-api package, plus network access to YouTube transcript endpoints.
Where is the transcript saved and what format is used?
A file named youtube_transcript_{video_id}.txt is written to the current working directory and includes the source URL and timestamped transcript lines.