home / skills / dkyazzentwatwa / chatgpt-skills / language-detector
This skill detects text language with confidence scores across 50+ languages, supporting batch analysis and CSV inputs.
npx playbooks add skill dkyazzentwatwa/chatgpt-skills --skill language-detectorReview the files below or copy the command above to add this skill to your agents.
---
name: language-detector
description: Detect language of text with confidence scores, support for 50+ languages, and batch text classification.
---
# Language Detector
Identify the language of text with confidence scoring.
## Features
- **50+ Languages**: Wide language support
- **Confidence Scores**: Probability estimates
- **Batch Detection**: Process multiple texts
- **CSV Support**: Analyze text columns
- **Multiple Algorithms**: Character n-gram analysis
## CLI Usage
```bash
python language_detector.py --text "Hello world" --output result.json
python language_detector.py --file texts.csv --column text --output languages.csv
```
## Dependencies
- langdetect>=1.0.9
- pandas>=2.0.0
This skill detects the language of input text and returns per-language confidence scores. It supports 50+ languages, batch processing, and CSV column analysis for quick integration into pipelines. The implementation is lightweight and designed for command-line and programmatic use. It is focused on reliable, tested detection with multiple algorithmic heuristics.
The detector analyzes text using character n-gram patterns and probabilistic models to estimate the most likely language. For each input it returns a ranked list of candidate languages with confidence (probability) scores. It supports single-text queries, batch lists, and CSV files where a specified column contains text to classify. Results can be exported as JSON or CSV for downstream processing.
Which languages are supported?
The detector supports 50+ common languages; it covers major world languages used in typical NLP pipelines.
What formats can I process in batch?
You can process plain text lists or CSV files by specifying the column containing text. Outputs can be JSON or CSV.