ocr

Original：🇺🇸 English

Translated

2 scriptsChecked / no sensitive code detected

Extract text from images using Tesseract OCR

17installs

Sourcetrpc-group/trpc-agent-go

Added on2026-05-16

NPX Install

npx skill4agent add trpc-group/trpc-agent-go ocr

SKILL.md Content

View Translation Comparison →

OCR Image Text Extraction Skill

Extract text from images using Tesseract OCR engine.

Capabilities

Extract text from image files (PNG, JPG, JPEG, GIF, BMP, TIFF)
Support for 100+ languages
Optional image preprocessing for better accuracy
Output in plain text or JSON format with confidence scores

Usage

Basic OCR

bash

python3 scripts/ocr.py <image_file> <output_file>

With Options

bash

# Specify language (default: eng)
python3 scripts/ocr.py image.png text.txt --lang eng

# Chinese text
python3 scripts/ocr.py image.png text.txt --lang chi_sim

# Multiple languages
python3 scripts/ocr.py image.png text.txt --lang eng+chi_sim

# With image preprocessing (improves accuracy)
python3 scripts/ocr.py image.png text.txt --preprocess

# JSON output with confidence scores
python3 scripts/ocr.py image.png output.json --format json

Download and OCR from URL

bash

# OCR from remote image
python3 scripts/ocr_url.py <image_url> <output_file>

# With options
python3 scripts/ocr_url.py https://example.com/image.jpg text.txt --lang eng --preprocess

Parameters

```
image_file
```
/
```
image_url
```
(required): Path to local image or image URL
```
output_file
```
(required): Path to output text/JSON file
```
--lang
```
: Language code (e.g., eng, chi_sim, jpn, fra, deu). Default: eng
```
--preprocess
```
: Apply image preprocessing (grayscale, thresholding) for better accuracy
```
--format
```
: Output format (text/json, default: text)

Common Languages

Language	Code
English	eng
Chinese (Simplified)	chi_sim
Chinese (Traditional)	chi_tra
Japanese	jpn
Korean	kor
French	fra
German	deu
Spanish	spa
Russian	rus
Arabic	ara

Supported Image Formats

PNG, JPG, JPEG, GIF, BMP, TIFF, WEBP

Dependencies

Python 3.8+
pytesseract
Pillow (PIL)
tesseract-ocr (system package)

Installation

bash

# Python packages
pip install pytesseract Pillow

# Tesseract OCR engine
sudo apt-get install tesseract-ocr  # Ubuntu/Debian
sudo yum install tesseract           # CentOS/RHEL
brew install tesseract               # macOS

ocr

NPX Install

Tags

SKILL.md Content

OCR Image Text Extraction Skill

Capabilities

Usage

Basic OCR

With Options

Download and OCR from URL

Parameters

Common Languages

Supported Image Formats

Dependencies

Installation