Loading...
Loading...
Corrects speech-to-text transcription errors in meeting notes, lectures, and interviews using dictionary rules and AI. Learns patterns to build personalized correction databases. Use when working with transcripts containing ASR/STT errors, homophones, or Chinese/English mixed content requiring cleanup.
npx skill4agent add daymade/claude-code-skills transcript-fixeruvuv# macOS/Linux
curl -LsSf https://astral.sh/uv/install.sh | sh
# Windows PowerShell
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"# First time: Initialize database
uv run scripts/fix_transcription.py --init
# Process transcript with enhanced UX
uv run scripts/fix_transcript_enhanced.py input.md --output ./correctedANTHROPIC_BASE_URL# 1. Set API key (if not auto-detected)
export GLM_API_KEY="<api-key>" # From https://open.bigmodel.cn/
# 2. Add common corrections (5-10 terms)
uv run scripts/fix_transcription.py --add "错误词" "正确词" --domain general
# 3. Run full correction pipeline
uv run scripts/fix_transcription.py --input meeting.md --stage 3
# 4. Review learned patterns after 3-5 runs
uv run scripts/fix_transcription.py --review-learned*_stage1.md*_stage2.md*_对比.htmluv run scripts/generate_word_diff.py original.md corrected.md output.htmljapanese 3 proGemini 3 Promeeting.md今天我们讨论了巨升智能的最新进展。
股价系统需要优化,目前性能不够好。meeting_stage1.md今天我们讨论了具身智能的最新进展。 ← "巨升"→"具身" corrected
股价系统需要优化,目前性能不够好。 ← Unchanged (not in dictionary)meeting_stage2.md今天我们讨论了具身智能的最新进展。
框架系统需要优化,目前性能不够好。 ← "股价"→"框架" corrected by AI✓ Detected: "股价" → "框架" (confidence: 85%, count: 1)
Run --review-learned after 2 more occurrences to approve~/.transcript-fixer/corrections.dbuv run scripts/fix_transcription.py --init--add "错误词" "正确词" --domain <domain>--input file.md --stage 3--review-learned--approvegeneralembodied_aifinancemedical火星加速器具身智能references/workflow_guide.mdreferences/script_parameters.mdreferences/team_collaboration.mduv run scripts/fix_transcription.py --add "错误词" "正确词" --domain generalreferences/iteration_workflow.md[CLAUDE_FALLBACK]--addreferences/database_schema.md# View all corrections
sqlite3 ~/.transcript-fixer/corrections.db "SELECT * FROM active_corrections;"
# Check schema version
sqlite3 ~/.transcript-fixer/corrections.db "SELECT value FROM system_config WHERE key='schema_version';"| Stage | Description | Speed | Cost |
|---|---|---|---|
| 1 | Dictionary only | Instant | Free |
| 2 | AI only | ~10s | API calls |
| 3 | Full pipeline | ~10s | API calls |
ensure_deps.pyfix_transcript_enhanced.pyfix_transcription.pygenerate_word_diff.pyexamples/bulk_import.pydatabase_schema.mditeration_workflow.mdinstallation_setup.mdglm_api_setup.mdworkflow_guide.mdquick_reference.mdscript_parameters.mddictionary_guide.mdsql_queries.mdfile_formats.mdarchitecture.mdbest_practices.mdtroubleshooting.mdteam_collaboration.mduv run scripts/fix_transcription.py --validate--initexport GLM_API_KEY="<key>"~/.transcript-fixer/references/troubleshooting.mdreferences/glm_api_setup.md