Add transcription and alignment pipeline

New commands:
- `transcribe`: Uses Whisper to generate word-level timestamps from video
- `align`: Matches manuscript markers to transcript, outputs transcript.csv

Workflow:
1. gnommo transcribe video.mov → video.transcript.json
2. gnommo align project/ → transcript.csv with markers at aligned times

Alignment uses fuzzy text matching to find the first phrase after each
marker in the manuscript, then locates it in the transcript. Applies
configurable offset (default -1s) so slides appear before speech.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
2026-01-12 12:41:10 +01:00
parent 7f7425da46
commit 216131e072
4 changed files with 418 additions and 0 deletions
+1
View File
@@ -8,6 +8,7 @@ venv/
.venv/
*.egg-info/
Video1/*
# OS
.DS_Store
Thumbs.db