Add transcription and alignment pipeline

New commands:
- `transcribe`: Uses Whisper to generate word-level timestamps from video
- `align`: Matches manuscript markers to transcript, outputs transcript.csv

Workflow:
1. gnommo transcribe video.mov → video.transcript.json
2. gnommo align project/ → transcript.csv with markers at aligned times

Alignment uses fuzzy text matching to find the first phrase after each
marker in the manuscript, then locates it in the transcript. Applies
configurable offset (default -1s) so slides appear before speech.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

This commit is contained in:

Glitch University

2026-01-12 12:41:10 +01:00

parent 7f7425da46

commit 216131e072

4 changed files with 418 additions and 0 deletions

.gitignore

View File

@@ -8,6 +8,7 @@ venv/
 .venv/
 *.egg-info/
 Video1/*
 # OS
 .DS_Store
 Thumbs.db