processes/ -> playbooks/ (git mv preserves history; updated cross-refs in ROADMAP, README, subtitles playbook + scripts). playbooks/import-media/README.md v1.0 — 7-step import workflow: stage on onyx -> rsync to nullstone -> chmod -> verify scan -> Items/Counts bump -> optional subtitle pass -> run-log Cross-references docs/05/07/08, ADMIN-GUIDE, README. Mirrors the existing subtitles playbook structure (CHANGELOG + runs/_template). CHANGELOG v1.0 lists known gaps (bin/cleanup-import.sh and bin/normalize.py still doc-only, ROADMAP M6). First run logged: playbooks/import-media/runs/lilo-stitch-2002.md. Lilo & Stitch (2002) imported to /home/user/media/movies/, item c2f4aff133c1b9631500fadf293b0b2f, TMDb 11544, MovieCount 3 -> 4. LibraryMonitor didn't auto-fire — needed manual /Library/Refresh; playbook updated to make this an unconditional step. Source: 1080p BluRay HEVC 10-bit / EAC3 5.1 / 2x PGS embedded subs. Per quality bar (README.md:41) — passes.
46 lines
2 KiB
Markdown
46 lines
2 KiB
Markdown
# Stop-gap subs — pending Whisper cross-ref
|
|
|
|
Shows whose current subtitles ship from a path that explicitly violates
|
|
[`STYLE.md`](STYLE.md). Quality is "acceptable, not great" (~85 %). When
|
|
v4 WhisperX (ROADMAP H5) lands on the friend RTX 4080 node, **regenerate
|
|
every show on this list** with proper-noun-prompted transcription and
|
|
replace the sidecars in place. Keep this file as the v4 worklist.
|
|
|
|
## Active stop-gaps
|
|
|
|
| Show | Eps subbed | Source path | Why stop-gap | Owner verdict | Logged |
|
|
|---|---|---|---|---|---|
|
|
| Sassy the Sasquatch (2022) | S01 5/5 | v3.5 YouTube auto-CC | lowercase, no punctuation, names mangled (`Sassy → sasha`), profanity = `[ __ ]` | "85 % the way there, acceptable, fine" — keep until v4 | 2026-05-10 |
|
|
|
|
## When more Big Lez universe shows ship via v3.5
|
|
|
|
Same channel hosts these — when subbed via the v3.5 yt-dlp path, append
|
|
to the table above:
|
|
|
|
- The Donny & Clarence Show (2024)
|
|
- The Big Lez Saga (2022)
|
|
- The Mike Nolan Show (2016) — but **try the YT "COMPLETE SEASON | SUBTITLES"
|
|
upload first** for hand-typed CCs before falling back to auto-CC
|
|
|
|
## v4 WhisperX rebuild plan
|
|
|
|
When the friend node (`100.64.0.3`, per memory `project_friend_gpu.md`) is
|
|
back online:
|
|
|
|
1. Install WhisperX on the node (CUDA 12 + cuDNN 9 + faster-whisper +
|
|
pyannote VAD).
|
|
2. For each show in the table above, write
|
|
`playbooks/subtitles/prompts/<show>.yaml` with the recurring proper
|
|
nouns the YT auto-CC mangled.
|
|
3. Run `lib/sub-whisperx-fetch.py` (TBD, ROADMAP H5) per show. Each
|
|
episode: pull mkv → ffmpeg extract 16k mono wav → WhisperX large-v3
|
|
with `--initial_prompt` from the yaml → SRT → SSH push to nullstone
|
|
with library filename, **overwriting the v3.5 sidecar in place**.
|
|
4. Tick off the row from the table; move it to a "Cleared via v4" archive
|
|
section below this one (kept as record).
|
|
5. Library scan; verify Jellyfin still reports 1 external eng sub stream
|
|
per ep (no dupes from v3.5 + v4 stacking).
|
|
|
|
## Cleared via v4 (archive)
|
|
|
|
(empty — populate as v4 rebuilds land)
|