xiaoyuzhou-transcribe

📁 weifenghuang/skillvault 📅 Feb 13, 2026

总安装量

周安装量

#58572

全站排名

安装命令

npx skills add https://github.com/weifenghuang/skillvault --skill xiaoyuzhou-transcribe

Agent 安装分布

github-copilot 3

codex 3

kimi-cli 3

gemini-cli 3

amp 3

opencode 3

Skill 文档

Xiaoyuzhou Transcribe

Overview

Generate SRT and TXT transcripts from a Xiaoyuzhou episode URL by downloading the audio and running faster-whisper locally. Use the bundled script to keep the workflow deterministic and repeatable.

Quick Start

Install dependency: python3 -m pip install -U faster-whisper
Run: python3 scripts/xiaoyuzhou_transcribe.py "<episode-url>" --output-dir .
Expect outputs: xiaoyuzhou-<eid>.mp3, xiaoyuzhou-<eid>.srt, xiaoyuzhou-<eid>.txt

Workflow

Fetch the episode page and parse __NEXT_DATA__ to locate the audio URL.
Download the audio (resume supported) unless --audio-path is provided.
Transcribe audio with faster-whisper and write SRT + TXT.

Script Usage

scripts/xiaoyuzhou_transcribe.py accepts:

--output-dir: write outputs to a specific directory
--model: whisper model size (tiny by default; use base or small for higher accuracy)
--language: force a language code or allow auto-detect
--audio-path: transcribe a local audio file instead of downloading
--force-download: re-download the audio even if it exists
--no-vad: disable VAD filtering if segments are too aggressive

Notes

If the episode is private or requires login, the script cannot access it.
Xiaoyuzhou does not expose public transcripts for many episodes; this workflow generates subtitles via speech-to-text instead.

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台