xiaoyuzhou-transcribe

📁 weifenghuang/skillvault 📅 Feb 13, 2026
3
总安装量
3
周安装量
#58572
全站排名
安装命令
npx skills add https://github.com/weifenghuang/skillvault --skill xiaoyuzhou-transcribe

Agent 安装分布

github-copilot 3
codex 3
kimi-cli 3
gemini-cli 3
amp 3
opencode 3

Skill 文档

Xiaoyuzhou Transcribe

Overview

Generate SRT and TXT transcripts from a Xiaoyuzhou episode URL by downloading the audio and running faster-whisper locally. Use the bundled script to keep the workflow deterministic and repeatable.

Quick Start

  • Install dependency: python3 -m pip install -U faster-whisper
  • Run: python3 scripts/xiaoyuzhou_transcribe.py "<episode-url>" --output-dir .
  • Expect outputs: xiaoyuzhou-<eid>.mp3, xiaoyuzhou-<eid>.srt, xiaoyuzhou-<eid>.txt

Workflow

  • Fetch the episode page and parse __NEXT_DATA__ to locate the audio URL.
  • Download the audio (resume supported) unless --audio-path is provided.
  • Transcribe audio with faster-whisper and write SRT + TXT.

Script Usage

scripts/xiaoyuzhou_transcribe.py accepts:

  • --output-dir: write outputs to a specific directory
  • --model: whisper model size (tiny by default; use base or small for higher accuracy)
  • --language: force a language code or allow auto-detect
  • --audio-path: transcribe a local audio file instead of downloading
  • --force-download: re-download the audio even if it exists
  • --no-vad: disable VAD filtering if segments are too aggressive

Notes

  • If the episode is private or requires login, the script cannot access it.
  • Xiaoyuzhou does not expose public transcripts for many episodes; this workflow generates subtitles via speech-to-text instead.