good-ttvideo2text

📁 imgoodbai/goodable 📅 14 days ago

总安装量

周安装量

#48742

全站排名

安装命令

npx skills add https://github.com/imgoodbai/goodable --skill good-ttvideo2text

Agent 安装分布

mcpjam 2

claude-code 2

kilo 2

replit 2

junie 2

windsurf 2

Skill 文档

good-TTvideo2text

Extract audio from Douyin/TikTok videos and transcribe to text using ASR service.

Task Objective

Convert Douyin/TikTok video audio to text with timestamps, supporting both interactive UI and command-line workflow.

Capabilities: Video parsing, audio extraction, ASR transcription, timestamp generation

Trigger: User provides Douyin/TikTok URL and requests transcription

Usage Modes

Mode 1: Web UI (Recommended)

Visual interface for transcription management:

# Install dependencies
cd skills/good-TTvideo2text
pip install -r requirements.txt

# Start service (default port 8000)
python app/main.py

# Browser access
http://localhost:8000

Features:

Paste video URL for instant transcription
View results with timestamps
Cookie management for restricted videos
Real-time progress updates

Mode 2: Script (Command Line)

Suitable for automation, AI workflow integration:

# Basic usage
python scripts/transcribe.py "https://v.douyin.com/xxx"

# Extract URL from share text
python scripts/transcribe.py "7.47 å¤å¶æå¼æé³ï¼ççãç¨æ·åãçä½å https://v.douyin.com/xxx"

# JSON output
python scripts/transcribe.py "https://v.douyin.com/xxx" --output json

# Text output (default)
python scripts/transcribe.py "https://v.douyin.com/xxx" --output text

Output Format (Text):

=== Video Info ===
Title: Video title
Author: Author name
Duration: 30s

=== Transcription ===
Full Text:
Complete transcription text...

Sentences with Timestamps:
[00:00-00:03] First sentence
[00:03-00:06] Second sentence

Output Format (JSON):

{
  "success": true,
  "video_info": {
    "title": "Video title",
    "author": "Author name",
    "duration": 30,
    "create_time": 1234567890
  },
  "transcription": {
    "text": "Complete transcription...",
    "sentences": [
      {
        "start_ms": 0,
        "end_ms": 3000,
        "text": "First sentence"
      }
    ]
  }
}

Prerequisites

Dependencies:

fastapi>=0.120.3, uvicorn>=0.35.0, httpx>=0.28.1
TikTokDownloader dependencies (see requirements.txt)

ASR Configuration:

Environment variables auto-injected by Goodable platform
GOODABLE_ASR_SUBMIT_URL – ASR task submission endpoint
GOODABLE_ASR_QUERY_URL_TEMPLATE – ASR result query endpoint (with {task_id} placeholder)

Cookie Configuration (Optional):

Some videos require login cookies
Configure via Web UI Settings or edit TikTokDownloader/settings.json
Cookie format: Douyin web cookie string

Standard Workflow (AI Usage)

When User Provides URL

Extract URL from input:
- User may provide raw URL or share text with URL
- Use regex to extract actual video URL

Call transcribe script:

python scripts/transcribe.py "USER_PROVIDED_TEXT"

Handle results:
- Success: Present transcription text and timestamps
- Error: Check error message for troubleshooting

Common Error Handling

Error	Cause	Solution
Invalid video URL	URL format incorrect	Ask user to provide valid Douyin/TikTok URL
Video not found	Video deleted or requires login	Ask user to check video or provide cookies
No audio found	Video has no background music	Inform user this video has no audio track
ASR not configured	Environment variables missing	Run via Goodable platform (auto-injects vars)
ASR timeout	Long audio or service slow	Retry or use Web UI for monitoring

Resource Index

app/main.py – FastAPI application entry
scripts/transcribe.py – Command-line transcription script
TikTokDownloader/ – Video parsing library (source code)
TikTokDownloader/settings.json – Cookie and configuration
static/index.html – Web UI interface
requirements.txt – Python dependencies
downloads/ – Temporary file directory

Important Notes

Cookie Requirement: Public videos work without cookies, restricted videos need login cookies
ASR Platform: This skill requires Goodable platform’s ASR service integration
URL Formats Supported:
- Short URL: https://v.douyin.com/xxx
- Full URL: https://www.douyin.com/video/1234567890
- TikTok: https://vm.tiktok.com/xxx, https://www.tiktok.com/@user/video/xxx
Audio Extraction: Extracts background music URL directly (no download needed)
ASR Processing Time: Typically 5-30 seconds depending on audio length

Example AI Usage

User: "å¸®æè½¬åè¿ä¸ªæé³è§é¢ 7.47 å¤å¶æå¼æé³ https://v.douyin.com/abc123"

AI: Let me transcribe this video for you.

[Calls transcribe.py script with the provided text]

Based on the transcription:

Video: "è§é¢æ é¢"
Author: ä½èå

Transcription:
[00:00-00:03] ç¬¬ä¸å¥è¯åå®¹
[00:03-00:08] ç¬¬äºå¥è¯åå®¹
...

Full text:
å®æ´çè½¬åææ¬åå®¹...

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台