good-ttvideo2text
4
总安装量
2
周安装量
#48742
全站排名
安装命令
npx skills add https://github.com/imgoodbai/goodable --skill good-ttvideo2text
Agent 安装分布
mcpjam
2
claude-code
2
kilo
2
replit
2
junie
2
windsurf
2
Skill 文档
good-TTvideo2text
Extract audio from Douyin/TikTok videos and transcribe to text using ASR service.
Task Objective
Convert Douyin/TikTok video audio to text with timestamps, supporting both interactive UI and command-line workflow.
Capabilities: Video parsing, audio extraction, ASR transcription, timestamp generation
Trigger: User provides Douyin/TikTok URL and requests transcription
Usage Modes
Mode 1: Web UI (Recommended)
Visual interface for transcription management:
# Install dependencies
cd skills/good-TTvideo2text
pip install -r requirements.txt
# Start service (default port 8000)
python app/main.py
# Browser access
http://localhost:8000
Features:
- Paste video URL for instant transcription
- View results with timestamps
- Cookie management for restricted videos
- Real-time progress updates
Mode 2: Script (Command Line)
Suitable for automation, AI workflow integration:
# Basic usage
python scripts/transcribe.py "https://v.douyin.com/xxx"
# Extract URL from share text
python scripts/transcribe.py "7.47 å¤å¶æå¼æé³ï¼ççãç¨æ·åãçä½å https://v.douyin.com/xxx"
# JSON output
python scripts/transcribe.py "https://v.douyin.com/xxx" --output json
# Text output (default)
python scripts/transcribe.py "https://v.douyin.com/xxx" --output text
Output Format (Text):
=== Video Info ===
Title: Video title
Author: Author name
Duration: 30s
=== Transcription ===
Full Text:
Complete transcription text...
Sentences with Timestamps:
[00:00-00:03] First sentence
[00:03-00:06] Second sentence
Output Format (JSON):
{
"success": true,
"video_info": {
"title": "Video title",
"author": "Author name",
"duration": 30,
"create_time": 1234567890
},
"transcription": {
"text": "Complete transcription...",
"sentences": [
{
"start_ms": 0,
"end_ms": 3000,
"text": "First sentence"
}
]
}
}
Prerequisites
Dependencies:
- fastapi>=0.120.3, uvicorn>=0.35.0, httpx>=0.28.1
- TikTokDownloader dependencies (see requirements.txt)
ASR Configuration:
- Environment variables auto-injected by Goodable platform
GOODABLE_ASR_SUBMIT_URL– ASR task submission endpointGOODABLE_ASR_QUERY_URL_TEMPLATE– ASR result query endpoint (with {task_id} placeholder)
Cookie Configuration (Optional):
- Some videos require login cookies
- Configure via Web UI Settings or edit
TikTokDownloader/settings.json - Cookie format: Douyin web cookie string
Standard Workflow (AI Usage)
When User Provides URL
-
Extract URL from input:
- User may provide raw URL or share text with URL
- Use regex to extract actual video URL
-
Call transcribe script:
python scripts/transcribe.py "USER_PROVIDED_TEXT" -
Handle results:
- Success: Present transcription text and timestamps
- Error: Check error message for troubleshooting
Common Error Handling
| Error | Cause | Solution |
|---|---|---|
| Invalid video URL | URL format incorrect | Ask user to provide valid Douyin/TikTok URL |
| Video not found | Video deleted or requires login | Ask user to check video or provide cookies |
| No audio found | Video has no background music | Inform user this video has no audio track |
| ASR not configured | Environment variables missing | Run via Goodable platform (auto-injects vars) |
| ASR timeout | Long audio or service slow | Retry or use Web UI for monitoring |
Resource Index
app/main.py– FastAPI application entryscripts/transcribe.py– Command-line transcription scriptTikTokDownloader/– Video parsing library (source code)TikTokDownloader/settings.json– Cookie and configurationstatic/index.html– Web UI interfacerequirements.txt– Python dependenciesdownloads/– Temporary file directory
Important Notes
- Cookie Requirement: Public videos work without cookies, restricted videos need login cookies
- ASR Platform: This skill requires Goodable platform’s ASR service integration
- URL Formats Supported:
- Short URL:
https://v.douyin.com/xxx - Full URL:
https://www.douyin.com/video/1234567890 - TikTok:
https://vm.tiktok.com/xxx,https://www.tiktok.com/@user/video/xxx
- Short URL:
- Audio Extraction: Extracts background music URL directly (no download needed)
- ASR Processing Time: Typically 5-30 seconds depending on audio length
Example AI Usage
User: "帮æè½¬åè¿ä¸ªæé³è§é¢ 7.47 å¤å¶æå¼æé³ https://v.douyin.com/abc123"
AI: Let me transcribe this video for you.
[Calls transcribe.py script with the provided text]
Based on the transcription:
Video: "è§é¢æ é¢"
Author: ä½è
å
Transcription:
[00:00-00:03] 第ä¸å¥è¯å
容
[00:03-00:08] 第äºå¥è¯å
容
...
Full text:
宿´çè½¬åææ¬å
容...