alicloud-ai-audio-tts-voice-design
53
总安装量
53
周安装量
#7451
全站排名
安装命令
npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-design
Agent 安装分布
qoder
52
github-copilot
52
codex
52
kimi-cli
52
gemini-cli
52
cursor
52
Skill 文档
Category: provider
Model Studio Qwen TTS Voice Design
Use voice design models to create controllable synthetic voices from natural language descriptions.
Critical model names
Use one of these exact model strings:
qwen3-tts-vd-2026-01-26qwen3-tts-vd-realtime-2026-01-15
Prerequisites
- Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
- Set
DASHSCOPE_API_KEYin your environment, or adddashscope_api_keyto~/.alibabacloud/credentials.
Normalized interface (tts.voice_design)
Request
voice_prompt(string, required) target voice descriptiontext(string, required)stream(bool, optional)
Response
audio_url(string) or streaming PCM chunksvoice_id(string)request_id(string)
Operational guidance
- Write voice prompts with tone, pace, emotion, and timbre constraints.
- Build a reusable voice prompt library for product consistency.
- Validate generated voice in short utterances before long scripts.
Local helper script
Prepare a normalized request JSON and validate response schema:
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
--voice-prompt "A warm female host voice, clear articulation, medium pace" \
--text "è¿æ¯é³è²è®¾è®¡æ¼ç¤º"
Output location
- Default output:
output/ai-audio-tts-voice-design/audio/ - Override base dir with
OUTPUT_DIR.
References
references/sources.md