alicloud-ai-audio-tts-voice-design

📁 cinience/alicloud-skills 📅 3 days ago

总安装量

周安装量

#7451

全站排名

安装命令

npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-design

Agent 安装分布

qoder 52

github-copilot 52

codex 52

kimi-cli 52

gemini-cli 52

cursor 52

Skill 文档

Category: provider

Model Studio Qwen TTS Voice Design

Use voice design models to create controllable synthetic voices from natural language descriptions.

Critical model names

Use one of these exact model strings:

qwen3-tts-vd-2026-01-26
qwen3-tts-vd-realtime-2026-01-15

Prerequisites

Install SDK in a virtual environment:

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.voice_design)

Request

voice_prompt (string, required) target voice description
text (string, required)
stream (bool, optional)

Response

audio_url (string) or streaming PCM chunks
voice_id (string)
request_id (string)

Operational guidance

Write voice prompts with tone, pace, emotion, and timbre constraints.
Build a reusable voice prompt library for product consistency.
Validate generated voice in short utterances before long scripts.

Local helper script

Prepare a normalized request JSON and validate response schema:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-design/scripts/prepare_voice_design_request.py \
  --voice-prompt "A warm female host voice, clear articulation, medium pace" \
  --text "è¿æ¯é³è²è®¾è®¡æ¼ç¤º"

Output location

Default output: output/ai-audio-tts-voice-design/audio/
Override base dir with OUTPUT_DIR.

References

references/sources.md

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台