alicloud-ai-audio-tts-voice-clone

📁 cinience/alicloud-skills 📅 3 days ago

总安装量

周安装量

#7082

全站排名

安装命令

npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-clone

Agent 安装分布

github-copilot 55

codex 55

kimi-cli 55

gemini-cli 55

cursor 55

amp 55

Skill 文档

Category: provider

Model Studio Qwen TTS Voice Clone

Use voice cloning models to replicate timbre from enrollment audio samples.

Critical model names

Use one of these exact model strings:

qwen3-tts-vc-2026-01-22
qwen3-tts-vc-realtime-2026-01-15

Prerequisites

Install SDK in a virtual environment:

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.voice_clone)

Request

text (string, required)
voice_sample (string | bytes, required) enrollment sample
voice_name (string, optional)
stream (bool, optional)

Response

audio_url (string) or streaming PCM chunks
voice_id (string)
request_id (string)

Operational guidance

Use clean speech samples with low background noise.
Respect consent and policy requirements for cloned voices.
Persist generated voice_id and reuse for future synthesis requests.

Local helper script

Prepare a normalized request JSON and validate response schema:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/prepare_voice_clone_request.py \
  --text "æ¬¢è¿æ¥å°è¯é³å¤å»æ¼ç¤º" \
  --voice-sample "https://example.com/voice-sample.wav"

Output location

Default output: output/ai-audio-tts-voice-clone/audio/
Override base dir with OUTPUT_DIR.

References

references/sources.md

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台