alicloud-ai-audio-tts-voice-clone

📁 cinience/alicloud-skills 📅 3 days ago
56
总安装量
56
周安装量
#7082
全站排名
安装命令
npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-tts-voice-clone

Agent 安装分布

github-copilot 55
codex 55
kimi-cli 55
gemini-cli 55
cursor 55
amp 55

Skill 文档

Category: provider

Model Studio Qwen TTS Voice Clone

Use voice cloning models to replicate timbre from enrollment audio samples.

Critical model names

Use one of these exact model strings:

  • qwen3-tts-vc-2026-01-22
  • qwen3-tts-vc-realtime-2026-01-15

Prerequisites

  • Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.voice_clone)

Request

  • text (string, required)
  • voice_sample (string | bytes, required) enrollment sample
  • voice_name (string, optional)
  • stream (bool, optional)

Response

  • audio_url (string) or streaming PCM chunks
  • voice_id (string)
  • request_id (string)

Operational guidance

  • Use clean speech samples with low background noise.
  • Respect consent and policy requirements for cloned voices.
  • Persist generated voice_id and reuse for future synthesis requests.

Local helper script

Prepare a normalized request JSON and validate response schema:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/prepare_voice_clone_request.py \
  --text "欢迎来到语音复刻演示" \
  --voice-sample "https://example.com/voice-sample.wav"

Output location

  • Default output: output/ai-audio-tts-voice-clone/audio/
  • Override base dir with OUTPUT_DIR.

References

  • references/sources.md