voice-agent
0
总安装量
1
周安装量
安装命令
npx skills add https://github.com/ypyt1/all-skills --skill voice-agent
Agent 安装分布
amp
1
cline
1
opencode
1
cursor
1
continue
1
kimi-cli
1
Skill 文档
Voice Agent
This skill allows you to speak and listen to the user using the local Voice Agent API.
Behavior Guidelines
- Audio First: When the user communicates via audio (files), your PRIMARY mode of response is Audio File.
- Silent Delivery: When sending an audio response, DO NOT send a text explanation like “I sent an audio”. Just send the audio file.
- Workflow:
- User sends audio.
- You usage
transcribeto read it. - You think of a response.
- You usage
synthesizeto generate the audio file. - You send the file.
- STOP. Do not add text commentary.
Tools
Transcribe File
To transcribe an audio file (Speech-to-Text), run the client script with the transcribe command.
python3 {baseDir}/scripts/client.py transcribe "/path/to/audio/file.ogg"
Synthesize to File
To generate audio from text and save it to a file (Text-to-Speech), run the client script with the synthesize command.
python3 {baseDir}/scripts/client.py synthesize "Text to speak" --output "/path/to/output.mp3"
Health Check
To check if the voice agent API is running and healthy:
python3 {baseDir}/scripts/client.py health
Service Management
If the Health Check fails or you receive a connection error, the service may be stopped.
You can attempt to start it by running:
{baseDir}/scripts/start.sh