发现和使用优秀的技能扩展
通过您的OpenClaw代理进行的实时AI视频聊天。使用Groq Whisper(云语音转文本)、edge-tts(通过微软的云文本转语音)和OpenClaw chatCompletions API进行对话。您的代理可以看到您的摄像头画面,听到您的声音,并以其自身的个性和记忆进行回应。
要求:需要GROQ_API_KEY用于语音识别。读取~/.openclaw/openclaw.json获取网关端口和身份验证令牌。
数据流向:音频→Groq云(语音转文本),文本转语音文本→微软云(edge-tts),摄像头帧(base64格式)+文本→OpenClaw网关→您配置的大型语言模型提供商(可能是云服务——如果使用云大型语言模型,帧会离开本地机器)。
安装持久化的launchd服务(可选)。触发短语:“video chat”、“voice call”、“call me”、“视频一下”、“语音”、“打电话给我”、“我要和你视频”、“videochat-withme”。
Real-time AI video chat that routes through your OpenClaw agent. Uses Groq Whisper (cloud STT),
edge-tts (cloud TTS via Microsoft), and OpenClaw chatCompletions API for conversation. Your agent
sees your camera, hears your voice, and responds with its own personality and memory.
Requires: GROQ_API_KEY for speech recognition. Reads ~/.openclaw/openclaw.json for gateway port and auth token.
Data flows: audio → Groq cloud (STT), TTS text → Microsoft cloud (edge-tts), camera frames (base64) + text
→ OpenClaw gateway → your configured LLM provider (may be cloud — frames leave the machine if using a cloud LLM).
Installs a persistent launchd service (optional). Trigger phrases: "video chat", "voice call",
"call me", "视频一下", "语音", "打电话给我", "我要和你视频", "videochat-withme".