发现和使用优秀的技能扩展
使用OpenAI的TTS API进行文本到语音转换,以生成高质量、自然流畅的音频。支持6种声音(alloy、echo、fable、onyx、nova、shimmer)、速度控制(0.25倍-4.0倍)、高清质量模型、多种输出格式(mp3、opus、aac、flac),以及针对长内容的自动文本分块(每个请求的字符限制为4096个)。在以下情况使用:(1)用户请求音频/语音输出,触发词包括“读给我听”、“转换为音频”、“生成语音”、“文本转语音”、“tts”、“旁白”、“朗读”,或出现关键词“openai tts”、“声音”、“播客”时。(2)内容需要听而不是读(如多任务处理、辅助功能)。(3)用户有特定的声音偏好,如“alloy”、“echo”、“fable”、“onyx”、“nova”、“shimmer”,或需要调整速度时。
Text-to-speech conversion using OpenAI's TTS API for generating high-quality, natural-sounding audio.
Supports 6 voices (alloy, echo, fable, onyx, nova, shimmer), speed control (0.25x-4.0x),
HD quality model, multiple output formats (mp3, opus, aac, flac), and automatic text chunking
for long content (4096 char limit per request).
Use when: (1) User requests audio/voice output with triggers like "read this to me",
"convert to audio", "generate speech", "text to speech", "tts", "narrate", "speak",
or when keywords "openai tts", "voice", "podcast" appear. (2) Content needs to be spoken
rather than read (multitasking, accessibility). (3) User wants specific voice preferences
like "alloy", "echo", "fable", "onyx", "nova", "shimmer" or speed adjustments.