发现和使用优秀的技能扩展
从图片和用户自己的音频录制生成唇同步视频。
✅ 适用场景:
- 用户提供自己的音频文件(语音录制)
- 希望将图片与特定音频/语音同步
- 用户自己录制了脚本
- 需要精确保留音频时间
❌ 不适用场景:
- 用户提供文本脚本(非音频)→ 使用veed-ugc
- 需要AI生成语音 → 使用veed-ugc
- 尚未拥有音频文件 → 使用带脚本的veed-ugc
输入:图片 + 音频文件(用户录制)
输出:与所提供音频唇同步的MP4视频
主要区别:veed-ugc = 脚本 → AI语音 → 视频
ugc-manual = 用户音频 → 视频(无语音生成)
Generate lip-sync video from image + user's own audio recording.
✅ USE WHEN:
- User provides their OWN audio file (voice recording)
- Want to sync image to specific audio/voice
- User recorded the script themselves
- Need exact audio timing preserved
❌ DON'T USE WHEN:
- User provides text script (not audio) → use veed-ugc
- Need AI to generate the voice → use veed-ugc
- Don't have audio file yet → use veed-ugc with script
INPUT: Image + audio file (user's recording)
OUTPUT: MP4 video with lip-sync to provided audio
KEY DIFFERENCE: veed-ugc = script → AI voice → video
ugc-manual = user audio → video (no voice generation)