VibeVideo 生成
vibevideo-generate
by bytevirts
Generate images and videos using the VibeVideo API. Use when the user
安装
claude skill add --url https://github.com/openclaw/skills文档
VibeVideo Generation Skill
Prerequisites
- Environment variable
VIBEVIDEO_API_KEYmust be set with a valid API key - Get your API key from: Dashboard → Settings → API Keys
API Endpoint
This skill always uses the official VibeVideo API endpoint: https://vibevideo.app
Generate Image
Create an image generation task:
curl -s -X POST https://vibevideo.app/api/ai/generate \
-H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"mediaType": "image",
"scene": "text-to-image",
"model": "nano-banana-2",
"prompt": "A cat sitting on a rainbow",
"options": {
"aspect_ratio": "1:1",
"quality": "2K"
}
}'
For image-to-image, set "scene": "image-to-image" and add "image_url": "..." in options.
Response:
{ "code": 0, "data": { "id": "task_id", "status": "pending", "taskId": "...", "costCredits": 5 } }
Generate Video
Create a video generation task:
curl -s -X POST https://vibevideo.app/api/ai/generate \
-H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"mediaType": "video",
"scene": "text-to-video",
"model": "seedance-2-0",
"prompt": "A dog playing in a park",
"options": {
"resolution": "720p",
"duration": "5s",
"aspect_ratio": "16:9"
}
}'
For image-to-video, set "scene": "image-to-video" and add "image_url": "..." in options.
For frames-to-video, add "start_image_url": "..." and "end_image_url": "..." in options.
Query Task Status
Tasks are asynchronous. Poll until status is success, failed, or canceled:
curl -s -X POST https://vibevideo.app/api/ai/query \
-H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
-H "Content-Type: application/json" \
-d '{ "taskId": "YOUR_TASK_ID" }'
Response includes status, taskInfo, taskResult, and taskUrls (JSON string of media URLs).
Calculate Cost
Check credit cost before generating:
curl -s -X POST https://vibevideo.app/api/ai/cost \
-H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "seedance-2-0",
"mediaType": "video",
"scene": "text-to-video",
"options": { "resolution": "720p", "duration": "5s" }
}'
Cancel Task
curl -s -X DELETE https://vibevideo.app/api/ai/tasks/YOUR_TASK_ID \
-H "Authorization: Bearer $VIBEVIDEO_API_KEY"
Workflow
- If the user doesn't specify a model, pick the default for the scene (see tables below)
- Call the generate endpoint
- Poll the query endpoint every 5 seconds until status is terminal (
success/failed/canceled) - Parse
taskUrlsfrom the response and report the media URL(s) to the user - If
codeis not 0, handle the error (see Error Handling below)
Image Models
| ID | Label | Vendor | Scenes | Qualities |
|---|---|---|---|---|
| nano-banana-2 | Nano Banana 2 | text-to-image, image-to-image | 1K, 2K, 4K | |
| gpt-image-1-5 | GPT Image 1.5 | OpenAI | text-to-image, image-to-image | Medium, High |
| grok-imagine | Grok Imagine | Grok | text-to-image, image-to-image | — |
| seedream-5-0 | Seedream 5.0 | ByteDance | text-to-image, image-to-image | Basic, High |
| qwen-image | Qwen Image | Qwen | text-to-image, image-to-image | — |
| wan-2-7-image | Wan 2.7 Image | Qwen/Alibaba | text-to-image, image-to-image | 1K, 2K |
| wan-2-7-image-pro | Wan 2.7 Image Pro | Qwen/Alibaba | text-to-image, image-to-image | 1K, 2K, 4K |
Default for text-to-image: nano-banana-2
Video Models
| ID | Label | Vendor | Scenes | Resolutions | Durations |
|---|---|---|---|---|---|
| seedance-2-0 | Seedance 2.0 | ByteDance | text-to-video, image-to-video, frames-to-video, reference-to-video | 720p, 1080p | 5s, 10s, 15s |
| seedance-2-0-fast | Seedance 2.0 Fast | ByteDance | text-to-video, image-to-video, frames-to-video, reference-to-video | 720p, 1080p | 5s, 10s, 15s |
| seedance-1-5-pro | Seedance 1.5 Pro | ByteDance | text-to-video, image-to-video | 480p, 720p, 1080p | 4s, 8s, 12s |
| grok-imagine | Grok Imagine | Grok | text-to-video, image-to-video | 480p, 720p | 6s, 10s, 15s |
| kling-2-6 | Kling 2.6 | Kling | text-to-video, image-to-video | — | 5s, 10s |
| runway | Runway | Runway | text-to-video, image-to-video | 720p, 1080p | 5s, 10s |
| veo-3-1 | Veo 3.1 | text-to-video, image-to-video, frames-to-video, reference-to-video | 720p, 1080p, 4k | — | |
| veo-3-1-fast | Veo 3.1 Fast | text-to-video, image-to-video, frames-to-video, reference-to-video | 720p, 1080p, 4k | — | |
| sora-2-pro | Sora 2 Pro | OpenAI | text-to-video, image-to-video | — | 10s, 15s |
| sora-2 | Sora 2 | OpenAI | text-to-video, image-to-video | — | 10s, 15s |
| seedence-1-0-pro | Seedence 1.0 Pro | ByteDance | text-to-video, image-to-video | 480p, 720p, 1080p | 5s, 10s |
| seedence-1-0-pro-fast | Seedence 1.0 Pro Fast | ByteDance | image-to-video | 720p, 1080p | 5s, 10s |
| seedence-1-0-lite | Seedence 1.0 Lite | ByteDance | text-to-video, image-to-video | 480p, 720p, 1080p | 5s, 10s |
Default for text-to-video: seedance-2-0
Error Handling
code: -1with "no auth": User's API key is missing or invalid. Remind them to setVIBEVIDEO_API_KEY.code: -1002with "insufficient credits": User needs to purchase credits at VibeVideo dashboard.code: -1with "invalid": Wrong model ID, scene, or mediaType. Check against the model tables above.- Task stuck in "processing": Polling timeout. The task may still complete — suggest the user wait and query again later.
API Response Envelope
All endpoints return:
{ "code": 0, "message": "ok", "data": { ... } }
code: 0 means success. Non-zero code means error (check message).
相关 Skills
内部沟通
by anthropics
按公司常用模板和语气快速起草内部沟通内容,覆盖 3P 更新、状态报告、领导汇报、项目进展、事故复盘、FAQ 与 newsletter,适合需要统一格式的团队沟通场景。
✎ 按公司偏好的模板快速产出状态汇报、领导更新和 FAQ,既省去反复改稿,也让内部沟通更统一、更专业。
主题工厂
by anthropics
给幻灯片、文档、报告和 HTML 落地页快速套用专业配色与字体主题,内置 10 套预设风格并支持现场生成新主题,适合统一品牌或内容视觉。
✎ 主题工厂能帮你把幻灯片、文档到落地页快速统一视觉风格,内置 10 套主题,还能按需即时生成新主题。
文档共著
by anthropics
围绕文档、提案、技术规格、决策记录等写作任务,按上下文收集、结构迭代、读者测试三步协作共创,减少信息遗漏,写出更清晰、经得起他人阅读的内容。
✎ 写文档、方案或技术规格时容易思路散、信息漏,它用结构化共著流程帮你高效传递上下文、反复打磨内容,还能从读者视角做验证。
相关 MCP 服务
by nirholas
免费的加密新闻聚合 MCP,汇集 Bitcoin、Ethereum、DeFi、Solana 与 altcoins 资讯源。
by ProfessionalWiki
让 Large Language Model 客户端无缝连接任意 MediaWiki 站点,可创建、更新、搜索页面,并通过 OAuth 2.0 安全管理内容。
by transloadit
借助 86+ 个云端 media processing robots,处理视频、音频、图像和文档。