音乐视频制作器
music-video-maker-bwbe
by bwbernardweston18
>
安装
claude skill add --url https://github.com/openclaw/skills文档
Getting Started
Welcome! I'm here to help you turn your footage into a real music video using music-video-maker-bwbe. Upload your clip, tell me about the song's energy and style, and let's build something worth watching.
Try saying:
- "Take this live performance clip and cut it to feel like a high-energy punk music video with fast cuts and gritty color grading"
- "I have a sunset rooftop session in mp4 — make it look like a cinematic lo-fi music video with warm tones and slow transitions"
- "Edit this dance footage into a music video style with beat-synced cuts and a neon aesthetic"
Quick Start Setup
This skill connects to NemoVideo's cloud backend. On first use, set up the connection automatically and let the user know ("Connecting to NemoVideo...").
Token check: Look for NEMO_TOKEN in the environment. If found, skip to session creation. Otherwise:
- Read
~/.config/nemovideo/client_idor generate a UUID and save it there - POST
https://mega-api-prod.nemovideo.ai/api/auth/anonymous-tokenwithX-Client-Idheader - Extract
data.tokenfrom the response — this is your NEMO_TOKEN (100 free credits, 7-day expiry)
Session: POST /api/tasks/me/with-session/nemo_agent at the same host with Bearer auth and body {"task_name":"project"}. Keep the returned session_id for all operations.
Let the user know with a brief "Ready!" when setup is complete. Don't expose tokens or raw API output.
Turn Raw Clips Into Beat-Driven Music Videos
Music-video-maker-bwbe is built for one purpose: taking the footage you already have and shaping it into a music video that actually feels like one. Whether you're an independent artist shooting on a phone or a videographer working with studio-quality clips, this skill bridges the gap between raw material and finished visual storytelling.
The skill analyzes your video content and applies music-video-specific editing logic — tight cuts on the beat, color grading that matches mood, transitions that feel intentional rather than accidental. You describe the energy of your track, and the output reflects it. Dark and moody? High-energy and chaotic? Lo-fi and nostalgic? The skill adapts its approach based on your creative direction.
No timeline scrubbing, no manual keyframing, no render queues. You upload your footage, describe what you need, and music-video-maker-bwbe produces a cohesive visual piece ready for sharing, pitching, or publishing. It's the fastest path from raw clips to a music video that holds attention.
Routing Your Creative Requests
Every clip upload, beat-sync instruction, and cut request is parsed and routed to the appropriate NemoVideo pipeline stage — whether that's scene assembly, audio waveform alignment, or final render export.
| User says... | Action | Skip SSE? |
|---|---|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" | → §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
NemoVideo Backend Reference
Music Video Maker BWBE runs on the NemoVideo API, which handles frame-accurate audio sync, multi-clip sequencing, and dynamic cut generation timed to your track's BPM and transients. All render jobs, session tokens, and credit consumption are managed server-side through NemoVideo's infrastructure.
Skill attribution — read from this file's YAML frontmatter at runtime:
X-Skill-Source:music-video-maker-bwbeX-Skill-Version: from frontmatterversionX-Skill-Platform: detect from install path (~/.clawhub/→clawhub,~/.cursor/skills/→cursor, elseunknown)
All requests must include: Authorization: Bearer <NEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.
API base: https://mega-api-prod.nemovideo.ai
Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"<lang>"} — returns task_id, session_id. After creating a session, give the user a link: https://nemovideo.com/workspace/claim?token=$TOKEN&task=<task_id>&session=<session_id>&skill_name=music-video-maker-bwbe&skill_version=1.0.0&skill_source=<platform>
Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"<sid>","new_message":{"parts":[{"text":"<msg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.
Upload: POST /api/upload-video/nemo_agent/me/<sid> — file: multipart -F "files=@/path", or URL: {"urls":["<url>"],"source_type":"url"}
Credits: GET /api/credits/balance/simple — returns available, frozen, total
Session state: GET /api/state/nemo_agent/me/<sid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media
Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_<ts>","sessionId":"<sid>","draft":<json>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/<id> every 30s until status = completed. Download URL at output.url.
Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.
SSE Event Handling
| Event | Action |
|---|---|
| Text response | Apply GUI translation (§4), present to user |
| Tool call/result | Process internally, don't forward |
heartbeat / empty data: | Keep waiting. Every 2 min: "⏳ Still working..." |
| Stream closes | Process final response |
~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.
Backend Response Translation
The backend assumes a GUI exists. Translate these into API actions:
| Backend says | You do |
|---|---|
| "click [button]" / "点击" | Execute via API |
| "open [panel]" / "打开" | Query session state |
| "drag/drop" / "拖拽" | Send edit via SSE |
| "preview in timeline" | Show track summary |
| "Export button" / "导出" | Execute export workflow |
Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.
Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)
Error Handling
| Code | Meaning | Action |
|---|---|---|
| 0 | Success | Continue |
| 1001 | Bad/expired token | Re-auth via anonymous-token (tokens expire after 7 days) |
| 1002 | Session not found | New session §3.0 |
| 2001 | No credits | Anonymous: show registration URL with ?bind=<id> (get <id> from create-session or state response when needed). Registered: "Top up at nemovideo.ai" |
| 4001 | Unsupported file | Show supported formats |
| 4002 | File too large | Suggest compress/trim |
| 400 | Missing X-Client-Id | Generate Client-Id and retry (see §1) |
| 402 | Free plan export blocked | Subscription tier issue, NOT credits. "Register at nemovideo.ai to unlock export." |
| 429 | Rate limit (1 token/client/7 days) | Retry in 30s once |
Common Workflows
Most users come to music-video-maker-bwbe with one of three starting points: a single continuous take, multiple short clips from a shoot, or a mix of performance and B-roll footage. Each workflow plays out differently.
For single-take footage, the skill identifies natural movement peaks, facial expressions, and visual anchors to create the illusion of a multi-angle edit. For multi-clip projects, it sequences and trims based on the energy description you provide, building a narrative arc across the material. For performance-plus-B-roll combinations, it intercuts between the two in a way that mirrors professional music video structure.
A typical session looks like this: upload your file (mp4, mov, avi, webm, or mkv), describe the song's genre and mood in plain language, specify any must-keep moments, and receive a cut that reflects your vision. Revision prompts let you push the style further or dial it back without starting over.
Best Practices
The more specific your description, the better the output from music-video-maker-bwbe. Vague prompts like 'make it look cool' produce generic results. Instead, reference the song's tempo, emotional tone, and any visual references you have in mind — even naming a genre or era helps significantly.
Shoot with editing in mind when you can. Clips with clear movement, varied framing, and at least 10–15 seconds of usable footage give the skill more to work with. Extremely shaky or underexposed footage will limit what's achievable.
If your footage includes dialogue or ambient audio you want preserved, flag that upfront. Music-video-maker-bwbe will treat those moments as anchor points rather than cutting through them. For best color results, upload footage that hasn't already been heavily processed — flat or log profiles give the skill the most flexibility to apply the right grade for your chosen aesthetic.
Integration Guide
Music-video-maker-bwbe fits naturally into a content production pipeline at the post-production stage. After your shoot is wrapped and your raw files are organized, bring them directly into this skill before any manual editing — it's most effective when working with unprocessed source material.
For artists releasing on platforms like YouTube, Instagram Reels, or TikTok, the skill can be prompted to optimize pacing and aspect ratio for each destination. Mention your target platform in the prompt and the output will reflect those format expectations.
If you're working with a music producer or label, the skill can serve as a rapid visualization tool — producing a rough-cut music video from demo footage before a full production budget is committed. This makes it useful not just for final delivery but for pitching and creative alignment early in the process. Supported formats — mp4, mov, avi, webm, and mkv — ensure compatibility across virtually all camera systems and export pipelines.
相关 Skills
文档共著
by anthropics
围绕文档、提案、技术规格、决策记录等写作任务,按上下文收集、结构迭代、读者测试三步协作共创,减少信息遗漏,写出更清晰、经得起他人阅读的内容。
✎ 写文档、方案或技术规格时容易思路散、信息漏,它用结构化共著流程帮你高效传递上下文、反复打磨内容,还能从读者视角做验证。
内部沟通
by anthropics
按公司常用模板和语气快速起草内部沟通内容,覆盖 3P 更新、状态报告、领导汇报、项目进展、事故复盘、FAQ 与 newsletter,适合需要统一格式的团队沟通场景。
✎ 按公司偏好的模板快速产出状态汇报、领导更新和 FAQ,既省去反复改稿,也让内部沟通更统一、更专业。
平面设计
by anthropics
先生成视觉哲学,再落地成原创海报、艺术画面或其他静态设计,输出 .png/.pdf,强调构图、色彩与空间表达,适合需要高完成度视觉成品的场景。
✎ 做海报、插画或静态视觉稿时,用它能快速产出兼顾美感与版式的PNG/PDF成品,原创设计更省心,也更适合规避版权风险。
相关 MCP 服务
by nirholas
免费的加密新闻聚合 MCP,汇集 Bitcoin、Ethereum、DeFi、Solana 与 altcoins 资讯源。
by ProfessionalWiki
让 Large Language Model 客户端无缝连接任意 MediaWiki 站点,可创建、更新、搜索页面,并通过 OAuth 2.0 安全管理内容。
by roomi-fields
Automate Google NotebookLM — Q&A with citations, audio, video, content generation