AI 头像生成器

ai-avatar-generator

by bwbernardweston18

>

4.5kAI 与智能体未扫描2026年4月20日

安装

claude skill add --url https://github.com/openclaw/skills

文档

Getting Started

Send me a photo or describe the avatar you want and I'll generate a custom, styled result instantly. No image? Just tell me your preferred look, art style, and personality.

Try saying:

  • "Create a cyberpunk-style avatar of me with neon blue hair, glowing eyes, and a dark urban background"
  • "Generate a professional LinkedIn headshot avatar — clean background, business casual, friendly expression, realistic style"
  • "Make an anime-style avatar of a female character with silver hair, violet eyes, wearing fantasy armor"

First-Time Connection

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

  1. Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days.
  2. Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer <token>, Content-Type: application/json, and body {"task_name":"project","language":"<detected>"}. Store the returned session_id for all subsequent requests.

Keep setup communication brief. Don't display raw API responses or token values to the user.

Your Face, Your Style, Infinite Possibilities

Creating a great avatar used to mean hiring an illustrator, wrestling with complex design software, or settling for a generic cartoon filter. The AI Avatar Generator changes that entirely. Whether you want a sleek professional headshot for LinkedIn, a fantasy warrior for your gaming profile, or a stylized illustration for social media, this skill generates it from a simple description or uploaded photo.

You stay in full creative control. Describe your preferred art style — anime, watercolor, pixel art, photorealistic, comic book — and the generator adapts. Want specific features like eye color, hairstyle, or outfit? Just say so. The skill interprets natural language instructions and turns them into visually compelling results without any design knowledge required.

This tool is built for creators, professionals, streamers, Discord communities, and anyone who wants a distinctive visual identity online. Stop recycling old selfies or using placeholder icons — generate an avatar that actually represents who you are or who you want to be.

Routing Your Avatar Requests

Each request — whether text prompt, uploaded photo, or style transfer — is parsed and routed to the appropriate generation pipeline based on input type and selected avatar style.

User says...ActionSkip SSE?
"export" / "导出" / "download" / "send me the video"→ §3.5 Export
"credits" / "积分" / "balance" / "余额"→ §3.3 Credits
"status" / "状态" / "show tracks"→ §3.4 State
"upload" / "上传" / user sends file→ §3.2 Upload
Everything else (generate, edit, add BGM…)→ §3.1 SSE

Avatar API Backend Reference

Avatar generation runs on a distributed cloud inference backend that processes diffusion model requests asynchronously, returning base64-encoded image outputs once rendering completes. Latency varies by resolution tier and queue depth, typically ranging from 8 to 45 seconds per avatar.

Skill attribution — read from this file's YAML frontmatter at runtime:

  • X-Skill-Source: ai-avatar-generator
  • X-Skill-Version: from frontmatter version
  • X-Skill-Platform: detect from install path (~/.clawhub/clawhub, ~/.cursor/skills/cursor, else unknown)

All requests must include: Authorization: Bearer <NEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"<lang>"} — returns task_id, session_id.

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"<sid>","new_message":{"parts":[{"text":"<msg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/<sid> — file: multipart -F "files=@/path", or URL: {"urls":["<url>"],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me/<sid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_<ts>","sessionId":"<sid>","draft":<json>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/<id> every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

SSE Event Handling

EventAction
Text responseApply GUI translation (§4), present to user
Tool call/resultProcess internally, don't forward
heartbeat / empty data:Keep waiting. Every 2 min: "⏳ Still working..."
Stream closesProcess final response

~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend saysYou do
"click [button]" / "点击"Execute via API
"open [panel]" / "打开"Query session state
"drag/drop" / "拖拽"Send edit via SSE
"preview in timeline"Show track summary
"Export button" / "导出"Execute export workflow

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

code
Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Error Handling

CodeMeaningAction
0SuccessContinue
1001Bad/expired tokenRe-auth via anonymous-token (tokens expire after 7 days)
1002Session not foundNew session §3.0
2001No creditsAnonymous: show registration URL with ?bind=<id> (get <id> from create-session or state response when needed). Registered: "Top up credits in your account"
4001Unsupported fileShow supported formats
4002File too largeSuggest compress/trim
400Missing X-Client-IdGenerate Client-Id and retry (see §1)
402Free plan export blockedSubscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429Rate limit (1 token/client/7 days)Retry in 30s once

Tips and Tricks

The more specific your description, the better your avatar will turn out. Instead of saying 'make me look cool,' try 'create a noir detective avatar with a fedora, sharp jawline, and a moody city background in black and white.' Art style keywords like 'oil painting,' 'flat vector,' 'Studio Ghibli-inspired,' or 'hyper-realistic' dramatically shape the output.

If you're uploading a reference photo, make sure it's well-lit and front-facing for the best facial feature recognition. You can layer instructions — start with a base style, then refine with follow-up prompts like 'make the background darker' or 'add a hood to the outfit.' Iterating in small steps gives you much finer control over the final look.

For branded avatars or team sets, describe a consistent style guide across multiple requests — same color palette, same art style, same background treatment — to keep everything cohesive.

Common Workflows

A popular workflow is the 'photo-to-style' pipeline: upload a selfie, specify an art style (e.g., 'turn this into a Pixar-style 3D character'), and receive a stylized avatar that still resembles you. This is widely used by streamers setting up channel branding and professionals refreshing their social profiles.

Another common use case is building a full avatar set — same character rendered in multiple styles or outfits for different platforms. Start with a base avatar, then prompt variations: 'same character but in winter gear,' 'same character in a formal suit,' 'same character as a pixel art sprite.'

For community managers and Discord server owners, the batch persona workflow is useful: define a character archetype and generate multiple unique members of a 'crew' or 'team' with shared visual DNA but distinct individual looks. This creates a cohesive branded aesthetic across an entire community.

Performance Notes

Avatar generation quality scales with the clarity and detail of your input. Vague prompts produce generic results; specific prompts with style references, color preferences, and mood descriptors consistently yield sharper, more personalized outputs.

Highly complex scenes with multiple characters, intricate backgrounds, or conflicting style instructions may require a follow-up refinement prompt to dial in the details. For photorealistic avatars, the skill performs best when given a clear reference image alongside your description — relying on text alone for realistic portraits can occasionally produce stylized interpretations.

Processing time is typically fast for standard avatar requests. Highly detailed or large-format outputs may take slightly longer. If a result misses the mark, a single clarifying instruction is usually enough to correct course without starting over.

相关 Skills

Claude接口

by anthropics

Universal
热门

面向接入 Claude API、Anthropic SDK 或 Agent SDK 的开发场景,自动识别项目语言并给出对应示例与默认配置,快速搭建 LLM 应用。

想把Claude能力接进应用或智能体,用claude-api上手快、兼容Anthropic与Agent SDK,集成路径清晰又省心

AI 与智能体
未扫描139.0k

RAG架构师

by alirezarezvani

Universal
热门

聚焦生产级RAG系统设计与优化,覆盖文档切块、检索链路、索引构建、召回评估等关键环节,适合搭建可扩展、高准确率的知识库问答与检索增强应用。

面向RAG落地,把知识库、向量检索和生成链路系统串联起来,做架构设计时更清晰,也更少踩坑。

AI 与智能体
未扫描15.8k

多智能体架构

by alirezarezvani

Universal
热门

聚焦多智能体系统架构设计,梳理 Supervisor、Swarm、分层和 Pipeline 等模式,覆盖角色定义、通信协作与性能评估,适合规划稳健可扩展的 AI agent 编排方案。

帮你系统解决多智能体应用的架构设计与协同编排难题,适合构建复杂 AI 工作流,成熟度高、社区认可也很亮眼。

AI 与智能体
未扫描15.8k

相关 MCP 服务

知识图谱记忆

编辑精选

by Anthropic

热门

Memory 是一个基于本地知识图谱的持久化记忆系统,让 AI 记住长期上下文。

帮 AI 和智能体补上“记不住”的短板,用本地知识图谱沉淀长期上下文,连续对话更聪明,数据也更可控。

AI 与智能体
86.1k

顺序思维

编辑精选

by Anthropic

热门

Sequential Thinking 是让 AI 通过动态思维链解决复杂问题的参考服务器。

这个服务器展示了如何让 Claude 像人类一样逐步推理,适合开发者学习 MCP 的思维链实现。但注意它只是个参考示例,别指望直接用在生产环境里。

AI 与智能体
86.1k

PraisonAI

编辑精选

by mervinpraison

热门

PraisonAI 是一个支持自反思和多 LLM 的低代码 AI 智能体框架。

如果你需要快速搭建一个能 24/7 运行的 AI 智能体团队来处理复杂任务(比如自动研究或代码生成),PraisonAI 的低代码设计和多平台集成(如 Telegram)让它上手极快。但作为非官方项目,它的生态成熟度可能不如 LangChain 等主流框架,适合愿意尝鲜的开发者。

AI 与智能体
7.9k

评论