Image Tools - Background Removal, Upscaling, Face Restoration

内容与创意

by fasuizu-br

提供图像背景移除(BiRefNet)、超分辨率放大与人脸修复功能,并支持 GPU 加速处理。

什么是 Image Tools - Background Removal, Upscaling, Face Restoration

提供图像背景移除(BiRefNet)、超分辨率放大与人脸修复功能,并支持 GPU 加速处理。

README

Brainiall AI APIs

API Status License: MIT MCP Servers Azure Marketplace Models

Production AI APIs for speech, text, image, and LLM inference. Available as REST endpoints and MCP servers for AI agents.

Base URL: https://apim-ai-apis.azure-api.net Full API reference for LLMs: llms-full.txt | llms.txt

Products

ProductEndpointsLatencyNotes
Pronunciation Assessment/v1/pronunciation/assess/base64<500ms17MB ONNX, per-phoneme scoring (39 ARPAbet)
Text-to-Speech/v1/tts/synthesize<1s12 voices (American + British), 24kHz WAV
Speech-to-Text/v1/stt/transcribe/base64<500msCompact 17MB model, English, word timestamps
Whisper Pro/v1/whisper/transcribe/base64<3s99 languages, speaker diarization
NLP Suite/v1/nlp/{toxicity,sentiment,entities,pii,language}<50msCPU-only, ONNX, 5 endpoints
Image Processing/v1/image/{remove-background,upscale,restore-face}/base64<3sGPU (A10), BiRefNet + ESRGAN + GFPGAN
LLM Gateway/v1/chat/completionsvaries113+ models, OpenAI-compatible, streaming

Authentication

Include ONE of these headers in every request:

code
Ocp-Apim-Subscription-Key: YOUR_KEY
Authorization: Bearer YOUR_KEY
api-key: YOUR_KEY

Get API keys at the portal (GitHub sign-in, purchase credits, create key).

Quick Start

Python — LLM Gateway (OpenAI SDK)

python
from openai import OpenAI

client = OpenAI(
    base_url="https://apim-ai-apis.azure-api.net/v1",
    api_key="YOUR_KEY"
)

response = client.chat.completions.create(
    model="claude-sonnet",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

Python — Pronunciation Assessment

python
import requests, base64

audio_b64 = base64.b64encode(open("audio.wav", "rb").read()).decode()
r = requests.post(
    "https://apim-ai-apis.azure-api.net/v1/pronunciation/assess/base64",
    headers={"Ocp-Apim-Subscription-Key": "YOUR_KEY"},
    json={"audio": audio_b64, "text": "Hello world", "format": "wav"}
)
print(r.json()["overallScore"])  # 0-100

Python — NLP Pipeline

python
import requests

headers = {"Ocp-Apim-Subscription-Key": "YOUR_KEY"}
base = "https://apim-ai-apis.azure-api.net/v1/nlp"

# Sentiment
r = requests.post(f"{base}/sentiment", headers=headers, json={"text": "I love this!"})
print(r.json())  # {"label": "positive", "score": 0.9987}

# PII detection with redaction
r = requests.post(f"{base}/pii", headers=headers, json={"text": "Email john@acme.com", "redact": True})
print(r.json()["redacted_text"])  # "Email [EMAIL]"

Node.js — LLM Gateway

javascript
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://apim-ai-apis.azure-api.net/v1",
  apiKey: "YOUR_KEY"
});

const res = await client.chat.completions.create({
  model: "claude-sonnet",
  messages: [{ role: "user", content: "Hello!" }]
});
console.log(res.choices[0].message.content);

curl — Image Background Removal

bash
curl -X POST https://apim-ai-apis.azure-api.net/v1/image/remove-background/base64 \
  -H "Ocp-Apim-Subscription-Key: YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d "{\"image\": \"$(base64 -i photo.jpg)\"}"

LLM Gateway — Popular Models

ModelAliasPrice ($/MTok in/out)
Claude Opus 4.6claude-opus$5 / $25
Claude Sonnet 4.6claude-sonnet$3 / $15
Claude Haiku 4.5claude-haiku$1 / $5
DeepSeek R1deepseek-r1$1.35 / $5.40
DeepSeek V3deepseek-v3$0.27 / $1.10
Llama 3.3 70Bllama-3.3-70b$0.72 / $0.72
Amazon Nova Pronova-pro$0.80 / $3.20
Amazon Nova Micronova-micro$0.035 / $0.14
Mistral Large 3mistral-large-3$2 / $6
Qwen3 32Bqwen3-32b$0.35 / $0.35

Full list: GET /v1/models (113+ models from 17 providers).

Supports: streaming SSE, tool calling, structured output (json_object/json_schema), extended thinking.

Works with: OpenAI SDK, LiteLLM, LangChain, Cline, Cursor, Aider, Continue, SillyTavern, Open WebUI.

MCP Servers (for AI Agents)

3 MCP servers with 20 tools total. Streamable HTTP transport.

ServerURLTools
Speech AIhttps://apim-ai-apis.azure-api.net/mcp/pronunciation/mcp10 tools + 8 resources + 3 prompts
NLP Toolshttps://apim-ai-apis.azure-api.net/mcp/nlp/mcp6 tools + 3 resources + 3 prompts
Image Toolshttps://apim-ai-apis.azure-api.net/mcp/image/mcp4 tools + 3 resources + 2 prompts

MCP Configuration (Claude Desktop / Cursor / Cline)

json
{
  "mcpServers": {
    "brainiall-speech": {
      "url": "https://apim-ai-apis.azure-api.net/mcp/pronunciation/mcp",
      "headers": { "Ocp-Apim-Subscription-Key": "YOUR_KEY" }
    },
    "brainiall-nlp": {
      "url": "https://apim-ai-apis.azure-api.net/mcp/nlp/mcp",
      "headers": { "Ocp-Apim-Subscription-Key": "YOUR_KEY" }
    },
    "brainiall-image": {
      "url": "https://apim-ai-apis.azure-api.net/mcp/image/mcp",
      "headers": { "Ocp-Apim-Subscription-Key": "YOUR_KEY" }
    }
  }
}

Also available on: Smithery (score 95/100) | MCPize | Apify ($0.02/call) | MCP Registry

Examples

FileDescription
python/basic_usage.pySpeech APIs — assess, transcribe, synthesize
python/pronunciation_tutor.pyInteractive pronunciation tutor
javascript/basic_usage.jsNode.js examples for speech APIs
curl/examples.shcurl commands for every endpoint
mcp/claude-desktop-config.jsonMCP config for Claude Desktop
mcp/cursor-config.jsonMCP config for Cursor IDE
llms-full.txtComplete API reference for LLM consumption

Pricing

ProductPriceUnit
Pronunciation$0.02per call
TTS$0.01-0.03per 1K chars
STT (compact)$0.01per request
Whisper Pro$0.02per minute
NLP (any)$0.001-0.002per call
Image (any)$0.003-0.005per image
LLM Gatewaycompetitive pricingper MTok

Credit packages: $5, $10, $25, $50, $100. Portal | Azure Marketplace (search "Brainiall").

License

MIT — Brainiall

常见问题

Image Tools - Background Removal, Upscaling, Face Restoration 是什么?

提供图像背景移除(BiRefNet)、超分辨率放大与人脸修复功能,并支持 GPU 加速处理。

相关 Skills

文档共著

by anthropics

Universal
热门

围绕文档、提案、技术规格、决策记录等写作任务,按上下文收集、结构迭代、读者测试三步协作共创,减少信息遗漏,写出更清晰、经得起他人阅读的内容。

写文档、方案或技术规格时容易思路散、信息漏,它用结构化共著流程帮你高效传递上下文、反复打磨内容,还能从读者视角做验证。

内容与创意
未扫描152.6k

内部沟通

by anthropics

Universal
热门

按公司常用模板和语气快速起草内部沟通内容,覆盖 3P 更新、状态报告、领导汇报、项目进展、事故复盘、FAQ 与 newsletter,适合需要统一格式的团队沟通场景。

按公司偏好的模板快速产出状态汇报、领导更新和 FAQ,既省去反复改稿,也让内部沟通更统一、更专业。

内容与创意
未扫描152.6k

平面设计

by anthropics

Universal
热门

先生成视觉哲学,再落地成原创海报、艺术画面或其他静态设计,输出 .png/.pdf,强调构图、色彩与空间表达,适合需要高完成度视觉成品的场景。

做海报、插画或静态视觉稿时,用它能快速产出兼顾美感与版式的PNG/PDF成品,原创设计更省心,也更适合规避版权风险。

内容与创意
未扫描152.6k

相关 MCP Server

免费的加密新闻聚合 MCP,汇集 Bitcoin、Ethereum、DeFi、Solana 与 altcoins 资讯源。

内容与创意
238

用于Adobe Photoshop自动化的MCP server,让AI assistants直接控制Photoshop。

内容与创意
112

by ProfessionalWiki

让 Large Language Model 客户端无缝连接任意 MediaWiki 站点,可创建、更新、搜索页面,并通过 OAuth 2.0 安全管理内容。

内容与创意16 个工具
97

评论