Gemini Vision

Name: Gemini Vision
Rating: 0.1 (2 reviews)
Author: Artin0123

内容与创意

by Artin0123

Analyze images and videos with Gemini to get fast, reliable visual insights. Handle content from URLs and YouTube links. Summarize scenes, identify objects, and extract key details for reports or automation. This is remote version, check local branch in github to use local tools.

2GitHub

什么是 Gemini Vision？

README

image-mcp-server-gemini

This is remote server, use local version for local images and videos.

Features

Analyze one or more image URLs with a single tool call.
Analyze YouTube videos without downloading files locally.
Supply an API key and optionally override the Gemini model via environment variables.
File size limit: Images are limited to 16 MB to ensure fast processing.
YouTube videos: No size limit as they are streamed directly by Gemini API.

Installation

Installing via Smithery

Install the server in Claude Desktop:

bash

npx -y @smithery/cli install @Artin0123/gemini-image-mcp-server --client claude

Manual Installation

bash

# Clone the repository
git clone https://github.com/Artin0123/gemini-vision-mcp.git
cd gemini-vision-mcp

# Install dependencies
npm install

# Compile TypeScript to dist/
npm run build

Configuration

Create a Gemini API key in Google AI Studio and provide GEMINI_API_KEY to the server.

json

{
  "mcpServers": {
    "gemini-media": {
      "command": "node",
      "args": ["/absolute/path/to/gemini-vision-mcp/dist/index.js"],
      "env": {
        "GEMINI_API_KEY": "your_api_key_here",
        "GEMINI_MODEL": "models/gemini-flash-lite-latest"
      }
    }
  }
}

If no key is supplied, the server can still start (handy for automated scans), but any tool invocation will return a configuration error until a valid API key is configured.

Model override

The server defaults to models/gemini-flash-lite-latest. Override it by either:

Setting the GEMINI_MODEL environment variable, or Providing modelName in the Smithery/SDK configuration schema.

Available tools

analyze_image: Analyze one or more image URLs. Maximum file size: 16 MB per image.
analyze_youtube_video: Analyze a YouTube video from URL. No size limit.

Image URLs are downloaded and processed with a 16 MB size limit to ensure fast response times. Files exceeding this limit will result in an error message indicating the actual file size.

YouTube videos are streamed directly by Gemini API without downloading, so there is no size restriction.

Prompt examples

code

Please analyze this product photo: https://teimg-bgr.pages.dev/file/mvYT6KeF.webp

code

Extract the main talking points from this clip: https://www.youtube.com/watch?v=dQw4w9WgXcQ

Development

bash

npm install
npm test
npm run build

The test suite exercises URL forwarding, MIME handling, and configuration fallbacks.

License

MIT

Gemini Vision

什么是 Gemini Vision？

README

image-mcp-server-gemini

Features

Installation

Installing via Smithery

Manual Installation

Configuration

Model override

Available tools

Prompt examples

Development

License

常见问题

Gemini Vision 是什么？

相关 Skills

文档共著

内部沟通

平面设计

相关 MCP Server

Crypto News Aggregator

io.github.alisaitteke/photoshop-mcp

NotebookLM MCP

评论