Glazyr Viz
AI 与智能体by glazyr
Glazyr Viz 直接将 Chromium 原始内存帧提供给 AI agent,绕过脆弱的 DOM scraping 与 Cloudflare 封锁,实现 177 FPS 零拷贝视觉与原生 USDC 结算。
什么是 Glazyr Viz?
Glazyr Viz 直接将 Chromium 原始内存帧提供给 AI agent,绕过脆弱的 DOM scraping 与 Cloudflare 封锁,实现 177 FPS 零拷贝视觉与原生 USDC 结算。
核心功能 (10 个工具)
get_optic_nerve_statusReturns a high-level dashboard of the agent's visual health, including FPS, latency, and Aquarium population metrics.
browser_navigateDispatches a navigation command to the agent's browser, used to switch between benchmarks or sites.
browser_set_fish_countControls the hardware load by setting the number of active WebGL fish in the Aquarium simulation.
peek_vision_bufferPerforms a low-latency 'peek' at the raw vision stream, returning resolution, sequence numbers, and optional base64 frame data.
browser_evaluate_jsEvaluates arbitrary JavaScript in the GCP Big Iron browser context.
run_dogfood_surgeExecutes the standardized dogfooding sequence: sets baseline, triggers a 30k fish surge, and returns status.
verify_paymentVerifies a USDC transfer on the Base network to grant vision credits to the current session (1 USDC = 1,000,000 frames).
get_remaining_creditsRetrieve the current balance of cognitive frames available for this session.
browser_clickLegacy control: Dispatches a mouse click to the specified coordinates.
browser_typeLegacy control: Types the specified text into the active browser element.
README
Glazyr Viz: 7.35ms Perception. 90%+ Token Savings. 🚀
Ditch the screenshot loop. Glazyr Viz is a high-performance Chromium fork that provides agents with Zero-Copy Vision—direct, raw memory access to the frame buffer for sub-10ms perception.
🎯 Real-World Use Cases
- High-Density Data Extraction: Navigating complex tables, Canvas-based charts, and WebGL interfaces where DOM scrapers fail.
- Latency-Critical Automation: Executing multi-step workflows (checkout bots, form filling) at human or super-human speeds.
- Large-Scale Scraping: Reducing API tokens by 99%, allowing for thousands of perception cycles at a fraction of the cost.
- Anti-Bot Resilience: Interacting with raw coordinates to bypass detection systems that flag standard WebDriver behavior.
⚡ Performance Floor
- 7.35ms Latency: Sub-10ms frame-to-data conversion floor.
- 99% Token Savings: 12-16 tokens per perception cycle via the
vision.jsonschema. - Zero-Jitter: Synchronous frame access directly from the Chromium Viz subsystem.
Installation
# Copy the skill to your OpenClaw skills directory
cp -r glazyr-viz ~/.openclaw/workspace/skills/glazyr-viz
# Install dependencies
cd ~/.openclaw/workspace/skills/glazyr-viz/scripts
npm install
Quick Start
# Navigate to a page
node skills/glazyr-viz/scripts/navigate.js https://news.ycombinator.com
# Extract data (Ah-Ha Demo)
node skills/glazyr-viz/scripts/showcase.js
Pricing (Launch Tiers)
| Tier | Frames | Price |
|---|---|---|
| Free | 2,500 | $0 |
| Developer | 100,000 | $3 |
| Professional | 500,000 | $15 |
Get your API key at glazyr.com/dashboard.
📘 Technical FAQ: Zero-Copy Vision
Q: How do you achieve 99% token savings?
Most agents use "Pixel-Pushing"—they capture a screenshot, encode it to Base64, and send the entire image to an LLM. This consumes roughly 1,200–1,600 tokens per frame. Glazyr Viz uses the vision.json schema to extract semantic UI metadata and raw coordinate vectors directly from the Chromium Viz subsystem’s frame buffer. This reduces the payload to 12–16 tokens per perception cycle.
Q: Is any "intelligence" lost by not sending a full screenshot?
None. In fact, you gain precision. Standard vision models often "guess" coordinates from pixels, leading to click hallucinations. vision.json provides the exact [x, y] coordinates and semantic metadata (ARIA roles, labels, states) directly from the Chromium render tree. Your agent doesn't have to guess; it knows.
Q: How does this eliminate "Jitter"?
Traditional "screenshot" methods are asynchronous. Because Glazyr Viz is baked into the Chromium source, frame access is synchronous. The agent perceives the UI state at the exact moment the frame is committed to the GPU. Build fast. Stop serializing. Built by MAGNETAR SENTIENT L.L.C. // V1.0.0 General Release
常见问题
Glazyr Viz 是什么?
Glazyr Viz 直接将 Chromium 原始内存帧提供给 AI agent,绕过脆弱的 DOM scraping 与 Cloudflare 封锁,实现 177 FPS 零拷贝视觉与原生 USDC 结算。
Glazyr Viz 提供哪些工具?
提供 10 个工具,包括 get_optic_nerve_status、browser_navigate、browser_set_fish_count 等。
相关 Skills
Claude接口
by anthropics
面向接入 Claude API、Anthropic SDK 或 Agent SDK 的开发场景,自动识别项目语言并给出对应示例与默认配置,快速搭建 LLM 应用。
✎ 想把Claude能力接进应用或智能体,用claude-api上手快、兼容Anthropic与Agent SDK,集成路径清晰又省心
RAG架构师
by alirezarezvani
聚焦生产级RAG系统设计与优化,覆盖文档切块、检索链路、索引构建、召回评估等关键环节,适合搭建可扩展、高准确率的知识库问答与检索增强应用。
✎ 面向RAG落地,把知识库、向量检索和生成链路系统串联起来,做架构设计时更清晰,也更少踩坑。
多智能体架构
by alirezarezvani
聚焦多智能体系统架构设计,梳理 Supervisor、Swarm、分层和 Pipeline 等模式,覆盖角色定义、通信协作与性能评估,适合规划稳健可扩展的 AI agent 编排方案。
✎ 帮你系统解决多智能体应用的架构设计与协同编排难题,适合构建复杂 AI 工作流,成熟度高、社区认可也很亮眼。
相关 MCP Server
知识图谱记忆
编辑精选by Anthropic
Memory 是一个基于本地知识图谱的持久化记忆系统,让 AI 记住长期上下文。
✎ 帮 AI 和智能体补上“记不住”的短板,用本地知识图谱沉淀长期上下文,连续对话更聪明,数据也更可控。
顺序思维
编辑精选by Anthropic
Sequential Thinking 是让 AI 通过动态思维链解决复杂问题的参考服务器。
✎ 这个服务器展示了如何让 Claude 像人类一样逐步推理,适合开发者学习 MCP 的思维链实现。但注意它只是个参考示例,别指望直接用在生产环境里。
by deusdata
持久化的代码库知识图谱,可跨会话保留上下文,在 session 重启或上下文压缩后仍能继续使用。
✎ 专治 AI 编程助手“会话失忆”,把代码库沉淀为持久知识图谱,重启或压缩上下文后也能无缝续上开发状态。