io.github.openpoem/spec-score-mcp

平台与服务

by openpoem

Score your specs before feeding them to an LLM. MCP server with radar chart visualization.

什么是 io.github.openpoem/spec-score-mcp

Score your specs before feeding them to an LLM. MCP server with radar chart visualization.

README

Spec Score MCP

Score your specs before Claude builds from them.

A balanced spec produces balanced code. An unbalanced spec produces creative fiction.

The problem

When your spec is detailed on some axes but vague on others, Claude doesn't ask for clarification — it fills in the blanks. The result compiles, the tests pass, but it's not what you meant.

This tool catches that before you start building. It scores your spec on 4 axes, tells you which one is weakest, and gives you a concrete tip to fix it.

4 axes

AxisQuestion it answers
completenessCan Claude understand the full scope of what to build?
clarityIs there only one way to interpret this spec?
constraintsDoes Claude know what NOT to build?
specificityAre there concrete, testable details?

Each axis scores 0.0 to 1.0. The balance score measures how evenly the 4 axes are covered.

Balance matters more than individual scores. A spec scoring 0.50 on all 4 axes (balance: 0.97) will produce better output than one scoring 0.95 / 0.95 / 0.20 / 0.90 (balance: 0.58). Why? That one weak axis — constraints at 0.20 — is exactly where Claude will improvise. You described what to build in detail, but forgot to say what's out of scope. So Claude builds everything you asked for, plus features you didn't.

On the radar chart: an even diamond beats a sharp spike.

Verdicts

VerdictWhat it means
SHIP ITSpec is ready — Claude knows what to build and what not to
ALMOSTOne axis needs a small fix before you start
DRAFTMultiple axes need work, but the structure is there
VAGUEWell-organized but too abstract to act on
UNBOUNDEDClear goal but no boundaries — Claude will over-build
OVER-CONSTRAINEDLots of rules but unclear what the actual goal is
SKETCHStarting point — needs detail on most axes

Not SHIP IT yet? The tool tells you which axis is weakest and what to add. Fix that axis, re-score, repeat. Most specs reach SHIP IT in 2-3 rounds.

Install

bash
git clone https://github.com/openpoem/spec-score-mcp.git
cd spec-score-mcp && npm install && npm run build
claude mcp add spec-score -- node $(pwd)/dist/mcp.js

The 3 tools are now available in every Claude Code session.

Usage

Slash commands

Clone this repo to get the built-in slash commands:

code
/project:scan my-feature-spec.md

Reads the file, scores it, and writes a my-feature-spec.md.scored.md with scores, verdict, tip, and a radar chart.

code
/project:compare blueprint.md implementation.md

Scores both files and writes a compared.scored.md with side-by-side radar charts.

Direct tool use

The 3 MCP tools work in any Claude Code conversation:

ToolWhat it does
spec_scoreScores a spec on 4 axes, returns balance score and verdict
spec_visualizeGenerates an SVG radar chart from scores
spec_compareSide-by-side comparison of two scored specs

Ask Claude: "Score this spec", "Show me the radar chart", or "Compare these two specs".

Example: from UNBOUNDED to SHIP IT

This tool scoring its own spec — four rounds, each fixing the weakest axis:

<p align="center"> <img src="docs/compare-before-after.jpg" alt="From UNBOUNDED to SHIP IT" width="1000" /> </p>

Round 1: the idea

Build a spec scoring tool

<img src="docs/round1-the-idea.jpg" alt="Round 1: UNBOUNDED (0.12)" width="480" />
code
UNBOUNDED  0.12  Tip: What does 'scoring' mean? What axes? What output?

One axis is high (clarity — the goal is clear), everything else is near zero. Claude would build... anything. A web app? A CLI? A VS Code extension? No way to know.

Round 2: add context

Build an MCP server that scores specs on 4 axes: completeness, clarity, constraints, specificity. Each axis is 0.0-1.0. Returns a balance score and verdict.

<img src="docs/round2-add-context.jpg" alt="Round 2: ALMOST (0.67)" width="480" />
code
ALMOST  0.67  Tip: What are the verdicts? What does the tool NOT do?

Now Claude knows what to build. But constraints are still weak — it might add auto-fixing, CI integration, a database.

Round 3: add boundaries

Three tools: spec_score, spec_visualize, spec_compare. Non-goals: no auto-fixing, no CI integration, no storage.

<img src="docs/round3-add-boundaries.jpg" alt="Round 3: SHIP IT (0.84)" width="480" />
code
SHIP IT  0.84  Tip: Add testable criteria — what balance maps to which verdict?

Crossed the threshold. Claude now knows what to build AND what not to build. Specificity is still the weakest axis.

Round 4: add testable details

Balance = 1 - sqrt(variance)/mean. SHIP IT > 0.75, ALMOST > 0.60, plus pattern-based verdicts. Node.js, MCP SDK, stdio transport.

<img src="docs/round4-ship-it.jpg" alt="Round 4: SHIP IT (0.95)" width="480" />
code
SHIP IT  0.95  Spec is ready for implementation.

Four rounds: 0.12 → 0.67 → 0.84 → 0.95. Each round fixed exactly one thing.

The math

  1. Claude scores each axis (0.0 - 1.0)
  2. Normalize the vector: v / ||v||
  3. Balance: 1 - sqrt(variance) / mean
  4. Verdict: balance threshold + axis pattern matching

The scoring intelligence comes from Claude, not from the algorithm. The algorithm only measures balance.

Project structure

code
src/
  mcp.ts        # MCP server (3 tools)
  score.ts      # Scoring engine
  visualize.ts  # SVG radar charts
.claude/
  commands/
    scan.md     # /project:scan command
    compare.md  # /project:compare command

OpenPoem — spec-score-mcp

MIT License.

© 2026 OpenPoem. info@openpoem.org

常见问题

io.github.openpoem/spec-score-mcp 是什么?

Score your specs before feeding them to an LLM. MCP server with radar chart visualization.

相关 Skills

MCP构建

by anthropics

Universal
热门

聚焦高质量 MCP Server 开发,覆盖协议研究、工具设计、错误处理与传输选型,适合用 FastMCP 或 MCP SDK 对接外部 API、封装服务能力。

想让 LLM 稳定调用外部 API,就用 MCP构建:从 Python 到 Node 都有成熟指引,帮你更快做出高质量 MCP 服务器。

平台与服务
未扫描114.1k

Slack动图

by anthropics

Universal
热门

面向Slack的动图制作Skill,内置emoji/消息GIF的尺寸、帧率和色彩约束、校验与优化流程,适合把创意或上传图片快速做成可直接发送的Slack动画。

帮你快速做出适配 Slack 的动图,内置约束规则和校验工具,少踩上传与播放坑,做表情包和演示都更省心。

平台与服务
未扫描114.1k

MCP服务构建器

by alirezarezvani

Universal
热门

从 OpenAPI 一键生成 Python/TypeScript MCP server 脚手架,并校验 tool schema、命名规范与版本兼容性,适合把现有 REST API 快速发布成可生产演进的 MCP 服务。

帮你快速搭建 MCP 服务与后端 API,脚手架完善、扩展顺手,尤其适合想高效验证服务能力的开发者。

平台与服务
未扫描10.2k

相关 MCP Server

Slack 消息

编辑精选

by Anthropic

热门

Slack 是让 AI 助手直接读写你的 Slack 频道和消息的 MCP 服务器。

这个服务器解决了团队协作中需要 AI 实时获取 Slack 信息的痛点,特别适合开发团队让 Claude 帮忙汇总频道讨论或发送通知。不过,它目前只是参考实现,文档有限,不建议在生产环境直接使用——更适合开发者学习 MCP 如何集成第三方服务。

平台与服务
83.4k

by netdata

热门

io.github.netdata/mcp-server 是让 AI 助手实时监控服务器指标和日志的 MCP 服务器。

这个工具解决了运维人员需要手动检查系统状态的痛点,最适合 DevOps 团队让 Claude 自动分析性能数据。不过,它依赖 NetData 的现有部署,如果你没用过这个监控平台,得先花时间配置。

平台与服务
78.4k

by d4vinci

热门

Scrapling MCP Server 是专为现代网页设计的智能爬虫工具,支持绕过 Cloudflare 等反爬机制。

这个工具解决了爬取动态网页和反爬网站时的头疼问题,特别适合需要批量采集电商价格或新闻数据的开发者。不过,它依赖外部浏览器引擎,资源消耗较大,不适合轻量级任务。

平台与服务
35.4k

评论