io.github.mickelsamuel/migrationpilot

数据与存储

by mickelsamuel

在生产前用 80 条规则拦截危险的 PostgreSQL migration,支持 lock analysis 与自动修复。

什么是 io.github.mickelsamuel/migrationpilot

在生产前用 80 条规则拦截危险的 PostgreSQL migration,支持 lock analysis 与自动修复。

README

MigrationPilot

npm version CI License: MIT

Know exactly what your PostgreSQL migration will do to production — before you merge.

MigrationPilot is a static analysis tool for PostgreSQL schema migrations. It parses your SQL with the actual PostgreSQL parser (libpg-query), classifies every lock acquired, flags dangerous patterns with 83 safety rules, scores overall risk, and suggests safe alternatives — all without touching your database. Works as a CLI, a GitHub Action, and a Node.js library.


Quick Start

Analyze any migration file — no install required:

bash
npx migrationpilot analyze migration.sql

That's it. One command, instant results. MigrationPilot parses your SQL with the real PostgreSQL parser, checks 83 safety rules, and tells you exactly what's dangerous.

Example

Given this migration:

sql
-- 002_add_index_and_constraint.sql
CREATE INDEX idx_users_email ON users (email);
ALTER TABLE users ADD CONSTRAINT users_email_unique UNIQUE (email);

MigrationPilot catches:

code
  MigrationPilot — 002_add_index_and_constraint.sql

  Risk:  RED   Score: 80/100

  Violations:

  ✗ [MP001] CRITICAL
    CREATE INDEX blocks writes on "users". Use CREATE INDEX CONCURRENTLY.
    Safe alternative:
    CREATE INDEX CONCURRENTLY idx_users_email ON users (email);

  ✗ [MP027] CRITICAL
    UNIQUE constraint without USING INDEX scans full table under ACCESS EXCLUSIVE.

  ⚠ [MP004] WARNING
    No SET lock_timeout before DDL on "users".
    Auto-fixable: run with --fix

  83 rules checked in 23ms

More CLI usage

bash
# Check all migrations in a directory
npx migrationpilot check migrations/ --pattern "*.sql"

# Auto-fix what's fixable
npx migrationpilot analyze migration.sql --fix

# Pipe from Django, Prisma, Knex, or any framework
python manage.py sqlmigrate myapp 0042 | npx migrationpilot analyze --stdin

GitHub Action

Add migration safety checks to every PR in 30 seconds:

yaml
# .github/workflows/migration-check.yml
name: Migration Safety Check
on: [pull_request]

jobs:
  check:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: mickelsamuel/migrationpilot@v1
        with:
          migration-path: "migrations/*.sql"
          fail-on: critical

Posts a safety report as a PR comment, fails the check on critical violations, and generates SARIF for GitHub Code Scanning.


CLI Commands

CommandDescription
analyze <file>Analyze a single SQL migration file
check <dir>Check all migration files in a directory
plan <file>Show execution plan with timeline visualization
initGenerate a .migrationpilotrc.yml config file
detectAuto-detect migration framework (14 supported)
watch <dir>Watch migration files and re-analyze on change
hookInstall/uninstall git pre-commit hook
list-rulesList all 83 safety rules with metadata
doctorRun diagnostic checks on your environment
completion <shell>Generate shell completion scripts (bash/zsh/fish)
driftCompare two database schemas for differences
trendsView historical analysis trends
explain <rule>Show detailed information about a specific rule
rollback <file>Generate reverse DDL for migration rollback

CLI Options

OptionCommandsDescriptionDefault
--pg-version <n>analyze, checkTarget PostgreSQL version (9-20)17
--format <fmt>analyze, checkOutput: text, json, sarif, markdowntext
--fail-on <level>analyze, checkExit code threshold: critical, warning, nevercritical
--fixanalyzeAuto-fix violations in-place (12 rules)
--dry-runanalyzePreview auto-fix changes without writing
--quietanalyze, checkOne-line-per-violation (gcc-style) output
--verboseanalyzePer-statement PASS/FAIL for all rules
--stdinanalyzeRead SQL from stdin instead of file
--exclude <rules>analyze, checkComma-separated rule IDs to skip
--database-url <url>analyze, checkPostgreSQL connection for production context (Pro)
--license-key <key>analyze, checkLicense key for Pro features
--pattern <glob>checkFile pattern for directory scanning*.sql
--output <file>analyze, checkWrite report to file instead of stdout
--offlineanalyze, checkAir-gapped mode: skip network access
--no-configanalyze, checkIgnore config file
--no-colorallDisable colored output
--jsonlist-rulesOutput rules as JSON array

Environment variables: MIGRATIONPILOT_LICENSE_KEY, NO_COLOR, TERM=dumb.


All 80 Rules

Lock Safety (Critical)

RuleNameAuto-fixWhat it catches
MP001require-concurrent-indexYesCREATE INDEX without CONCURRENTLY blocks writes
MP002require-check-not-nullSET NOT NULL without CHECK pattern scans full table
MP003volatile-default-rewriteADD COLUMN DEFAULT now() rewrites table on PG < 11
MP004require-lock-timeoutYesDDL without SET lock_timeout blocks queue
MP005require-not-valid-fkADD CONSTRAINT FK without NOT VALID scans full table
MP006no-vacuum-fullVACUUM FULL rewrites table under ACCESS EXCLUSIVE
MP007no-column-type-changeALTER COLUMN TYPE rewrites table
MP008no-multi-ddl-transactionMultiple DDL in one transaction compound lock time
MP025ban-concurrent-in-transactionCONCURRENTLY ops inside a transaction = runtime ERROR
MP026ban-drop-tableDROP TABLE permanently removes table and data
MP027disallowed-unique-constraintUNIQUE constraint without USING INDEX scans full table
MP030require-not-valid-checkYesCHECK constraint without NOT VALID scans full table
MP031ban-exclusion-constraintEXCLUSION constraint builds GiST index under ACCESS EXCLUSIVE
MP032ban-clusterCLUSTER rewrites table under ACCESS EXCLUSIVE
MP034ban-drop-databaseDROP DATABASE in a migration file
MP035ban-drop-schemaDROP SCHEMA permanently removes schema + objects
MP036ban-truncate-cascadeTRUNCATE CASCADE silently truncates FK-referencing tables
MP046require-concurrent-detach-partitionYesDETACH PARTITION without CONCURRENTLY (PG 14+)
MP047ban-set-logged-unloggedSET LOGGED/UNLOGGED rewrites entire table
MP049require-partition-key-in-pkPartitioned table PK doesn't include partition key columns
MP055drop-pk-replica-identity-breakDropping PK breaks logical replication
MP057rls-enabled-without-policyENABLE RLS without CREATE POLICY = silent deny-all
MP060alter-type-rename-valueRENAME VALUE breaks logical replication subscribers
MP064ban-disable-triggerDISABLE TRIGGER ALL/USER breaks replication + FK enforcement
MP065ban-lock-tableExplicit LOCK TABLE blocks queries and can cause deadlocks
MP062ban-add-generated-stored-columnStored generated column causes full table rewrite
MP069warn-fk-lock-both-tablesFK constraint locks both source and referenced table
MP072warn-partition-default-scanATTACH PARTITION scans DEFAULT partition under lock
MP073ban-superuser-roleALTER SYSTEM / CREATE ROLE SUPERUSER in migrations

Warnings

RuleNameAuto-fixWhat it catches
MP009require-drop-index-concurrentlyYesDROP INDEX without CONCURRENTLY
MP010no-rename-columnRENAME COLUMN breaks app queries
MP011unbatched-backfillUPDATE without WHERE (full table scan)
MP012no-enum-add-in-transactionALTER TYPE ADD VALUE inside transaction
MP015no-add-column-serialSERIAL creates implicit sequence (use IDENTITY)
MP016require-fk-indexFK columns without index = slow cascading deletes
MP017no-drop-columnDROP COLUMN under ACCESS EXCLUSIVE
MP018no-force-set-not-nullSET NOT NULL without CHECK pre-validation
MP020require-statement-timeoutYesLong-running DDL without statement_timeout
MP021require-concurrent-reindexYesREINDEX without CONCURRENTLY (PG 12+)
MP022no-drop-cascadeDROP CASCADE silently drops dependents
MP023require-if-not-existsYesCREATE TABLE/INDEX without IF NOT EXISTS
MP024no-enum-value-removalDROP TYPE destroys enum + dependent columns
MP028no-rename-tableRENAME TABLE breaks queries, views, FKs
MP029ban-drop-not-nullDROP NOT NULL may break app assumptions
MP033require-concurrent-refresh-matviewYesREFRESH MATERIALIZED VIEW without CONCURRENTLY
MP037prefer-text-over-varcharYesVARCHAR(n) has no benefit over TEXT in PostgreSQL
MP038prefer-bigint-over-intINT PK/FK columns can overflow (use BIGINT)
MP039prefer-identity-over-serialSERIAL quirks — use GENERATED ALWAYS AS IDENTITY
MP040prefer-timestamptzYesTIMESTAMP without timezone causes timezone bugs
MP041ban-char-fieldYesCHAR(n) wastes space, causes comparison bugs
MP042require-index-nameUnnamed indexes are hard to reference
MP043ban-domain-constraintDomain constraints validate against ALL using columns
MP044no-data-loss-type-narrowingNarrowing column type (e.g., BIGINT → INT) can lose data
MP045require-primary-keyTables without PK break replication
MP048ban-alter-default-volatileVolatile SET DEFAULT on existing column is misleading
MP050prefer-hnsw-over-ivfflatpgvector: HNSW index preferred over IVFFlat
MP051require-spatial-indexPostGIS: spatial columns need GIST/SPGIST index
MP052warn-dependent-objectsDROP/ALTER COLUMN may break views, functions, triggers
MP053ban-uncommitted-transactionBEGIN without matching COMMIT leaves open transaction
MP054alter-type-add-value-in-transactionNew enum value not visible until COMMIT
MP056gin-index-on-jsonb-without-expressionPlain GIN on JSONB useless for ->> queries
MP058multi-alter-table-same-tableMultiple ALTER TABLE on same table = extra lock cycles
MP059sequence-not-reset-after-data-migrationINSERT with explicit IDs without setval()
MP061suboptimal-column-orderVariable-length columns before fixed-size wastes padding
MP063warn-do-block-ddlDO block contains DDL that bypasses static analysis
MP066warn-autovacuum-disabledautovacuum_enabled = false causes bloat + wraparound risk
MP067warn-backfill-no-batchingDELETE without WHERE clause = full table lock + WAL bloat
MP068warn-integer-pk-capacityCREATE SEQUENCE AS integer risks overflow — use bigint
MP070warn-concurrent-index-invalidCONCURRENTLY can leave invalid index on failure
MP071ban-rename-in-use-columnRENAME COLUMN without updating dependent views/functions
MP074require-deferrable-fkFK constraints should be DEFERRABLE for bulk loading
MP075warn-toast-bloat-riskUPDATE on TOAST columns causes bloat from full-row copies
MP076warn-xid-consuming-retrySAVEPOINT creates subtransactions consuming XIDs
MP077prefer-lz4-toast-compressionUse lz4 over pglz for TOAST compression (PG 14+)
MP078warn-extension-version-pinCREATE EXTENSION without VERSION = non-deterministic
MP079warn-rls-policy-completenessRLS policies don't cover all operations (SELECT/INSERT/UPDATE/DELETE)
MP080ban-data-in-migrationDML (INSERT/UPDATE/DELETE) mixed with DDL in same migration

Production Context (Pro)

RuleNameWhat it catches
MP013high-traffic-table-ddlDDL on tables with 10K+ queries
MP014large-table-ddlLong-held locks on tables with 1M+ rows
MP019no-exclusive-lock-high-connectionsACCESS EXCLUSIVE with many active connections

Auto-fix: 12 rules can be automatically fixed with --fix: MP001, MP004, MP009, MP020, MP021, MP023, MP030, MP033, MP037, MP040, MP041, MP046.


Features

Auto-fix

bash
migrationpilot analyze migration.sql --fix         # Fix in-place
migrationpilot analyze migration.sql --fix --dry-run  # Preview changes

Automatically fixes: missing CONCURRENTLY, lock_timeout, statement_timeout, NOT VALID on CHECK constraints, CONCURRENTLY on materialized view refresh.

Framework Detection

bash
migrationpilot detect
# → Detected: Prisma (prisma/migrations/)

Detects 14 frameworks: Flyway, Liquibase, Alembic, Django, Knex, Prisma, TypeORM, Drizzle, Sequelize, goose, dbmate, Sqitch, Rails, Ecto.

Watch Mode

bash
migrationpilot watch migrations/ --pattern "*.sql"

Re-analyzes on file changes with intelligent debouncing.

Pre-commit Hook

bash
migrationpilot hook install   # Installs git hook (supports Husky)
migrationpilot hook uninstall

Execution Plan

bash
migrationpilot plan migration.sql

Visual timeline showing lock type, duration estimate, blocking impact, and transaction boundaries for each statement.

Schema Drift Detection

bash
migrationpilot drift \
  --source postgresql://localhost/production \
  --target postgresql://localhost/staging

Compares tables, columns, indexes, constraints, and sequences between two databases.

MCP Server

MigrationPilot includes a Model Context Protocol server for AI assistant integration:

json
{
  "mcpServers": {
    "migrationpilot": {
      "command": "npx",
      "args": ["migrationpilot-mcp"]
    }
  }
}

Exposes 4 tools: analyze_migration, suggest_fix, explain_lock, list_rules.

Historical Trends

bash
migrationpilot trends

Stores analysis results in ~/.migrationpilot/history/ and tracks risk scores, violation counts, and improvement trends over time.

Air-Gapped Mode

bash
migrationpilot analyze migration.sql --offline

Skips update checks and network access. Ed25519 license keys validate entirely client-side — no phone-home, no telemetry.

Config File

yaml
# .migrationpilotrc.yml
extends: "migrationpilot:strict"   # Built-in preset
pgVersion: 16
failOn: warning
rules:
  MP037: false                     # Disable a rule
  MP004:
    severity: warning              # Downgrade severity
  MP013:
    threshold: 5000                # Custom threshold
ignore:
  - "migrations/seed_*.sql"

Five built-in presets: migrationpilot:recommended (default), migrationpilot:strict (all rules at critical, fail on warning), migrationpilot:ci, migrationpilot:startup (relaxed for early-stage), migrationpilot:enterprise (maximum safety with audit logging).

Inline Disable

sql
-- migrationpilot-disable MP001
CREATE INDEX idx_users_email ON users (email);

-- migrationpilot-disable-next-line MP004
ALTER TABLE users ADD COLUMN bio TEXT;

Severity Overrides

Override any rule's severity via config:

yaml
rules:
  MP009:
    severity: critical   # Upgrade from warning to critical
  MP007:
    severity: warning    # Downgrade from critical to warning

PG Version Awareness

Rules adapt their advice based on --pg-version:

  • PG 10+: Recommends GENERATED ALWAYS AS IDENTITY over SERIAL
  • PG 11+: Non-volatile ADD COLUMN DEFAULT is safe (no rewrite)
  • PG 12+: REINDEX CONCURRENTLY available
  • PG 14+: DETACH PARTITION CONCURRENTLY available
  • PG 18+: SET NOT NULL NOT VALID + VALIDATE NOT NULL pattern

Output Formats

FormatFlagUse case
Text--format text (default)Terminal with colors, tables, risk bars
JSON--format jsonScripting, CI pipelines (versioned $schema)
SARIF--format sarifGitHub Code Scanning, VS Code, IntelliJ
Markdown--format markdownDocs, wikis, Notion
Quiet--quietOne-line-per-violation (gcc-style)
Verbose--verbosePer-statement PASS/FAIL for all 83 rules

JSON Schema

JSON output includes a $schema URL and version field for reliable parsing:

json
{
  "$schema": "https://migrationpilot.dev/schemas/report-v1.json",
  "version": "1.5.0",
  "file": "migrations/001.sql",
  "riskLevel": "RED",
  "riskScore": 80,
  "violations": [...]
}

SARIF

bash
# CLI
migrationpilot analyze migration.sql --format sarif > results.sarif

# GitHub Action (auto-generated)
- uses: github/codeql-action/upload-sarif@v3
  with:
    sarif_file: ${{ steps.migration-check.outputs.sarif-file }}

Production Context (Pro)

With a Pro license + --database-url, MigrationPilot connects read-only to query system catalogs:

Data SourceWhat it provides
pg_classTable row counts, total size, index count
pg_stat_statementsAffected queries by call frequency
pg_stat_activityActive connections on target tables

This data feeds into risk scoring (table size 0-30 pts, query frequency 0-30 pts) and unlocks rules MP013, MP014, MP019.

Safety: Only reads pg_stat_* and pg_class. Never reads user data. Never runs DDL. Single read-only connection with timeouts.


Risk Scoring

FactorWeightFreePro
Lock Severity0-40YesYes
Table Size0-30Yes
Query Frequency0-30Yes
LevelScoreMeaning
GREEN0-24Safe to deploy
YELLOW25-49Review recommended
RED50-100Dangerous — use safe alternative

GitHub Action

Inputs

InputDescriptionRequiredDefault
migration-pathGlob pattern for SQL filesYes
github-tokenGitHub token for PR commentsNo${{ github.token }}
license-keyPro license keyNo
database-urlPostgreSQL connection (Pro)No
pg-versionTarget PostgreSQL versionNo17
fail-onFail threshold: critical, warning, neverNocritical

Outputs

OutputDescription
risk-levelOverall: RED, YELLOW, or GREEN
violationsTotal violation count
sarif-filePath to SARIF file for Code Scanning upload

With Production Context + Code Scanning (Pro)

yaml
- uses: mickelsamuel/migrationpilot@v1
  id: migration-check
  with:
    migration-path: "migrations/*.sql"
    license-key: ${{ secrets.MIGRATIONPILOT_LICENSE_KEY }}
    database-url: ${{ secrets.DATABASE_URL }}
    fail-on: critical

- uses: github/codeql-action/upload-sarif@v3
  if: always()
  with:
    sarif_file: ${{ steps.migration-check.outputs.sarif-file }}

PostgreSQL Lock Reference

DDL OperationLock TypeBlocks ReadsBlocks WritesLong-held
CREATE INDEXSHARENoYesYes
CREATE INDEX CONCURRENTLYSHARE UPDATE EXCLUSIVENoNoNo
DROP INDEXACCESS EXCLUSIVEYesYesNo
DROP INDEX CONCURRENTLYSHARE UPDATE EXCLUSIVENoNoNo
ADD COLUMN (no default)ACCESS EXCLUSIVEYesYesNo
ADD COLUMN DEFAULT now()ACCESS EXCLUSIVEYesYesYes*
ALTER COLUMN TYPEACCESS EXCLUSIVEYesYesYes
SET NOT NULLACCESS EXCLUSIVEYesYesYes
ADD CONSTRAINT FKACCESS EXCLUSIVEYesYesYes
ADD CONSTRAINT FK NOT VALIDACCESS EXCLUSIVEYesYesNo
VALIDATE CONSTRAINTSHARE UPDATE EXCLUSIVENoNoNo
RENAME COLUMN/TABLEACCESS EXCLUSIVEYesYesNo
DROP TABLEACCESS EXCLUSIVEYesYesNo
TRUNCATEACCESS EXCLUSIVEYesYesNo
CLUSTERACCESS EXCLUSIVEYesYesYes
VACUUM FULLACCESS EXCLUSIVEYesYesYes
VACUUMSHARE UPDATE EXCLUSIVENoNoNo
REINDEXACCESS EXCLUSIVEYesYesYes
REINDEX CONCURRENTLYSHARE UPDATE EXCLUSIVENoNoNo
REFRESH MATERIALIZED VIEWACCESS EXCLUSIVEYesYesYes
REFRESH MATERIALIZED VIEW CONCURRENTLYSHARE UPDATE EXCLUSIVENoNoNo
SET LOGGED/UNLOGGEDACCESS EXCLUSIVEYesYesYes
DETACH PARTITIONACCESS EXCLUSIVEYesYesNo
DETACH PARTITION CONCURRENTLYSHARE UPDATE EXCLUSIVENoNoNo

* On PG < 11, volatile defaults cause full table rewrite. On PG 11+, evaluated per-row at read time.


Comparison

MigrationPilotSquawkAtlas
Total rules8031~15
Free rules77310 (paywalled since v0.38)
Auto-fix12 rules00
Output formats6 (text, JSON, SARIF, markdown, quiet, verbose)32
Framework detection14 frameworks00
Watch modeYesNoNo
Pre-commit hooksYesNoNo
Execution planYesNoNo
Config file + presets5 presets00
PG-version-awareFull (9-20)PartialPartial
Programmatic APINode.jsNoGo
GitHub ActionYesYesYes
SARIF for Code ScanningYesNoNo
Inline disable commentsYesYesNo
Open sourceMITApache 2.0Proprietary

Pricing

FeatureFreePro ($19/mo)Team ($49/mo)Enterprise
80 free safety rules (83 total)YesYesYesYes
All output formats + GitHub ActionYesYesYesYes
Auto-fix (12 rules)YesYesYesYes
Config file + 5 presetsYesYesYesYes
Production contextYesYesYes
Production rules (MP013, MP014, MP019)YesYesYes
Custom rules plugin APIYesYes
Team seats1Up to 10Unlimited
Team seat managementYesYes
Audit loggingYesYes
Policy enforcementYes
SSO / SAML + air-gapped modeYes

Get a license key at migrationpilot.dev.


Architecture

code
src/
├── parser/        # DDL parsing with libpg-query WASM (actual PG parser)
├── locks/         # Lock type classification (pure lookup table)
├── rules/         # 83 safety rules (MP001-MP083), engine, registry, helpers
├── production/    # Production context queries (Pro: pg_stat_*, pg_class)
├── scoring/       # Risk scoring (RED/YELLOW/GREEN, 0-100)
├── generator/     # Safe migration SQL generation
├── output/        # CLI, JSON, SARIF, markdown, PR comment, execution plan
├── analysis/      # Shared pipeline, transaction boundaries, migration ordering
├── fixer/         # Auto-fix engine (12 rules)
├── frameworks/    # Migration framework detection (14 frameworks)
├── watch/         # File watcher with debounce
├── hooks/         # Git pre-commit hook installer
├── config/        # Config loader (.migrationpilotrc.yml, presets)
├── license/       # Ed25519 license key validation
├── billing/       # Stripe checkout + webhook + email delivery
├── action/        # GitHub Action entry point
├── doctor/        # Diagnostic checks (node version, config, latest version)
├── completion/    # Shell completion generators (bash/zsh/fish)
├── update/        # npm update checker with cache
├── prompts/       # One-time prompts (GitHub star)
├── drift/         # Schema drift detection (compare two DBs)
├── history/       # Historical analysis storage and trends
├── audit/         # Enterprise audit logging (JSONL)
├── mcp/           # MCP server (Model Context Protocol)
├── usage/         # Free usage tracking (3 production analyses/month)
├── team/          # Team management (seats, members, activity)
├── policy/        # Policy enforcement (required rules, severity floors)
├── auth/          # SSO authentication (device code flow, API keys)
├── index.ts       # Programmatic API (45+ exports)
└── cli.ts         # CLI entry point (20 commands)

Programmatic API

typescript
import { analyzeSQL, allRules, parseMigration, classifyLock } from 'migrationpilot';

const result = await analyzeSQL(sql, 'migration.sql', 16, allRules);
console.log(result.violations);
console.log(result.overallRisk);

Full TypeScript types included.


Development

bash
pnpm install
pnpm test          # 970+ tests across 54 files
pnpm dev analyze path/to/migration.sql
pnpm build         # CLI 1.0MB, Action 1.6MB, API 390KB, MCP 1.2MB
pnpm lint
pnpm typecheck

Contributing

See CONTRIBUTING.md for development setup and guidelines.

Security

See SECURITY.md for our security policy.

License

MIT

常见问题

io.github.mickelsamuel/migrationpilot 是什么?

在生产前用 80 条规则拦截危险的 PostgreSQL migration,支持 lock analysis 与自动修复。

相关 Skills

迁移架构师

by alirezarezvani

Universal
热门

为数据库、API 与基础设施迁移制定分阶段零停机方案,提前校验兼容性与风险,生成回滚策略、验证关卡和时间线,适合复杂系统平滑切换。

做数据库与存储迁移时,用它统一梳理表结构和数据搬迁流程,架构视角更完整,复杂迁移也更稳。

数据与存储
未扫描9.0k

数据库建模

by alirezarezvani

Universal
热门

把需求梳理成关系型数据库表结构,自动生成迁移脚本、TypeScript/Python 类型、种子数据、RLS 策略和索引方案,适合多租户、审计追踪、软删除等后端建模与 Schema 评审场景。

把数据库结构设计、ER图梳理和SQL建模放到一处,复杂业务也能快速统一数据模式,少走不少返工弯路。

数据与存储
未扫描9.0k

资深数据工程师

by alirezarezvani

Universal
热门

聚焦生产级数据工程,覆盖 ETL/ELT、批处理与流式管道、数据建模、Airflow/dbt/Spark 优化和数据质量治理,适合设计数据架构、搭建现代数据栈与排查性能问题。

复杂数据管道、ETL/ELT 和治理难题交给它,凭 Spark、Airflow、dbt 等现代数据栈经验,能更稳地搭起可扩展的数据基础设施。

数据与存储
未扫描9.0k

相关 MCP Server

by Anthropic

热门

PostgreSQL 是让 Claude 直接查询和管理你的数据库的 MCP 服务器。

这个服务器解决了开发者需要手动编写 SQL 查询的痛点,特别适合数据分析师或后端开发者快速探索数据库结构。不过,由于是参考实现,生产环境使用前务必评估安全风险,别指望它能处理复杂事务。

数据与存储
82.9k

SQLite 数据库

编辑精选

by Anthropic

热门

SQLite 是让 AI 直接查询本地数据库进行数据分析的 MCP 服务器。

这个服务器解决了 AI 无法直接访问 SQLite 数据库的问题,适合需要快速分析本地数据集的开发者。不过,作为参考实现,它可能缺乏生产级的安全特性,建议在受控环境中使用。

数据与存储
82.9k

by Firecrawl

热门

Firecrawl 是让 AI 直接抓取网页并提取结构化数据的 MCP 服务器。

它解决了手动写爬虫的麻烦,让 Claude 能直接访问动态网页内容。最适合需要实时数据的研究者或开发者,比如监控竞品价格或抓取新闻。但要注意,它依赖第三方 API,可能涉及隐私和成本问题。

数据与存储
5.9k

评论