CLIO Compression

效率与工作流

by iowarp

Compression MCP server implementation using Model Context Protocol

什么是 CLIO Compression

Compression MCP server implementation using Model Context Protocol

README

CLIO Kit

<!-- mcp-name: io.github.iowarp/adios-mcp --> <!-- mcp-name: io.github.iowarp/arxiv-mcp --> <!-- mcp-name: io.github.iowarp/chronolog-mcp --> <!-- mcp-name: io.github.iowarp/compression-mcp --> <!-- mcp-name: io.github.iowarp/darshan-mcp --> <!-- mcp-name: io.github.iowarp/geo-mcp --> <!-- mcp-name: io.github.iowarp/geojson-mcp --> <!-- mcp-name: io.github.iowarp/hdf5-mcp --> <!-- mcp-name: io.github.iowarp/jarvis-mcp --> <!-- mcp-name: io.github.iowarp/lmod-mcp --> <!-- mcp-name: io.github.iowarp/ndp-mcp --> <!-- mcp-name: io.github.iowarp/node-hardware-mcp --> <!-- mcp-name: io.github.iowarp/pandas-mcp --> <!-- mcp-name: io.github.iowarp/parallel-sort-mcp --> <!-- mcp-name: io.github.iowarp/paraview-mcp --> <!-- mcp-name: io.github.iowarp/parquet-mcp --> <!-- mcp-name: io.github.iowarp/plot-mcp --> <!-- mcp-name: io.github.iowarp/sac-mcp --> <!-- mcp-name: io.github.iowarp/seismic-mcp --> <!-- mcp-name: io.github.iowarp/slurm-mcp --> <!-- mcp-name: io.github.iowarp/terrain-mcp -->

License: BSD-3-Clause PyPI version Python FastMCP CI Coverage

MCP Servers Ruff Type Checked Package Manager Security Audit

CLIO Kit - Part of the IoWarp platform's tooling layer for AI agents. A comprehensive collection of tools, skills, plugins, and extensions. Currently featuring 15+ Model Context Protocol (MCP) servers for scientific computing, with plans to expand to additional agent capabilities. Enables AI agents to interact with HPC resources, scientific data formats, and research datasets.

Website | IOWarp

Chat with us on Zulip or join us

Developed by <img src="https://grc.iit.edu/img/logo.png" alt="GRC Logo" width="18" height="18"> Gnosis Research Center


❌ Without CLIO Kit

Working with scientific data and HPC resources requires manual scripting and tool-specific knowledge:

  • ❌ Write custom scripts for every HDF5/Parquet file exploration
  • ❌ Manually craft Slurm job submission scripts
  • ❌ Switch between multiple tools for data analysis
  • ❌ No AI assistance for scientific workflows
  • ❌ Repetitive coding for common research tasks

✅ With CLIO Kit

AI agents handle scientific computing tasks through natural language:

  • "Analyze the temperature dataset in this HDF5 file" - HDF5 MCP does it
  • "Submit this simulation to Slurm with 32 cores" - Slurm MCP handles it
  • "Find papers on neural networks from ArXiv" - ArXiv MCP searches
  • "Plot the results from this CSV file" - Plot MCP visualizes
  • "Optimize memory usage for this pandas DataFrame" - Pandas MCP optimizes
  • "Find all documents where pressure exceeds 200 kPa" - Agentic Search retrieves

One unified interface. 16 MCP servers. Hybrid search engine. 150+ specialized tools. Built for research.

CLIO Kit is part of the IoWarp platform's comprehensive tooling ecosystem for AI agents. It brings AI assistance to your scientific computing workflow—whether you're analyzing terabytes of HDF5 data, managing Slurm jobs across clusters, or exploring research papers. Built by researchers, for researchers, at Illinois Institute of Technology with NSF support.

Part of IoWarp Platform: CLIO Kit is the tooling layer of the IoWarp platform, providing skills, plugins, and extensions for AI agents working in scientific computing environments.

One simple command. Production-ready, fully typed, MIT licensed, and beta-tested in real HPC environments.

🚀 Quick Installation

One Command for Any Server

bash
# List all 16 available MCP servers
uvx clio-kit mcp-servers

# Run any server instantly
uvx clio-kit mcp-server hdf5
uvx clio-kit mcp-server pandas
uvx clio-kit mcp-server slurm

# Agentic search — hybrid retrieval for scientific corpora
uvx clio-kit search serve               # Start search API server
uvx clio-kit search query --namespace local_fs --q "pressure > 200 kPa"

# AI prompts also available
uvx clio-kit prompts                    # List all prompts
uvx clio-kit prompt code-coverage-prompt # Use a prompt
<details> <summary><b>Install in Cursor</b></summary>

Add to your Cursor ~/.cursor/mcp.json:

json
{
  "mcpServers": {
    "hdf5-mcp": {
      "command": "uvx",
      "args": ["clio-kit", "mcp-server", "hdf5"]
    },
    "pandas-mcp": {
      "command": "uvx",
      "args": ["clio-kit", "mcp-server", "pandas"]
    },
    "slurm-mcp": {
      "command": "uvx",
      "args": ["clio-kit", "mcp-server", "slurm"]
    }
  }
}

See Cursor MCP docs for more info.

</details> <details> <summary><b>Install in Claude Code</b></summary>
bash
# Add HDF5 MCP
claude mcp add hdf5-mcp -- uvx clio-kit mcp-server hdf5

# Add Pandas MCP
claude mcp add pandas-mcp -- uvx clio-kit mcp-server pandas

# Add Slurm MCP
claude mcp add slurm-mcp -- uvx clio-kit mcp-server slurm

See Claude Code MCP docs for more info.

</details> <details> <summary><b>Install in VS Code</b></summary>

Add to your VS Code MCP config:

json
"mcp": {
  "servers": {
    "hdf5-mcp": {
      "type": "stdio",
      "command": "uvx",
      "args": ["clio-kit", "mcp-server", "hdf5"]
    },
    "pandas-mcp": {
      "type": "stdio",
      "command": "uvx",
      "args": ["clio-kit", "mcp-server", "pandas"]
    }
  }
}

See VS Code MCP docs for more info.

</details> <details> <summary><b>Install in Claude Desktop</b></summary>

Edit claude_desktop_config.json:

json
{
  "mcpServers": {
    "hdf5-mcp": {
      "command": "uvx",
      "args": ["clio-kit", "mcp-server", "hdf5"]
    },
    "arxiv-mcp": {
      "command": "uvx",
      "args": ["clio-kit", "mcp-server", "arxiv"]
    }
  }
}

See Claude Desktop MCP docs for more info.

</details>

Available Packages

<div align="center">
📦 Package📌 Ver🔧 System📋 DescriptionInstall Command
adios2.0.1Data I/ORead data using ADIOS2 engineuvx clio-kit mcp-server adios
arxiv2.0.1ResearchFetch research papers from ArXivuvx clio-kit mcp-server arxiv
chronolog2.0.1LoggingLog and retrieve data from ChronoLoguvx clio-kit mcp-server chronolog
compression2.0.1UtilitiesFile compression with gzipuvx clio-kit mcp-server compression
darshan2.0.1PerformanceI/O performance trace analysisuvx clio-kit mcp-server darshan
hdf52.0.1Data I/OHPC-optimized scientific data with 27 tools, AI insights, caching, streaminguvx clio-kit mcp-server hdf5
jarvis2.0.1WorkflowData pipeline lifecycle managementuvx clio-kit mcp-server jarvis
lmod2.0.1EnvironmentEnvironment module managementuvx clio-kit mcp-server lmod
ndp2.0.1Data ProtocolSearch and discover datasets across CKAN instancesuvx clio-kit mcp-server ndp
node-hardware2.0.1SystemSystem hardware informationuvx clio-kit mcp-server node-hardware
pandas2.0.1Data AnalysisCSV data loading and filteringuvx clio-kit mcp-server pandas
parallel-sort2.0.1ComputingLarge file sortinguvx clio-kit mcp-server parallel-sort
paraview2.0.1VisualizationScientific 3D visualization and analysisuvx clio-kit mcp-server paraview
parquet2.0.1Data I/ORead Parquet file columnsuvx clio-kit mcp-server parquet
plot2.0.1VisualizationGenerate plots from CSV datauvx clio-kit mcp-server plot
slurm2.0.1HPCJob submission and managementuvx clio-kit mcp-server slurm
</div>

Agentic Search

Hybrid retrieval engine for scientific corpora — combines lexical (BM25), vector, graph, and scientific search (numeric range, unit matching, formula targeting) over namespaced document collections. DuckDB storage, FastAPI, async job queue, OpenTelemetry tracing, Prometheus metrics.

bash
# Start the search API server
uvx clio-kit search serve

# Index documents from a namespace
uvx clio-kit search index --namespace local_fs

# Query with scientific operators
uvx clio-kit search query --namespace local_fs --q "pressure between 190 and 360 kPa"

# List indexed documents
uvx clio-kit search list --namespace local_fs

API endpoints: /query, /jobs/index, /documents, /health, /metricsfull docs


📖 Usage Examples

HDF5: Scientific Data Analysis

code
"What datasets are in climate_simulation.h5? Show me the temperature field structure and read the first 100 timesteps."

Tools used: open_file, analyze_dataset_structure, read_partial_dataset, list_attributes

Slurm: HPC Job Management

code
"Submit simulation.py to Slurm with 32 cores, 64GB memory, 24-hour runtime. Monitor progress and retrieve output when complete."

Tools used: submit_slurm_job, check_job_status, get_job_output

ArXiv: Research Discovery

code
"Find the latest papers on diffusion models from ArXiv, get details on the top 3, and export citations to BibTeX."

Tools used: search_arxiv, get_paper_details, export_to_bibtex, download_paper_pdf

Pandas: Data Processing

code
"Load sales_data.csv, clean missing values, compute statistics by region, and save as Parquet with compression."

Tools used: load_data, handle_missing_data, groupby_operations, save_data

Plot: Data Visualization

code
"Create a line plot showing temperature trends over time from weather.csv with proper axis labels."

Tools used: line_plot, data_info

Agentic Search: Scientific Retrieval

code
"Find all chunks mentioning pressure above 200 kPa in the local_fs namespace."

CLI: uvx clio-kit search query --namespace local_fs --q "pressure > 200 kPa"


🚨 Troubleshooting

<details> <summary><b>Server Not Found Error</b></summary>

If uvx clio-kit mcp-server <server-name> fails:

bash
# Verify server name is correct
uvx clio-kit mcp-servers

# Common names: hdf5, pandas, slurm, arxiv (not hdf5-mcp, pandas-mcp)
</details> <details> <summary><b>Import Errors or Missing Dependencies</b></summary>

For development or local testing:

bash
cd clio-kit-mcp-servers/hdf5
uv sync --all-extras --dev
uv run hdf5-mcp
</details> <details> <summary><b>uvx Command Not Found</b></summary>

Install uv package manager:

bash
# Linux/macOS
curl -LsSf https://astral.sh/uv/install.sh | sh

# Windows
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

# Or via pip
pip install uv
</details>

Team

Sponsored By

<img src="https://www.nsf.gov/themes/custom/nsf_theme/components/molecules/logo/logo-desktop.png" alt="NSF Logo" width="24" height="24"> NSF (National Science Foundation) - Supporting scientific computing research and AI integration initiatives

we welcome more sponsorships. please contact the Principal Investigator

Ways to Contribute

  • Submit Issues: Report bugs or request features via GitHub Issues
  • Develop New MCPs: Add servers for your research tools (CONTRIBUTING.md)
  • Improve Documentation: Help make guides clearer
  • Share Use Cases: Tell us how you're using CLIO Kit in your research

Full Guide: CONTRIBUTING.md

Community & Support


常见问题

CLIO Compression 是什么?

Compression MCP server implementation using Model Context Protocol

相关 Skills

技能工坊

by anthropics

Universal
热门

覆盖 Skill 从创建到迭代优化全流程:起草能力、补测试提示、跑评测与基准方差分析,并持续改写内容和描述,提升效果与触发准确率。

技能工坊把技能从创建、迭代到评测串成闭环,方差分析加描述优化,特别适合把触发准确率打磨得更稳。

效率与工作流
未扫描150.9k

PPT处理

by anthropics

Universal
热门

处理 .pptx 全流程:创建演示文稿、提取和解析幻灯片内容、批量修改现有文件,支持模板套用、合并拆分、备注评论与版式调整。

涉及PPTX的创建、解析、修改到合并拆分都能一站搞定,连备注、模板和评论也能处理,做演示文稿特别省心。

效率与工作流
未扫描150.9k

PDF处理

by anthropics

Universal
热门

遇到 PDF 读写、文本表格提取、合并拆分、旋转加水印、表单填写或加解密时直接用它,也能提取图片、生成新 PDF,并把扫描件通过 OCR 变成可搜索文档。

PDF杂活别再来回切工具了,文本表格提取、合并拆分到OCR识别一次搞定,连扫描件也能变可搜索。

效率与工作流
未扫描150.9k

相关 MCP Server

文件系统

编辑精选

by Anthropic

热门

Filesystem 是 MCP 官方参考服务器,让 LLM 安全读写本地文件系统。

这个服务器解决了让 Claude 直接操作本地文件的痛点,比如自动整理文档或生成代码文件。适合需要自动化文件处理的开发者,但注意它只是参考实现,生产环境需自行加固安全。

效率与工作流
87.2k

by wonderwhy-er

热门

Desktop Commander 是让 AI 直接执行终端命令、管理文件和进程的 MCP 服务器。

这工具解决了 AI 无法直接操作本地环境的痛点,适合需要自动化脚本调试或文件批量处理的开发者。它能让你用自然语言指挥终端,但权限控制需谨慎,毕竟让 AI 执行 rm -rf 可不是闹着玩的。

效率与工作流
6.2k

by stickerdaniel

热门

LinkedIn Profile and Job Scraper 是让 Claude 直接抓取 LinkedIn 个人资料、公司信息和职位详情的工具。

这个服务器解决了招聘和商业调研中手动复制粘贴 LinkedIn 数据的痛点,适合猎头或市场分析师快速获取候选人背景和公司动态。不过,LinkedIn 反爬机制频繁更新,数据稳定性需要持续维护,使用时建议搭配人工验证。

效率与工作流
2.4k

评论