CLIO Compression
效率与工作流by iowarp
Compression MCP server implementation using Model Context Protocol
什么是 CLIO Compression?
Compression MCP server implementation using Model Context Protocol
README
CLIO Kit
<!-- mcp-name: io.github.iowarp/adios-mcp --> <!-- mcp-name: io.github.iowarp/arxiv-mcp --> <!-- mcp-name: io.github.iowarp/chronolog-mcp --> <!-- mcp-name: io.github.iowarp/compression-mcp --> <!-- mcp-name: io.github.iowarp/darshan-mcp --> <!-- mcp-name: io.github.iowarp/geo-mcp --> <!-- mcp-name: io.github.iowarp/geojson-mcp --> <!-- mcp-name: io.github.iowarp/hdf5-mcp --> <!-- mcp-name: io.github.iowarp/jarvis-mcp --> <!-- mcp-name: io.github.iowarp/lmod-mcp --> <!-- mcp-name: io.github.iowarp/ndp-mcp --> <!-- mcp-name: io.github.iowarp/node-hardware-mcp --> <!-- mcp-name: io.github.iowarp/pandas-mcp --> <!-- mcp-name: io.github.iowarp/parallel-sort-mcp --> <!-- mcp-name: io.github.iowarp/paraview-mcp --> <!-- mcp-name: io.github.iowarp/parquet-mcp --> <!-- mcp-name: io.github.iowarp/plot-mcp --> <!-- mcp-name: io.github.iowarp/sac-mcp --> <!-- mcp-name: io.github.iowarp/seismic-mcp --> <!-- mcp-name: io.github.iowarp/slurm-mcp --> <!-- mcp-name: io.github.iowarp/terrain-mcp -->CLIO Kit - Part of the IoWarp platform's tooling layer for AI agents. A comprehensive collection of tools, skills, plugins, and extensions. Currently featuring 15+ Model Context Protocol (MCP) servers for scientific computing, with plans to expand to additional agent capabilities. Enables AI agents to interact with HPC resources, scientific data formats, and research datasets.
Chat with us on Zulip or join us
Developed by <img src="https://grc.iit.edu/img/logo.png" alt="GRC Logo" width="18" height="18"> Gnosis Research Center
❌ Without CLIO Kit
Working with scientific data and HPC resources requires manual scripting and tool-specific knowledge:
- ❌ Write custom scripts for every HDF5/Parquet file exploration
- ❌ Manually craft Slurm job submission scripts
- ❌ Switch between multiple tools for data analysis
- ❌ No AI assistance for scientific workflows
- ❌ Repetitive coding for common research tasks
✅ With CLIO Kit
AI agents handle scientific computing tasks through natural language:
- ✅ "Analyze the temperature dataset in this HDF5 file" - HDF5 MCP does it
- ✅ "Submit this simulation to Slurm with 32 cores" - Slurm MCP handles it
- ✅ "Find papers on neural networks from ArXiv" - ArXiv MCP searches
- ✅ "Plot the results from this CSV file" - Plot MCP visualizes
- ✅ "Optimize memory usage for this pandas DataFrame" - Pandas MCP optimizes
- ✅ "Find all documents where pressure exceeds 200 kPa" - Agentic Search retrieves
One unified interface. 16 MCP servers. Hybrid search engine. 150+ specialized tools. Built for research.
CLIO Kit is part of the IoWarp platform's comprehensive tooling ecosystem for AI agents. It brings AI assistance to your scientific computing workflow—whether you're analyzing terabytes of HDF5 data, managing Slurm jobs across clusters, or exploring research papers. Built by researchers, for researchers, at Illinois Institute of Technology with NSF support.
Part of IoWarp Platform: CLIO Kit is the tooling layer of the IoWarp platform, providing skills, plugins, and extensions for AI agents working in scientific computing environments.
One simple command. Production-ready, fully typed, MIT licensed, and beta-tested in real HPC environments.
🚀 Quick Installation
One Command for Any Server
# List all 16 available MCP servers
uvx clio-kit mcp-servers
# Run any server instantly
uvx clio-kit mcp-server hdf5
uvx clio-kit mcp-server pandas
uvx clio-kit mcp-server slurm
# Agentic search — hybrid retrieval for scientific corpora
uvx clio-kit search serve # Start search API server
uvx clio-kit search query --namespace local_fs --q "pressure > 200 kPa"
# AI prompts also available
uvx clio-kit prompts # List all prompts
uvx clio-kit prompt code-coverage-prompt # Use a prompt
Add to your Cursor ~/.cursor/mcp.json:
{
"mcpServers": {
"hdf5-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "hdf5"]
},
"pandas-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "pandas"]
},
"slurm-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "slurm"]
}
}
}
See Cursor MCP docs for more info.
</details> <details> <summary><b>Install in Claude Code</b></summary># Add HDF5 MCP
claude mcp add hdf5-mcp -- uvx clio-kit mcp-server hdf5
# Add Pandas MCP
claude mcp add pandas-mcp -- uvx clio-kit mcp-server pandas
# Add Slurm MCP
claude mcp add slurm-mcp -- uvx clio-kit mcp-server slurm
See Claude Code MCP docs for more info.
</details> <details> <summary><b>Install in VS Code</b></summary>Add to your VS Code MCP config:
"mcp": {
"servers": {
"hdf5-mcp": {
"type": "stdio",
"command": "uvx",
"args": ["clio-kit", "mcp-server", "hdf5"]
},
"pandas-mcp": {
"type": "stdio",
"command": "uvx",
"args": ["clio-kit", "mcp-server", "pandas"]
}
}
}
See VS Code MCP docs for more info.
</details> <details> <summary><b>Install in Claude Desktop</b></summary>Edit claude_desktop_config.json:
{
"mcpServers": {
"hdf5-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "hdf5"]
},
"arxiv-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "arxiv"]
}
}
}
See Claude Desktop MCP docs for more info.
</details>Available Packages
<div align="center">| 📦 Package | 📌 Ver | 🔧 System | 📋 Description | ⚡ Install Command |
|---|---|---|---|---|
adios | 2.0.1 | Data I/O | Read data using ADIOS2 engine | uvx clio-kit mcp-server adios |
arxiv | 2.0.1 | Research | Fetch research papers from ArXiv | uvx clio-kit mcp-server arxiv |
chronolog | 2.0.1 | Logging | Log and retrieve data from ChronoLog | uvx clio-kit mcp-server chronolog |
compression | 2.0.1 | Utilities | File compression with gzip | uvx clio-kit mcp-server compression |
darshan | 2.0.1 | Performance | I/O performance trace analysis | uvx clio-kit mcp-server darshan |
hdf5 | 2.0.1 | Data I/O | HPC-optimized scientific data with 27 tools, AI insights, caching, streaming | uvx clio-kit mcp-server hdf5 |
jarvis | 2.0.1 | Workflow | Data pipeline lifecycle management | uvx clio-kit mcp-server jarvis |
lmod | 2.0.1 | Environment | Environment module management | uvx clio-kit mcp-server lmod |
ndp | 2.0.1 | Data Protocol | Search and discover datasets across CKAN instances | uvx clio-kit mcp-server ndp |
node-hardware | 2.0.1 | System | System hardware information | uvx clio-kit mcp-server node-hardware |
pandas | 2.0.1 | Data Analysis | CSV data loading and filtering | uvx clio-kit mcp-server pandas |
parallel-sort | 2.0.1 | Computing | Large file sorting | uvx clio-kit mcp-server parallel-sort |
paraview | 2.0.1 | Visualization | Scientific 3D visualization and analysis | uvx clio-kit mcp-server paraview |
parquet | 2.0.1 | Data I/O | Read Parquet file columns | uvx clio-kit mcp-server parquet |
plot | 2.0.1 | Visualization | Generate plots from CSV data | uvx clio-kit mcp-server plot |
slurm | 2.0.1 | HPC | Job submission and management | uvx clio-kit mcp-server slurm |
Agentic Search
Hybrid retrieval engine for scientific corpora — combines lexical (BM25), vector, graph, and scientific search (numeric range, unit matching, formula targeting) over namespaced document collections. DuckDB storage, FastAPI, async job queue, OpenTelemetry tracing, Prometheus metrics.
# Start the search API server
uvx clio-kit search serve
# Index documents from a namespace
uvx clio-kit search index --namespace local_fs
# Query with scientific operators
uvx clio-kit search query --namespace local_fs --q "pressure between 190 and 360 kPa"
# List indexed documents
uvx clio-kit search list --namespace local_fs
API endpoints: /query, /jobs/index, /documents, /health, /metrics — full docs
📖 Usage Examples
HDF5: Scientific Data Analysis
"What datasets are in climate_simulation.h5? Show me the temperature field structure and read the first 100 timesteps."
Tools used: open_file, analyze_dataset_structure, read_partial_dataset, list_attributes
Slurm: HPC Job Management
"Submit simulation.py to Slurm with 32 cores, 64GB memory, 24-hour runtime. Monitor progress and retrieve output when complete."
Tools used: submit_slurm_job, check_job_status, get_job_output
ArXiv: Research Discovery
"Find the latest papers on diffusion models from ArXiv, get details on the top 3, and export citations to BibTeX."
Tools used: search_arxiv, get_paper_details, export_to_bibtex, download_paper_pdf
Pandas: Data Processing
"Load sales_data.csv, clean missing values, compute statistics by region, and save as Parquet with compression."
Tools used: load_data, handle_missing_data, groupby_operations, save_data
Plot: Data Visualization
"Create a line plot showing temperature trends over time from weather.csv with proper axis labels."
Tools used: line_plot, data_info
Agentic Search: Scientific Retrieval
"Find all chunks mentioning pressure above 200 kPa in the local_fs namespace."
CLI: uvx clio-kit search query --namespace local_fs --q "pressure > 200 kPa"
🚨 Troubleshooting
<details> <summary><b>Server Not Found Error</b></summary>If uvx clio-kit mcp-server <server-name> fails:
# Verify server name is correct
uvx clio-kit mcp-servers
# Common names: hdf5, pandas, slurm, arxiv (not hdf5-mcp, pandas-mcp)
For development or local testing:
cd clio-kit-mcp-servers/hdf5
uv sync --all-extras --dev
uv run hdf5-mcp
Install uv package manager:
# Linux/macOS
curl -LsSf https://astral.sh/uv/install.sh | sh
# Windows
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
# Or via pip
pip install uv
Team
- Gnosis Research Center (GRC) - Illinois Institute of Technology | Lead
- HDF Group - Data format and library developers | Industry Partner
- University of Utah - Research collaboration | Domain Science Partner
Sponsored By
<img src="https://www.nsf.gov/themes/custom/nsf_theme/components/molecules/logo/logo-desktop.png" alt="NSF Logo" width="24" height="24"> NSF (National Science Foundation) - Supporting scientific computing research and AI integration initiatives
we welcome more sponsorships. please contact the Principal Investigator
Ways to Contribute
- Submit Issues: Report bugs or request features via GitHub Issues
- Develop New MCPs: Add servers for your research tools (CONTRIBUTING.md)
- Improve Documentation: Help make guides clearer
- Share Use Cases: Tell us how you're using CLIO Kit in your research
Full Guide: CONTRIBUTING.md
Community & Support
- Chat: Zulip Community
- Join: Invitation Link
- Issues: GitHub Issues
- Discussions: GitHub Discussions
- Website: https://docs.iowarp.ai/
- Project: IOWarp Project
常见问题
CLIO Compression 是什么?
Compression MCP server implementation using Model Context Protocol
相关 Skills
技能工坊
by anthropics
覆盖 Skill 从创建到迭代优化全流程:起草能力、补测试提示、跑评测与基准方差分析,并持续改写内容和描述,提升效果与触发准确率。
✎ 技能工坊把技能从创建、迭代到评测串成闭环,方差分析加描述优化,特别适合把触发准确率打磨得更稳。
PPT处理
by anthropics
处理 .pptx 全流程:创建演示文稿、提取和解析幻灯片内容、批量修改现有文件,支持模板套用、合并拆分、备注评论与版式调整。
✎ 涉及PPTX的创建、解析、修改到合并拆分都能一站搞定,连备注、模板和评论也能处理,做演示文稿特别省心。
PDF处理
by anthropics
遇到 PDF 读写、文本表格提取、合并拆分、旋转加水印、表单填写或加解密时直接用它,也能提取图片、生成新 PDF,并把扫描件通过 OCR 变成可搜索文档。
✎ PDF杂活别再来回切工具了,文本表格提取、合并拆分到OCR识别一次搞定,连扫描件也能变可搜索。
相关 MCP Server
文件系统
编辑精选by Anthropic
Filesystem 是 MCP 官方参考服务器,让 LLM 安全读写本地文件系统。
✎ 这个服务器解决了让 Claude 直接操作本地文件的痛点,比如自动整理文档或生成代码文件。适合需要自动化文件处理的开发者,但注意它只是参考实现,生产环境需自行加固安全。
by wonderwhy-er
Desktop Commander 是让 AI 直接执行终端命令、管理文件和进程的 MCP 服务器。
✎ 这工具解决了 AI 无法直接操作本地环境的痛点,适合需要自动化脚本调试或文件批量处理的开发者。它能让你用自然语言指挥终端,但权限控制需谨慎,毕竟让 AI 执行 rm -rf 可不是闹着玩的。
by stickerdaniel
LinkedIn Profile and Job Scraper 是让 Claude 直接抓取 LinkedIn 个人资料、公司信息和职位详情的工具。
✎ 这个服务器解决了招聘和商业调研中手动复制粘贴 LinkedIn 数据的痛点,适合猎头或市场分析师快速获取候选人背景和公司动态。不过,LinkedIn 反爬机制频繁更新,数据稳定性需要持续维护,使用时建议搭配人工验证。