什么是 CLIO Darshan?
Darshan I/O profiler MCP server for analyzing I/O trace files
README
CLIO Kit
<!-- mcp-name: io.github.iowarp/adios-mcp --> <!-- mcp-name: io.github.iowarp/arxiv-mcp --> <!-- mcp-name: io.github.iowarp/chronolog-mcp --> <!-- mcp-name: io.github.iowarp/compression-mcp --> <!-- mcp-name: io.github.iowarp/darshan-mcp --> <!-- mcp-name: io.github.iowarp/hdf5-mcp --> <!-- mcp-name: io.github.iowarp/jarvis-mcp --> <!-- mcp-name: io.github.iowarp/lmod-mcp --> <!-- mcp-name: io.github.iowarp/ndp-mcp --> <!-- mcp-name: io.github.iowarp/node-hardware-mcp --> <!-- mcp-name: io.github.iowarp/pandas-mcp --> <!-- mcp-name: io.github.iowarp/parallel-sort-mcp --> <!-- mcp-name: io.github.iowarp/paraview-mcp --> <!-- mcp-name: io.github.iowarp/parquet-mcp --> <!-- mcp-name: io.github.iowarp/plot-mcp --> <!-- mcp-name: io.github.iowarp/slurm-mcp -->CLIO Kit - Part of the IoWarp platform's tooling layer for AI agents. A comprehensive collection of tools, skills, plugins, and extensions. Currently featuring 15+ Model Context Protocol (MCP) servers for scientific computing, with plans to expand to additional agent capabilities. Enables AI agents to interact with HPC resources, scientific data formats, and research datasets.
Chat with us on Zulip or join us
Developed by <img src="https://grc.iit.edu/img/logo.png" alt="GRC Logo" width="18" height="18"> Gnosis Research Center
❌ Without CLIO Kit
Working with scientific data and HPC resources requires manual scripting and tool-specific knowledge:
- ❌ Write custom scripts for every HDF5/Parquet file exploration
- ❌ Manually craft Slurm job submission scripts
- ❌ Switch between multiple tools for data analysis
- ❌ No AI assistance for scientific workflows
- ❌ Repetitive coding for common research tasks
✅ With CLIO Kit
AI agents handle scientific computing tasks through natural language:
- ✅ "Analyze the temperature dataset in this HDF5 file" - HDF5 MCP does it
- ✅ "Submit this simulation to Slurm with 32 cores" - Slurm MCP handles it
- ✅ "Find papers on neural networks from ArXiv" - ArXiv MCP searches
- ✅ "Plot the results from this CSV file" - Plot MCP visualizes
- ✅ "Optimize memory usage for this pandas DataFrame" - Pandas MCP optimizes
- ✅ "Find all documents where pressure exceeds 200 kPa" - Agentic Search retrieves
One unified interface. 16 MCP servers. Hybrid search engine. 150+ specialized tools. Built for research.
CLIO Kit is part of the IoWarp platform's comprehensive tooling ecosystem for AI agents. It brings AI assistance to your scientific computing workflow—whether you're analyzing terabytes of HDF5 data, managing Slurm jobs across clusters, or exploring research papers. Built by researchers, for researchers, at Illinois Institute of Technology with NSF support.
Part of IoWarp Platform: CLIO Kit is the tooling layer of the IoWarp platform, providing skills, plugins, and extensions for AI agents working in scientific computing environments.
One simple command. Production-ready, fully typed, MIT licensed, and beta-tested in real HPC environments.
🚀 Quick Installation
One Command for Any Server
# List all 16 available MCP servers
uvx clio-kit mcp-servers
# Run any server instantly
uvx clio-kit mcp-server hdf5
uvx clio-kit mcp-server pandas
uvx clio-kit mcp-server slurm
# Agentic search — hybrid retrieval for scientific corpora
uvx clio-kit search serve # Start search API server
uvx clio-kit search query --namespace local_fs --q "pressure > 200 kPa"
# AI prompts also available
uvx clio-kit prompts # List all prompts
uvx clio-kit prompt code-coverage-prompt # Use a prompt
Add to your Cursor ~/.cursor/mcp.json:
{
"mcpServers": {
"hdf5-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "hdf5"]
},
"pandas-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "pandas"]
},
"slurm-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "slurm"]
}
}
}
See Cursor MCP docs for more info.
</details> <details> <summary><b>Install in Claude Code</b></summary># Add HDF5 MCP
claude mcp add hdf5-mcp -- uvx clio-kit mcp-server hdf5
# Add Pandas MCP
claude mcp add pandas-mcp -- uvx clio-kit mcp-server pandas
# Add Slurm MCP
claude mcp add slurm-mcp -- uvx clio-kit mcp-server slurm
See Claude Code MCP docs for more info.
</details> <details> <summary><b>Install in VS Code</b></summary>Add to your VS Code MCP config:
"mcp": {
"servers": {
"hdf5-mcp": {
"type": "stdio",
"command": "uvx",
"args": ["clio-kit", "mcp-server", "hdf5"]
},
"pandas-mcp": {
"type": "stdio",
"command": "uvx",
"args": ["clio-kit", "mcp-server", "pandas"]
}
}
}
See VS Code MCP docs for more info.
</details> <details> <summary><b>Install in Claude Desktop</b></summary>Edit claude_desktop_config.json:
{
"mcpServers": {
"hdf5-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "hdf5"]
},
"arxiv-mcp": {
"command": "uvx",
"args": ["clio-kit", "mcp-server", "arxiv"]
}
}
}
See Claude Desktop MCP docs for more info.
</details>Available Packages
<div align="center">| 📦 Package | 📌 Ver | 🔧 System | 📋 Description | ⚡ Install Command |
|---|---|---|---|---|
adios | 2.0.1 | Data I/O | Read data using ADIOS2 engine | uvx clio-kit mcp-server adios |
arxiv | 2.0.1 | Research | Fetch research papers from ArXiv | uvx clio-kit mcp-server arxiv |
chronolog | 2.0.1 | Logging | Log and retrieve data from ChronoLog | uvx clio-kit mcp-server chronolog |
compression | 2.0.1 | Utilities | File compression with gzip | uvx clio-kit mcp-server compression |
darshan | 2.0.1 | Performance | I/O performance trace analysis | uvx clio-kit mcp-server darshan |
hdf5 | 2.0.1 | Data I/O | HPC-optimized scientific data with 27 tools, AI insights, caching, streaming | uvx clio-kit mcp-server hdf5 |
jarvis | 2.0.1 | Workflow | Data pipeline lifecycle management | uvx clio-kit mcp-server jarvis |
lmod | 2.0.1 | Environment | Environment module management | uvx clio-kit mcp-server lmod |
ndp | 2.0.1 | Data Protocol | Search and discover datasets across CKAN instances | uvx clio-kit mcp-server ndp |
node-hardware | 2.0.1 | System | System hardware information | uvx clio-kit mcp-server node-hardware |
pandas | 2.0.1 | Data Analysis | CSV data loading and filtering | uvx clio-kit mcp-server pandas |
parallel-sort | 2.0.1 | Computing | Large file sorting | uvx clio-kit mcp-server parallel-sort |
paraview | 2.0.1 | Visualization | Scientific 3D visualization and analysis | uvx clio-kit mcp-server paraview |
parquet | 2.0.1 | Data I/O | Read Parquet file columns | uvx clio-kit mcp-server parquet |
plot | 2.0.1 | Visualization | Generate plots from CSV data | uvx clio-kit mcp-server plot |
slurm | 2.0.1 | HPC | Job submission and management | uvx clio-kit mcp-server slurm |
Agentic Search
Hybrid retrieval engine for scientific corpora — combines lexical (BM25), vector, graph, and scientific search (numeric range, unit matching, formula targeting) over namespaced document collections. DuckDB storage, FastAPI, async job queue, OpenTelemetry tracing, Prometheus metrics.
# Start the search API server
uvx clio-kit search serve
# Index documents from a namespace
uvx clio-kit search index --namespace local_fs
# Query with scientific operators
uvx clio-kit search query --namespace local_fs --q "pressure between 190 and 360 kPa"
# List indexed documents
uvx clio-kit search list --namespace local_fs
API endpoints: /query, /jobs/index, /documents, /health, /metrics — full docs
📖 Usage Examples
HDF5: Scientific Data Analysis
"What datasets are in climate_simulation.h5? Show me the temperature field structure and read the first 100 timesteps."
Tools used: open_file, analyze_dataset_structure, read_partial_dataset, list_attributes
Slurm: HPC Job Management
"Submit simulation.py to Slurm with 32 cores, 64GB memory, 24-hour runtime. Monitor progress and retrieve output when complete."
Tools used: submit_slurm_job, check_job_status, get_job_output
ArXiv: Research Discovery
"Find the latest papers on diffusion models from ArXiv, get details on the top 3, and export citations to BibTeX."
Tools used: search_arxiv, get_paper_details, export_to_bibtex, download_paper_pdf
Pandas: Data Processing
"Load sales_data.csv, clean missing values, compute statistics by region, and save as Parquet with compression."
Tools used: load_data, handle_missing_data, groupby_operations, save_data
Plot: Data Visualization
"Create a line plot showing temperature trends over time from weather.csv with proper axis labels."
Tools used: line_plot, data_info
Agentic Search: Scientific Retrieval
"Find all chunks mentioning pressure above 200 kPa in the local_fs namespace."
CLI: uvx clio-kit search query --namespace local_fs --q "pressure > 200 kPa"
🚨 Troubleshooting
<details> <summary><b>Server Not Found Error</b></summary>If uvx clio-kit mcp-server <server-name> fails:
# Verify server name is correct
uvx clio-kit mcp-servers
# Common names: hdf5, pandas, slurm, arxiv (not hdf5-mcp, pandas-mcp)
For development or local testing:
cd clio-kit-mcp-servers/hdf5
uv sync --all-extras --dev
uv run hdf5-mcp
Install uv package manager:
# Linux/macOS
curl -LsSf https://astral.sh/uv/install.sh | sh
# Windows
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
# Or via pip
pip install uv
Team
- Gnosis Research Center (GRC) - Illinois Institute of Technology | Lead
- HDF Group - Data format and library developers | Industry Partner
- University of Utah - Research collaboration | Domain Science Partner
Sponsored By
<img src="https://www.nsf.gov/themes/custom/nsf_theme/components/molecules/logo/logo-desktop.png" alt="NSF Logo" width="24" height="24"> NSF (National Science Foundation) - Supporting scientific computing research and AI integration initiatives
we welcome more sponsorships. please contact the Principal Investigator
Ways to Contribute
- Submit Issues: Report bugs or request features via GitHub Issues
- Develop New MCPs: Add servers for your research tools (CONTRIBUTING.md)
- Improve Documentation: Help make guides clearer
- Share Use Cases: Tell us how you're using CLIO Kit in your research
Full Guide: CONTRIBUTING.md
Community & Support
- Chat: Zulip Community
- Join: Invitation Link
- Issues: GitHub Issues
- Discussions: GitHub Discussions
- Website: https://docs.iowarp.ai/
- Project: IOWarp Project
常见问题
CLIO Darshan 是什么?
Darshan I/O profiler MCP server for analyzing I/O trace files
相关 Skills
表格处理
by anthropics
围绕 .xlsx、.xlsm、.csv、.tsv 做读写、修复、清洗、格式整理、公式计算与格式转换,适合修改现有表格、生成新报表或把杂乱数据整理成交付级电子表格。
✎ 做 Excel/CSV 相关任务很省心,能直接读写、修复、清洗和格式转换,尤其擅长把乱七八糟的表格整理成交付级文件。
PDF处理
by anthropics
遇到 PDF 读写、文本表格提取、合并拆分、旋转加水印、表单填写或加解密时直接用它,也能提取图片、生成新 PDF,并把扫描件通过 OCR 变成可搜索文档。
✎ PDF杂活别再来回切工具了,文本表格提取、合并拆分到OCR识别一次搞定,连扫描件也能变可搜索。
Word文档
by anthropics
覆盖Word/.docx文档的创建、读取、编辑与重排,适合生成报告、备忘录、信函和模板,也能处理目录、页眉页脚、页码、图片替换、查找替换、修订批注及内容提取整理。
✎ 搞定 .docx 的创建、改写与精排版,目录、批量替换、批注修订和图片更新都能自动化,做正式文档尤其省心。
相关 MCP Server
文件系统
编辑精选by Anthropic
Filesystem 是 MCP 官方参考服务器,让 LLM 安全读写本地文件系统。
✎ 这个服务器解决了让 Claude 直接操作本地文件的痛点,比如自动整理文档或生成代码文件。适合需要自动化文件处理的开发者,但注意它只是参考实现,生产环境需自行加固安全。
by wonderwhy-er
Desktop Commander 是让 AI 直接执行终端命令、管理文件和进程的 MCP 服务器。
✎ 这工具解决了 AI 无法直接操作本地环境的痛点,适合需要自动化脚本调试或文件批量处理的开发者。它能让你用自然语言指挥终端,但权限控制需谨慎,毕竟让 AI 执行 rm -rf 可不是闹着玩的。
EdgarTools
编辑精选by dgunning
EdgarTools 是无需 API 密钥即可解析 SEC EDGAR 财报的开源 Python 库。
✎ 这个工具解决了金融数据获取的痛点——直接让 AI 读取结构化财报,比如让 Claude 分析苹果的 10-K 文件。适合量化分析师或金融开发者快速构建数据管道。但注意,它依赖 SEC 网站稳定性,高峰期可能延迟。