io.github.MusaddiqueHussainLabs/mhlabs_mcp_tools

平台与服务

by musaddiquehussainlabs

提供文本预处理、NLP components 与文档分析能力的 MCP server,适合 AI 流程集成。

什么是 io.github.MusaddiqueHussainLabs/mhlabs_mcp_tools

提供文本预处理、NLP components 与文档分析能力的 MCP server,适合 AI 流程集成。

README

mhlabs-mcp-tools

mcp-name: io.github.MusaddiqueHussainLabs/mhlabs_mcp_tools

🧠 mhlabs-mcp-tools

mhlabs-mcp-tools is a Modular MCP Tools Server built using FastMCP.
It provides an extendable AI tool ecosystem organized into functional categories (Text Preprocessing, NLP Components, Document Analysis, etc.) that can be dynamically loaded and served through MCP (Model Context Protocol) via STDIO transport.

This project is part of the MHLabs AI Agentic Ecosystem, designed to work with mhlabs-mcp-server, mhlabs-mcp-agents, and downstream A2A agent frameworks.


Features

  • FastMCP Server: Pure FastMCP implementation supporting multiple transport protocols
  • Factory Pattern: Reusable MCP tools factory for easy service management
  • Domain-Based Organization: Services organized by business domains (HR, Tech Support, etc.)
  • Authentication: Optional Azure AD authentication support
  • Multiple Transports: STDIO, HTTP (Streamable), and SSE transport support
  • VS Code Integration: Debug configurations and development settings
  • Comprehensive Testing: Unit tests with pytest
  • Flexible Configuration: Environment-based configuration management

Architecture

code
mhlabs_mcp_tools/
├── .gitignore
├── .vscode/
│   └── settings.json
├── CHANGELOG.md
├── LICENSE
├── README.md
├── docs/
│   └── index.md
├── examples/
│   ├── example_client.py
│   └── example_client_http.py
├── mkdocs.yml
├── pyproject.toml
├── requirements.txt
├── server.json
└── src/
    ├── __init__.py
    ├── main.py
    └── mhlabs_mcp_tools/
        ├── __init__.py
        ├── core/
        │   ├── __init__.py
        │   ├── config.py
        │   ├── constants.py
        │   ├── factory.py
        │   └── prompts.py
        ├── data/
        │   ├── __init__.py
        │   ├── external/
        │   │   └── __init__.py
        │   ├── interim/
        │   │   └── __init__.py
        │   ├── processed/
        │   │   └── __init__.py
        │   └── raw/
        │       ├── __init__.py
        │       ├── contractions_dict.json
        │       ├── custom_substitutions.csv
        │       ├── leftovers_dict.json
        │       └── slang_dict.json
        ├── handlers/
        │   ├── __init__.py
        │   ├── custom_exceptions.py
        │   └── output_generator.py
        ├── mcp_server.py
        ├── models/
        │   └── __init__.py
        ├── nlp_components/
        │   ├── __init__.py
        │   └── nlp_model.py
        ├── services/
        │   ├── __init__.py
        │   ├── langchain_framework.py
        │   └── spacy_extractor.py
        └── text_preprocessing/
            ├── __init__.py
            ├── contractions.py
            ├── emo_unicode.py
            ├── slang_text.py
            └── text_preprocessing.py

Available Services

Currently the package is organized into three primary modules:

1. NLP Components

Component TypeDescription
tokenizeText tokenization
posPart-of-Speech tagging
lemmaWord lemmatization
morphologyStudy of word forms
depDependency parsing
nerNamed Entity Recognition
normText normalization

2. Text Preprocessing

This module equips users with an extensive set of text preprocessing tools:

FunctionDescription
to_lowerConvert text to lowercase
to_upperConvert text to uppercase
remove_numberRemove numerical characters
remove_itemized_bullet_and_numberingEliminate itemized/bullet-point numbering
remove_urlRemove URLs from text
remove_punctuationRemove punctuation marks
remove_special_characterRemove special characters
keep_alpha_numericKeep only alphanumeric characters
remove_whitespaceRemove excess whitespace
normalize_unicodeNormalize Unicode characters
remove_stopwordEliminate common stopwords
remove_freqwordsRemove frequently occurring words
remove_rarewordsRemove rare words
remove_emailRemove email addresses
remove_phone_numberRemove phone numbers
remove_ssnRemove Social Security Numbers (SSN)
remove_credit_card_numberRemove credit card numbers
remove_emojiRemove emojis
remove_emoticonsRemove emoticons
convert_emoticons_to_wordsConvert emoticons to words
convert_emojis_to_wordsConvert emojis to words
remove_htmlRemove HTML tags
chat_words_conversionConvert chat language to standard English
expand_contractionExpand contractions (e.g., "can't" to "cannot")
tokenize_wordTokenize words
tokenize_sentenceTokenize sentences
stem_wordStem words
lemmatize_wordLemmatize words
preprocess_textCombine multiple preprocessing steps into one function

Quick Start

Development Setup

  1. Clone and Navigate:

    bash
    cd src/mhlabs_mcp_tools
    
  2. Install Dependencies:

    bash
    pip install -r requirements.txt
    
  3. Configure Environment:

    bash
    cp .env.example .env
    # Edit .env with your configuration
    
  4. Start the Server:

    bash
    # Default STDIO transport (for local MCP clients)
    python mcp_server.py
    
    # HTTP transport (for web-based clients)
    python mcp_server.py --transport http --port 9000
    or
    after installed mhlabs-mcp-tools
    python -m mhlabs_mcp_tools.mcp_server --transport http --port 9000
    
    # Using FastMCP CLI (recommended)
    fastmcp run mcp_server.py -t streamable-http --port 9000 -l DEBUG
    
    # Debug mode with authentication disabled
    python mcp_server.py --transport http --debug --no-auth
    

Transport Options

1. STDIO Transport (default)

  • 🔧 Perfect for: Local tools, command-line integrations, Claude Desktop
  • 🚀 Usage: python mcp_server.py or python mcp_server.py --transport stdio

2. HTTP (Streamable) Transport

  • 🌐 Perfect for: Web-based deployments, microservices, remote access
  • 🚀 Usage: python mcp_server.py --transport http --port 9000
  • 🌐 URL: http://127.0.0.1:9000/mcp/

3. SSE Transport (deprecated)

  • ⚠️ Legacy support only - use HTTP transport for new projects
  • 🚀 Usage: python mcp_server.py --transport sse --port 9000

FastMCP CLI Usage

bash
# Standard HTTP server
fastmcp run mcp_server.py -t streamable-http --port 9000 -l DEBUG

# With custom host
fastmcp run mcp_server.py -t streamable-http --host 0.0.0.0 --port 9000 -l DEBUG

# STDIO transport (for local clients)
fastmcp run mcp_server.py -t stdio

# Development mode with MCP Inspector
fastmcp dev mcp_server.py -t streamable-http --port 9000

VS Code Development

  1. Open in VS Code:

    bash
    code .
    
  2. Use Debug Configurations:

    • Debug MCP Server (STDIO): Run with STDIO transport
    • Debug MCP Server (HTTP): Run with HTTP transport
    • Debug Tests: Run the test suite

Configuration

Environment Variables

Create a .env file based on .env.example:

env
# Server Settings
MCP_HOST=0.0.0.0
MCP_PORT=9000
MCP_DEBUG=false
MCP_SERVER_NAME=MHLABS MCP Server

# Authentication Settings
MCP_ENABLE_AUTH=true
AZURE_TENANT_ID=your-tenant-id-here
AZURE_CLIENT_ID=your-client-id-here
AZURE_JWKS_URI=https://login.microsoftonline.com/your-tenant-id/discovery/v2.0/keys
AZURE_ISSUER=https://sts.windows.net/your-tenant-id/
AZURE_AUDIENCE=api://your-client-id

Authentication

When MCP_ENABLE_AUTH=true, the server expects Azure AD Bearer tokens. Configure your Azure App Registration with the appropriate settings.

For development, set MCP_ENABLE_AUTH=false to disable authentication.

Adding New Services

  1. Create Service Class:

    python
    from core.factory import MCPToolBase, Domain
    
    class MyService(MCPToolBase):
        def __init__(self):
            super().__init__(Domain.MY_DOMAIN)
    
        def register_tools(self, mcp):
            @mcp.tool(tags={self.domain.value})
            async def my_tool(param: str) -> str:
                # Tool implementation
                pass
    
        @property
        def tool_count(self) -> int:
            return 1  # Number of tools
    
  2. Register in Server:

    python
    # In mcp_server.py (gets registered automatically from services/ directory)
    factory.register_service(MyService())
    
  3. Add Domain (if new):

    python
    # In core/factory.py
    class Domain(Enum):
        # ... existing domains
        MY_DOMAIN = "my_domain"
    

MCP Client Usage

Python Client

python
import asyncio
from fastmcp import Client

client = Client("http://localhost:9000/mcp")

async def main():
    async with client:
        tools = await client.list_tools()
        # tools -> list[mcp.types.Tool]
        # print(tools)
        for tool in tools:
            print(f"Tool: {tool.name}")
        
        result = await client.call_tool("textprep.expand_contraction", {"input_text": "The must've SSN is 859-98-0987. The employee's phone number is 555-555-5555."})
        print("Result:", result)

asyncio.run(main())

Command Line Testing

bash
# Test the server is running
curl http://localhost:9000/mcp/

# With FastMCP CLI for testing
fastmcp dev mcp_server.py -t streamable-http --port 9000

Quick Test

Test STDIO Transport:

bash
# Start server in STDIO mode
python mcp_server.py --debug --no-auth

# Test with client_example.py
python client_example.py

Test HTTP Transport:

bash
# Start HTTP server
python mcp_server.py --transport http --port 9000 --debug --no-auth

# Test with FastMCP client
python -c "
from fastmcp import Client
import asyncio
async def test():
    async with Client('http://localhost:9000/mcp') as client:
        result = await client.call_tool("textprep.expand_contraction", {"input_text": "The must've SSN is 859-98-0987. The employee's phone number is 555-555-5555."})
        print(result)
asyncio.run(test())
"

Test with FastMCP CLI:

bash
# Start with FastMCP CLI
fastmcp run mcp_server.py -t streamable-http --port 9000 -l DEBUG

# Server will be available at: http://127.0.0.1:9000/mcp/

Troubleshooting

Common Issues

  1. Import Errors: Make sure you're in the correct directory and dependencies are installed
  2. Authentication Errors: Check your Azure AD configuration and tokens
  3. Port Conflicts: Change the port in configuration if 9000 is already in use
  4. Missing fastmcp: Install with pip install fastmcp

Debug Mode

Enable debug mode for detailed logging:

bash
python mcp_server.py --debug --no-auth

Or set in environment:

env
MCP_DEBUG=true

Server Arguments

bash
usage: mcp_server.py [-h] [--transport {stdio,http,streamable-http,sse}]
                     [--host HOST] [--port PORT] [--debug] [--no-auth]

MHLABS MCP Server

options:
  -h, --help            show this help message and exit
  --transport, -t       Transport protocol (default: stdio)
  --host HOST           Host to bind to for HTTP transport (default: 127.0.0.1)
  --port, -p PORT       Port to bind to for HTTP transport (default: 9000)
  --debug               Enable debug mode
  --no-auth             Disable authentication

📄 License

MIT License © 2025 MusaddiqueHussain Labs


🤝 Contributing

  1. Follow the existing code structure and patterns
  2. Add tests for new functionality
  3. Update documentation for new features
  4. Use the provided VS Code configurations for development

🧠 Learn More


💡 Tip

If you want to embed mhlabs-mcp-tools into a larger MCP-based orchestrator:

python
from fastmcp import StdioServerParameters
server_params = StdioServerParameters(
    command="python",
    args=["-m", "mhlabs_mcp_tools.server"],
    //env={"MHLABS_MCP_CATEGORY": "textprep,nlp"}
)

Developed with ❤️ by MusaddiqueHussain Labs

常见问题

io.github.MusaddiqueHussainLabs/mhlabs_mcp_tools 是什么?

提供文本预处理、NLP components 与文档分析能力的 MCP server,适合 AI 流程集成。

相关 Skills

MCP构建

by anthropics

Universal
热门

聚焦高质量 MCP Server 开发,覆盖协议研究、工具设计、错误处理与传输选型,适合用 FastMCP 或 MCP SDK 对接外部 API、封装服务能力。

想让 LLM 稳定调用外部 API,就用 MCP构建:从 Python 到 Node 都有成熟指引,帮你更快做出高质量 MCP 服务器。

平台与服务
未扫描114.1k

Slack动图

by anthropics

Universal
热门

面向Slack的动图制作Skill,内置emoji/消息GIF的尺寸、帧率和色彩约束、校验与优化流程,适合把创意或上传图片快速做成可直接发送的Slack动画。

帮你快速做出适配 Slack 的动图,内置约束规则和校验工具,少踩上传与播放坑,做表情包和演示都更省心。

平台与服务
未扫描114.1k

MCP服务构建器

by alirezarezvani

Universal
热门

从 OpenAPI 一键生成 Python/TypeScript MCP server 脚手架,并校验 tool schema、命名规范与版本兼容性,适合把现有 REST API 快速发布成可生产演进的 MCP 服务。

帮你快速搭建 MCP 服务与后端 API,脚手架完善、扩展顺手,尤其适合想高效验证服务能力的开发者。

平台与服务
未扫描10.2k

相关 MCP Server

Slack 消息

编辑精选

by Anthropic

热门

Slack 是让 AI 助手直接读写你的 Slack 频道和消息的 MCP 服务器。

这个服务器解决了团队协作中需要 AI 实时获取 Slack 信息的痛点,特别适合开发团队让 Claude 帮忙汇总频道讨论或发送通知。不过,它目前只是参考实现,文档有限,不建议在生产环境直接使用——更适合开发者学习 MCP 如何集成第三方服务。

平台与服务
83.4k

by netdata

热门

io.github.netdata/mcp-server 是让 AI 助手实时监控服务器指标和日志的 MCP 服务器。

这个工具解决了运维人员需要手动检查系统状态的痛点,最适合 DevOps 团队让 Claude 自动分析性能数据。不过,它依赖 NetData 的现有部署,如果你没用过这个监控平台,得先花时间配置。

平台与服务
78.4k

by d4vinci

热门

Scrapling MCP Server 是专为现代网页设计的智能爬虫工具,支持绕过 Cloudflare 等反爬机制。

这个工具解决了爬取动态网页和反爬网站时的头疼问题,特别适合需要批量采集电商价格或新闻数据的开发者。不过,它依赖外部浏览器引擎,资源消耗较大,不适合轻量级任务。

平台与服务
35.4k

评论