io.github.roshunsunder/filesift
编码与调试by roshunsunder
Local file search for AI agents — semantic + keyword indexing over your codebase.
什么是 io.github.roshunsunder/filesift?
Local file search for AI agents — semantic + keyword indexing over your codebase.
README
FileSift lets your AI coding agent search across a codebase based on what code does, rather than what it looks like. Instead of sifting through entire files after a grep, your agent can jump straight to the most relevant code using natural language queries like "authentication middleware" or "database connection pooling". Everything runs locally on your machine — your code never leaves your environment.
Key benefits:
- Smarter search — hybrid keyword + semantic search finds code by intent, not just string matching
- Less context wasted — agents get pointed to the right files immediately, saving token budget on exploration
Installation
pip install filesift
Usage
There are three ways to use FileSift, depending on your workflow:
1. CLI
The most straightforward approach. Good for testing queries, managing indexes, and configuring settings.
# Index a project
filesift index /path/to/your/project
# Search for files by what they do
filesift find "authentication and session handling"
# Search in a specific directory
filesift find "retry logic for API calls" --path /path/to/project
2. MCP Server
<!-- mcp-name: io.github.roshunsunder/filesift -->Installing FileSift also provides a filesift-mcp command — a lightweight MCP server that exposes indexing and search as tools over STDIO. This works with most popular coding agents including Claude Code, Cursor, Copilot, and more.
Add it to your agent's MCP configuration:
{
"mcpServers": {
"filesift": {
"command": "filesift-mcp"
}
}
}
The MCP server exposes four tools:
filesift_search— search an indexed codebase by natural language queryfilesift_find_related— find files related to a given file via imports and semantic similarityfilesift_index— index a directory to enable searchingfilesift_status— check indexing status of a directory
3. Skills
FileSift ships with a search-codebase skill that can be installed directly into your coding agent's skill directory. This lets the agent interact with the FileSift CLI through bash, without requiring MCP support.
# Install for Claude Code (default)
filesift skill install
# Install for other agents
filesift skill install --agent cursor
filesift skill install --agent copilot
filesift skill install --agent codex
Supported agents: claude, codex, cursor, copilot, gemini, roo, windsurf.
How It Works
FileSift uses a daemonized embedding model to keep searches fast. At its core, it generates embeddings from code descriptions and performs searches against small vector stores called indexes.
-
Indexing —
filesift indexfirst builds a fast keyword/structural index (completes in seconds), then triggers background semantic indexing that generates embeddings for each file. -
Daemon — A background daemon loads indexes into memory and automatically shuts down after a configurable period of inactivity. After the first cold-start search, subsequent searches are near-instant.
-
Search — Queries are matched using both keyword (BM25) and semantic (FAISS) search, then combined via Reciprocal Rank Fusion for the best of both approaches.
Indexes are stored in a .filesift directory within each indexed project.
Configuration
FileSift uses a TOML configuration file, manageable via the CLI:
# View all settings
filesift config list --all
# Set a value
filesift config set search.MAX_RESULTS 20
filesift config set daemon.INACTIVITY_TIMEOUT 600
# Manage ignore patterns
filesift config add-ignore "node_modules" ".venv"
filesift config list-ignore
Configuration sections: search, indexing, daemon, models, paths.
Contributing
Contributions are welcome! To get started:
git clone https://github.com/roshunsunder/filesift.git
cd filesift
pip install -e .
- Fork the repository
- Create a feature branch (
git checkout -b feature/my-feature) - Commit your changes and open a pull request
License
Apache 2.0 — see LICENSE for details.
常见问题
io.github.roshunsunder/filesift 是什么?
Local file search for AI agents — semantic + keyword indexing over your codebase.
相关 Skills
网页构建器
by anthropics
面向复杂 claude.ai HTML artifact 开发,快速初始化 React + Tailwind CSS + shadcn/ui 项目并打包为单文件 HTML,适合需要状态管理、路由或多组件交互的页面。
✎ 在 claude.ai 里做复杂网页 Artifact 很省心,多组件、状态和路由都能顺手搭起来,React、Tailwind 与 shadcn/ui 组合效率高、成品也更精致。
前端设计
by anthropics
面向组件、页面、海报和 Web 应用开发,按鲜明视觉方向生成可直接落地的前端代码与高质感 UI,适合做 landing page、Dashboard 或美化现有界面,避开千篇一律的 AI 审美。
✎ 想把页面做得既能上线又有设计感,就用前端设计:组件到整站都能产出,难得的是能避开千篇一律的 AI 味。
网页应用测试
by anthropics
用 Playwright 为本地 Web 应用编写自动化测试,支持启动开发服务器、校验前端交互、排查 UI 异常、抓取截图与浏览器日志,适合调试动态页面和回归验证。
✎ 借助 Playwright 一站式验证本地 Web 应用前端功能,调 UI 时还能同步查看日志和截图,定位问题更快。
相关 MCP Server
GitHub
编辑精选by GitHub
GitHub 是 MCP 官方参考服务器,让 Claude 直接读写你的代码仓库和 Issues。
✎ 这个参考服务器解决了开发者想让 AI 安全访问 GitHub 数据的问题,适合需要自动化代码审查或 Issue 管理的团队。但注意它只是参考实现,生产环境得自己加固安全。
Context7 文档查询
编辑精选by Context7
Context7 是实时拉取最新文档和代码示例的智能助手,让你告别过时资料。
✎ 它能解决开发者查找文档时信息滞后的问题,特别适合快速上手新库或跟进更新。不过,依赖外部源可能导致偶尔的数据延迟,建议结合官方文档使用。
by tldraw
tldraw 是让 AI 助手直接在无限画布上绘图和协作的 MCP 服务器。
✎ 这解决了 AI 只能输出文本、无法视觉化协作的痛点——想象让 Claude 帮你画流程图或白板讨论。最适合需要快速原型设计或头脑风暴的开发者。不过,目前它只是个基础连接器,你得自己搭建画布应用才能发挥全部潜力。