网络挖掘
Pattern
by bytesagain
Web mining module for Python, with tools for scraping, natural language processing, machine learning text-mining, python, machine-learning.
安装
claude skill add --url github.com/openclaw/skills/tree/main/skills/bytesagain/text-mining文档
Text Mining
A finance-oriented toolkit for recording, categorizing, and analyzing financial data from the command line. Each command logs timestamped entries to dedicated log files, with built-in statistics, multi-format export, search, and health-check capabilities.
Why Text Mining?
- Works entirely offline — financial data stays on your machine
- Purpose-built commands for finance workflows (record, categorize, balance, forecast, budget-check, tax-note)
- Each command type maintains its own log file for clean data separation
- Built-in multi-format export (JSON, CSV, plain text)
- Full activity history with timestamped audit trail
- Summary statistics with entry counts and disk usage
Commands
Finance Operations
| Command | Description |
|---|---|
text-mining record <input> | Record a financial entry (no args: show recent) |
text-mining categorize <input> | Categorize a transaction (no args: show recent) |
text-mining balance <input> | Log a balance entry (no args: show recent) |
text-mining trend <input> | Record a trend observation (no args: show recent) |
text-mining forecast <input> | Log a forecast entry (no args: show recent) |
text-mining export-report <input> | Record an export report entry (no args: show recent) |
text-mining budget-check <input> | Log a budget check (no args: show recent) |
text-mining summary <input> | Record a summary entry (no args: show recent) |
text-mining alert <input> | Log a financial alert (no args: show recent) |
text-mining history <input> | Record a history entry (no args: show recent) |
text-mining compare <input> | Log a comparison entry (no args: show recent) |
text-mining tax-note <input> | Record a tax note (no args: show recent) |
Utility Commands
| Command | Description |
|---|---|
text-mining stats | Show summary statistics (entry counts per type, total, disk usage) |
text-mining export <fmt> | Export all data in json, csv, or txt format |
text-mining search <term> | Search across all log files (case-insensitive) |
text-mining recent | Show the 20 most recent activity log entries |
text-mining status | Health check (version, entries, disk, last activity) |
text-mining help | Display all available commands |
text-mining version | Print version string |
Each finance command works in two modes:
- With arguments: Saves a timestamped entry to
<command>.logand logs tohistory.log - Without arguments: Displays the 20 most recent entries from that command's log
Data Storage
All data is stored locally in ~/.local/share/text-mining/. The directory contains:
record.log,categorize.log,balance.log,trend.log,forecast.log, etc. — One log file per command type, storingYYYY-MM-DD HH:MM|inputentrieshistory.log— Unified activity log with timestamped records of every command executedexport.json/export.csv/export.txt— Generated export files
Requirements
- Bash 4.0+ with
set -euo pipefailstrict mode - Standard Unix utilities:
grep,cat,tail,wc,du,date,sed - No external dependencies or network access required
When to Use
- Tracking daily expenses — Use
text-mining record "lunch 45 CNY"to log transactions with automatic timestamps - Budget monitoring — Run
text-mining budget-check "March: 3200/5000 CNY spent"to track spending against limits - Financial trend analysis — Record observations with
text-mining trend "Q1 savings rate up 12%"and review withtext-mining search "savings" - Tax preparation — Keep tax-related notes with
text-mining tax-note "deductible: home office 1200 CNY"for year-end review - Exporting financial summaries — Run
text-mining export csvto get all recorded entries across all categories in spreadsheet-ready CSV format
Examples
# Record financial transactions
text-mining record "salary received 15000 CNY"
text-mining record "rent payment 3500 CNY"
text-mining categorize "groceries: 280 CNY weekly average"
# Balance and budget tracking
text-mining balance "checking: 12,500 CNY | savings: 45,000 CNY"
text-mining budget-check "food budget: 1800/2000 CNY (90%)"
text-mining forecast "projected savings by June: 55,000 CNY"
# Tax and comparison notes
text-mining tax-note "freelance income Q1: 8,000 CNY"
text-mining compare "Feb vs Mar spending: -15% reduction"
# Search, review, and export
text-mining search "rent"
text-mining recent
text-mining stats
text-mining export json
text-mining export csv
Configuration
The data directory defaults to ~/.local/share/text-mining/. All log files are plain text with pipe-delimited fields (timestamp|value), making them easy to parse with standard Unix tools.
Powered by BytesAgain | bytesagain.com | hello@bytesagain.com
相关 Skills
Claude API
by anthropic
Build, debug, and optimize Claude API / Anthropic SDK apps. Apps built with this skill should include prompt caching. Also handles migrating existing Claude API code between Claude model versions (4.5 → 4.6, 4.6 → 4.7, retired-model replacements). TRIGGER when: code imports `anthropic`/`@anthropic-ai/sdk`; user asks for the Claude API, Anthropic SDK, or Managed Agents; user adds/modifies/tunes a Claude feature (caching, thinking, compaction, tool use, batch, files, citations, memory) or model (Opus/Sonnet/Haiku) in a file; questions about prompt caching / cache hit rate in an Anthropic SDK project. SKIP: file imports `openai`/other-provider SDK, filename like `*-openai.py`/`*-generic.py`, provider-neutral code, general programming/ML.
并行代理
by axelhu
Use when facing 2 or more independent tasks that can be worked on without shared state - dispatches parallel subagents using sessions_spawn for concurrent investigation and execution, adapted for OpenClaw
高光制作器
by bwbernardweston18
>
相关资讯
本研究提出信息论探测框架,分析统一多模态模型(UMMs)的编码与生成过程。发现伪统一现象源于模态不对称编码和模式分裂响应,仅通过上下文预测实现信息流一致性的模型能获得更真实的统一。该工作首次从模型内部探测统一性,表明真实多模态协同需要信息流一致性而非仅共享参数。
本文针对工具集成推理场景中现有效率指标无法反映真实推理延迟的问题,提出了硬件感知的PTE效率度量方法。实验验证表明PTE比标准token计数更符合实际延迟,并识别出四种低效模式。该研究为优化工具集成推理系统提供了新的评估框架和效率洞察。
本文通过‘画一只猫’的比喻,阐述了提示工程的核心在于减少输出方差。介绍了系统指令的权重层级,并详细解析了少样本提示、思维链提示、角色提示和工具增强提示四大模式。最后强调评估和验证的重要性。