网络挖掘

Pattern

by bytesagain

Web mining module for Python, with tools for scraping, natural language processing, machine learning text-mining, python, machine-learning.

4.5k其他未扫描2026年3月23日

安装

claude skill add --url github.com/openclaw/skills/tree/main/skills/bytesagain/text-mining

文档

Text Mining

A finance-oriented toolkit for recording, categorizing, and analyzing financial data from the command line. Each command logs timestamped entries to dedicated log files, with built-in statistics, multi-format export, search, and health-check capabilities.

Why Text Mining?

  • Works entirely offline — financial data stays on your machine
  • Purpose-built commands for finance workflows (record, categorize, balance, forecast, budget-check, tax-note)
  • Each command type maintains its own log file for clean data separation
  • Built-in multi-format export (JSON, CSV, plain text)
  • Full activity history with timestamped audit trail
  • Summary statistics with entry counts and disk usage

Commands

Finance Operations

CommandDescription
text-mining record <input>Record a financial entry (no args: show recent)
text-mining categorize <input>Categorize a transaction (no args: show recent)
text-mining balance <input>Log a balance entry (no args: show recent)
text-mining trend <input>Record a trend observation (no args: show recent)
text-mining forecast <input>Log a forecast entry (no args: show recent)
text-mining export-report <input>Record an export report entry (no args: show recent)
text-mining budget-check <input>Log a budget check (no args: show recent)
text-mining summary <input>Record a summary entry (no args: show recent)
text-mining alert <input>Log a financial alert (no args: show recent)
text-mining history <input>Record a history entry (no args: show recent)
text-mining compare <input>Log a comparison entry (no args: show recent)
text-mining tax-note <input>Record a tax note (no args: show recent)

Utility Commands

CommandDescription
text-mining statsShow summary statistics (entry counts per type, total, disk usage)
text-mining export <fmt>Export all data in json, csv, or txt format
text-mining search <term>Search across all log files (case-insensitive)
text-mining recentShow the 20 most recent activity log entries
text-mining statusHealth check (version, entries, disk, last activity)
text-mining helpDisplay all available commands
text-mining versionPrint version string

Each finance command works in two modes:

  • With arguments: Saves a timestamped entry to <command>.log and logs to history.log
  • Without arguments: Displays the 20 most recent entries from that command's log

Data Storage

All data is stored locally in ~/.local/share/text-mining/. The directory contains:

  • record.log, categorize.log, balance.log, trend.log, forecast.log, etc. — One log file per command type, storing YYYY-MM-DD HH:MM|input entries
  • history.log — Unified activity log with timestamped records of every command executed
  • export.json / export.csv / export.txt — Generated export files

Requirements

  • Bash 4.0+ with set -euo pipefail strict mode
  • Standard Unix utilities: grep, cat, tail, wc, du, date, sed
  • No external dependencies or network access required

When to Use

  1. Tracking daily expenses — Use text-mining record "lunch 45 CNY" to log transactions with automatic timestamps
  2. Budget monitoring — Run text-mining budget-check "March: 3200/5000 CNY spent" to track spending against limits
  3. Financial trend analysis — Record observations with text-mining trend "Q1 savings rate up 12%" and review with text-mining search "savings"
  4. Tax preparation — Keep tax-related notes with text-mining tax-note "deductible: home office 1200 CNY" for year-end review
  5. Exporting financial summaries — Run text-mining export csv to get all recorded entries across all categories in spreadsheet-ready CSV format

Examples

bash
# Record financial transactions
text-mining record "salary received 15000 CNY"
text-mining record "rent payment 3500 CNY"
text-mining categorize "groceries: 280 CNY weekly average"

# Balance and budget tracking
text-mining balance "checking: 12,500 CNY | savings: 45,000 CNY"
text-mining budget-check "food budget: 1800/2000 CNY (90%)"
text-mining forecast "projected savings by June: 55,000 CNY"

# Tax and comparison notes
text-mining tax-note "freelance income Q1: 8,000 CNY"
text-mining compare "Feb vs Mar spending: -15% reduction"

# Search, review, and export
text-mining search "rent"
text-mining recent
text-mining stats
text-mining export json
text-mining export csv

Configuration

The data directory defaults to ~/.local/share/text-mining/. All log files are plain text with pipe-delimited fields (timestamp|value), making them easy to parse with standard Unix tools.


Powered by BytesAgain | bytesagain.com | hello@bytesagain.com

相关 Skills

Claude API

by anthropic

热门

Build, debug, and optimize Claude API / Anthropic SDK apps. Apps built with this skill should include prompt caching. Also handles migrating existing Claude API code between Claude model versions (4.5 → 4.6, 4.6 → 4.7, retired-model replacements). TRIGGER when: code imports `anthropic`/`@anthropic-ai/sdk`; user asks for the Claude API, Anthropic SDK, or Managed Agents; user adds/modifies/tunes a Claude feature (caching, thinking, compaction, tool use, batch, files, citations, memory) or model (Opus/Sonnet/Haiku) in a file; questions about prompt caching / cache hit rate in an Anthropic SDK project. SKIP: file imports `openai`/other-provider SDK, filename like `*-openai.py`/`*-generic.py`, provider-neutral code, general programming/ML.

其他
安全139.0k

并行代理

by axelhu

热门

Use when facing 2 or more independent tasks that can be worked on without shared state - dispatches parallel subagents using sessions_spawn for concurrent investigation and execution, adapted for OpenClaw

其他
未扫描4.5k

高光制作器

by bwbernardweston18

热门

>

其他
未扫描4.5k

相关资讯

本研究提出信息论探测框架,分析统一多模态模型(UMMs)的编码与生成过程。发现伪统一现象源于模态不对称编码和模式分裂响应,仅通过上下文预测实现信息流一致性的模型能获得更真实的统一。该工作首次从模型内部探测统一性,表明真实多模态协同需要信息流一致性而非仅共享参数。

深度·4月15日·26 分钟

本文针对工具集成推理场景中现有效率指标无法反映真实推理延迟的问题,提出了硬件感知的PTE效率度量方法。实验验证表明PTE比标准token计数更符合实际延迟,并识别出四种低效模式。该研究为优化工具集成推理系统提供了新的评估框架和效率洞察。

深度·4月7日·19 分钟

本文通过‘画一只猫’的比喻,阐述了提示工程的核心在于减少输出方差。介绍了系统指令的权重层级,并详细解析了少样本提示、思维链提示、角色提示和工具增强提示四大模式。最后强调评估和验证的重要性。

指南The New Stack·3月23日·4 分钟

评论