zhihu-search-api-skill

by browseract-ai

This skill helps users automatically extract structured article details and full content from Zhihu via the BrowserAct API. Agent should proactively apply this skill when users express needs like: searching for Zhihu articles on a specific topic, tracking industry trends on Zhihu, monitoring public relations or sentiment on Zhihu, collecting competitor updates, getting the latest reports on specific keywords, monitoring brand exposure in Zhihu media, researching market hot topics, summarizing daily Zhihu industry news, retrieving hot events from the past week, extracting structured data for market research, finding full Zhihu articles for AI agents, extracting full article body from Zhihu links.

3.7k搜索与获取未扫描2026年3月23日

安装

claude skill add --url github.com/openclaw/skills/tree/main/skills/browseract-ai/zhihu-search-api-skill

文档

Zhihu Search API Automated Extraction Skill

📖 Brief

This skill uses BrowserAct's Zhihu Search API template to provide a one-stop article extraction service. It extracts structured article details and full content from Zhihu article search results based on keywords and publication date filters.

✨ Features

  1. No hallucinations, ensuring stable and precise data extraction: Pre-set workflows avoid AI generative hallucinations.
  2. No CAPTCHA issues: No need to handle reCAPTCHA or other verification challenges.
  3. No IP access restrictions and geo-fencing: No need to deal with regional IP limits.
  4. Faster execution speed: Compared to pure AI-driven browser automation solutions, task execution is much faster.
  5. Extremely high cost-effectiveness: Significantly reduces data acquisition costs compared to AI solutions that consume a large number of tokens.

🔑 API Key Guide

Before running, you need to check the BROWSERACT_API_KEY environment variable. If it is not set, do not take other actions; wait for the user to provide it. The Agent must inform the user at this time:

"Since you have not configured the BrowserAct API Key yet, please go to the BrowserAct Console first to get your Key."

🛠️ Input Parameters

The Agent should flexibly configure the following parameters according to user needs when calling the script:

  1. keyword (Search Keywords)

    • Type: string
    • Description: Search keywords used to find Zhihu articles. Can be company name, industry term, etc.
    • Example: AI agent, openclaw
  2. Publish_date (Publication Date Range)

    • Type: string
    • Description: Filter articles by publication date.
    • Options:
      • 7d: Past 7 days
      • 30d: Past 30 days
      • 90d: Past 90 days
      • 1y: Past year
      • all: Any time
    • Default: 7d
  3. Date_limit (Extraction Limit)

    • Type: number
    • Description: Maximum number of articles to extract.
    • Default: 10

🚀 Recommended Usage

The Agent should execute the following independent script to achieve "one command gets results":

bash
# Example call
python -u ./scripts/zhihu_search_api.py "keyword" "Publish_date" limit

⏳ Execution Status Monitoring

Because this task involves automated browser operations, it may take a long time (several minutes). The script will continuously output status logs with timestamps (e.g., [14:30:05] Task Status: running) while running. Agent Must Know:

  • While waiting for the script to return a result, keep monitoring the terminal output.
  • As long as the terminal is still outputting new status logs, it means the task is running normally, do not mistake it for a deadlock or unresponsiveness.
  • If the status remains unchanged for a long time or the script stops outputting without returning a result, then consider triggering the retry mechanism.

📊 Data Output

Upon successful execution, the script will directly parse and print the result from the API response. The result includes:

  • title: Full article title
  • body_content: Full body content of the article
  • image_url: Main image URL or article cover image URL
  • author: Article author or publishing account name
  • publication_date: Article publication date
  • url_link: Original article URL

⚠️ Error Handling & Retry

During the execution of the script, if an error is encountered (such as network fluctuations or task failure), the Agent should follow this logic:

  1. Check the output content:

    • If the output contains "Invalid authorization", it means the API Key is invalid or expired. Do not retry at this time, and guide the user to check and provide the correct API Key.
    • If the output does not contain "Invalid authorization" but the task execution fails (for example, the output starts with Error: or the return result is empty), the Agent should automatically try to execute the script once more.
  2. Retry limits:

    • Automatic retry is limited to one time. If the second attempt still fails, stop retrying and report the specific error information to the user.

🌟 Typical Use Cases

  1. Industry Trend Tracking: Find the latest industry dynamics on specific topics like "low-altitude economy" or "generative AI" on Zhihu.
  2. Public Relations Monitoring: Monitor the media exposure of a specific brand or company on Zhihu over the past 30 days.
  3. Competitor Intelligence Gathering: Collect recent product information or market activities published by competitors on Zhihu.
  4. Market Hotspot Research: Get popular Zhihu reports on specific keywords across different time dimensions.
  5. Character Dynamics Tracking: Retrieve the latest Zhihu articles and interviews of industry leaders or public figures.
  6. Daily Briefing Summary: Automatically extract and summarize daily industry news briefings from Zhihu.
  7. Global Event Monitoring: Real-time access to major breaking news and discussions on Zhihu.
  8. Structured Data Extraction: Extract structured information such as article titles, authors, and links from Zhihu for market research analysis.
  9. Media Exposure Analysis: Evaluate the spread and popularity of a specific project or event on Zhihu.
  10. Long-term Thematic Research: Retrieve in-depth reports and discussions on a specific technical topic from the past year.

相关 Skills

agent-browser

by chulla-ceja

热门

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.

搜索与获取
未扫描3.7k

接口规范

by alexxxiong

热门

API 规范管理工具 - 跨项目 API 文档的初始化、更新、查询与搜索。Triggers: 'API文档', 'API规范', '接口文档', '路由解析', 'apispec', 'API lookup', 'API search'.

搜索与获取
未扫描3.7k

investment-research

by caijichang212

热门

Perform structured investment research (投研分析) for a company/stock/ETF/sector using a repeatable framework: fundamentals (basic/财务报表与商业模式), technical analysis (技术指标与关键价位), industry research (行业景气与竞争格局), valuation (估值对比/情景), catalysts and risks, and produce a professional research report + actionable plan. Use when the user asks for: equity/ETF analysis, earnings/financial statement breakdown, peer/industry comparison, valuation ranges, bull/base/bear scenarios, technical trend/support-resistance, or a full research memo.

搜索与获取
未扫描3.7k

相关 MCP 服务

by Anthropic

热门

Puppeteer 是让 Claude 自动操作浏览器进行网页抓取和测试的 MCP 服务器。

这个服务器解决了手动编写 Puppeteer 脚本的繁琐问题,适合需要自动化网页交互的开发者,比如抓取动态内容或做端到端测试。不过,作为参考实现,它可能缺少生产级的安全防护,建议在可控环境中使用。

搜索与获取
82.9k

网页抓取

编辑精选

by Anthropic

热门

Fetch 是 MCP 官方参考服务器,让 AI 能抓取网页并转为 Markdown 格式。

这个服务器解决了 AI 直接处理网页内容时格式混乱的问题,适合需要让 Claude 分析在线文档或新闻的开发者。不过作为参考实现,它缺乏生产级的安全配置,你得自己处理反爬虫和隐私风险。

搜索与获取
82.9k

Brave 搜索

编辑精选

by Anthropic

热门

Brave Search 是让 Claude 直接调用 Brave 搜索 API 获取实时网络信息的 MCP 服务器。

如果你想让 AI 助手帮你搜索最新资讯或技术文档,这个工具能绕过传统搜索的限制,直接返回结构化数据。特别适合需要实时信息的开发者,比如查 API 更新或竞品动态。不过它依赖 Brave 的 API 配额,高频使用可能受限。

搜索与获取
82.9k

评论