browser-read
by bill492
Extract readable content from browser pages as markdown. Use when web_fetch fails (bot protection, auth-required pages, Twitter/X, LinkedIn) and you already have the page open in the browser.
安装
claude skill add --url github.com/openclaw/skills/tree/main/skills/bill492/browser-read文档
browser-read
Extract readable text from an already-open browser page and return markdown, suitable for pages where web_fetch is blocked or missing auth context.
When to use
web_fetchreturned an error or empty content.- Page requires authentication/cookies/session state available only in the browser.
- You need text extraction from Twitter/X or LinkedIn timelines/articles where screenshot/OCR was previously used.
When NOT to use
web_fetchalready returns good markdown/text (faster and cheaper).- Purely static pages where normal fetch is sufficient.
Steps
- Navigate to the URL with
browser navigate. - Read extraction script from
~/clawd/skills/browser-read/extract.js. - Run
browser actwithkind=evaluateand pass the script contents asfn. - Script returns
{title, content, excerpt, byline, siteName, length}wherecontentis markdown. - If extraction fails or returns empty content, script falls back to
document.body.innerText.
Example (tool calls)
json
{
"action": "navigate",
"targetId": "...",
"url": "https://example.com"
}
{
"action": "act",
"targetId": "...",
"kind": "evaluate",
"fn": "(() => { ... return {title, content, excerpt, byline, siteName, length}; })()"
}
Notes
extract.jsis a self-contained IIFE so it can be passed directly as thefnvalue tobrowser act.- Keep in mind this is a lightweight extractor; it intentionally strips script/style/nav/header/footer/aside/cookie/ad elements before conversion.