browser

@iamzhihuix/browser

iamzhihuix

102

7 forks

Updated 1/6/2026

View on GitHub

Minimal Chrome DevTools Protocol tools for browser automation and scraping. Use when you need to start Chrome, navigate pages, execute JavaScript, take screenshots, or interactively pick DOM elements. Triggers include "browse website", "scrape page", "take screenshot", "automate browser", "extract DOM", "web scraping".

Installation

$skills install @iamzhihuix/browser

Claude Code

Cursor

Copilot

Codex

Antigravity

Details

Repositoryiamzhihuix/happy-claude-skills

Pathskills/browser/SKILL.md

Branchmain

Scoped Name@iamzhihuix/browser

Usage

After installing, this skill will be available to your AI coding assistant.

Verify installation:

skills list

Skill Instructions

name: browser description: Minimal Chrome DevTools Protocol tools for browser automation and scraping. Use when you need to start Chrome, navigate pages, execute JavaScript, take screenshots, or interactively pick DOM elements. Triggers include "browse website", "scrape page", "take screenshot", "automate browser", "extract DOM", "web scraping".

Browser Tools

Minimal CDP tools for collaborative site exploration and scraping.

Credits: Based on Mario Zechner's article What if you don't need MCP?, adapted from Factory.ai.

Setup

Before first use, install dependencies:

npm install --prefix skills/browser

Start Chrome

./skills/browser/scripts/start.js              # Fresh profile
./skills/browser/scripts/start.js --profile    # Copy your profile (cookies, logins)

Start Chrome on :9222 with remote debugging.

Navigate

./skills/browser/scripts/nav.js https://example.com
./skills/browser/scripts/nav.js https://example.com --new

Navigate current tab or open new tab.

Evaluate JavaScript

./skills/browser/scripts/eval.js 'document.title'
./skills/browser/scripts/eval.js 'document.querySelectorAll("a").length'

Execute JavaScript in active tab (async context).

IMPORTANT: The code must be a single expression or use IIFE for multiple statements:

Single expression: 'document.title'
Multiple statements: '(() => { const x = 1; return x + 1; })()'
Avoid newlines in the code string - keep it on one line

Screenshot

./skills/browser/scripts/screenshot.js

Screenshot current viewport, returns temp file path.

Pick Elements

./skills/browser/scripts/pick.js "Click the submit button"

Interactive element picker. Click to select, Cmd/Ctrl+Click for multi-select, Enter to finish.

Workflow

Start Chrome with start.js --profile to mirror your authenticated state.
Drive navigation via nav.js https://target.app or open secondary tabs with --new.
Inspect the DOM using eval.js for quick counts, attribute checks, or extracting JSON payloads.
Capture artifacts with screenshot.js for visual proof or pick.js when you need precise selectors or text snapshots.

Usage Notes

Start Chrome first before using other tools
The --profile flag syncs your actual Chrome profile so you're logged in everywhere
JavaScript evaluation runs in an async context in the page
Pick tool allows you to visually select DOM elements by clicking on them

More by iamzhihuix

View all

docx-format-replicator

102

Extract formatting from existing Word documents and generate new documents with the same format but different content. Use this skill when users need to create multiple documents with consistent formatting, replicate document templates, or maintain corporate document standards across different content.

video-processor

102

Download and process videos from YouTube and other platforms. Supports video download, audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions YouTube download, video conversion, audio extraction, transcription, mp4, webm, ffmpeg, yt-dlp, or whisper transcription.

wechat-article-writer

102

公众号文章自动化写作流程。支持资料搜索、文章撰写、爆款标题生成、排版优化。当用户提到写公众号、微信文章、自媒体写作、爆款文章、内容创作时使用此 skill。