browser

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Browser Automation

浏览器自动化

Automate browser interactions using Stagehand CLI with Claude.
通过Stagehand CLI与Claude实现浏览器交互自动化。

First: Environment Selection (Local vs Remote)

第一步:环境选择(本地 vs 远程)

The skill automatically selects between local and remote browser environments:
  • If Browserbase API keys exist (BROWSERBASE_API_KEY and BROWSERBASE_PROJECT_ID in .env file): Uses remote Browserbase environment
  • If no Browserbase API keys: Falls back to local Chrome browser
  • No user prompting: The selection happens automatically based on available configuration
该Skill会自动在本地和远程浏览器环境间切换:
  • 若存在Browserbase API密钥(.env文件中配置了BROWSERBASE_API_KEY和BROWSERBASE_PROJECT_ID):使用远程Browserbase环境
  • 若无Browserbase API密钥:回退到本地Chrome浏览器
  • 无需用户手动选择:根据现有配置自动完成环境选择

Setup (First Time Only)

首次设置

Check
setup.json
in this directory. If
setupComplete: false
:
bash
npm install    # Install dependencies
npm link       # Create global 'browser' command
检查当前目录下的
setup.json
。若
setupComplete: false
bash
npm install    # 安装依赖
npm link       # 创建全局的'browser'命令

Commands

命令

All commands work identically in both modes:
bash
browser navigate <url>                    # Go to URL
browser act "<action>"                    # Natural language action
browser extract "<instruction>" ['{}']    # Extract data (optional schema)
browser observe "<query>"                 # Discover elements
browser screenshot                        # Take screenshot
browser close                             # Close browser
所有命令在两种模式下的工作方式完全相同:
bash
browser navigate <url>                    # 跳转到指定URL
browser act "<action>"                    # 执行自然语言描述的操作
browser extract "<instruction>" ['{}']    # 提取数据(可选配置Schema)
browser observe "<query>"                 # 发现页面元素
browser screenshot                        # 截图
browser close                             # 关闭浏览器

Quick Example

快速示例

bash
browser navigate https://example.com
browser act "click the Sign In button"
browser extract "get the page title"
browser close
bash
browser navigate https://example.com
browser act "click the Sign In button"
browser extract "get the page title"
browser close

Mode Comparison

模式对比

FeatureLocalBrowserbase
SpeedFasterSlightly slower
SetupChrome requiredAPI key required
Stealth modeNoYes
Proxy/CAPTCHANoYes
Best forDevelopmentProduction/scraping
特性本地模式Browserbase模式
速度更快稍慢
配置要求需要安装Chrome需要API密钥
隐身模式不支持支持
代理/CAPTCHA处理不支持支持
最佳适用场景开发阶段生产环境/数据爬取

Best Practices

最佳实践

  1. Always navigate first before interacting
  2. View screenshots after each command to verify
  3. Be specific in action descriptions
  4. Close browser when done
  1. 先导航再交互:执行交互操作前务必先导航到目标页面
  2. 操作后查看截图:每次命令执行后查看截图以验证结果
  3. 操作描述要具体:自然语言指令需明确具体
  4. 使用完毕关闭浏览器:完成操作后关闭浏览器

Troubleshooting

故障排除

  • Chrome not found: Install Chrome or use Browserbase mode
  • Action fails: Use
    browser observe
    to discover available elements
  • Browserbase fails: Verify API key and project ID are set
For detailed examples, see EXAMPLES.md. For API reference, see REFERENCE.md.
  • Chrome未找到:安装Chrome或切换到Browserbase模式
  • 操作执行失败:使用
    browser observe
    命令发现可用页面元素
  • Browserbase模式失败:验证API密钥和项目ID是否配置正确
更多详细示例,请查看EXAMPLES.md。 API参考文档,请查看REFERENCE.md