browser
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseBrowser Automation
浏览器自动化
Automate browser interactions using Stagehand CLI with Claude.
通过Stagehand CLI与Claude实现浏览器交互自动化。
First: Environment Selection (Local vs Remote)
第一步:环境选择(本地 vs 远程)
The skill automatically selects between local and remote browser environments:
- If Browserbase API keys exist (BROWSERBASE_API_KEY and BROWSERBASE_PROJECT_ID in .env file): Uses remote Browserbase environment
- If no Browserbase API keys: Falls back to local Chrome browser
- No user prompting: The selection happens automatically based on available configuration
该Skill会自动在本地和远程浏览器环境间切换:
- 若存在Browserbase API密钥(.env文件中配置了BROWSERBASE_API_KEY和BROWSERBASE_PROJECT_ID):使用远程Browserbase环境
- 若无Browserbase API密钥:回退到本地Chrome浏览器
- 无需用户手动选择:根据现有配置自动完成环境选择
Setup (First Time Only)
首次设置
Check in this directory. If :
setup.jsonsetupComplete: falsebash
npm install # Install dependencies
npm link # Create global 'browser' command检查当前目录下的。若:
setup.jsonsetupComplete: falsebash
npm install # 安装依赖
npm link # 创建全局的'browser'命令Commands
命令
All commands work identically in both modes:
bash
browser navigate <url> # Go to URL
browser act "<action>" # Natural language action
browser extract "<instruction>" ['{}'] # Extract data (optional schema)
browser observe "<query>" # Discover elements
browser screenshot # Take screenshot
browser close # Close browser所有命令在两种模式下的工作方式完全相同:
bash
browser navigate <url> # 跳转到指定URL
browser act "<action>" # 执行自然语言描述的操作
browser extract "<instruction>" ['{}'] # 提取数据(可选配置Schema)
browser observe "<query>" # 发现页面元素
browser screenshot # 截图
browser close # 关闭浏览器Quick Example
快速示例
bash
browser navigate https://example.com
browser act "click the Sign In button"
browser extract "get the page title"
browser closebash
browser navigate https://example.com
browser act "click the Sign In button"
browser extract "get the page title"
browser closeMode Comparison
模式对比
| Feature | Local | Browserbase |
|---|---|---|
| Speed | Faster | Slightly slower |
| Setup | Chrome required | API key required |
| Stealth mode | No | Yes |
| Proxy/CAPTCHA | No | Yes |
| Best for | Development | Production/scraping |
| 特性 | 本地模式 | Browserbase模式 |
|---|---|---|
| 速度 | 更快 | 稍慢 |
| 配置要求 | 需要安装Chrome | 需要API密钥 |
| 隐身模式 | 不支持 | 支持 |
| 代理/CAPTCHA处理 | 不支持 | 支持 |
| 最佳适用场景 | 开发阶段 | 生产环境/数据爬取 |
Best Practices
最佳实践
- Always navigate first before interacting
- View screenshots after each command to verify
- Be specific in action descriptions
- Close browser when done
- 先导航再交互:执行交互操作前务必先导航到目标页面
- 操作后查看截图:每次命令执行后查看截图以验证结果
- 操作描述要具体:自然语言指令需明确具体
- 使用完毕关闭浏览器:完成操作后关闭浏览器
Troubleshooting
故障排除
- Chrome not found: Install Chrome or use Browserbase mode
- Action fails: Use to discover available elements
browser observe - Browserbase fails: Verify API key and project ID are set
For detailed examples, see EXAMPLES.md.
For API reference, see REFERENCE.md.
- Chrome未找到:安装Chrome或切换到Browserbase模式
- 操作执行失败:使用命令发现可用页面元素
browser observe - Browserbase模式失败:验证API密钥和项目ID是否配置正确
更多详细示例,请查看EXAMPLES.md。
API参考文档,请查看REFERENCE.md。