agent-browser
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseAgent Browser
Agent Browser
Browser automation using the CLI - a fast, headless browser automation tool for AI agents.
agent-browser使用 CLI实现浏览器自动化——一款面向AI Agent的快速无头浏览器自动化工具。
agent-browserInstallation
安装
bash
npm install -g agent-browser
agent-browser install # Install browser binariesbash
npm install -g agent-browser
agent-browser install # Install browser binariesQuick Start
快速开始
bash
undefinedbash
undefinedNavigate to a URL
Navigate to a URL
agent-browser open https://example.com
agent-browser open https://example.com
Get accessibility snapshot (shows refs like @e1, @e2)
Get accessibility snapshot (shows refs like @e1, @e2)
agent-browser snapshot -i
agent-browser snapshot -i
Click using ref from snapshot
Click using ref from snapshot
agent-browser click @e2
agent-browser click @e2
Type into an element
Type into an element
agent-browser fill @e3 "hello world"
agent-browser fill @e3 "hello world"
Take screenshot
Take screenshot
agent-browser screenshot output.png
undefinedagent-browser screenshot output.png
undefinedWorkflow Pattern
工作流模式
- Open - Navigate to the target URL
- Snapshot - Get the accessibility tree to see available elements
- Interact - Use refs (@e1, @e2, etc.) to interact with elements
- Verify - Take a snapshot or screenshot to verify state
- 打开 - 导航至目标URL
- 快照 - 获取可访问性树以查看可用元素
- 交互 - 使用引用(@e1、@e2等)与元素交互
- 验证 - 拍摄快照或截图以验证状态
Core Commands
核心命令
See references/commands.md for the complete command reference.
完整命令参考请查看references/commands.md。
Navigation
导航
bash
agent-browser open <url> # Navigate to URL
agent-browser back # Go back
agent-browser forward # Go forward
agent-browser reload # Reload pagebash
agent-browser open <url> # Navigate to URL
agent-browser back # Go back
agent-browser forward # Go forward
agent-browser reload # Reload pageInteraction
交互
bash
agent-browser click <sel> # Click element (or @ref)
agent-browser fill <sel> <text> # Clear and fill
agent-browser press <key> # Press key (Enter, Tab, etc.)
agent-browser select <sel> <val> # Select dropdown optionbash
agent-browser click <sel> # Click element (or @ref)
agent-browser fill <sel> <text> # Clear and fill
agent-browser press <key> # Press key (Enter, Tab, etc.)
agent-browser select <sel> <val> # Select dropdown optionGetting Information
获取信息
bash
agent-browser snapshot # Accessibility tree with refs
agent-browser snapshot -i # Interactive elements only
agent-browser get text <sel> # Get element text
agent-browser get url # Get current URLbash
agent-browser snapshot # Accessibility tree with refs
agent-browser snapshot -i # Interactive elements only
agent-browser get text <sel> # Get element text
agent-browser get url # Get current URLCapture
捕获
bash
agent-browser screenshot [path] # Take screenshot
agent-browser screenshot --full # Full page screenshot
agent-browser pdf <path> # Save as PDFbash
agent-browser screenshot [path] # Take screenshot
agent-browser screenshot --full # Full page screenshot
agent-browser pdf <path> # Save as PDFSessions
会话
Use sessions to maintain browser state across commands:
bash
agent-browser --session myproject open https://example.com
agent-browser --session myproject snapshot
agent-browser --session myproject click @e1使用会话在多个命令间维持浏览器状态:
bash
agent-browser --session myproject open https://example.com
agent-browser --session myproject snapshot
agent-browser --session myproject click @e1Selectors
选择器
- Refs: ,
@e1(from snapshot output) - preferred@e2 - CSS: ,
#id,.classdiv > span - Text:
text=Submit - Role:
role=button[name="Submit"]
- 引用(Refs):、
@e1(来自快照输出) - 推荐使用@e2 - CSS选择器:、
#id、.classdiv > span - 文本选择器:
text=Submit - 角色选择器:
role=button[name="Submit"]
Best Practices
最佳实践
- Always snapshot first - Get the accessibility tree before interacting
- Use refs - Prefer refs from snapshot over CSS selectors
@e1 - Use sessions - Maintain state across multiple commands
- Wait appropriately - Use for dynamic content
wait - Verify actions - Snapshot or screenshot after interactions
- 始终先获取快照 - 交互前先获取可访问性树
- 使用引用 - 优先使用快照中的引用而非CSS选择器
@e1 - 使用会话 - 在多个命令间维持状态
- 适当等待 - 针对动态内容使用命令
wait - 验证操作 - 交互后拍摄快照或截图