Loading...
Loading...
Automate desktop GUI workflows via Claude computer use API with screenshot capture and mouse/keyboard control.
npx skill4agent add athola/claude-night-market computer-controlWhy this stays opt-in. Per docs/inclusive-defaults.md (TRUE-exception category 4), Computer Use takes screenshots and synthesizes keyboard/mouse input — cross-process side effects that must always be explicitly invoked, never default-on.
phantom.displayphantom.loopphantom.cliUser Task
|
v
Agent Loop <----> Claude API (beta)
| |
v v
Display Toolkit tool_use responses
| (click, type, screenshot)
v
OS Commands (xdotool, scrot)
|
v
Display (X11 / Xvfb / WSLg)cd plugins/phantom
uv run python -m phantom.cli --checkexport ANTHROPIC_API_KEY="sk-ant-..."
uv run python -m phantom.cli "Open Firefox and search for Claude AI"from phantom.display import DisplayConfig, DisplayToolkit
from phantom.loop import LoopConfig, run_loop
result = run_loop(
task="Take a screenshot of the desktop",
api_key="sk-ant-...",
loop_config=LoopConfig(
model="claude-sonnet-4-6",
max_iterations=10,
),
display_config=DisplayConfig(width=1920, height=1080),
)
print(f"Done in {result.iterations} iterations")
print(result.final_text)| Model | Tool Version | Beta Flag |
|---|---|---|
| Opus 4.6, Sonnet 4.6, Opus 4.5 | | |
| Sonnet 4.5, Haiku 4.5, older | | |
resolve_tool_version()screenshotleft_click[x, y]typekeyctrl+smouse_movescrollleft_click_dragright_clickmiddle_clickdouble_clicktriple_clickhold_keywaitzoommax_iterationson_actionsudo apt install xdotool scrot xclip# Install Xvfb for virtual display
sudo apt install xvfb xdotool scrot xclip
Xvfb :1 -screen 0 1920x1080x24 &
export DISPLAY=:1