Loading...
Loading...
Agent-first media toolkit for image, video, and audio processing. Use when you need to resize, convert, generate images, remove backgrounds, extract audio, transcribe speech, or generate videos. All commands return deterministic JSON output.
npx skill4agent add agntswrm/agent-media agent-mediaagent-media image resizeagent-media image convertagent-media image remove-backgroundagent-media image generateagent-media audio extractagent-media audio transcribeagent-media video generate{
"ok": true,
"media_type": "image",
"action": "resize",
"provider": "local",
"output_path": "output_123.webp",
"mime": "image/webp",
"bytes": 12345
}{
"ok": false,
"error": {
"code": "INVALID_INPUT",
"message": "input file not found"
}
}--provider <name>AGENT_MEDIA_DIRFAL_API_KEYREPLICATE_API_TOKENRUNPOD_API_KEYAI_GATEWAY_API_KEY