Loading...
Loading...
Call the vss agent to run video understanding on video to answer a text question. Use when the user asks about video content, or about visual details that cannot be answered from conversation history, search hits, or metadata alone.
npx skill4agent add nvidia/skills video-understandingvideo_understandingcurl -sf --max-time 5 "http://${HOST_IP}:8000/docs" >/dev/null"No VSS profile is running on. Shall I deploy$HOST_IP(recommended for per-clip VLM QnA) using thebaseskill? If you prefer/deploy, say so."lvs
/deploy -p base-p lvs/deploy -p basebase/generate# Set from deployment (compose / .env / host where vss-agent listens)
export VSS_AGENT_BASE_URL="http://localhost:8000"
curl -s -X POST "${VSS_AGENT_BASE_URL}/generate" \
-H "Content-Type: application/json" \
-d '{"input_message": "Call video_understanding tool to answer the following question about <sensor-id>: <user query>"}' | jq .VIDEO_URL/generate