Loading...
Loading...
Use when the user wants to set up, scale, validate, or harden NVIDIA physical AI infrastructure for synthetic data generation workflows across local MicroK8s or Azure AKS, including Kubernetes clusters, inference endpoint deployment, OSMO deployment, workload submission readiness, and infrastructure failure recovery. Trigger keywords: physical ai infrastructure, resilient scaling, SDG infrastructure, microk8s, azure aks, NVCF deployment, NIM Operator, OSMO deploy, workflow scaling. Don't trigger for: OSMO log summarization or workload-only operations unless infrastructure setup, scaling, validation, or recovery is requested.
npx skill4agent add nvidia/skills physical-ai-infrastructure-setup-and-resilient-scaling${REPO_ROOT}/.env.envsecretKeyRefscripts/scan_transcript_secrets.pygit rev-parse --show-toplevel| Concern | Load | Assets |
|---|---|---|
| Stage matrix and old driver notes | | None |
| MicroK8s cluster | | |
| Azure AKS cluster | | |
| NIM Operator inference | | |
| NVCF inference | | |
| Azure AI Foundry inference | | |
| MicroK8s OSMO | | |
| Azure OSMO | | |
| Azure access setup | | None |
| OSMO CLI and workflow operations | | |
| OpenClaw Azure device login | | None |
| File | Read when |
|---|---|
| Spawning a workflow-generation or workflow-failure subagent. |
| Spawning a log summarization subagent for OSMO workflow failures. |
| Exact OSMO CLI flags, payloads, or command syntax are needed. |
| Workflow YAML schema, credentials, outputs, or provider fields are needed. |
| Multi-task, data dependency, Jinja, serial, or parallel workflow design is needed. |
| Checkpointing, retry/exit behavior, or node exclusion is needed. |
| Validating or debugging the OSMO orchestration review pattern. |
MicroK8sAzureMicroK8s OSMOAzure OSMONIM OperatorNVCFAzure AI FoundryNone| Cluster | NIM Operator | NVCF | Azure AI Foundry |
|---|---|---|---|
| MicroK8s | yes | yes | no, Foundry requires Azure identities |
| Azure | yes | yes | yes |
components/openclaw-azure-login/reference.mdcomponents/azure-access/reference.mdscripts/preflight.shpreflight_credentials.shpre_submit_guard.py--setcomponents/osmo-cli/reference.md*.osmo-nims.svc.cluster.localapi.nvcf.nvidia.com/**.inference.ai.azure.com*.cognitiveservices.azure.comcomponents/inference-nim-operator/nims/components/inference-azure/scripts/install.sh| Stage | Gate |
|---|---|
| Kubernetes | Cluster API reachable, nodes Ready, GPU capacity advertised for GPU paths, and CPU+NVCF paths have |
| Inference | Every endpoint referenced by the workload is reachable. NIM readiness uses |
| OSMO | OSMO pods Ready, pool ONLINE, port-forward watchdogs alive, storage credentials configured, and verify-hello workflow COMPLETED. |
| Workload | Selected workload pre-submit guards pass before submit. |
terraform applyskills/physical-ai-video-data-augmentation/SKILL.mdskills/physical-ai-defect-image-generation/SKILL.mdskills/carline-adaptation/SKILL.mdskills/INDEX.md