NVIDIA Certified Professional - AI Networking
Validates the ability to deploy and configure environments leveraging NVIDIA advanced networking technologies for AI data centers, including AI data center network design and optimization, NVIDIA Spectrum Ethernet networking with Spectrum-4 switches and BlueField SuperNICs, NVIDIA InfiniBand networking with ConnectX adapters and Subnet Manager, troubleshooting tools for network diagnostics, automation and configuration management, and Kubernetes networking integration with GPU Operator and Network Operator. The exam covers six domains: NVIDIA Spectrum Networking (30%), NVIDIA InfiniBand Networking (30%), Troubleshooting Tools (20%), Automation and Configuration (10%), AI Data Center Design and Optimization (5%), and Kubernetes Integration (5%). Format: 70-75 multiple-choice questions, 120 minutes, proctored online.
Exam domains
- NVIDIA Spectrum Networking30%
Spectrum-X end-to-end Ethernet for AI: Spectrum-4 SN5600 switches plus BlueField-3 SuperNICs delivering RoCEv2 with adaptive routing, programmable congestion control, and lossless AI east-west traffic. Covers ECN/PFC, ECMP vs packet spraying, and Cumulus Linux / NVOS operation.
- NVIDIA InfiniBand Networking30%
Quantum-2 NDR 400Gb/s and Quantum-3 XDR 800Gb/s InfiniBand with ConnectX-7/8 HCAs: SDR/EDR/HDR/NDR signaling, OpenSM/UFM Subnet Manager routing, SHARP in-network reductions for NCCL allreduce/allgather, and adaptive routing on non-blocking fat-tree topologies.
- Troubleshooting Tools20%
Diagnose fabric health with ibdiagnet, ibnetdiscover, ibstat, ibportstate, mlxlink and mlxconfig; correlate link errors, symbol errors, and CRCs with cable/transceiver issues. Includes UFM telemetry, NCCL debug logs (NCCL_DEBUG=INFO), and GPUDirect RDMA verification with ib_write_bw / nccl-tests.
- Automation and Configuration10%
Day-0/Day-1 fabric automation with NVOS / Cumulus Linux, NetQ, Ansible playbooks and switch APIs; image and firmware management with MFT (mlxfwmanager), nv_fw_update, and Subnet Manager configuration files driving consistent PFC, ECN, and QoS policies across the fabric.
Sources
Questions are grounded in 50 references from official and authoritative materials.