The NVIDIA® Network Operator is a Kubernetes application designed for managing and optimizing software components for networking between NVIDIA GPUs in the cloud. The operator automates many tasks related to network setup, including the configuration of high-performance networking features like RDMA (Remote Direct Memory Access) and GPUDirect, which are crucial for applications requiring low latency and high throughput. This tool is particularly beneficial for environments where NVIDIA GPUs are deployed for compute-intensive tasks, as it ensures that the network can support the high data transfer demands of such applications.
You can deploy the NVIDIA Network Operator in your Nebius AI Managed Service for Kubernetes clusters using this Marketplace product. Your cluster must have a node group attached to a Compute Cloud GPU cluster.
Before installing this product:
- Create a GPU cluster in Compute Cloud.
- Create a Kubernetes cluster and a node group in it. When creating the group, select the created GPU cluster for it.
- Install kubectl and configure it to work with the created cluster.
To install the product:
-
On the cluster page in the management console, go to the Marketplace tab, select the product, and click Install.
-
Configure the application:
- Namespace: Select a namespace or create a new one.
- Application name: Enter an application name.
-
Click Install.
-
Wait for the application to change its status to
Deployed
. -
To check that the NVIDIA Network Operator is working, check that its pods are running:
kubectl get pods -n <namespace>
- Automating management of software components for GPU networking in Kubernetes clusters.
- Building fast infrastructures for high-performance computing (HPC) and AI workloads.
Nebius AI does not provide technical support for the product. If you have any issues, please refer to the developer’s information resources.
Helm chart | Version | Pull-command | Documentation |
---|---|---|---|
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/chart/network-operator | 23.7.0 | Open |
Docker image | Version | Pull-command |
---|---|---|
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/sriov-network-operator1708945733664526440984944757808484354342988005976 | network-operator-23.7.0 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/sriov-network-operator-config-daemon1708945733664526440984944757808484354342988005976 | network-operator-23.7.0 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/sriov-cni1708945733664526440984944757808484354342988005976 | v2.7.0 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/ib-sriov-cni1708945733664526440984944757808484354342988005976 | v1.0.3 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/sriov-network-device-plugin1708945733664526440984944757808484354342988005976 | 7e7f979087286ee950bd5ebc89d8bbb6723fc625 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/network-resources-injector1708945733664526440984944757808484354342988005976 | v1.4 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/sriov-network-operator-webhook1708945733664526440984944757808484354342988005976 | v1.1.0 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/busybox1708945733664526440984944757808484354342988005976 | 1.27.2 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/containerd-fixer1708945733664526440984944757808484354342988005976 | 0.0.1 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/node-feature-discovery1708945733664526440984944757808484354342988005976 | v0.13.2 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/network-operator1708945733664526440984944757808484354342988005976 | v23.7.0 | |
cr.nemax.nebius.cloud/yc-marketplace/nebius/network-operator/mofed1708945733664526440984944757808484354342988005976 | 23.04-0.5.3.3.1-ubuntu20.04-amd64 |