Marketplace

NVIDIA® Network Operator

Updated August 12, 2024

The NVIDIA® Network Operator is a Kubernetes application designed for managing and optimizing software components for networking between NVIDIA GPUs in the cloud. The operator automates many tasks related to network setup, including the configuration of high-performance networking features like RDMA (Remote Direct Memory Access) and GPUDirect, which are crucial for applications requiring low latency and high throughput. This tool is particularly beneficial for environments where NVIDIA GPUs are deployed for compute-intensive tasks, as it ensures that the network can support the high data transfer demands of such applications.

You can deploy the NVIDIA Network Operator in your Nebius AI Managed Service for Kubernetes clusters using this Marketplace product. Your cluster must have a node group attached to a Compute Cloud GPU cluster.

Deployment instructions

Before installing this product:

  1. Create a GPU cluster in Compute Cloud.
  2. Create a Kubernetes cluster and a node group in it. When creating the group, select the created GPU cluster for it.
  3. Install kubectl and configure it to work with the created cluster.

To install the product:

  1. On the cluster page in the management console, go to the Marketplace tab, select the product, and click Install.

  2. Configure the application:

    • Namespace: Select a namespace or create a new one.
    • Application name: Enter an application name.
  3. Click Install.

  4. Wait for the application to change its status to Deployed.

  5. To check that the NVIDIA Network Operator is working, check that its pods are running:

    kubectl get pods -n <namespace>
    
Billing type
Free
Type
Kubernetes® Application
Category
Developer tools
Dataset preparation
Training
Inference
Publisher
Nebius
Use cases
  • Automating management of software components for GPU networking in Kubernetes clusters.
  • Building fast infrastructures for high-performance computing (HPC) and AI workloads.
Technical support

Nebius AI does not provide technical support for the product. If you have any issues, please refer to the developer’s information resources.

Product composition
Helm chartVersion
Pull-command
Documentation
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/chart/network-operator24.4.0Open
Docker imageVersion
Pull-command
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/sriov-network-operatornetwork-operator-24.4.0
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/sriov-network-operator-config-daemonnetwork-operator-24.4.0
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/sriov-cni3e6368077716f6b8368b0e036a1290d1c64cf1fb
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/ib-sriov-cnifc002af57a81855542759d0f77d16dacd7e1aa38
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/ovs-cni-plugin6f8174b1a47c47657fe9e59fe448f2a452bb6960
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/rdma-cniv1.1.0
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/sriov-network-device-plugine6ead1e8f76a407783430ee2666b403db2d76f64
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/network-resources-injector8810e6a127366cc1eb829d3f7cb3f866d096946e
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/sriov-network-operator-webhooknetwork-operator-24.4.0
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/busybox1.27.2
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/containerd-fixer0.0.1
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/node-feature-discoveryv0.13.2
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/network-operatorv24.4.0
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/mofed23.10-2.1.3.1-10-ubuntu20.04-amd64
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/mofed23.10-2.1.3.1-10-ubuntu22.04-amd64
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/network-operator-init-containerv0.0.2
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/k8s-rdma-shared-dev-plugin1.4.0
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/sriov-network-device-plugine6ead1e8f76a407783430ee2666b403db2d76f64
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/ib-kubernetesv1.0.2
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/nvidia-k8s-ipamv0.1.2
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/pluginsv1.3.0
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/multus-cniv3.9.3
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/ipoib-cni428715a57c0b633e48ec7620f6e3af6863149ccf
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/whereaboutsv0.7.0
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/nic-feature-discoveryv0.0.1
cr.nemax.nebius.cloud/yc-marketplace/nebius/nvidia-network-operator/image/doca_telemetry1.16.5-doca2.6.0-host
Terms
By using this product you agree to the Nebius AI Marketplace Terms of Service and the terms and conditions of the following software: Apache 2.0
Billing type
Free
Type
Kubernetes® Application
Category
Developer tools
Dataset preparation
Training
Inference
Publisher
Nebius