Skip to content
Neutree Documentation

Managing Kubernetes clusters

Port requirements

If there is a firewall between your standard Kubernetes cluster and the Neutree management plane, open the corresponding ports on the destination side according to the following list to ensure Neutree can manage your Kubernetes cluster. Unless otherwise specified, all ports listed below are TCP ports.

SourceDestinationPortPurpose
Control planeKubernetes cluster nodes6443Manage and deploy Neutree cluster components and inference endpoints.
LoadBalancer IP associated with Kubernetes cluster nodes8000Forward requests from the AI gateway to the specific cluster.
Control Plane virtual IP of Kubernetes cluster nodesAssigned NodePortForward requests from the AI gateway to the specific cluster.
Kubernetes cluster nodesNode where monitoring components are deployed8480Required when monitoring components are deployed on a server or VM to upload monitoring metrics.
LoadBalancer IP allocated to monitoring components deployed on Kubernetes8480Required when monitoring components are deployed on a Kubernetes cluster to upload monitoring metrics.

Configuring accelerators

If the Kubernetes nodes include accelerators, complete the corresponding configuration based on the accelerator type.

When used with AKE, enable the NVIDIA GPU Operator plugin for the AKE workload cluster in Arcfra Operation Center. For details, see the Configuring cluster addons section in the Arcfra Kubernetes Engine Administration Guide.

When used with other standard Kubernetes clusters, refer to the Installing the NVIDIA GPU Operator section in NVIDIA GPU Operator from NVIDIA’s official documentation.

Creating a cluster

Follow the steps below to create a cluster. If the cluster nodes cannot access Docker Hub or the connection is slow, you can manually import cluster images.

  1. Log in to the Neutree management interface, click Clusters in the left navigation pane, then click Create on the right.

  2. Fill in the configuration.

    • Basic Information

      ParameterDescriptionEditable after creation
      NameThe name of the cluster.No
      WorkspaceThe workspace to which the cluster belongs.No
    • Image Registry

      Select a container registry for the cluster to store cluster-related container images. If no registry is available, see Creating a container registry; if no registry is available in your environment, see Setting up a temporary container registry. This field is not editable after creation.

    • Cluster Type

      The cluster type. Select Kubernetes. Not editable after creation.

    • Version

      The cluster version. The system automatically retrieves available versions from the selected registry. Can be updated after creation via Upgrading the cluster version.

    • Provider

      Enter the Kubeconfig string for the cluster to access the Kubernetes cluster. Not editable after creation.

    • Router

      ParameterDescriptionEditable after creation
      Access ModeRouting component access mode: LoadBalancer or NodePort.

      When selecting LoadBalancer, ensure the Kubernetes cluster supports LoadBalancer services.

      Yes
      ReplicasNumber of replicas for the routing component. Recommended: at least 2 for high availability.Yes
      CPUNumber of CPUs for the routing component.Yes
      MemoryMemory capacity for the routing component.Yes
    • Model Caches

      ParameterDescriptionEditable after creation
      NameThe name of the model cache.No
      Cache TypeSupported cache types: Host Path (local cache); NFS (NFS cache); PVC (persistent storage, ReadWriteMany only).Yes
      Cache PathThe path for model caching. When cache type is Host Path, specify the host path; when NFS, specify the NFS server path; when PVC, this field is not required.Yes
      NFS Server AddressThe IP address or domain name of the NFS server. Required only when cache type is NFS.Yes
      StorageSpecify the storage capacity for model caching. Required only when cache type is PVC.Yes
      Storage Class NameSpecify the storage class name for model caching. Required only when cache type is PVC.No

      If model cache is not configured during creation, it can be added after the cluster is created.

  3. After confirming the configuration is correct, click Save to complete creation.

Manually importing a cluster image

When upgrading the cluster version or when the network environment is restricted, you can manually import the required cluster images into the Neutree container registry.

Procedure

  1. Download version 1.0.1 of the Neutree CLI tool and the cluster offline image for the specified accelerator type, based on the server CPU architecture.

  2. Upload the cluster offline image to the specified registry using the CLI tool:

    Terminal window
    ./neutree-cli-<arch> import cluster \
    --package <cluster_package> \
    --mirror-registry <mirror_registry> \
    [--registry-project <registry_project>] \
    --registry-username <registry_username> \
    --registry-password <registry_password>
    ParameterDescription
    <arch>CPU architecture of the server: amd64 or aarch64.
    <cluster_package>Cluster offline image name, format: neutree-cluster-k8s-v1.0.1-<arch>.tar.gz.
    <mirror_registry>Registry address. Must match the address used when uploading images with the CLI tool during Neutree management plane deployment. Enter an OCI-compatible registry address without the https:// prefix.
    --registry-project <registry_project>Optional. Registry project name. Ensure the corresponding project has been pre-created in the registry.
    <registry_username>Registry username. Must have upload permissions.
    <registry_password>Registry password or access key (such as a token).

Viewing clusters

Log in to the Neutree management interface, click Clusters in the left navigation pane. The cluster list on the right shows all current clusters. Click a cluster name to view details. On the details page, you can view Basic and Monitor as needed.

The possible statuses during cluster operation and their descriptions are as follows:

StatusDescription
InitializingThe cluster is performing initial initialization.
RunningThe cluster is operating normally.
UpdatingThe cluster configuration has changed and the new configuration is being applied.
UpgradingThe cluster is undergoing a version upgrade.
FailedThe cluster is experiencing an error. Check node status and logs.
DeletingThe cluster is being deleted; resources are being cleaned up.

If cluster monitoring shows No data, see Kubernetes cluster monitoring shows No data to install the required components.

Editing a cluster

After creation, you can modify the routing and model cache configuration of the cluster as needed.

  1. Log in to the Neutree management interface, click the menu icon () in the cluster list or details page, and select Edit.

  2. On the configuration page, modify as needed. For parameter descriptions, see Creating a cluster.

  3. After confirming the configuration is correct, click Save to complete editing.

Deleting clusters

You can delete one or more clusters at a time.

  1. Log in to the Neutree management interface, click the menu icon () in the cluster list or details page, and select Delete; or select multiple clusters in the list and click Delete above the list.

  2. In the dialog that appears, confirm again and click Delete. The selected clusters will be permanently deleted.