
Nutanix Enterprise AI 2.4
A powerful step forward in performance, control, and simplicity.Nutanix Enterprise AI 2.4 introduces meaningful improvements that make managing inference endpoints faster, smarter, and more flexible—across edge, on-prem, or cloud environments.Review the detailed release notes before deploying any product updates.🧠 Smarter Performance, Built-In CPU and GPU Acceleration Choose between GPU or Intel® AMX-enabled CPU acceleration for optimal performance—tailored to your workload. Expanded Model Support Pre-validated support for new Hugging Face and NVIDIA models including Llama 3, Mistral, Stable Diffusion, and more. Now with NVIDIA H200 GPU support Tap into the next generation of high-performance AI acceleration. 🛠️ A Simpler, Smoother Workflow New Endpoint Creation Wizard A guided, intuitive flow for building endpoints—no YAML needed. Create Endpoints from Imported Models Go from model import to deployment in a single click. Auto Model Size Detection Model size is now automatically detected when importing from Hugging Face. HTTP Proxy Support Easily connect to external services—even from air-gapped or dark sites. 🔐 More Control, More Confidence Fine-Grained Access to Models Control exactly which models your users can import or upload—via catalog, direct URLs, or manual upload. Search, Filter, and Discover Improved model and endpoint search with case-insensitive matching. 📊 Better Insights at a Glance Enhanced Endpoint Metrics Get visibility into model performance with new metrics: Time to First Token (TTFT) Time Per Output Token (TPOT) Output Tokens per Second Kubernetes Monitoring Made Easy A unified view of your cluster, nodes, and usage—all in one place. Click-Through Model Links Quickly access Hugging Face or NIM model source pages right from your dashboard. 🧰 Support Gets Smarter Support Bundles On Demand Generate support bundles from your Kubernetes cluster to streamline troubleshooting with Nutanix Support. Minimum Requirements Kubernetes 1.31, NVIDIA GPU Operator v25.3.0, NVIDIA Toolkit v1.17.5, and more. Supported GPUs: NVIDIA L40S, A100, H100, H100-NVL, and H200. For full software and GPU requirements, see the Nutanix Enterprise AI Guide.🎯 Final ThoughtWith this release, Nutanix Enterprise AI becomes even more robust—giving enterprises the tools they need to securely deploy and monitor GenAI workloads at scale, with greater control and visibility than ever before.Continue the conversation on the Nutanix Cloud Platform for AI forum.