How It works
Have questions about how the Nutanix Platform works? Looking to get started - start here!
- 1,178 Topics
- 1,838 Replies
Hello Community!! I'd like to discuss an ambiguous issue.There is an 3-nodes Hybrid Storage AHV cluster set to RF2. In this case, each node has 70% storage usage. Each node’s storage capacity is same.Of course, it's 70 percent overall.The data resiliency status is critical.That cluster has enough CPU and memory. The problem comes up from here.If one node goes down, will data rebuild occur?If data rebuilding occurs, the total cluster capacity will be over 95%.That would cause problems with the cluster's IO. Is there anyone who can clearly tell me how the cluster works in the event of a failure in that situation.
Self Service restore (SSR) lets you open and copy a previous version of a file. For SMB, you can use SSR to restore files. Enable or disable SSR through the Prism Web Console SSR for SMB does not restore streams or attributes in directories. Files does not support SSR at the root of distributed shares or exportsFiles take snapshots of the stored cluster data at the share/export level. Self-Service Restore (SSR) exposes these snapshots to the share or export. SSR lets you restore a file from any of the previous snapshots without an administrator. SSR is disabled by default, but you can enable it during or after share or export creation. By default, Files creates a snapshot every hour and retains the most recent 24 snapshots. By default, Files deletes the oldest SSR snapshot after exceeding the retention count for the snapshot type. The snapshot retention count corresponds to the retention period. SSR has the following retention periods: 24 hours of hourly snapshots. 7 days for daily s
Below are new knowledge base articles published on the week of April 11-17, 2021.KB 10388 - Alert - A801102 - L2StretchInvalidAncVersion KB 10390 - Alert - A801104 - L2StretchVpnConnectionNotFound KB 10391 - Alert - A801105 - L2StretchRemoteAzUnreachable KB 10392 - Alert - A801106 - L2StretchSubnetNotFound KB 10393 - Alert - A801107 - L2StretchCidrMismatch KB 10394 - Alert - A801108 - L2StretchDhcpPoolOverlap KB 10395 - Alert - A801109 - L2StretchLocalIfConflict KB 10396 - Alert - A801110 - L2StretchRemoteIfConflict KB 10604 - Alert - A801111 - L2StretchIpConflict KB 10718 - NCC Health Check: node_storage_tier_skew_check KB 11034 - How to query the CVM, backplane, host, and IPMI IP address mappings from CVM CLI KB 11063 - How to convert the certificate from PKCS #7 to PEM format if openssl commands fail KB 11091 - "java.lang.NullPointerException" error during update of virtual switch in Prism UI KB 11101 - How to disable Prism Ultimate Trial license in Prism Central KB 11106 - Xi-Frame
Below are new knowledge base articles published on the week of April 18-24, 2021.KB 8185 - NCC Health Check: bond_uplink_usability_check KB 9366 - Alert - A802001 - AncServiceUnresponsive KB 10089 - NCC Health Check: ovs_bond_config KB 10128 - NCC Health Check: node_schedulable_check KB 10389 - Alert - A801103 - L2StretchInvalidVpnVersion KB 10783 - Enabling IAMv2 with Prism Central (PC) version pc.2021.3 when upgrading from a previous version of PC with only CMSP enabled KB 11020 - Cloning or storage VMotion operations to different destination hosts cause long stun times KB 11080 - Prism Central - Role Assignment is not visible for an existing role KB 11086 - Failed to Perfom Inventory with LCM 2.4.1 for XC Series due to parsing error of /bin/python /opt/dell/DellPTAgent/tools/pta_cfg get rest_ip. KB 11115 - Nutanix Files - Cerebro to Smart DR Migration KB 11138 - Nutanix Files - Potential loss of connectivity to FSVM Remote Console Login and SSH KB 11157 - Foundation 5.0 - Lenovo/Fuj
Hi!We are running about 100 vm’s in our cluster - so find out a specific information manually for each vm is pretty hard.I want to get the information which vm’s are stored in a specific storage container with acli on cvm.I know that i could perform acli vm.get <vm-name> and look for the source_nfs_path of the vmdisks but it is way too much effort for 100 vm’s. Another way would be establishing a connection via SSH to /storage-container/.acropolis/vmdisk of of the cvm but i only see the vmdisk-uuids there, not the vm names i need.Is there any possibility, maybe with a for loop on each vm and a grep filter?
Below are the top knowledge base articles for the month of April 2021.KB 7503 - NX Hardware [Memory] – G6, G7 platforms - DIMM Error handling and replacement policy KB 4141 - Alert - A1046 - PowerSupplyDown KB 1540 - [AOS Only] What to do when /home partition or /home/nutanix directory on a Controller VM (CVM) is full KB 4409 - LCM: (Life Cycle Manager) Troubleshooting Guide KB 1113 - HDD or SSD disk troubleshooting KB 4158 - Alert - A1104 - PhysicalDiskBad KB 2090 - AHV host networking KB 2473 - NCC Health Check: cvm_memory_usage_check KB 4519 - NCC Health Check: check_ntp KB 6945 - How Upgrades Work at Nutanix KB 4872 - LCM: How to cancel an ongoing LCM update operation using the "Stop Update" feature KB 1863 - NCC Health Check: sufficient_disk_space_check KB 7386 - NCC Health Check: power_supply_check KB 9937 - Alert ID 111066 - Failed to send alert emails KB 9261 - NCC Health Check: dimm_size_reduction_check KB 10908 - CVM in Boot Loop After Reboot / Upgrade KB 3741 - Nutanix Guest
Below are new knowledge base articles published on the week of May 2-8, 2021.KB 10315 - Unable to boot to Windows after P2V conversion to AHV KB 11030 - Node add fails with VMD-enabled nodes on AOS 5.19.1 or 18.104.22.168 KB 11040 - Alerts page in Prism displays entity type in Source Entity column instead of the name of the entity KB 11121 - "Host restore VM locality” task disappears in Prism if the VM migrations fail KB 11173 - Xi Frame - Cannot send email on port 25 in a Frame VDI session hosted in AWS KB 11204 - Metadata IO latency spikes intermittently after Nutanix Files upgrade to 3.8 KB 11215 - CVM fails to boot after UEFI firmware upgrade on Lenovo nodes KB 11218 - Memory hot-add to a UVM running on AHV is rounded up to the next GB KB 11228 - High metadata IOPS observed after Nutanix Files upgrade to 3.8 KB 11248 - LCM 22.214.171.124 fails with error "Foundation Api [reboot_to_phoenix] failed" after Foundation 5.0 upgrade KB 11256 - LCM: Lenovo SATADOM FW upgrades not showing in InventoryN
Below are new knowledge base articles published on the week of May 9-15, 2021.KB 10162 - NCC Health Check: disk_power_check KB 10204 - NCC Health Check: esx_product_locker_setting_check KB 10414 - NCC Health Check: ahv_file_integrity_check KB 10420 - NCC Health Check: stale_recovery_points_check KB 10461 - NCC Health Check: aide_check KB 10524 - NCC Health Check: marker_files_check KB 10532 - NCC Health Check: recovery_plan_consistency_group_checks KB 10735 - NCC Health Check: rsyslog_forwarding_check KB 10811 - NCC Health Check: remote_log_forwarding_check KB 10864 - Alert - A130347 - ExternalDatasourceStorageMigrationSuspendedAlert KB 11141 - Secure Boot Causes Device Re-enumeration in Windows Guest VMs on AHV KB 11212 - Nutanix Move fails to prepare Windows Server 2008 R2 VM for migration from Azure KB 11266 - Cassandra fails to be active after one metadata SSD goes bad in a multi SSD node KB 11273 - Hyper-V 2019 upgrade - SKU option is missing in Upgrade Software page KB 11323 - Nu
AOS 5.20 delivers performance enhancements that build on the breakthroughs in AOS 5.19 and expands on built-in key management capabilities for keeping data encrypted and secure. AOS 5.20 also increases the portability of VMs running on the built-in hypervisor AHV, streamlines advanced management capabilities, and more. A complete list can be found in the Release Notes. Nutanix Insights:Available with all new AOS releases is Nutanix Insights. Most enterprise IT solutions rely on a reactive approach to system maintenance and issue resolution. For example, when a technical issue arises, vendor support teams typically capture detailed system data from the customer and recreate the issue in a separate environment—only then can the actual debugging begin. This approach consumes unnecessary time and resources and ultimately delays resolution. Nutanix simplifies and streamlines this process through two important support services: Nutanix Pulse and Nutanix Remote Diagnostics. When enabled, Puls
Below are new knowledge base articles published on the week of May 16-22, 2021.KB 11189 - How to secure the bootloader with a user-defined password KB 11190 - RHEL STIG requirement for actions when audit storage is full KB 11191 - How to configure CVM to use nutanix user on AHV instead of root KB 11192 - How Envoy handles downstream service failures KB 11193 - New users are added to AHV: "nutanix" and "admin" KB 11252 - SSL Certificates and the Secure Gateway Appliance for Frame KB 11312 - AOS upgrade notification shows 5.20.x as Short Term Support (STS) release instead of Long Term Support (LTS) release KB 11321 - Calm VM image upload to vCenter7 fails with error "Unable to retrieve manifest or certificate file." KB 11333 - Adding Nutanix Objects as Secondary Storage with Veritas Enterprise Vault KB 11334 - AHV/ESXi Standalone Foundation deployment failure on HPE DX AMD platforms with Mellanox NIC CX4 or CX5 KB 11346 - Nutanix Files - Inode usage high on FSVM KB 11364 - CVE-2020-13946
Below are new knowledge base articles published on the week of March 21-27, 2021.KB 10516 - [ Karbon ] PE cluster is showing alerts for VGs used by the Kubernetes cluster(s) KB 10651 - NCC Health Check: metering_rest_connection_check KB 10768 - A number of VMs may be missing from the list when monitoring cluster using SNMP protocol KB 10813 - "UnicodeDecodeError" and "UnicodeEncodeError" for VM operation KB 10936 - Duplicate scheduled reports triggered after Daylight Savings Time (DST) change KB 10946 - Identifying the source IP generating TCP Reset packets in a network path KB 10953 - Using Nutanix Objects Self-Signed Certificate with Veritas Enterprise Vault KB 10954 - SMCIPMITool commands output "The node product key needs to be activated for this device" on BMC 7.10 KB 10967 - Cloning a Secure boot enabled VM on AHV with the "Custom Script" option enabled fails with "q35 machine type does not support ide bus type" error KB 10976 - Cluster instability after upgrading both primary an
Failures are part of everything and Nutanix Clusters is not immune to it. But how we plan for failures determines the versatility of the product or a person for that matter!!Nutanix categorizes the type of failures into availability domains essentially based on type of failure. Nutanix provides the ability to tolerate rack failure for extended data availability, in addition to drive, node, block and network link failure. Node FailureA Nutanix Node comprises Physical host and a controller VM. Both these components can fail without any impact to the Nutanix cluster.CVM failureWhen a CVM fails, an alert is generated in Prism and another CVM redirects the storage path on the related host to another CVM. Read and writes will occur over the 10GbE network until the CVM comes back online.It is business as usual for the end customer with maybe a slight performance decrease.Controller VM FailurePhysical Host failureIf a node fails, all HA-protected VMs can be automatically restarted on other nod
Hardware failures are a part of any datacenter lifecycle. The Nutanix architecture was designed with this inevitability in mind. A cluster can tolerate one or two failures (depending on the replication factor of the cluster or container) of a variety of hardware components while still running guest VMs and responding to commands through the management console. Many of these failures also trigger an alert through that same management console in order to give the administrator a chance to respond to the situation.Nutanix provides the ability to tolerate rack failures for extended data availability, in addition to drive, node, block, and network link failure.Block fault tolerance lets a Nutanix cluster make redundant copies of data and metadata and place the copies on nodes in different blocks.A block is a rack-mountable enclosure that contains one to four Nutanix nodes. All nodes in a block share power supplies, front control panels (ears), backplane, and fans.Nutanix offers block fault
Below are the top knowledge base articles for the month of March 2021.KB 7503 - NX Hardware [Memory] – G6, G7 platforms - DIMM Error handling and replacement policy KB 4141 - Alert - A1046 - PowerSupplyDown KB 1540 - What to do when /home partition or /home/nutanix directory on a Controller VM is full KB 1113 - HDD/SSD Troubleshooting KB 4409 - LCM: (Life Cycle Manager) Troubleshooting Guide KB 9937 - Alert ID 111066 - Failed to send alert emails KB 4158 - Alert - A1104 - PhysicalDiskBad KB 6945 - How Upgrades Work at Nutanix KB 2090 - AHV host networking KB 2473 - NCC Health Check: cvm_memory_usage_check KB 3784 - Alert - A1030 - StargateTemporarilyDown KB 4519 - NCC Health Check: check_ntp KB 5582 - NCC Health Check: idf_db_to_db_sync_heartbeat_status_check KB 3741 - Nutanix Guest Tools Troubleshooting Guide KB 1863 - NCC Health Check: sufficient_disk_space_check KB 7386 - NCC Health Check: power_supply_check KB 6153 - NCC Health Check: default_password_check and pc_default_password_
Recycle bin feature is available from AOS 5.18 onwards. With help from Nutanix Support, the recycle bin tool helps you to restore deleted storage entities (guest VMs and volume group vDisks) and manage the recycle bin itself.After you delete a guest VM, the configuration file and disk remain in the recycle bin for up to 24 hours. After 24 hours, these files are deleted. The files are deleted in less than 24 hours if your cluster is unable to maintain sufficient free disk space.Recycle Bin Limitations and Guidelines The recycle bin stores vDisk and configuration data for up to 24 hours. After 24 hours, these files are deleted. The files are deleted in less than 24 hours if your cluster is unable to maintain sufficient free disk space. Recycle bin is not supported on storage containers where metro availability is enabled. Recycle bin is not available for recovering protection domain snapshots. You can disable and enable the recycle bin or clear its contents. As a default, the recycle
Below are new knowledge base articles published on the week of March 28-April 3, 2021.KB 8378 - NCC Health Check: Check Interface Configuration Files KB 10075 - NCC Health Check: ngt_client_cert_expiry_check KB 10743 - NCC Health Check: pulse_enablement_checks KB 10966 - Using storage_container_reference functionality with Nutanix provider for Terraform fails with error "SPEC_INCOMPATIBLE_ERROR" or "INVALID_REQUEST" KB 10980 - Hyper-V VM created in Windows Server 2008 R2 or older may fail to be migrated by Move KB 10998 - NCC ipmi_checksf fail after upgrading ESXi to version 7.0 KB 11018 - LCM-driven AHV upgrade may fail due to missing /bin/bash binary KB 11026 - Fujitsu hardware platform BIOS power setting preventing NVIDIA AHV vGPU driver loading KB 11031 - Agentless Management Service (AMS) version 11.4.0 filling up /tmp on HPE ESXi hostsNote: You may need to log in to the Support Portal to view some of these articles.
I want to gather data for the following stats metrics using the REST API v2. I understand that I need to put the metrics ID as part of the parameters, is that correct?In addition, can anyone tell me the ID for the following stats metrics?Disk Usage (%) GPU Framebuffer Usage GPU Usage GPU video decoder Usage GPU video encoder usage Memory Usage (%) Network Rx Bytes Network Tx BytesThanks!
Hi,Looking to automate some reports using the Rest API. So far I can successfully pull out CPU, RAM and Physical & Logical storage cluster wide and snapshot details but am unable to calculate the values as shown on the capacity runway when viewing a cluster in PC.. specifically the storage usage figure for ‘snapshots’ and ‘system.Anyone know how these are calculated?Thanks
Below are new knowledge base articles published on the week of April 4-10, 2021.KB 9952 - NCC Health Check: list_vms_with_qos_attrs KB 10350 - NCC Health Check: rdma_enabled_check KB 10482 - Getting Error Message "You’ve uploaded an invalid cluster summary file" When Licensing a Cluster KB 10668 - NCC Health Check: list_containers_being_converted_to_aes KB 10955 - Create Category in Prism returns "The name you entered is in use" even though the name does not show up in the list of categories KB 11010 - Adding Nutanix Objects as Primary or Secondary Storage with Veritas Enterprise Vault KB 11041 - Nutanix Files share not accessible after performing Nutanix Files upgrade or rebooting File Server VMs KB 11051 - Message "kvm: already loaded the other module" when booting into Phoenix KB 11054 - Expand cluster with node reimaging on AOS 5.19.1 and ESXi 7.0u1 fails KB 11057 - Cluster creation fails on HPE nodes when using generic VMWare ESXi image KB 11059 - Unable to enable HA or HA in crit
Login to the community
Login with your account
Enter your username or e-mail address. We'll send you an e-mail with instructions to reset your password.