Installation & Configuration
This forum is the best way to get up and running with the Nutanix platform
- 1,181 Topics
- 3,238 Replies
Can someone let me know how I restart the upgrade on the failed host? I cannot seem to find any restart/retry button.The host failed to go into maintenance mode and now upgrade says aborted on this one host.I have fixed the cause of maintenance mode now so it will be fine.
Dears:We have a 3 node pure NVME SSD clusters with RF 2. Now we plan to expand more 5 nodes and change the RF from 2 to 3. We have a little worried about the performance on :(1). During conversion phase from 2 copies to 3 copies.(2). For BAU usage, if RF3 performance will slower than RF2 because every write need do 3 times compared of 2 times. I got the question 1 answer from Nutanix Resiliency – Part 2 – Converting from RF2 to RF3 | CloudXC (joshodgers.com) and want to see any more experience. For question 2, any experiences could be shared? Thank you very much! P.S. We have enabled compression feature.
I have figured out that the version of the kernel is 4.19.100. I have also figured out that this version is not working with my NUC11 and the i225-V NIC. I would like to replace that kernel with the 5.15 kernel as the 5.15 kernel works with the i225-V NIC. Does anyone have a procedure on how to replace the kernel in the ISO? Any help would be greatly appreciated. Thanks,Steve
I have a used NX-3060-G5, with 4 nodesI am trying to install the new CE 2.0 on itEach node is configured with the e5-2650v4 x24x32 sticks of ramBMC version 3.94BIOS Version G4G5t8.0They all have the same 64gb statadoma 512GB SSD and a 1TB HDDI had used nodes a and b as test nodes orginally, then i tried to reinstall the new CE 2.0 on them and i keep getting failures at 2402/2430 hypervisor installation in progress with the read out to please take a look at the installer_vm_*.log inside foundation logs to debug hypervisor installation, then it restarts the install process and repeats. 2 of the nodes this installed on just fine, but they were not a reinstall of CE, they had vmware running on them previously.I have tried installing the latest supermicro bios and get the same issue, i have screenshots of the logs that i could find. Not sure if i can post them on here. Just seeing if anyone may be able to assist or point me in the right direction to find out why this keeps failing when al
Hello everyone,I have to install the first Nutanix cluster from a customer but I can't figure out how to setup the network.All customer switches (Aruba 2930, 2920) have the native vLAN (vLan1) dedicated to workstations and servers, while the management network is vLan ID 10 where the ESX, storage controllers, backup NAS and the Veeam server backing up the current infrastructure.This is the situation of all my customers, vLAN10 is completely isolated and unreachable from the native vLAN1, this was done to prevent any ransomware from affecting the backups as well.Now, Nutanix recommends configuring the CVM and hypervisor host VLAN as the native, or untagged, VLAN on the connected switch ports.I'm sure it would be nice to have all of the management in the native, adding a node would be much easier and I think the simpler things are kept the better.Furthermore, vLAN10 is propagated on many remote racks because obviously the backup NAS do not reside in the same racks as the servers, how can
Hi, I’m new to Nutanix, coming from the vSphere world.I am working on getting our new Nutanix infrastructure set up to run Qualys scans for security. I have been looking at the following document:https://portal.nutanix.com/page/documents/kbs/details?targetId=kA07V000000LXYqSAOI am getting stuck at determining which accounts to use to set up authentication, so that Qualys can run its scans on the Hypervisors, CVMs, and Prism Central appliances. My research shows that adding new accounts via the OS to these components is not supported. Does that mean that Qualys then has to log in as root, nutanix, and admin respectively, to scan these components? Or am I looking in the wrong place for the Qualys account to be set up, to scan for vulnerabilities and compliance? I feel like I'm missing something here. Thanks!
Hi allI have a cluster with vswitch active - activeI have new nodes to expand and I did theses steps below 1 - root@ahv# ./network_configuration to configure ips and vlan tag2 - Configure lacp 2.1 ssh email@example.com "ovs-vsctl set port br0-up other_config:lacp-time=fast" 2.2 ssh ovs-vsctl set port br0-up other_config:lacp-fallback-ab=true 2.3 ssh firstname.lastname@example.org "ovs-vsctl set port br0-up lacp=active" nutanix@CVM$ ssh 2.4 ssh ssh email@example.com "ovs-vsctl set port br0-up bond_mode=balance-tcp"3 - Expand cluster The nodes were joined success in the cluster, but when I checked the vswitch configuration of theses nodes was changed to active - backupDid I do something wrong? What better way? Thank you
I ma installing remotely Nutanix AHV cluster in very isolated environment, where for everything had to request firewall rules. IPMI is in different non-routable subnet, other that AHV and CVM. Firewall team confirmed that they don’t see any blocked traffic, but Foundation fails with error like this:2023-04-07 18:25:48,759Z WARNING Failed to register internal hypervisor: <Left ("Unable to satisfy any required capabilities for '<ExportedObjectDescriptor "command##.BindInternalHypervisor-1.0.0">'") at 0x52d10f8>2023-04-07 18:25:48,759Z INFO Tartarus initialization complete2023-04-07 18:25:48,996Z DEBUG Failed to load all plugins: Unable to satisfy required capability 'OPTIONAL_SMBIOS' for <ExportedObjectDescriptor "command##.LenovoRedfish_Pyghmi-1.0.0"> Skipping command##.GenericRedfish_Pyghmi-1.0.0, entity is disabled Unable to satisfy required capability 'OPTIONAL_SMBIOS' for <ExportedObjectDescriptor "command##.YadroRedfish_Pyghmi-1.0.0"> Unable to satisfy requi
Hello FriendsHow are you? Currently i am trying to foundation A Nutanix 3 node environment but after the foundation process begins it will halt at “waiting for the installer to boot” stage with “fatal” error warning i don’t know what is causing the error can anyone tell me something.I have attached screenshots of errors and process.
The NX G6 demo equipment is currently updated with the latest firmware. If you try to install version 5.10 to test the AOS upgrade with the latest version of the foundation, the 4. foundation version will not mount, and the 5. latest foundation version will not configure CVM after rebooting after installation and installation will fail. Is it like this originally?
We are triying to install a new Nutanix Cluster with 2 nodes Lenovo HS1021. The nodes machine type is 7D20. When we launch the installation with the Foundation, the use ends in failed and we see the following errors:WARNING Failed to register internal hypervisor: <Left ("Unable to satisfy required capability 'HYPERV_VERSION' for <ExportedObjectDescriptor "command##.BindInternalHypervisor-1.0.0">") at 0x7efbc4d82bd8>2023-03-14 17:15:27,733Z WARNING Skipping <ImagingStepPreInstall(<NodeConfig(X.X.X.X) @35d0>) @80d0> because dependencies not met2023-03-14 17:15:27,731Z WARNING Skipping <ImagingStepRAIDCheckPhoenix(<NodeConfig(X.X.X.X) @35d0>) @d150> because dependencies not met, failed tasks: [<ImagingStepInitIPMI(<NodeConfig(X.X.X.X) @35d0>) @d390>]2023-03-14 17:15:27,736Z DEBUG Setting state of <ImagingStepPhoenix(<NodeConfig(X.X.X.X.) @35d0>) @67d0> from PENDING to NR2023-03-14 17:15:27,736Z WARNING Skipping <ImagingStepPho
There are still many places where the old version is being used.Even if you try to install it with the previous version (AOS 5.10 or lower version) of the foundation on the G6 device to test the upgrade, it cannot be installed.Is it because all the firmware is up to date? It was the version that was originally installed. AOS 5.10 versions are not installed now?In foundation the log is "Device 2 :The Length of Name is not correct" The phrase continues to print. Please reply from those with experience.
Hi AllWhen I try to install AOS 6.5.2 I get the following errors. IOError: [Errno 2] No such file or directory: '/etc/nutanix/factory_config.json' 2023-03-07 13:19:54,652Z CRITICAL svm_rescue:926 No suitable SVM boot disk found. 2023-03-07 13:19:54,652Z INFO svm_rescue:114 exec_cmd: sync; sync; sync 2023-03-07 13:19:54,658Z INFO svm_rescue:114 exec_cmd: umount -R /mnt/disk 2023-03-07 13:19:54,663Z INFO svm_rescue:114 exec_cmd: umount -R /mnt/data] 2023-03-07 13:19:52,159Z INFO Imaging thread 'svm' failed with reason [None] 2023-03-07 13:19:52,164Z CRITICAL Imaging thread 'svm' failed with reason [None] 2023-03-07 13:19:52,200Z ERROR Exception in running <InstallHypervisorKVM(<NodeConfig(172.16.150.9) @b5d0>) @ee10> Traceback (most recent call last): File "foundation/imaging_step.py", line 161, in _run File "foundation/imaging_step_hypervisor.py", line 47, in run File "foundation/imaging_step.py", line 353, in wait_for_event StandardError: Received "fatal" in waiting for ev
My 3 node cluster consiting of 1065-G5 nodes is coming EOL this year. I will be replacing the hardware with a NX-3360N-G8 cluster. What is the recommended workflow for replacing the hardware? One thing to note is the current cluster is licensed with Prism Pro and the new one will be licensed with Starter as I’m not using all the features in Prism Pro.Is the replacement as easy as expanding the cluster one node at a time, migrating the workloads and then removing the old cluster? Will there be any hiccups with difference in licensing?
Hey guysI am reinstalling three Lenovo HX5521 nodes and I got stuck on the below errorFoundation IP not set. Try running the “set_foundation_ip_address” script on the desktopI'm running the foundation vm on the same subnet of the IPMI interfaces connected directly to an unmanaged switch, without VLANs or any other configuration. connectivity is perfect. I already reviewed all the settings and tried to reimage foundation using ESXi and AHV. Both result in the same error.Im running;Foundation_VM-5.2.2 AOS euphrates-5.20.3 LTS VMware-ESXi-7.0.1 or AHV-20201105.2244I've been trying to solve this problem for three days now, but so far I haven't found any clues. Any help will be greatly appreciated. Thanks! 👷🏽
Using phoenix to reimage a node after a failed satadom, after phoenix is loaded, I uploaded AHV ISO so it can proceed to install AHV/AOS. It turns out I uploaded the lcm_ahv_el7.nutanix.20201105.2267.tar.gz instead of the AHV-DVD-x86_64-el7.nutanix.20201105.2267.iso. The install is now hung at 75% Node discovery succeeded. I have tried to install the proper iso, but it immediately failed at that same progress. How can I cancel this install so I can try again?I did find this error in the foundation log. It looks like the error happens if you use the html 5 client to mount the phoenix iso, but I am using the java client.
Hello, i have nutanix 3060g8 i have error:Committed memory update intent that is stuck on 50% i tried to reboot cvm and all nodes but i still have same problem.cvm memory are 64gb i tired:ecli task.list include_completed=noTask UUID Parent Task UUID Component Sequence-id Type Statusf0ffcb92-96b1-426f-51d7-204729b269fe kGenesis 1 kCvmreconfig kRunning progress_monitor_cli --entity_id="f0ffcb92-96b1-426f-51d7-204729b269fe" --deleteany suggest ?
Hello eveyone,following my last post I eventually got my C node to be alive. (it was not booting nor being visible whatever I tries, always down).I can ping it with both IPv4 and IPv6 addresses, connect to it in SSH, run commands etc.Now my situation is : while I was struggling with my node down, I removed it from the cluster with Prism. I think I could bring it back after… But it does not work.As you can see here, the C node (#3) is now missing :I use the “Expand cluster” tools in Prism Element, I add the node manually, it is detectedI check the mode, validate and it starts to expand.But after a few minutes I get the error :Failure in pre expand-cluster tests. Errors: Failed to get HCI node info using discovery It seems like the cluster refuse to consider the node as “free”, or the node itself refuse to join because it thinks that it is still in the cluster.Thank you very much for any help you could provide
Hello, after a firmware upgrade, one host is locked DOWN in maintenance mode :CVM: 192.168.131.132 DownI can run a command to exit maintenance mode but it is not working and it is "Removed from metadata store" :nutanix@:~$ ncli host edit id=7 enable-maintenance-mode=falseId : …Hypervisor Address : 192.168.131.122Host Status : NORMALOplog Disk Size : 394 GiB (423,054,278,649 bytes) (3.9%)Under Maintenance Mode : false (ncli_manual)Metadata store status : Node is removed from metadata store...So I tried to recover it but the script fails :nutanix@:~$ python /home/nutanix/cluster/bin/lcm/lcm_node_recovery.py 192.168.131.122Recovering node 192.168.131.122Checking if the node 192.168.131.122 is in phoenixCurrent node status host Node 192.168.131.122 out of phoenix modeBringing host None out of maintenance mode Successfully put host None out of maintenance mode Bringing CVM 192.168.131.122 out of maintenance modeTraceback (most recent call last): File "/home/nutanix/cluster/bin/lcm/lcm_node_
Hi all, I’m wondering if someone can help me out with the first install of a nutanix CE single cluster with esxiThe installation went fine but, and followed this link to make it work. https://vmik.net/2021/01/26/nutanix-ce-install-esxi-2021/I see that only have 1 disk in the storagepool. Is there a way to add a disk into the CVM? or do I have to mount it first. can someone help me out? many thanks.
My hardware - NX-6035-G4 running 126.96.36.199 LTS and is no longer under support. It is one of 8 in the cluster. We use this as a backup target for our two other production clusters using protection domainsI was attempting to replace a satadom because I got this error satadom has worn out - PE cycles above 4500 or PE cycles above 3000 and daily PE cycles above 15.I followed the instructions on this page to get things started, but after 8 hours of trying to clone the satadom disk the process timed out - put the node in maintenance mode and detached it from the Metadata Store. I took the node out of maintenance mode to add it back to the Metadata Store, but the option has not popped up in PE. Here is the output from an NCLI host list. Id : Uuid : Name : IPMI Address : Controller VM Address : Controller VM NAT Address : Controller VM NAT PORT : Hypervisor Address : Host S
Login to the community
Login with your account
Enter your username or e-mail address. We'll send you an e-mail with instructions to reset your password.