Question

One or more cluster services are not healthy.

  • 12 September 2023
  • 24 replies
  • 179 views

Badge +2
Good afternoon, I have all the machines stopped and I cannot enter Nutanix, I have received this error by email, can someone tell me what the problem is and some clues to solve it. thank you

 

 

El servicio 'Cluster Health' ha caído en 192.168.8.99 en el clúster NTNXDEV01.

 

Impacto: Los servicios ofrecidos por el servicio 'Cluster Health' se verán afectados.

Causa: uno o más servicios de clúster no son correctos.

Solución: Póngase en contacto con el soporte de Nutanix.

 

ID de bloque : 676e88ee

ID del nodo: 4RGN3K2

ID del clúster : 548454096355313168

Uuid del clúster : 0005fe14-c435-64ba-079c-801844ddc610

Nombre del clúster : NTNXDEV01

Versión del clúster: nutanix-core-el7.3-release-ce-2020.09.16-stable-d4fc219b73b4181935a3a19465eb922313fc735f

UUID de nodo: 4331634c-64bc-46f4-9bcf-5abdcfc88dda

IP del nodo: 192.168.8.99

Ips del clúster : 192.168.8.99

Marca de tiempo : Tue Sep 12 12:35:04 2023


24 replies

Userlevel 5
Badge +8

Try to start the cluster with: cluster start

and try to access the prism gui again. 

Badge +2

 

Userlevel 5
Badge +8

You nood to ssh into the cvm and not the host. 
 

Check is cvm is running with virsh list —all

if not running start it. 

Badge +2

 

Badge +2
The CVM does not start, it is at 192.168.8.99

 

Userlevel 5
Badge +8

On your screenshot you can see the CVM is running. 

ssh into that cvm and and enter cluster start.

 

Are you running Communiy Edition? Do you have a 1 or =>3 node cluster? 

Badge +2
I can't access the IP of that CVM through ssh, that's my problem =(

Are you running Communiy Edition? yes

Do you have a 1 or =>3 node cluster? 1

Userlevel 5
Badge +8

Ah you are in the wrong subforum. ;) 

 

What you can do is the following (assuming it had worked before and nothing is changed): 

or:

  • shutdown the cvm via the ahov host: virsh shutdown NTNX-676e88ee-A-CVM
  • reboot the host via shutdown -r now

or:

  • reinstall the whole community edition node. 

 

Badge +2

 I have several virtual development machines and I need to get information... I don't understand what happened, suddenly, it is not accessible through AHV, which is IP 192.168.8.99 I only access host 192.168.8.98....and I can't do anything

 

 

After trying this... do you suggest I reinstall CE?
Userlevel 5
Badge +8

I dont get it ;) 

 

Let me summarize:

  • AHV/Host = 192.168.8.98
  • CVM = 192.168.8.99

Correct?

And you cant access the CVM via SSH?

Did you already tried rebooting the box? Is the cvm pingable? 

Badge +2

Let me summarize:

AHV/Host = 192.168.8.98

CVM = 192.168.8.99

Correct?

Correct And you can't access the CVM via SSH?

Yes, I cannot access the CVM via ssh or ping it, I can only access the AHV/host via ssh

Have you already tried restarting the box? Yeah

Can the cvm be pinged? No Let's see if you could help me hehe,

I've been crazy all morning! =)

Userlevel 5
Badge +8

Did it worked before?

Is the node installed bare metal? Or nested? (so is it a virtual machine in esx?)

 

Badge +2

Nothing I've tried has worked. It is installed on premise on a Dell Poweredge R720 server with 4 TB SSD This server is a development server, in addition to this one I have a 3-node claster with its license, but that one works without problems.

Userlevel 5
Badge +8

Oke I think the best is to start again. Looks like the CVM is broken. 

Badge +2

And all my machines? =(

Userlevel 5
Badge +8

And all my machines? =(

You said “nothing you tried has worked”. So the node was working before???? 

 

It is community edition, so you should not run production virtual machines on it. ;)

 

Badge +2

Sorry, maybe I didn't explain myself well before, sorry. I mean everything I've tried to fix it hasn't worked. In summary, in Nutanix CE it worked perfectly and I had 5 machines running without problems, but for some reason that I don't know I can no longer enter and all the machines appear turned off.

Userlevel 5
Badge +8

Oke lets start over ;) 

 

On the ahv host check if the cvm is running: virsh list --all

Then try to ping the cvm: ping 192.168.5.2 (Yes, that specific ip address)

When it replies ssh into it: ssh admin@192.168.5.2

When in: Check ip addresses with ifconfig

Do you see the correct ip-address for the cvm? And is that not available from ahv host? Then I would reinstall the box as troubleshooting would probably takes to much time. 

If that ip-address is accessible from ahv but not from you lan then you should look into firewalls or something.  

if that ip is accessible from ahv and lan start your cluster: cluster start

(I assume you are using the cluster vip and not the cvm ip)

Badge +2

 

Userlevel 5
Badge +8

Yeah so the cvm is broken (192.168.5.2 is cvm. 5.1 is ahv) You did reboot the whole server already you said? 
 

I would suggest reinstall the box. 

Badge +2

Yes, restart as you told me. Now, I don't know how to reinstall the box. would you help me?

Userlevel 5
Badge +8

All info can be found here: https://next.nutanix.com/discussion-forum-14/download-community-edition-38417

Badge +2

Oh you mean complete installation... so I lose the VMs?

Badge +2
Thanks for everything!!! I will reinstall Nutanix ce ! all the best!

Reply