Question

Unable to install Prism Central


Badge
Hi, I am using a Nutanix CE 20180123.170 version, as a single node cluster.
I want to use Prism Central to manage this cluster.
I downloaded 2 files :
ce-2018.01.31-metadata.json and ce-pc-deploy-2018.01.31.tar and choose to install the image deploy.
Files are uploaded, but deployment fails, without any reason.
PRISM VM is created But I can't do anything with it. Even not to be deleted !

This topic has been closed for comments

26 replies

Userlevel 2
Badge +13
I have the same problem.. I can not create any VM, the vm not boot and the Prism Central neither.

Rgds.
Userlevel 5
Badge +9
Hi,
@winsysman is the Prism VM accessible so you can check if any services are running? Or did it fail completely like @leoleonhardt reports?
If you cannot access the VM, can you check if any deployment tasks are running? Either in Prism or login to a CVM and run
acli task.list limit=1000 | grep kPrismCentralDeploymentRequest | grep kRunning
I would also check the logs in the cluster to see if there are any hints in there why the deployment failed.
Userlevel 2
Badge +13
When deploy any VM, this is the error to boot vm, and the prism central same..

Userlevel 5
Badge +9
Hi,
is this maybe a nested installation in VMware?
Badge
Hi, The Prism Central VM has been created but I can't do anything with this VM,


All deployment tast are finished.
Only the Download and deploy prism central task failed
Badge +5
You can remove the failed installation of the Prism Central VM by using ACLI. Putty to the CVM and run the following:

acli vm.list

This should list out the VM's with their UUID.

Then use: acli vm.delete UUID

It will ask, yes/no to delete. Choose yes.
Badge
All, I was trying to use ncli instead acli. Now delet is succesfull removing the bad VM's. Thanks for this.
Now still looking to install the Prism Central :(
Any suggestions / feedback here?
Thanks all participating in my journey to move forward with this 🙂
Badge +5
Try to install the manual method. You should have two files, boot and home. Upload those to image repository, create a VM with 4vcpu and 24Gb RAM - (Full Capability), add the two disks by cloning from image service, add network, etc.

Boot the VM:

  1. Launch the console (click the Launch Console action link) and log into the Prism Central VM (user name "nutanix" and password "nutanix/4u").
  2. Assign a static IP address to the Prism Central VM as follows:
  3. Open the ifcfg-eth0 file for editing.
  4. The following command opens the file using the vi editor:
code:
$ sudo vi /etc/sysconfig/network-scripts/ifcfg-eth0
Update the NETMASK, IPADDR, BOOTPROTO, and GATEWAY entries as needed.NETMASK="xxx.xxx.xxx.xxx"
IPADDR="xxx.xxx.xxx.xxx"
BOOTPROTO="none"
GATEWAY="xxx.xxx.xxx.xxx"
  • Enter the desired netmask value in the NETMASK field. (Replace xxx.xxx.xxx.xxx with the appropriate value.)
  • Enter the appropriate static IP address (usually assigned by your IT department) for the Prism Central VM in the IPADDR field.
  • Enter none as the value in the BOOTPROTO field. (You might need to change the value from dhcp to none if you employ DHCP. Only a static address is allowed; DHCP is not supported.)
  • Enter the IP address for your gateway in the GATEWAY field.
  1. Warning: Carefully check the file to ensure there are no syntax errors, whitespace at the end of lines, or blank lines in the file.
  2. Save the changes.
  3. Remove any existing Nutanix Controller VM entries, that is ones which include "NTNX--CVM", from the /etc/hosts file. (Be careful that you do not remove any other entries from the file.)
  4. To edit the file using vi, enter
code:
$ sudo vi /etc/hosts
  1. When the Prism Central VM starts the first time and DHCP is enabled in the network, an entry similar to the following is added to the /etc/hosts file. This entry (if present) needs to be removed from the /etc/hosts file before restarting the Prism Central VM in the next step, which generates a new entry if DHCP is enabled.
code:
127.0.0.1 NTNX-10-3-190-99-A-CVM
Restart the Prism Central VM.$ sudo reboot
Log in to the Prism Central VM again and then enter the following command to create a cluster:$ cluster --cluster_function_list="multicluster" -s static_ip_address create


The static_ip_address is the Prism Central VM IP address.

Go back to Prism Home Page and pick Register a Prism Central. Pick, I already have a Prism Central Instance, click connect and input the IP username/password to register.
Badge
Problem is that Launching console is not possible and I only have the ce-pc-deploy-2018.01.31.tar and the ce-pc-deploy-2018.01.31-metadata.json files to upload using the Prism Central Registration tool.
Badge +5
Extract the deploy .tar file. Why can't you use the console?
Badge
Now trying to extract and upload files.
in the VM overview the Launch console option is graded out, thats the problem I cant start the VM, or can I start it using an acli command?
Badge +5
It's grayed out until you power the VM on.
Badge
Power on is also graded out, only take snapshot is available
Badge +5
This is the VM you created through the Create VM function or deploy from main page for Prism Central?
Badge
This is still the vm created through deploy from Prism Central page
Badge +5
You need to shutdown that originally deployment of PC and delete it.

acli vm.list
acli vm.delete UUID (from above list)

Then extract the PCboot and PChome from the .tar file and upload them via image service.

Then follow my instructions to deploy PC manually.
Badge
Yes, I will try this comming days, and come back to it. Tanks already.
Userlevel 7
Badge +34
Hi @winsysman

How did things go - is this still an issue for you or did you find a solution. Let us know, Thanks
Badge +3
I have another issue with PC. After deployment and cluster creation it stucks on loadin services

Badge +4
I have another issue with PC. After deployment and cluster creation it stucks on loadin services



I'm having the same issue. Taking a look into the logs I can see the following errors:

code:
nutanix@NTNX-192-168-1-25-A-CVM:~$ tail -f /home/nutanix/data/logs/ssl_terminator.out
INFO 3168 ../../../../../infrastructure/cluster/service_monitor/service_monitor.c:182 StartServiceMonitor: Child 11557 exited with status: 256
INFO 3168 ../../../../../infrastructure/cluster/service_monitor/service_monitor.c:175 StartServiceMonitor: Launched child with pid: 12241
INFO 12241 ../../../../../infrastructure/cluster/service_monitor/service_monitor.c:200 StartService: Starting service with cmd: /home/nutanix/bin/ssl_terminator
INFO 12241 ../../../../../infrastructure/cluster/service_monitor/service_monitor.c:129 RefreshZkHostPortList: Setting ZOOKEEPER_HOST_PORT_LIST=zk1:9876;
2018-04-08 11:24:11 INFO zookeeper_session.py:110 ssl_terminator is attempting to connect to Zookeeper
2018-04-08 11:24:15 WARNING ssl_terminator_server.py:365 Checksum mismatch 5d0906e9ab1a7c3b23081031b383a4c940039e3b61b9144a1e5617804f7a7b61 !=
2018-04-08 11:24:17 WARNING ssl_terminator_server.py:365 Checksum mismatch e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855 !=
2018-04-08 11:24:17 CRITICAL decorators.py:47 Traceback (most recent call last):
File "/home/afg/src/main/builds/build-ce-2018.01.31-stable-release/python-tree/bdist.linux-x86_64/egg/util/misc/decorators.py", line 41, in wrapper
File "/home/afg/src/main/builds/build-ce-2018.01.31-stable-release/python-tree/bdist.linux-x86_64/egg/ssl_terminator/ssl_terminator_server.py", line 877, in _secure_key_repo_watch_thr
File "/home/afg/src/main/builds/build-ce-2018.01.31-stable-release/python-tree/bdist.linux-x86_64/egg/ssl_terminator/ssl_terminator_server.py", line 713, in _configure_pem
File "/home/afg/src/main/builds/build-ce-2018.01.31-stable-release/python-tree/bdist.linux-x86_64/egg/ssl_terminator/ssl_terminator_server.py", line 441, in _download_secure_key
SSLTerminatorServerException: Failed to download private key



code:
nutanix@NTNX-192-168-1-25-A-CVM:~$ tail -f /home/nutanix/data/logs/cassandra.out
2018-04-08 11:25:07,963:16620(0x7fd03a302700):ZOO_INFO@log_env@959: Client environment:os.version=#1 SMP Wed Sep 27 23:13:56 PDT 2017
2018-04-08 11:25:07,963:16620(0x7fd03a302700):ZOO_INFO@zookeeper_init@994: Initiating client connection, host=zk1:9876 sessionTimeout=12000 watcher=0x157c760 sessionId=0x162a65c3f5c02e6 sessionPasswd= context=0x3989560 flags=0
2018-04-08 11:25:07,976:16620(0x7fd03906c700):ZOO_INFO@zookeeper_interest@1915: Connecting to server 192.168.1.25:9876
2018-04-08 11:25:07,977:16620(0x7fd03906c700):ZOO_INFO@zookeeper_interest@1952: Zookeeper handle state changed to ZOO_CONNECTING_STATE for socket [192.168.1.25:9876]
2018-04-08 11:25:07,987:16620(0x7fd03906c700):ZOO_INFO@check_events@2126: initiated connection to server [192.168.1.25:9876]
F0408 11:25:07.977202 16620 cassandra_monitor.cc:4524] No metadata disks found on node: 2
*** Check failure stack trace: ***
2018-04-08 11:25:08,012:16620(0x7fd03906c700):ZOO_INFO@check_events@2173: session establishment complete on server [192.168.1.25:9876], sessionId=0x162a65c3f5c02e6, negotiated timeout=12000
2018-04-08 11:25:08,012:16620(0x7fd03a302700):ZOO_INFO@zookeeper_close@2981: Closing zookeeper sessionId=0x162a65c3f5c02e6 to [192.168.1.25:9876]

Stack traces are generated at /home/nutanix/data/cores/cassandra_monit.16620.20180408-112511.stack_trace.txt
E0408 11:25:11.075800 3270 cassandra_monitor.cc:1256] Child process died with status: 1
kill: usage: kill [-s sigspec | -n signum | -sigspec] pid | jobspec ... or kill -l [sigspec]
kill: usage: kill [-s sigspec | -n signum | -sigspec] pid | jobspec ... or kill -l [sigspec]
2018-04-08 11:25:14,501:16859(0x7fd04299dc40):ZOO_INFO@log_env@946: Client environment:zookeeper.version=zookeeper C client 3.4.3
2018-04-08 11:25:14,501:16859(0x7fd04299dc40):ZOO_INFO@log_env@950: Client environment:host.name=NTNX-192-168-1-25-A-CVM
2018-04-08 11:25:14,501:16859(0x7fd04299dc40):ZOO_INFO@log_env@957: Client environment:os.name=Linux
2018-04-08 11:25:14,501:16859(0x7fd04299dc40):ZOO_INFO@log_env@958: Client environment:os.arch=3.10.0-514.16.1.el7.nutanix.20170927.cvm.x86_64
2018-04-08 11:25:14,501:16859(0x7fd04299dc40):ZOO_INFO@log_env@959: Client environment:os.version=#1 SMP Wed Sep 27 23:13:56 PDT 2017
2018-04-08 11:25:14,501:16859(0x7fd04299dc40):ZOO_INFO@zookeeper_init@994: Initiating client connection, host=zk1:9876 sessionTimeout=12000 watcher=0x1528330 sessionId=0 sessionPasswd= context=0x3b18040 flags=0
2018-04-08 11:25:14,510:16859(0x7fd03a281700):ZOO_INFO@zookeeper_interest@1915: Connecting to server 192.168.1.25:9876
2018-04-08 11:25:14,511:16859(0x7fd03a281700):ZOO_INFO@zookeeper_interest@1952: Zookeeper handle state changed to ZOO_CONNECTING_STATE for socket [192.168.1.25:9876]
2018-04-08 11:25:14,512:16859(0x7fd03a281700):ZOO_INFO@check_events@2126: initiated connection to server [192.168.1.25:9876]
2018-04-08 11:25:14,513:16859(0x7fd03a281700):ZOO_INFO@check_events@2173: session establishment complete on server [192.168.1.25:9876], sessionId=0x162a65c3f5c02e9, negotiated timeout=12000
2018-04-08 11:25:14,515:16859(0x7fd03a302700):ZOO_INFO@log_env@946: Client environment:zookeeper.version=zookeeper C client 3.4.3
2018-04-08 11:25:14,515:16859(0x7fd03a302700):ZOO_INFO@log_env@950: Client environment:host.name=NTNX-192-168-1-25-A-CVM
2018-04-08 11:25:14,515:16859(0x7fd03a302700):ZOO_INFO@log_env@957: Client environment:os.name=Linux
2018-04-08 11:25:14,515:16859(0x7fd03a302700):ZOO_INFO@log_env@958: Client environment:os.arch=3.10.0-514.16.1.el7.nutanix.20170927.cvm.x86_64
2018-04-08 11:25:14,515:16859(0x7fd03a302700):ZOO_INFO@log_env@959: Client environment:os.version=#1 SMP Wed Sep 27 23:13:56 PDT 2017
2018-04-08 11:25:14,515:16859(0x7fd03a302700):ZOO_INFO@zookeeper_init@994: Initiating client connection, host=zk1:9876 sessionTimeout=12000 watcher=0x157c760 sessionId=0x162a65c3f5c02e7 sessionPasswd= context=0x3989560 flags=0
2018-04-08 11:25:14,526:16859(0x7fd03906c700):ZOO_INFO@zookeeper_interest@1915: Connecting to server 192.168.1.25:9876
2018-04-08 11:25:14,527:16859(0x7fd03906c700):ZOO_INFO@zookeeper_interest@1952: Zookeeper handle state changed to ZOO_CONNECTING_STATE for socket [192.168.1.25:9876]
2018-04-08 11:25:14,527:16859(0x7fd03906c700):ZOO_INFO@check_events@2126: initiated connection to server [192.168.1.25:9876]
2018-04-08 11:25:14,528:16859(0x7fd03906c700):ZOO_INFO@check_events@2173: session establishment complete on server [192.168.1.25:9876], sessionId=0x162a65c3f5c02e7, negotiated timeout=12000
F0408 11:25:14.528561 16859 cassandra_monitor.cc:4524] No metadata disks found on node: 2
*** Check failure stack trace: ***
2018-04-08 11:25:14,546:16859(0x7fd03a302700):ZOO_INFO@zookeeper_close@2981: Closing zookeeper sessionId=0x162a65c3f5c02e7 to [192.168.1.25:9876]


code:
nutanix@NTNX-192-168-1-25-A-CVM:~$ tail -f data/logs/genesis.out
2018-04-08 11:29:17 INFO zookeeper_service.py:560 Zookeeper is running as standalone
2018-04-08 11:29:18 INFO zookeeper_service.py:560 Zookeeper is running as standalone
2018-04-08 11:29:20 WARNING cassandra_service.py:690 Unable to get the Cassandra token for 192.168.1.25, ret 3, stdout , stderr Error connection to remote JMX agent!
java.io.IOException: Failed to retrieve RMIServer stub: javax.naming.ServiceUnavailableException [Root exception is java.rmi.ConnectException: Connection refused to host: localhost; nested exception is:
java.net.ConnectException: Connection refused (Connection refused)]
at javax.management.remote.rmi.RMIConnector.connect(RMIConnector.java:369)
at javax.management.remote.JMXConnectorFactory.connect(JMXConnectorFactory.java:270)
at org.apache.cassandra.tools.NodeProbe.connect(NodeProbe.java:152)
at org.apache.cassandra.tools.NodeProbe.(NodeProbe.java:122)
at org.apache.cassandra.tools.NodeCmd.main(NodeCmd.java:773)
Caused by: javax.naming.ServiceUnavailableException [Root exception is java.rmi.ConnectException: Connection refused to host: localhost; nested exception is:
java.net.ConnectException: Connection refused (Connection refused)]
at com.sun.jndi.rmi.registry.RegistryContext.lookup(RegistryContext.java:136)
at com.sun.jndi.toolkit.url.GenericURLContext.lookup(GenericURLContext.java:205)
at javax.naming.InitialContext.lookup(InitialContext.java:417)
at javax.management.remote.rmi.RMIConnector.findRMIServerJNDI(RMIConnector.java:1955)
at javax.management.remote.rmi.RMIConnector.findRMIServer(RMIConnector.java:1922)
at javax.management.remote.rmi.RMIConnector.connect(RMIConnector.java:287)
... 4 more
Caused by: java.rmi.ConnectException: Connection refused to host: localhost; nested exception is:
java.net.ConnectException: Connection refused (Connection refused)
at sun.rmi.transport.tcp.TCPEndpoint.newSocket(TCPEndpoint.java:619)
at sun.rmi.transport.tcp.TCPChannel.createConnection(TCPChannel.java:216)
at sun.rmi.transport.tcp.TCPChannel.newConnection(TCPChannel.java:202)
at sun.rmi.server.UnicastRef.newCall(UnicastRef.java:338)
at sun.rmi.registry.RegistryImpl_Stub.lookup(RegistryImpl_Stub.java:112)
at com.sun.jndi.rmi.registry.RegistryContext.lookup(RegistryContext.java:132)
... 9 more
Caused by: java.net.ConnectException: Connection refused (Connection refused)
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350)
at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)
at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:589)
at java.net.Socket.connect(Socket.java:538)
at java.net.Socket.(Socket.java:434)
at java.net.Socket.(Socket.java:211)
at sun.rmi.transport.proxy.RMIDirectSocketFactory.createSocket(RMIDirectSocketFactory.java:40)
at sun.rmi.transport.proxy.RMIMasterSocketFactory.createSocket(RMIMasterSocketFactory.java:148)
at sun.rmi.transport.tcp.TCPEndpoint.newSocket(TCPEndpoint.java:613)
... 14 more



I have tested the deployment with nested, native on ESXi and native on AHV.
Userlevel 7
Badge +34
Hi @winsysman

Just checking back to see how things went, hope you can share with the community any solutions found. Thanks
Hello All,

Same thing happens to me. When I try to install the Prism Central, I see that the service "DynamicRingChanger" and beyond is not in UP state and I am not able to install it on our system. Does anyone has an idea on it? I see that the problem is there for the last 9 months and there is no resolution, correct? Can you guide us on what to do?

Thank you very much. Cheers...
HI
I'm follow this instructions to deploy PC manually.
But it' failed

Userlevel 5
Badge +9
Hi,
this looks like a VM configuration issue... while not exactly the same issue, maybe the workaround described for nested installation in ESXi could help. See
https://next.nutanix.com/discussion-forum-14/this-is-all-how-to-install-latest-ce-2018-05-01-on-nested-esxi-28524
the section "for creating VM", below Tip 8.
Hi,
I do not think so this VM configuration issue,I Config other VM is normal
But I Config Prism Center is failed