I have a nutanix cluster which setting RF=3, according to the documents, It can tolerate 2 nodes broken at the same time. but in the data resilience page, it display that the failure tolerable for Oplog and Extent Group is 1 and all the message said it can tolerate 1 node failure maximum.
The node in the cluster are all standalone node, and block is same as node.
The AOS is 5.1.3 and hypervisor is esxi6.5
I checked the url:http://next.nutanix.com/t5/Installation-Configuration/Data-Resiliency-Status-shows-error/m-p/1114
But the full scan will be done every 6 hours automatically, and this tolerance problem has last for almost 1 week.
Do you have any comments for this problem. Does this state can let the cluster tolerate 2 nodes failures?
Best answer by chenzh4
I got where the problem is.
Although the cluster set RF=3, but there is also a container which setting is RF=2 and there is a VM running in this container.
That is why oplog and extent store display 1 in the data resilience page.