How to Handle a DIMM RAS Event on G6, G7, G8 Platforms | Nutanix Community
Skip to main content

If you’ve ever encountered an alert like the referenced one below, you might have wondered, “What does this alert mean, and what do I do next?”

Reference Alert:
1 DIMM RAS event found for P1-DIMMA1(Serial:XXXXXXXX) by Samsung on host x.x.x.x in last 24 hours. Installed BIOS version is PB42.602

The answer to the first question is easy.  RAS stands for Reliability, Availability, and Serviceability and is an advanced feature that is enabled in the server’s BIOS to detect and alert failing memory region(s) proactively.

The second answer is a bit more tricky, but don’t worry, we’ve got you covered with KB-11794 which goes into detail about how to diagnose and resolve a RAS event, and KB-7503 which covers DIMM error handling guidance for G6, G7, and G8 platforms.

It is awesome to see options like this that allow you to repair hardware so that a hardware replacement can be avoided. I see the day where drives, DIMMs, and other hardware FRUs repair can repair themselves to a degree before needing to be replaced.