University / Technology Innovation Workshop 2022 / OpenBMC and AI on a Mission to Reduce Memory Failures in Datacenters
OpenBMC and AI on a Mission to Reduce Memory Failures in Datacenters
Mar 18 2023 | 36 mins
- Details
OpenBMC and AI on a Mission to Reduce Memory Failures in Datacenters Reports show that top reliability concern in the datacenters are memory failures which in some cases can contribute to 50% of total server hangs. As memory density is increasing every year this state will not change without modern prediction methodology. In this talk we would like to present OpenBMC’s Intel Memory Resiliency Technology application that can predict memory failures using AI algorithm and warn user before that happens. It’s also capable of triggering self-healing actions like prediction guided runtime Post Package Repair which increases server uptime and minimize service interruptions.
Presenters:
Maciej Lawniczak
Cloud Software Architect
Intel Cloud Engineering
Wojciech Szczerba
Cloud Software Development Engineer
Intel Cloud Engineering
Karol Wojciechowski
Cloud Software Development Engineer
Intel Cloud Engineering