Print | Rate this content

Advisory: VMware - HPE ProLiant Gen8 Servers running VMware ESXi 5.5 Patch 10, VMware ESXi 6.0 Patch 4, Or VMware ESXi 6.5 May Experience Purple Screen Of Death (PSOD): LINT1 Motherboard Interrupt

SUPPORT COMMUNICATION - CUSTOMER ADVISORY

Document ID: c05392947

Version: 4

Advisory: (Revision) VMware - HPE ProLiant Gen8 Servers running VMware ESXi 5.5 Patch 10, VMware ESXi 6.0 Patch 4, Or VMware ESXi 6.5 May Experience Purple Screen Of Death (PSOD): LINT1 Motherboard Interrupt
NOTICE: The information in this document, including products and software versions, is current as of the Release Date. This document is subject to change without notice.

Release Date: 2017-02-17

Last Updated: 2017-05-08


DESCRIPTION

Document Version
Release Date
Details
4
05/08/2017
Description and Resolution sections were updated, and some platforms affected were removed.
3
03/22/2017
Additional platforms were added.
2
02/22/2017
Updated resolution.
1
02/17/2017
Original Document Release.

Any HPE ProLiant Gen8 Servers running VMware ESXi 5.5 patch 10, VMware ESXi 6.0 patch 4, or VMware ESXi 6.5 host may experience either or both of the following:

  • Intermittent purple diagnostic screens citing an NMI, Non-Maskable, or LINT1 interrupt similar to:

LINT1 motherboard interrupt. This is a hardware problem: please contact your hardware vendor.

  • IML entries in the AHS log similar to:

Critical","PCI Bus","Uncorrectable PCI Express Error (Slot X, Bus X, Device 0, Function 0, Error status 0x00100000)

Critical","System Error,"1","Unrecoverable System Error (NMI) has occurred. System Firmware will log additional details in a separate IML entry if possible"

This occurs when the VMware ESXi patch 10 for Vmware ESXi 5.5 is installed, or when the VMware ESXi patch 4 for VMware ESXi 6 is installed, or when VMware ESXi 6.5 is installed, which disables the Intel IOMMU interrupt remapper in the vmkernel.

HPE has identified the cause of the issue on the HPE ProLiant DL560 Gen8 server and HPE ProLiant DL380p Gen8 server as high performing, low-latency PCIe adapters installed in slot 3 and systems under heavy load.

As additional reference of this issue, check the VMware Knowledge Base document:

ESXi host fails with intermittent NMI purple diagnostic screen on HP ProLiant Gen8 servers (KB2149043).

https://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&docType=kc&externalId=2149043&sliceId=1&docTypeID=DT_KB_1_1&dialogID=345520259&stateId=1%200%20345524186 Non-HPE site

SCOPE

Any HPE Proliant Gen8 server running VMware ESXi 5.5 patch 10, VMware ESXi 6.0 patch 4 or VMware ESXi 6.5 host.

RESOLUTION

To resolve this issue, on the HPE ProLiant DL560 Gen8 server or the HPE ProLiant DL380p Gen8 Server when the IOMMU remapper is disabled, move the low-latency or high performing PCI-e card to slot 1,2,4,5 or 6 (depending on the type of secondary riser board that might be installed).

As a workaround, enable the Intel IOMMU remapper by typing the following command at the esxcli and reboot the VMware ESXi host:

# esxcli system settings kernel set --setting=iovDisableIR -v FALSE

After rebooting the host, verify the run-time setting for iovDisableIR is set to "FALSE" as follows:

# esxcli system settings kernel list -o iovDisableIR

The output should be similar to:

Name Type Description Configured Runtime Default

------------ ---- --------------------------------------- ---------- ------- -------

iovDisableIR Bool Disable Interrupt Routing in the IOMMU... FALSE FALSE TRUE

This issue is under investigation by HPE and VMware.




RECEIVE PROACTIVE UPDATES : Receive support alerts (such as Customer Advisories), as well as updates on drivers, software, firmware, and customer replaceable components, proactively via e-mail through HPE Subscriber's Choice. Sign up for Subscriber's Choice at the following URL: Proactive Updates Subscription Form .

NAVIGATION TIP : For hints on navigating HPE.com to locate the latest drivers, patches, and other support software downloads for ProLiant servers and Options, refer to the Navigation Tips document .

SEARCH TIP : For hints on locating similar documents on HPE.com, refer to the Search Tips document .


Hardware Platforms Affected: HPE ProLiant BL460c Gen8 Server Blade, HPE ProLiant DL360p Gen8 Server, HPE ProLiant DL380p Gen8 Server, HPE ProLiant DL320e Gen8 Server, HPE ProLiant DL360e Gen8 Server, HPE ProLiant DL380e Gen8 Server, HPE ProLiant DL560 Gen8 Server, HP ConvergedSystem 700 for Virtualization 1.1, HP ConvergedSystem 700x v1.1 VMware Kit
Operating Systems Affected: VMware ESXi 5.5, VMware ESXi 6.0
Software Affected: Not Applicable
Support Communication Cross Reference ID: IA05392947
©Copyright 2017 Hewlett Packard Enterprise Company, L.P.
Hewlett Packard Enterprise Company shall not be liable for technical or editorial errors or omissions contained herein. The information provided is provided "as is" without warranty of any kind. To the extent permitted by law, neither HPE nor its affiliates, subcontractors or suppliers will be liable for incidental, special or consequential damages including downtime cost; lost profits; damages relating to the procurement of substitute products or services; or damages for loss of data, or software restoration. The information in this document is subject to change without notice. Hewlett Packard Enterprise Company and the names of Hewlett Packard Enterprise Company products referenced herein are trademarks of Hewlett Packard Enterprise Company in the United States and other countries. Other product and company names mentioned herein may be trademarks of their respective owners.

Provide feedback

Please rate the information on this page to help us improve our content. Thank you!