A Uncorrectable Bus Error Has Occurred On System X3850
Contents |
Bus Error" and system hangs after power on - IBM BladeCenter and System x Applicable countries and regions Source RETAIN tip: H203828 Symptom In rare cases, fault in slot "all pci error" on system after the system powers on or reboots, the system encounters a fatal
An Uncorrectable Bus Error Has Occurred On Bus Cpus
error and hangs in the Power On Self Test (POST). Integrated Management Module (IMM) does not handle
Sensor Planar Fault Has Transitioned To Critical From A Less Severe State
this error or try to recover. The system may hang in a remote session with the following recorded in the event log: A Uncorrectable Bus Error has occurred
A Software Nmi Has Occurred On System
on system "SN# " Affected configurations The system may be any of the following IBM servers: BladeCenter HX5, type 1909, any model BladeCenter HX5, type 7872, any model BladeCenter HX5, type 7873, any model System x3690 X5, type 7147, any model System x3690 X5, type 7148, any model System x3690 X5, type 7149, any model System x3690 X5, type memory logging limit reached for on subsystem system memory 7192, any model System x3850 X5, type 7143, any model System x3850 X5, type 7145, any model System x3850 X5, type 7146, any model System x3850 X5, type 7191, any model System x3950 X5, type 7143, any model System x3950 X5, type 7145, any model This tip is not software specific. This tip is not option specific. Solution This behavior is corrected in IMM firmware Version 1.32 (Build ID: YUOOD4G) and later. The file is available by selecting the appropriate Product Group, Product name, Product machine type, and Operating system on IBM Support's Fix Central web page, at the following URL: http://www.ibm.com/support/fixcentral/ Workaround Reset the IMM to handle the fatal error using the Web Interface, Command Line Interface (CLI) console, or Advanced Management Module (AMM). After the IMM is started, reboot the host system. Applicable countries and regions Worldwide Back to top Document id:MIGR-5088751 Last modified:2012-02-21 Copyright © 2016 IBM Corporation Sign in To access your authorized content and to customize your pages. Footer links Contact Privacy Terms of use Accessibility
memory error and CPU bus error - IBM System x3530 M4 (7160) and System x3630 M4 (7158) Applicable countries and regions Source RETAIN tip: H213778 Symptom System fault in slot "no op rom space" on system x3630M4/x3530M4 may report uncorrectable bus error on Central Processing Unit (CPU) redundancy lost for power unit has asserted and uncorrectable error on memory while performing excessive memory usage application or memory diagnostic. The error from the sensor sys brd vol fault has transitioned to critical from a less severe state Integrated Management Module (IMM) chassis event log is as follows and results in a system restart. CHASSIS:(12/23/2014 02:23:11) An Uncorrectable Bus Error has occurred on bus CPUs. CHASSIS:(12/23/2014 https://www.ibm.com/support/entry/portal/docdisplay?lndocid=migr-5088751 02:23:01) Uncorrectable error detected for One of the DIMMs on Subsystem System Memory. The symptom can only be observed by the Three (3) DIMMs for One (1) CPU or Six (6) DIMMs for Two (2) CPUs ( One DIMM per channel ) configuration. Affected configurations The system may be any of the following IBM servers: System x3530 M4, type https://www.ibm.com/support/entry/portal/docdisplay?lndocid=migr-5097080 7160 E5-xxxxV2, any model System x3630 M4, type 7158 E5-xxxxV2, any model This tip is not software specific. This tip is not option specific. The following system BIOS/UEFI level(s) are affected: UEFI version 1.60/1.70/1.71 are affected. Solution The issue has been addressed by the UEFI firmware code version 2.12 or later. The file is or will be available by selecting the appropriate Product Group, type of System, Product name, Product machine type, and operating system on IBM Support's Fix Central web page, at the following URL: http://www.ibm.com/support/fixcentral/ Workaround This issue can be worked around by changing the UEFI setting to lower memory speed from default 1600 MT/s to 1333 MT/s. In UEFI setup menu, Select 'System Settings' --> 'Operating Modes' --> Choose Operating Mode: 'Custom Mode' --> Change Memory Speed to 'Balanced'. Additional information The system mistakenly triggers the failure symptom. The UEFI firmware will address this issue in next release. Applicable countries and regions Worldwide Back to top Document id:MIGR-5097080 Last modified:2015-05-29 Copyright © 2016 IBM Corporation Sign in To access y
About this task When you turn on the server, it performs a series of tests to check the operation of the server components and some optional devices in the server. This series of tests is called the power-on self-test, or POST. If http://publib.boulder.ibm.com/infocenter/systemx/documentation/topic/com.ibm.sysx.7145.doc/bb1py_r_post.html a power-on password is set, you must type the password and press Enter, when you are prompted, for POST to run. If POST is completed without detecting any problems, the server startup is completed. If POST detects a problem, an error message is sent to the POST event log. The following table describes the POST error codes and suggested actions to correct the detected problems. These errors can appear as severe, warning, or informational. I.11002 [I.11002] A processor mismatch has been on system detected between one or more processors in the system. I.2018002 [I.2018002] The device found at Bus % Device % Function % could not be configured due to resource constraints. The Vendor ID for the device is % and the Device ID is %. I.2018003 [I.2018003] A bad option ROM checksum was detected for the device found at Bus % Device % Function %. The Vendor ID for the device is % and the Device ID is %. I.3048005 [I.3048005] UEFI has occurred on has booted from the backup flash bank. I.3108002 I.3808004 [I.3808004] The IMM System Event log (SEL) is full. I.3818001 [I.3818001] The firmware image capsule signature for the currently booted flash bank is invalid. I.3818002 [I.3818002] The firmware image capsule signature for the non-booted flash bank is invalid. I.3818003 [I.3818003] The CRTM flash driver could not lock the secure flash region. I.580A4 [I.580A4] Memory population change detected. I.580A5 [I.580A5] Mirror Fail-over complete. DIMM number % has failed over to to the mirrored copy. I.580A6 [I.580A6] Memory spare copy has completed successfully. S.2011001 [S.2011001] An Uncorrected PCIe Error has Occurred at Bus % Device % Function %. The Vendor ID for the device is % and the Device ID is %. S.2018001 [S.2018001] An Uncorrected PCIe Error has Occurred at Bus % Device % Function %. The Vendor ID for the device is % and the Device ID is %. S.3058004 [S.3058004] A Three Strike boot failure has occurred. The system has booted with default UEFI settings. S.3818004 [S.3818004] The CRTM flash driver could not successfully flash the staging area. A failure occurred. S.3818007 [S.3818007] The firmware image capsules for both flash banks could not be verified. S.3828001 S.3828002 S.51003 [S.51003] An uncorrectable memory error was detected in DIMM slot % on rank %. S.51006 [S.51006] A memory mismatch has been detected. Please verify that the memory configuration is valid. S.58008 [S.58008] A DIMM has failed the POST memory test. S.68005