A Uncorrectable Bus Error Has Occurred On System X3550
Contents |
M5110 SAS/SATA controllers may reset and software NMIs may occur if PI is enabled - Servers Applicable countries and regions Source RETAIN tip: H21712 Symptom Systems configured with the ServeRAID M5110 or ServeRAID M5110e SAS/SATA Controller may exhibit controller resets, fault in slot all pci error on system system x3650 m4 as well as software Non-Maskable Interrupt (NMI) conditions, when Data Protection is enabled on
Sensor Planar Fault Has Transitioned To Critical From A Less Severe State
a Virtual Drive (VD) configured with Protection Information (PI) capable drives. In the ServeRAID Controller Event Log, the following error will be memory logging limit reached for on subsystem system memory reported: Controller encountered a fatal error and was reset One (1) or more of the following events will be reported in the Chassis Event Log at the same time: A software NMI has fault in slot "no op rom space" on system occurred on system IBM x3550 M4 Server. An Uncorrectable Bus Error has occurred on bus PCIs. Fault in slot All PCI Error on system IBM x3550 M4 Server. To verify if Data Protection is enabled, run the following MegaCLI command and look for the 'PI type'. If enabled, it will be anything other then 'None'. MegaCli -CfgDsply -aALL Data Protection can also be verified through MegaRAID Storage Manger (MSM) by selecting
Redundancy Lost For Power Unit Has Asserted
the Virtual Drive(s) under the Logical tab. Look for the 'Data Protection' field. Affected configurations The system may be any of the following IBM servers: IBM MessageSight Appliance, Type 6188, any model System x3300 M4, type 7382, any model System x3500 M4, type 7383, any model System x3530 M4, type 7160, any model System x3550 M4, type 5459, any model System x3550 M4, type 7914, any model System x3630 M4, type 7158, any model System x3650 M4, type 6188, any model System x3650 M4, type 7915, any model System x3750 M4, type 8722, any model System x3750 M4, type 8733, any model The system is configured with one or more of the following IBM Option part numbers: ServeRAID M5110 SAS/SATA Controller Card, Option 81Y4481, any CRU ServeRAID M5110 SAS/SATA Controller for IBM System x (CTO), any FRU ServeRAID M5110e SAS/SATA Controller for IBM System x, onboard, any embedded ServeRAID M5115 SAS/SATA Controller, Option 90Y4390, any CRU ServeRAID M5120 SAS/SATA Controller for IBM System x, Option 81Y4478, any CRU This tip is not software specific. The system has the symptom described above. Solution This behavior has been corrected in ServeRAID M5100/M5110e Series SAS/SATA Controller Firmware Update version 23.22.0-0024. The file is available by selecting the appropriate Product Group, type of System, Product name, Product ma
Uncorrectable Bus Error has occurred on bus CPUs.' seen in Chassis event log - IBM System x3100 sensor sys brd vol fault has transitioned to critical from a less severe state M5 (5457) Applicable countries and regions Source RETAIN tip: critical array has deasserted H213737 Symptom System x3100 M5 running Red Hat Enterprise Linux 6 (RHEL6) x86_64 may encounter
Memory Logging Limit Reached For Memory Device
unexpected system restart. The error is shown in the Chassis Event Log as: 'An Uncorrectable Bus Error has occurred on bus CPUs.' Affected configurations https://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=migr-5093594 The system may be any of the following IBM servers: System x3100 M5, type 5457, any model The system is configured with at least one of the following: Red Hat Enterprise Linux 6, any update This tip is not option specific. Note: This does not imply that the network https://www.ibm.com/support/entry/portal/docdisplay?lndocid=migr-5096868 operating system will work under all combinations of hardware and software. Please see the compatibility page for more information: http://www.ibm.com/systems/info/x86servers/serverproven/compat/us/ Solution This behavior will be corrected in a future release of Network Interface Controller (NIC) firmware. The target date for this release is scheduled for first quarter 2015. Workaround Append the parameters pcie_aspm=off to the vmlinuz line in the /etc/grub.conf file. This will turn off Active State Power Management (ASPM) on the Peripheral Component Interconnect Express (PCIe) bus for the NIC. The attached screenshot is an example of a grub.conf entry from a Red Hat system. The grub.conf file on the users' systems may vary. Applicable countries and regions Worldwide Back to top Document id:MIGR-5096868 Last modified:2015-02-05 Copyright © 2016 IBM Corporation Sign in To access your authorized content and to customize your pages. Footer links Contact Privacy Terms of use Accessibility
Help Receive Real-Time Help Create a Freelance Project Hire for a Full Time Job Ways to Get Help Ask a Question Ask for Help Receive Real-Time Help Create a Freelance Project Hire for a Full Time Job Ways to Get Help Expand Search https://www.experts-exchange.com/questions/28036158/URGENT-ibm-x3650-m3-vmware-hardware-alarm-Bus-Uncorrectable-error.html Submit Close Search Login Join Today Products BackProducts Gigs Live Careers Vendor Services Groups Website Testing Store Headlines Experts Exchange > Questions > URGENT! ibm x3650 m3 - vmware hardware alarm: Bus Uncorrectable error Want to Advertise Here? Solved URGENT! ibm x3650 m3 - vmware hardware alarm: Bus Uncorrectable error Posted on 2013-02-18 VMware Server Hardware 1 Verified Solution 4 Comments 6,305 Views Last Modified: 2013-03-24 Hardware: x3650 m3 (7945ac1) ESXi: 5.1.0 799733 on system (ibm-specific build) The host just rebooted and now shows hardware alarms (after the reboot): Group 2 PCIs: Bus Uncorrectable error (I don't have host logs prior to reboot because the tech who installed esxi 5.1 had not yet set the syslog to persistent storage...) I can't find much info on the alert - is it a concern? I've tried "Reset Sensors" and Refresh, and the alerts are still present. 0 Question by:snowdog_2112 Facebook fault in slot Twitter LinkedIn Google LVL 116 Active today Best Solution byAndrew Hancock (VMware vExpert / EE MVE) Hardware Fault on motherboard or backplane, get escalated to IBM Support for Engineer Repair. reseat any pci devices if applicable Go to Solution 4 Comments Message Active 1 day ago Author Comment by:snowdog_21122013-02-18 more info - the IMM event log shows the following at the time of the reboot: 02/18/2013; 15:05:01 0x816f03131701ffff System "SN# xxxx" has recovered from an NMI 02/18/2013; 15:03:46 0x806f002125820900 Fault in slot "All PCI Error" on system "SN# xxxx" 02/18/2013; 15:03:46 0x806f002130010901 Fault in slot "PCI 1" on system "SN# xxxx" 02/18/2013; 15:03:40 0x806f08132582ffff A Uncorrectable Bus Error has occurred on system "SN# xxxx" 02/18/2013; 15:03:40 0x806f03131701ffff A software NMI has occurred on system "SN# xxxx" Select all Open in new window 0 LVL 116 Overall: Level 116 VMware 108 Server Hardware 28 Message Active today Accepted Solution by:Andrew Hancock (VMware vExpert / EE MVE)2013-02-18 Hardware Fault on motherboard or backplane, get escalated to IBM Support for Engineer Repair. reseat any pci devices if applicable 0 Message Active 1 day ago Author Closing Comment by:snowdog_21122013-02-18 lightpath also indicates a PCI fault. VLP from IMM - Fault: orange PCI: orange PCI1: orange <-- this must be the slot? (pci2 - 4): off 0 Me