Hardware Machine Error Bus And Interconnect Vmware
Contents |
NSXVirtual SAN vCenterFusionWorkstationvExpertVMware {code} CloudCredSubmit a Link Home > VMTN > VMware vSphere™ > VMware ESXi 4 > Discussions Please enter a title. You can not post a blank message. Please type machine check exception fatal (unrecoverable) mce on pcpu your message and try again. 1 Reply Latest reply:
Machine Check Exception Vmware
Oct 6, 2011 7:00 AM by rajvm256 PSOD Error: Bus and Interconnect: OtherTrans Bus
Mca Error Detected Via Polling
Generic MaxRodkin Oct 6, 2011 2:36 AM hi all!i have a server:INTEL Xeon E5410 BX80574E5410PINTEL "StarLake” (8 x FBDIMM, 32Gb max, 8
Machine Check Exception Decoder
x SAS (opt RAID),2 x SATA II) S5000PSLROMBR esxi4.1At every saturday or sunday it has halt with PSOD. Dump is attached.The begin of log:0:00:30:08.155 cpu5:4101)Panic: 612: Panic from another CPU (cpu 5, world 4101): ip=0x418025c58466:0:00:30:08.155 cpu5:4101)Hardware (Machine) Error: Bus and Interconnect: OtherTrans Bus Genericerror. PCPU5 in intel machine check exception decoder world 4101:idle50:00:30:08.155 cpu5:4101)Backtrace for current CPU #5, worldID=4101, ebp=0x417f8002f8b80:00:30:08.156 cpu5:4101)0x417f8002f8b8:[0x418025c57da5]PanicLogBacktrace@vmkernel:nover+0x18stack: 0x5, 0x417f8002f8e8, 0x....What`s problem may be?Respect, Maxim vmkernel-log.1 258.6 K 1049Views Tags: none (add) This content has been marked as final. Show 1 reply 1. Re: PSOD Error: Bus and Interconnect: OtherTrans Bus Generic rajvm256 Oct 6, 2011 7:00 AM (in response to MaxRodkin) Hi,It seems to be a hardware problem as per the logs. 0:00:30:08.155 cpu5:4101)Hardware (Machine) Error: Bus and Interconnect: OtherTrans Bus Generic error. PCPU5 in world 4101:idle5 0:00:30:08.155 cpu5:4101)Backtrace for current CPU #5, worldID=4101, ebp=0x417f8002f8b8One thing might be to follow the below KB and do some test on the machine. Hardware might be faulty.http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=1005184 Like Show 0 Likes (0) Actions Actions Remove from profile Feature on your profile More Like This Retrieving data ... Share This Page Legend Correct Answers - 10 points
Things Small and Medium Business Service Providers All Solutions Services Advise, Transform and Manage Financing and Flexible mca recoverable error ce memory controller error Capacity IT Support Services Education and Training Services All cmci signaling for patrol scrub ucr errors not supported Services Products Integrated Systems Composable Systems Converged Systems Hyper Converged Systems Blade Systems machine check exception error Infrastructure Management Software Application Lifecycle Management Application Delivery Management Big Data Analytics DevOps Enterprise Security Hybrid and Private Cloud Information Governance https://communities.vmware.com/thread/331545?start=0&tstart=0 Information Management IT Service Management Operations Management Server Management Software as a Service (SaaS) Software-Defined Data Center Storage Management All Software Servers Rack Servers Tower Servers Blade Servers Density Optimized Mission Critical Servers Servers for Cloud Server Management All Servers Storage All-flash and https://community.hpe.com/t5/Technical-Support-Services/More-on-debugging-VMware-Crashes/ba-p/6789841 Hybrid Storage Midrange and Enterprise Storage Entry Storage Systems Data Availability, Protection and Retention Software Defined Storage Management and Orchestration Storage Networking All Storage Networking Switches Routers Access Points and Controllers Wireless LAN Campus and Branch Networking Data Center Networking Wide Area Network Software Defined Networking Network Functions Virtualization Network Management All Networking About UsSupportClearType to search2086159Solutions Transform to a Hybrid Infrastructure Protect Your Digital Enterprise Empower the Data-Driven Organization Enable Workplace Productivity Cloud Security Big Data Mobility Infrastructure Internet of Things Small and Medium Business Service Providers All Solutions Services Advise, Transform and Manage Financing and Flexible Capacity IT Support Services Education and Training Services All Services Products Integrated Systems Composable Systems Converged Systems Hyper Converged Systems Blade Systems Infrastructure Management Software Application Lifecycle Management Application Delivery Management Big D
time, we were "lucky" enough to capture its PSOD. In earlier article about Machine Check Errors, I was talking about what exactly do they mean and how to debug them. Also, most of the time, https://vmxp.wordpress.com/2014/11/27/psod-caused-by-a-machine-check-exception/ when these are correctable Machine Check Errors, the host only reboots itself without leaving any trace as of why. That I have investigated by determining faulty memory after running a custom memory stress test on an ESXi host. The Uncorrectable Machine Check Exception presented below is caused by "Other TransBus Generic Error" - this could have been related either to a CPU, or pathways on the motherboard… or both. Most of VMkernel dumps was pointing out to 2nd machine check Physical CPU, but there were some occurrences on 1st CPU as well. Even the AHS log from the HP blade server was corrupted each time I tried to send it to a technician. Therefore they took action and replaced both the motherboard and CPUs. Since then there were no more trouble with this host. Manual Debugging: For those of you who are interested - the MCE codes reported were: In iLO: FA001E8000020E0F in vmkernel.log: c800008000310e0f ; 8800004000310e0f Now, if we machine check exception decode the message we got from iLO manually (so that we have another source of MCE to decode from): 1 1 1 1 1 0 1 0 0 00 0000000011110100 0 0000 0000000000000010 0000 1110 0000 1111 UC 1 PCC 1 S 0 AR 0 Signaling: Uncorrected error (UC). RESET THE SYSTEM Examples? None found. Compound error code found: Bus and Interconnect Errors. BUS LL PP RRRR II T BUS{11}_{11}_{0000}_{11}_{0}_ERR Level: 11, generic Request: 0000, Generic Bus & Interconnect mnemonics: Participation: 11, Generic Therefore: Generic Bus and Interconnect Error Here you see VMkernel is pretty good at decoding the MCEs by itself, but it can also be very useful to see for yourself what the real cause was if your error decode is missing. Share this:TwitterFacebookGoogleLike this:Like Loading... Related This entry was posted in Data Center Hardware, ESXi / vSphere, Servers, Troubleshooting and tagged Debugging, esxi crashing, ESXi troubleshooting, Hardware Failure, Machine Check Error, MCE, PSOD, VMware ESXi on November 27, 2014 by Ali. Post navigation ← My VCP ExamExperience Online ESXi Firmware and Driver Upgrade on HPServers → Share your thoughts Cancel reply Enter your comment here... Fill in your details below or click an icon to log in: Email (required) (Address never made public) Name (required) Website You are commenting using your WordPress.com account. (LogOut/Change) You are commenting using your Twitter account. (LogOut/Change) You are commenting using your Facebook account. (LogOut/Change)