Dell Openmanage Single-bit Failure Error Rate Exceeded
Contents |
exceeded - DIMMA Join Sign in Single-bit failure error rate exceeded - DIMMA Servers Information and ideas on Dell PowerEdge rack, tower and blade server solutions. Get this RSS feed Home single-bit failure error rate exceeded dell Forums Server Media Gallery 3 Replies 0 Subscribers Postedover 12 years ago Single-bit correctable memory error rate exceeded for dimm failure error rate exceeded - DIMMA Posted by MSslave on 4 Aug 2004 15:13 Hi All - I have a
Clear Memory Error Dell Openmanage
Poweredge 6450 that began logging the following error in SA 1.8. Attributes Values Status Critical Device Name BANK_4/DIMM_A Size 128 MB Type SDRAM-SYNCHRONOUS Speed 7 ns Failures Single bit warning error rate exceededSingle-bit
Correctable Memory Error Log Limit Reached
failure error rate exceeded It logs this for Banks 1-4, always DIMMA. We have replaced the motherboard once, the memory riser board twice, replaced a total of 8 of 16DIMMS. We have swapped out the DIMMS in BANK 1-4 From A to B yet the error still comes up DIMMA in all banks even though the DIMMS were previously in DIMMB and didn't show any error there. I'm persistent correctable memory error rate has increased for a memory device at location having difficulty believing that it is a hardware issue (at least not a bad DIMM)and am wondering if this could possibly be a problem with SA. The server is up and running fine. Here are the system details: BIOS Information Manufacturer Dell Inc. Version A10 Release Date 11/06/2001 NOTE: Will be updating the BIOS to A13. Motherboard replacement downgraded it to A10. We were getting the same errors on A13 so I doubt they will stop. Firmware Information Name Embedded System Management Controller Version 5.39 Firmware Information Name Power Supply Paralleling Board Version 2.41 Firmware Information Name Primary Backplane Version 1.30 Firmware Information Name Secondary backplane Version 1.30 Software Profile Operating System Name Microsoft Windows 2000 Server Version 5.0 Service Pack 3 (Build 2195) System Time Wed Aug 04 11:38:26 2004 System Bootup Time Sat Jul 24 09:26:00 2004 System Management Name Server Administrator Version 1.8.0 Description Systems Management Software Contains Instrumentation Service 5.1.0 Storage Management Service 3.5.0 Update Service 1.9.0 Diagnostic Service 3.0.0 Sun JRE - OEM Installed Version 1.4.2 Secure Port Server 1.0.0 Core Service 1.8.0 Instrumentation Service Integration Layer 1.8.0 Storage Management Service Integration Layer 1.2.0 Server Administrator 1.8.0
in Systems Management Forums Single Bit Warning Error Rate Exceeded. Systems Management Dell Systems Management Solutions: Dell OpenManage, iDRAC, Repository Manager, Microsoft SCCM, Chassis Managment Controller, and more Get this RSS
Single Bit Warning Error Rate Exceeded Clear
feed TechCenter Home Topic Home Forums Wikis Twitter Details 3 Replies 1 Subscriber correctable memory error rate exceeded for dimm a1 Postedover 4 years ago Options RSS Share Related Forums Clear Forum Dell OpenManage Essentials Forum to discuss OpenManage Essentials (OME), poweredge diagnostics a systems management console that provides simple, basic Dell hardware management. Dell Repository Manager Dell Repository Manager allows IT admins to more easily manage Dell system updates Dell Systems Management General Forum A http://en.community.dell.com/support-forums/servers/f/177/t/7881805 general forum to discuss Dell Systems Management Enterprise IT solutions such as OMSA, iDRAC, CMC, SUU, SBUU, and more. OpenManage Connections for 3rd Party Console Integration Forum to discuss monitoring Dell servers and storage platforms with HP Operations Manager, IBM Tivoli Netcool / OMNIbus or CA Network and Systems Management (NSM) solutions. OpenManage Integration for VMware vCenter Next > Dell OpenManage Essentials Single Bit Warning Error Rate http://en.community.dell.com/techcenter/systems-management/f/4494/t/19459637 Exceeded. Posted by john.ross on 31 Jul 2012 10:10 Hello, I am using OpenMange Server Essential to monitor my DELL servers and recentlyI have been getting amemory warning on myPowerEdge 2950. Severity:Critical, Message:Memory device status is criticalMemory device location: DIMM3 Possible memory module event cause:Single bitwarning error rate exceeded,Single bit failure error rate exceeded I am just wondering if anybody has seen this error and what the solution could be. Any help is greatly appreciated. Thanks, john.ross Like 0 Reply You have posted to a forum that requires a moderator to approve posts before they are publicly available. Posted by DELL-Abhijit P on 31 Jul 2012 10:17 Hi John.ross, The error looks like an issue with the DIMM in slot 3. You might want to call Tech support and they can help you diagnose it further. Regards Abhijit Like 0 Reply You have posted to a forum that requires a moderator to approve posts before they are publicly available. Posted by john.ross on 31 Jul 2012 10:35 Thanks Abhijit. My issue now is that my server is out of warranty. Do you know any tool - preferably from dell- that I can use to diagnos
sorted by: [ date ] [ thread ] [ subject ] [ author ] On Thu, 2008-10-09 at 11:20 +0000, Arnar Þórarinsson wrote: http://lists.us.dell.com/pipermail/linux-poweredge/2008-October/037484.html > > Hello, > > Could somebody please explain these error messges to me. I've been > trying to find some info on this but have found nothing. > > Severity : Critical > ID : 1404 > Date and Time : Fri Oct 3 19:57:10 2008 > Category : Instrumentation Service > Description : Memory device status is critical Memory device error rate > location: DIMM2_B Possible memory module event cause:Single bit > warning error rate exceeded,Single bit error logging disabled > > Severity : Non-Critical > ID : 1403 > Date and Time : Fri Oct 3 18:01:02 2008 > Category : Instrumentation Service > Description : Memory device status is non-critical Memory device > location: DIMM2_B Possible memory module event cause:Single bit error rate exceeded > warning error rate exceeded > > > /Arnar Thorarinsson Single bit warning errors by them selves mean very little other then the memory found an error and corrected for it. However, IF you see many of these errors, then there is a more serious issue. That would indicate that you have a bad dimm or a bad dimm card. To test, just swap out dimm2-b with another dimm and see if the error follows the dimm or stays with the slot. If it stays with the slot, you need a new dimm card/MB, if it follows the dimm, you need a new dimm. Again, a few of these warnings mean nothing other then the ECC for your memory is working as designed. Many of these warnings means you have bad memory or bad memory riser/MB. -- Damon L. Chesser