From owner-freebsd-questions@freebsd.org Tue Apr 19 08:27:20 2016 Return-Path: Delivered-To: freebsd-questions@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 0777FB1110A for ; Tue, 19 Apr 2016 08:27:20 +0000 (UTC) (envelope-from lokadamus@gmx.de) Received: from mout.gmx.net (mout.gmx.net [212.227.17.21]) (using TLSv1.2 with cipher DHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "mout.gmx.net", Issuer "TeleSec ServerPass DE-1" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 6B2B61E2D for ; Tue, 19 Apr 2016 08:27:18 +0000 (UTC) (envelope-from lokadamus@gmx.de) Received: from [192.168.0.143] ([95.91.224.149]) by mail.gmx.com (mrgmx103) with ESMTPSA (Nemesis) id 0LdHLB-1bZz1m4BVJ-00iSud; Tue, 19 Apr 2016 10:27:08 +0200 Subject: Re: FreeBSD Crashes Intermittently !! To: shahzaib mushtaq References: <56E2E9AC.1040902@gmx.de> <33444.128.135.52.6.1457712900.squirrel@cosmo.uchicago.edu> <56E2F586.9000108@gmx.de> <5715DF0F.4090808@gmx.de> Cc: freebsd-questions@freebsd.org, galtsev@kicp.uchicago.edu From: "lokadamus@gmx.de" Message-ID: <5715EBDA.10907@gmx.de> Date: Tue, 19 Apr 2016 10:27:06 +0200 User-Agent: Mozilla/5.0 (X11; FreeBSD i386; rv:38.0) Gecko/20100101 Thunderbird/38.7.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-Provags-ID: V03:K0:CTOanJGIiXvYRhFGSliJIGlRi+N/gkujfX/2C6Rg1Mnx9zXMZL1 ujZw/S0IDgcq36b1r+dnSmMRK+kh2Q8amJpRFD8qAhHKQ83opSs01ddlWHBsApjqeVE6nf+ 175D+g/Yd+ifsqVSFnQSB+/L50q8swBPGXjmfwEVTIkbZyHdC/aXGJqyzZWaN0/PjzPfDcR WjB/PhSUoBCHV09f93DFA== X-UI-Out-Filterresults: notjunk:1;V01:K0:vX+PC4cRNSQ=:3mCMY5iw9CTKWFM6ZQwBxM QQDmf3gZgjJR73Jvevf1bPVARK8SziG+btZCM5xk7jwPnM3gBL21EfhNCsBHwCddVY0Ypk9r1 OalkS67i7feLxZoUKGoSJ3vzYrkD+j/HJD4e5wiHH2eSRz4voyBRbFPFy4dZ0vynTCq9/0sUR LUFOmw+1ax2BBO3Oob23FrK01tUJ963y4GI3pcNLZgfoogFOXaWYlTHfFiT//OSjTEiaxY+1l RBInc6C+RzZm6ZlFo7VheeIGE7aSVMN08n7xUWlWGWmmx1qH+ELIRZSemULbf791yhsEOboM/ LCtfMYDS8xzpFYhNG/kqJYFu4b+dGjyvqtttUIhXew6NI3V+oHGjo+BFrb1AouU3Co8E6PnwV oAIeSVRcOiMFaSmj2kM2412f9bGY0b9G7+X79W2xIZn3cLFfwOJVHB//ysolca2WLBjDQV+cG L6WdRwlyMkumOG5o65i+5XgsuRJoG5Wm6GasC7ub3EHoUT9sPAPHFR4442BZbJZhh1BJg3lyP IprEwT9VGW/wxkPmdwdT1l7kC8w9L5uBDTgzCbEY9E9VQHgIlIVUJKnHKlv7dDYXqWNzfoJfv GnqjDFW6ut//7e1J+Bp7YHASuxk9h6O2Z2qyhvUr2XP3sMrbE/w/70892+V/IdIZbch43UI3c p4E9TvO/lmFmJnHgnlSm6QJxq/5O1mLWy0X/Hnkq9phww9sBSoGrg/b3NxwnLsBA4S118mex4 Fesh0ugqvKkFqJ27nnqvsh/3SAQRzHOl7JTbm2xUS9lQ01j4JQ9tDu5ppAE1oJa5yMHGZP8bz V6JAlW8sGikdtiGe3/KOk1dUy4PYQ== X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 19 Apr 2016 08:27:20 -0000 Hi, I think about the error lines: Hardware event. This is not a software error. CPU 23 BANK 5 MISC 0 ADDR 805613c60 MCG status:MCIP STATUS be00000000800400 MCGSTATUS 4 .... Hardware event. This is not a software error. CPU 22 BANK 5 https://en.wikipedia.org/wiki/Machine-check_exception Looks like a hardware problem from the second cpu. Thinks, what can be done: - Is it possible to read cpu heat infos from bios? - Disable HTT and look, if the error comes again - Remove the second cpu and look, if ... - Install microcode updates and hope, it will fix it Intel offers for many CPUs an microcode update. https://downloadcenter.intel.com/download/25512/Linux-Processor-Microcode-Data-File?v=t Can you test a cpu in another system? https://www.freebsd.org/cgi/ports.cgi?query=cpuburn&stype=all Regards On 04/19/16 09:35, shahzaib mushtaq wrote: > Hi, sorry for the mistake, cpus are : > > 2 x Intel(R) Xeon(R) CPU L5640 @ 2.27GHz5640 (12 cores, 24 threads) > > On Tue, Apr 19, 2016 at 12:32 PM, lokadamus@gmx.de wrote: > >> On 04/18/16 16:28, shahzaib mushtaq wrote: >>> Hi again, got back after a long time. So yes, we've move to new Dell R510 >>> Hardware now. Here is the specs : >>> >>> DELL R510 >>> 2 x L5520 >>> 64GB RAM >>> 12x3TB Raid stripping+mirroring (HBA LSI-9211-fw version 19.00) >>> FreeBSD cw009.tunefiles.com 10.2-RELEASE-p14 FreeBSD 10.2-RELEASE-p14 >> #0: >>> Wed Mar 16 20:46:12 UTC 2016 >>> root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC >>> amd64 >>> >>> After 9days of uptime, server again got crashed with following error in >>> crash log : >>> >>> http://pastebin.com/baShWuMP >>> >>> I am so much depressed now, there's much pressure on me from my company. >> Please >>> help us resolving this crash issue . :( >> Which CPU Model is installed? Is it one or more? >> >> There where some microcode updates for some models. >> >> Greeting >> > _______________________________________________ > freebsd-questions@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org" >