From owner-freebsd-stable@FreeBSD.ORG Wed Apr 25 20:07:36 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6B2C2106566B; Wed, 25 Apr 2012 20:07:36 +0000 (UTC) (envelope-from lacombar@gmail.com) Received: from mail-wg0-f50.google.com (mail-wg0-f50.google.com [74.125.82.50]) by mx1.freebsd.org (Postfix) with ESMTP id C64D28FC12; Wed, 25 Apr 2012 20:07:35 +0000 (UTC) Received: by wgbds12 with SMTP id ds12so459985wgb.31 for ; Wed, 25 Apr 2012 13:07:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=jkGn5d1g5U9fhvHCEuD1se+TIWYbjPYZ47IMq/tvKNg=; b=AVTLKqc32ZVY4VWJqm7+7rEPNBytp8GRKDaFUSdjkmUOpQMlqbSNgATTFlzI+SNUoM +WWvSfChA5imNGGl0l3PF/oyLxu7NdoTsUyf9Ppx23wo9Q1tJjGMa4QIzEzHpBVAKOZq e/kcRD8ONq5RyCKd0DOKy2P573e1pgL0wfvBMhOy3AdaF08zwkqestNC1Ir61TXf9xQv pMxCDFcVXTDH1U/CKMLsEHxO0z9yY9CXwhna0PpLlAl2uChCubf+sKlc1uHYFGxMbmJi 174PJMbM2RssmLkYssAG2rPViSGtVRcJZBo64xib6bnU0JMWj8MnSApnyRGXjsfqxR/7 FAFA== MIME-Version: 1.0 Received: by 10.180.85.69 with SMTP id f5mr9770678wiz.18.1335384454926; Wed, 25 Apr 2012 13:07:34 -0700 (PDT) Received: by 10.216.49.81 with HTTP; Wed, 25 Apr 2012 13:07:34 -0700 (PDT) In-Reply-To: References: Date: Wed, 25 Apr 2012 16:07:34 -0400 Message-ID: From: Arnaud Lacombe To: freebsd-stable , FreeBSD Current Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: Subject: Re: Complete hang on 9.0-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 25 Apr 2012 20:07:36 -0000 Hi, On Sat, Apr 21, 2012 at 4:19 AM, Arnaud Lacombe wrote: > Hi, > > On Wed, Apr 18, 2012 at 2:22 AM, Arnaud Lacombe wrot= e: >> Hi, >> >> On Mon, Apr 16, 2012 at 5:50 PM, Arnaud Lacombe wro= te: >>> [...] >>> I reproduced the previous problem on 10-CURRENT from r233917, on the >>> following platform (here running 8.2-RELEASE): >>> >>> FreeBSD is a registered trademark of The FreeBSD Foundation. >>> FreeBSD 8.2-RELEASE #0: Thu Feb 17 02:41:51 UTC 2011 >>> =A0 =A0root@mason.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC amd64 >>> Timecounter "i8254" frequency 1193182 Hz quality 0 >>> CPU: Intel(R) Atom(TM) CPU D525 =A0 @ 1.80GHz (1800.01-MHz K8-class CPU= ) >>> =A0Origin =3D "GenuineIntel" =A0Id =3D 0x106ca =A0Family =3D 6 =A0Model= =3D 1c =A0Stepping =3D 10 >>> =A0Features=3D0xbfebfbff >>> =A0Features2=3D0x40e31d >>> =A0AMD Features=3D0x20100800 >>> =A0AMD Features2=3D0x1 >>> =A0TSC: P-state invariant >>> real memory =A0=3D 2136539136 (2037 MB) >>> avail memory =3D 2043772928 (1949 MB) >>> ACPI APIC Table: <010312 APIC0947> >>> FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs >>> FreeBSD/SMP: 1 package(s) x 2 core(s) x 2 HTT threads >>> =A0cpu0 (BSP): APIC ID: =A00 >>> =A0cpu1 (AP/HT): APIC ID: =A01 >>> =A0cpu2 (AP): APIC ID: =A02 >>> =A0cpu3 (AP/HT): APIC ID: =A03 >>> >>> Complete system freeze while running about 2400 threads. I had to >>> power cycle the system to get it back alive. I discussed a way to >>> debug this with attilio@ on freebsd-stable@, but still did not had >>> time to implement it. >>> >> 10-CURRENT from r233917 hanged again today while running 3600 threads. >> I enabled WITNESS and INVARIANTS on that specific kernel, secretly >> hoping that they would trigger some meaningful information, but they >> did not. I would guess my last attempt is to enable SW_WATCHDOG, and >> gather some state information out of DDB when the watchdog trigger, if >> it does... >> >> Btw, this issue seems to be specifically happening on Atom/ICH8M >> platform running amd64 kernel, as I've never seen it on other >> platforms, and yet ran extensive tests. I am not entirely sure it >> happens on i386. I would need to check. >> > For the record, 9.0-RELEASE i386 has been running the test for about 2 > days on the D510 platform without any hang so far. I'll keep it > running all week-end to give me a better idea. > ... or I have been too eager to expect an amd64 only issue. Thanks to some nasty virus which stuck me in my bed for two days, I finally got FreeBSD 9.0-RELEASE i386 stuck while running a single, 4000 threads, process. I guess it's time to play with SW_WATCHDOG and DDB. As a side note, the D510 platform seem to be much harder to hang than the D525... - Arnaud