From owner-freebsd-stable@FreeBSD.ORG Sun May 10 00:27:37 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 27AEA106566B; Sun, 10 May 2009 00:27:37 +0000 (UTC) (envelope-from david@usermode.org) Received: from outbound0.mx.meer.net (proxy.meer.net [64.13.141.13]) by mx1.freebsd.org (Postfix) with ESMTP id 088FC8FC0C; Sun, 10 May 2009 00:27:36 +0000 (UTC) (envelope-from david@usermode.org) Received: from mail.meer.net (mail.meer.net [64.13.141.3]) by outbound0.mx.meer.net (8.14.3/8.14.3) with ESMTP id n4A0Rate047475; Sat, 9 May 2009 17:27:36 -0700 (PDT) (envelope-from david@usermode.org) Received: from radagast.usermode.org (netblock-66-245-218-155.dslextreme.com [66.245.218.155]) by mail.meer.net (8.13.3/8.13.3/meer) with ESMTP id n4A0RCrU030822; Sat, 9 May 2009 17:27:12 -0700 (PDT) (envelope-from david@usermode.org) From: David Johnson To: Robert Noland Date: Sat, 9 May 2009 17:27:12 -0700 User-Agent: KMail/1.11.2 (FreeBSD/7.2-RELEASE; KDE/4.2.2; i386; ; ) References: <200905042015.29394.david@usermode.org> <200905081458.53651.david@usermode.org> <1241821864.1733.51.camel@balrog.2hip.net> In-Reply-To: <1241821864.1733.51.camel@balrog.2hip.net> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-15" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905091727.12165.david@usermode.org> X-Spam-Score: undef - spam scanning disabled X-CanIt-Geo: ip=64.13.141.3; country=US; region=CA; city=Mountain View; latitude=37.3974; longitude=-122.0732; metrocode=807; areacode=650; http://maps.google.com/maps?q=37.3974,-122.0732&z=6 X-CanItPRO-Stream: default X-Canit-Stats-ID: Bayes signature not available X-Scanned-By: CanIt (www . roaringpenguin . com) on 64.13.141.13 Cc: freebsd-stable@freebsd.org Subject: Re: Xorg hangs with drmwtq in 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 10 May 2009 00:27:37 -0000 On Friday 08 May 2009 03:31:04 pm Robert Noland wrote: > In order to guess what might be causing this, drm debugging needs to be > enabled before the hang, so that we can hopefully figure out what leads > up to the hung GPU. Unfortunately that won't work, because turning on hw.dri.0.debug slows down compositing so much that it won't reproduce. -- David Johnson From owner-freebsd-stable@FreeBSD.ORG Sun May 10 01:41:37 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A58DA1065670; Sun, 10 May 2009 01:41:37 +0000 (UTC) (envelope-from david@usermode.org) Received: from outbound0.mx.meer.net (proxy.meer.net [64.13.141.13]) by mx1.freebsd.org (Postfix) with ESMTP id 862128FC18; Sun, 10 May 2009 01:41:37 +0000 (UTC) (envelope-from david@usermode.org) Received: from mail.meer.net (mail.meer.net [64.13.141.3]) by outbound0.mx.meer.net (8.14.3/8.14.3) with ESMTP id n4A1fbYt049554; Sat, 9 May 2009 18:41:37 -0700 (PDT) (envelope-from david@usermode.org) Received: from radagast.usermode.org (netblock-66-245-218-155.dslextreme.com [66.245.218.155]) by mail.meer.net (8.13.3/8.13.3/meer) with ESMTP id n4A1fQgD059554; Sat, 9 May 2009 18:41:26 -0700 (PDT) (envelope-from david@usermode.org) From: David Johnson To: Robert Noland Date: Sat, 9 May 2009 18:41:26 -0700 User-Agent: KMail/1.11.2 (FreeBSD/7.2-RELEASE; KDE/4.2.2; i386; ; ) References: <200905042015.29394.david@usermode.org> <200905081458.53651.david@usermode.org> <1241821864.1733.51.camel@balrog.2hip.net> In-Reply-To: <1241821864.1733.51.camel@balrog.2hip.net> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-15" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905091841.26274.david@usermode.org> X-Spam-Score: undef - spam scanning disabled X-CanIt-Geo: ip=64.13.141.3; country=US; region=CA; city=Mountain View; latitude=37.3974; longitude=-122.0732; metrocode=807; areacode=650; http://maps.google.com/maps?q=37.3974,-122.0732&z=6 X-CanItPRO-Stream: default X-Canit-Stats-ID: Bayes signature not available X-Scanned-By: CanIt (www . roaringpenguin . com) on 64.13.141.13 Cc: freebsd-stable@freebsd.org Subject: Re: Xorg hangs with drmwtq in 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 10 May 2009 01:41:37 -0000 On Friday 08 May 2009 03:31:04 pm Robert Noland wrote: > In order to guess what might be causing this, drm debugging needs to be > enabled before the hang, so that we can hopefully figure out what leads > up to the hung GPU. I'm not able to do that, but I did manage to get debug turned on and dmesg captured early enough to catch some additional information. I've place the full file online at http://www.usermode.org/misc/dmesg.txt, but am including some snippets here. Hopefully this is enough to move forward. -- David Johnson ... [drm:pid1822:drm_ioctl] pid=1822, cmd=0xc0286429, nr=0x29, dev 0xc615fa00, auth=1 [drm:pid1822:radeon_freelist_get] done_age = 102778 [drm:pid1822:drm_ioctl] pid=1822, cmd=0xc010644d, nr=0x4d, dev 0xc615fa00, auth=1 [drm:pid1822:radeon_cp_indirect] idx=27 s=0 e=88 d=1 [drm:pid1822:radeon_cp_dispatch_indirect] buf=27 s=0x0 e=0x58 [drm:pid1822:drm_close] open_count = 2 [drm:pid1822:drm_close] pid = 1822, device = 0xc615fa00, open_count = 2 [drm:pid1822:drm_ioctl] pid=1822, cmd=0x80086442, nr=0x42, dev 0xc615fa00, auth=1 [drm:pid1822:radeon_cp_stop] [drm:pid1822:radeon_do_cp_flush] [drm:pid1822:radeon_do_cp_idle] [drm:pid1822:radeon_do_cp_stop] [drm:pid1822:radeon_do_engine_reset] info: [drm] Num pipes: 1 [drm:pid1822:radeon_do_cp_reset] [drm:pid1822:drm_ioctl] pid=1822, cmd=0x800c6459, nr=0x59, dev 0xc615fa00, auth=1 [drm:pid1822:drm_ioctl] pid=1822, cmd=0x80086414, nr=0x14, dev 0xc615fa00, auth=1 [drm:pid1822:drm_irq_uninstall] irq=16 [drm:pid1822:drm_ioctl] pid=1822, cmd=0x80546440, nr=0x40, dev 0xc615fa00, auth=1 [drm:pid1822:radeon_do_cleanup_cp] [drm:pid1822:drm_ioctl] pid=1822, cmd=0x80086439, nr=0x39, dev 0xc615fa00, auth=1 [drm:pid1822:drm_sg_free] sg free virtual = 0xe8a64000 [drm:pid1822:drm_ioctl] pid=1822, cmd=0x8004667e, nr=0x7e, dev 0xc615fa00, auth=1 [drm:pid1822:drm_ioctl] pid=1822, cmd=0x8004667d, nr=0x7d, dev 0xc615fa00, auth=1 [drm:pid1822:drm_ioctl] pid=1822, cmd=0xc0086421, nr=0x21, dev 0xc615fa00, auth=1 [drm:pid1822:drm_rmctx] 2 [drm:pid1822:drm_ioctl] pid=1822, cmd=0xc0086421, nr=0x21, dev 0xc615fa00, auth=1 [drm:pid1822:drm_rmctx] 1 [drm:pid1822:drm_ioctl] pid=1822, cmd=0xc0086426, nr=0x26, dev 0xc615fa00, auth=1 [drm:pid1822:drm_ioctl] pid=1822, cmd=0xc0086426, nr=0x26, dev 0xc615fa00, auth=1 [drm:pid1822:drm_ioctl] pid=1822, cmd=0x8008642b, nr=0x2b, dev 0xc615fa00, auth=1 [drm:pid1822:drm_unlock] 1 (pid 1822) requests unlock (0x80000001), flags = 0x00000000 [drm:pid1822:drm_close] open_count = 1 [drm:pid1822:drm_close] pid = 1822, device = 0xc615fa00, open_count = 1 [drm:pid1822:drm_lastclose] [drm:pid1822:radeon_do_cleanup_cp] info: [drm] Setting GART location based on new memory map info: [drm] Loading R500 Microcode info: [drm] Num pipes: 1 info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] info: [drm] Num pipes: 1 info: [drm] Setting GART location based on new memory map info: [drm] Loading R500 Microcode info: [drm] Num pipes: 1 info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] info: [drm] Num pipes: 1 info: [drm] Setting GART location based on new memory map info: [drm] Loading R500 Microcode info: [drm] Num pipes: 1 info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] info: [drm] Num pipes: 1 info: [drm] Setting GART location based on new memory map info: [drm] Loading R500 Microcode info: [drm] Num pipes: 1 info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] info: [drm] Num pipes: 1 info: [drm] Setting GART location based on new memory map info: [drm] Loading R500 Microcode info: [drm] Num pipes: 1 info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] info: [drm] Num pipes: 1 info: [drm] Setting GART location based on new memory map info: [drm] Loading R500 Microcode info: [drm] Num pipes: 1 info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] info: [drm] Num pipes: 1 info: [drm] Setting GART location based on new memory map info: [drm] Loading R500 Microcode info: [drm] Num pipes: 1 info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] [drm:pid6216:drm_ioctl] returning 4 [drm:pid6216:drm_ioctl] pid=6216, cmd=0x80046457, nr=0x57, dev 0xc615fa00, auth=1 [drm:pid6216:drm_ioctl] returning 4 [drm:pid6216:drm_ioctl] pid=6216, cmd=0x80046457, nr=0x57, dev 0xc615fa00, auth=1 [drm:pid6216:drm_ioctl] returning 4 [drm:pid6216:drm_ioctl] pid=6216, cmd=0x80046457, nr=0x57, dev 0xc615fa00, auth=1 [drm:pid6216:drm_ioctl] returning 4 From owner-freebsd-stable@FreeBSD.ORG Sun May 10 13:35:22 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BCE8C106564A; Sun, 10 May 2009 13:35:22 +0000 (UTC) (envelope-from villa.alberto@gmail.com) Received: from mail-bw0-f165.google.com (mail-bw0-f165.google.com [209.85.218.165]) by mx1.freebsd.org (Postfix) with ESMTP id 0CC2F8FC12; Sun, 10 May 2009 13:35:21 +0000 (UTC) (envelope-from villa.alberto@gmail.com) Received: by bwz9 with SMTP id 9so2173930bwz.43 for ; Sun, 10 May 2009 06:35:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=viJQOQEUGtWW+CVbNX+GWg1ZkKntGoIUHtzPWhHXgvI=; b=KNoXYTHkShpjIUNNLiMP0FvWc6FrA3ajHlb3CICaoljwQZco40iaOMc3C8O0F1fp20 SooyRSVZ402JHxJCt4t388qc4AXVEOKbOaxSfCxPw3hGrp9FLwR2hhRj4PTA7kit5pFH EldVeziGVv2cVFzZP+FH47Rxs2HpKijev3S+k= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=vRjzYUj46wFU7habh1Nw8go7YS35be4cfGly3/5ZJGAyya7GDvMcyF3/WX9xVqQPR9 Idx9XlNsMHLxFSe9WxKnJbxni+Q1t2JrIBHN6I5vKdfXSCt1DDjAkOdHC/XxHP0SCTM7 vEwi6uqpDc63KIHs5/1MoDLGX5LwC8EButQT4= MIME-Version: 1.0 Received: by 10.204.116.8 with SMTP id k8mr5768695bkq.110.1241960645767; Sun, 10 May 2009 06:04:05 -0700 (PDT) In-Reply-To: <1241870806.1733.61.camel@balrog.2hip.net> References: <1238293386.00093672.1238281804@10.7.7.3> <49CF6899.2060002@bsdforen.de> <49CF8E8D.1080604@bsdforen.de> <49CF9C19.3020509@FreeBSD.org> <49D5DA33.4010800@bsdforen.de> <1238778004.65025.30.camel@balrog.2hip.net> <49DF5D60.9010803@bsdforen.de> <1239384104.1922.70.camel@balrog.2hip.net> <4A053E52.5030602@bsdforen.de> <1241870806.1733.61.camel@balrog.2hip.net> Date: Sun, 10 May 2009 15:04:05 +0200 Message-ID: From: Alberto Villa To: Robert Noland Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: Dominic Fandrey , Alexander Motin , freebsd-stable@freebsd.org Subject: Re: powerd broken X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 10 May 2009 13:35:23 -0000 On Sat, May 9, 2009 at 2:06 PM, Robert Noland wrote: > Which update, what? =A0I haven't touched the kernel tree in a while, just > trying to sort it all out with patches here and there. =A0Are you saying > the the 2.7.0 intel driver helped? =A0Or maybe the Xserver or mesa > updates? updating intel driver, xserver and drm helped a lot! i've finally deinstalled intel 2.5.*, and started using (happily) exa instead of xaa in xorg --=20 Alberto Villa From owner-freebsd-stable@FreeBSD.ORG Sun May 10 15:51:26 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C7F3310656FA for ; Sun, 10 May 2009 15:51:26 +0000 (UTC) (envelope-from info@lottery.co.uk) Received: from hm995.locaweb.com.br (shared-1.locaweb.com.br [200.234.214.132]) by mx1.freebsd.org (Postfix) with ESMTP id 52B4F8FC23 for ; Sun, 10 May 2009 15:51:25 +0000 (UTC) (envelope-from info@lottery.co.uk) Received: from hm1207.locaweb.com.br (hm1207.locaweb.com.br [200.234.200.152]) by hm995.locaweb.com.br (Postfix) with ESMTP id AABEDA2E2828B for ; Sun, 10 May 2009 12:34:12 -0300 (BRT) Received: by hm1207.locaweb.com.br (Postfix, from userid 50714) id 9C6BC3C18A; Sun, 10 May 2009 12:31:39 -0300 (BRT) X-Locaweb-ID: 63325679646D56794F69426F625445794D4463734948567A5A584A755957316C4F694232595735705957526C5932467A64484A76 To: freebsd-stable@freebsd.org X-PHP-Script: www.vaniadecastro.com.br/zero.php for 196.3.182.250 From: UK NATIONAL LOTTERY MIME-Version: 1.0 Content-Type: text/plain Content-Transfer-Encoding: 8bit Message-Id: <20090510153412.9C6BC3C18A@hm1207.locaweb.com.br> Date: Sun, 10 May 2009 12:31:39 -0300 (BRT) Subject: National Lottery: Your Email Won X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: zonal.anderson-spencer@msn.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 10 May 2009 15:51:31 -0000 United Kingdom National Lottery 101 Bovill Road, London SE23 1EL United Kingdom File #: EGS/2251256003/02 Congratulations, we are pleased to inform you of the result of the United Kingdom National Lottery Award Winners. Your email address have been randomly selected as a winner in the ongoing United Kingdom National Lottery Online program, the draw was held on 30th April, 2009 using a computerized balloting system of selection. The United Kingdom National Lottery is aimed and focused at global development and improvement of living standard across the world. Free £77 Million Pounds won including *four* Ten Million Pounds Winners and *fourteen* Millionaires plus thousands of other cash prizes. Winner from all over the world, India, France, Singapore, USA, United Kingdom, Spain, South America, Malaysia, Indonesia, South Africa, Belgium, Denmark, Ireland and many more. We wish to express our sincere apologies for the late notification, this free award online program is been conducted bi-quarterly. United Kingdom National Lottery Free Award draw was conducted at the Europe Issuing Centre, you were selected from an exclusive list of 1,000,000,000 e-mail addresses of internet users from the following categories; consumers, professionals and corporate bodies picked by an advanced automated random computer ballot search from the internet 'NO TICKETS OR DRAFTS WERE SOLD'. Your email address attached to Security File #: EGS/2251256003/02 with Serial number No: 002839 emerged as a winner of Six Hundred Thousand Pounds (£600.000.00 GBP), therefore you are eligible to file claim for your prize as one of our lucky winners for the payout of your total sum after a thorough verification that will be conducted by our various credible financial institutions. This online program is precisely aimed at enabling all internet users across the world benefit from the United Kingdom National Lottery, your email address falls within the First Category Winner as such your file has been designated to our European Centre, where the complete verification and payout will be conducted only if there are no exceptions during the claims process, to file your claim immediately please contact our International Programs Director Anderson Spencer with the following information: 1. Name in full----------------------------------------- 2. Phone/Fax------------------------------------------- 3. Occupation------------------------------------------ TO: Contact Person: Anderson Spencer European Payment Issuing Office Tel: +447024065192 (8am - 5pm GMT) Fax: +447092894160 Email: zonal.anderson-spencer@msn.com NOTE: In order to benefit from this program, you are advised in your own best interest to file your claim not later than 7days days from the date of this notification to avoid disqualification; anybody under the age of 18 is automatically disqualified. Please include this File #: EGS/2251256003/02 in every of your correspondence with our Foreign Service Director Anderson Spencer. IMPORTANT: Solemn confidentiality should be ensured until successful remittance of your prize to you to avoid undue taking of advantage, unwarranted claim and abuse of program, any breach of confidentiality on the part of the winner will result to automatic disqualification. Sincerely Yours, Mrs. Julie Van Hans, Executive Director. United Kingdom National Lottery. From owner-freebsd-stable@FreeBSD.ORG Sun May 10 19:46:25 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D7F7C1065674 for ; Sun, 10 May 2009 19:46:25 +0000 (UTC) (envelope-from fbsd-ml@scrapper.ca) Received: from idcmail-mo2no.shaw.ca (idcmail-mo2no.shaw.ca [64.59.134.9]) by mx1.freebsd.org (Postfix) with ESMTP id A7C388FC12 for ; Sun, 10 May 2009 19:46:25 +0000 (UTC) (envelope-from fbsd-ml@scrapper.ca) Received: from pd7ml2no-ssvc.prod.shaw.ca ([10.0.153.162]) by pd5mo1no-svcs.prod.shaw.ca with ESMTP; 10 May 2009 13:17:40 -0600 X-Cloudmark-SP-Filtered: true X-Cloudmark-SP-Result: v=1.0 c=0 a=ep_KMAzDAAAA:8 a=l7WYoCnWvgNR4_T2h1AA:9 a=6BPJljR1zNXoPgkAihAA:7 a=hS5IZTnYiVPMlb0gRFd-aTOPqWMA:4 a=7mil6v5nu_kA:10 a=9_24lj8EJv0A:10 Received: from s010600121729c74c.vc.shawcable.net (HELO proven.lan) ([24.85.241.34]) by pd7ml2no-dmz.prod.shaw.ca with ESMTP; 10 May 2009 13:17:40 -0600 Received: from proven.lan (localhost [127.0.0.1]) by proven.lan (8.14.3/8.14.3) with ESMTP id n4AJHeVs003487 for ; Sun, 10 May 2009 12:17:40 -0700 (PDT) (envelope-from fbsd-ml@scrapper.ca) Received: from localhost (localhost [[UNIX: localhost]]) by proven.lan (8.14.3/8.14.3/Submit) id n4AJHeqX003486 for freebsd-stable@freebsd.org; Sun, 10 May 2009 12:17:40 -0700 (PDT) (envelope-from fbsd-ml@scrapper.ca) X-Authentication-Warning: proven.lan: npapke set sender to fbsd-ml@scrapper.ca using -f From: Norbert Papke Organization: Archaeological Filing To: freebsd-stable@freebsd.org Date: Sun, 10 May 2009 12:17:39 -0700 User-Agent: KMail/1.9.10 MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905101217.39920.fbsd-ml@scrapper.ca> Subject: 7.2-STABLE: Inserting USB device causes Fatal Trap 12 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 10 May 2009 19:46:26 -0000 Inserting a USB thumb drive into a running sytem result in a "Fatal trap 12: page fault while in kernel mode". Unfortunately, I was not able to save a core (not entirely sure why, I'll investigate separately). I have manually copied the backtrace: usb_transfer_complete bus_dmamap_load usbd_transfer usbd_do_request_flags_pipe usbd_do_request_flags usbd_get_string_desc usbd_get_string usbd_devinfo_vp usbd_devinfo usbd_new_device uhub_explore usb_event_thread fork_exit for_trampine The problem is repeatable. It only happens when I insert the thumb drive into a running system. If I boot with the thumb drive present, everything is fine. Any help is greatly appreciated. Cheers, -- Norbert Papke. ================================================= # uname -a FreeBSD proven.lan 7.2-STABLE FreeBSD 7.2-STABLE #0 r191841: Tue May 5 21:13:21 PDT 2009 npapke@proven.lan:/usr/obj/red/public/freebsd/sources/stable/sys/PROVEN amd64 ================================================= Kernel config: include GENERIC ident PROVEN options KDB # kernel debugger (just in case) options KDB_TRACE options DDB # kernel debugger (just in case) options WITNESS options WITNESS_SKIPSPIN options IPSEC device crypto device stf # for IPv6 tunneling # keep kernel messages from different cpus separate options PRINTF_BUFR_SIZE=64 option SC_HISTORY_SIZE=2000 options SC_NORM_ATTR=(FG_GREEN|BG_BLACK) options SC_NORM_REV_ATTR=(FG_YELLOW|BG_GREEN) options SC_KERNEL_CONS_ATTR=(FG_LIGHTRED|BG_BLACK) options SC_KERNEL_CONS_REV_ATTR=(FG_BLACK|BG_RED) # Alternate Queuing of network packets options ALTQ options ALTQ_CBQ # Class Bases Queuing (CBQ) options ALTQ_RED # Random Early Detection (RED) options ALTQ_RIO # RED In/Out options ALTQ_HFSC # Hierarchical Packet Scheduler (HFSC) options ALTQ_PRIQ # Priority Queuing (PRIQ) options ALTQ_NOPCC # Required for SMP build # load as module for debugging nodevice re # RealTek 8139C+/8169/8169S/8110S ================================================= Copyright (c) 1992-2009 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.2-STABLE #0 r191841: Tue May 5 21:13:21 PDT 2009 npapke@proven.lan:/usr/obj/red/public/freebsd/sources/stable/sys/PROVEN WARNING: WITNESS option enabled, expect reduced performance. Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Core(TM)2 Duo CPU E8500 @ 3.16GHz (3155.59-MHz K8-class CPU) Origin = "GenuineIntel" Id = 0x1067a Stepping = 10 Features=0xbfebfbff Features2=0x408e3fd,XSAVE> AMD Features=0x20100800 AMD Features2=0x1 Cores per package: 2 usable memory = 4279189504 (4080 MB) avail memory = 4097724416 (3907 MB) ACPI APIC Table: <100808 APIC1053> FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 cryptosoft0: on motherboard acpi0: <100808 XSDT1053> on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of ffc00000, 300000 (3) failed acpi0: reservation of fee00000, 1000 (3) failed acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, bff00000 (3) failed Timecounter "ACPI-safe" frequency 3579545 Hz quality 850 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 acpi_hpet0: iomem 0xfed00000-0xfed003ff on acpi0 Timecounter "HPET" frequency 14318180 Hz quality 900 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 pcib1: irq 16 at device 1.0 on pci0 pci1: on pcib1 vgapci0: port 0xc000-0xc0ff mem 0xd0000000-0xdfffffff,0xfe9f0000-0xfe9fffff irq 16 at device 0.0 on pci1 drm0: on vgapci0 info: [drm] MSI enabled 1 message(s) vgapci0: child drm0 requested pci_enable_busmaster info: [drm] Initialized radeon 1.29.0 20080528 hdac0: mem 0xfe9ec000-0xfe9effff irq 17 at device 0.1 on pci1 hdac0: HDA Driver Revision: 20090329_0131 hdac0: [ITHREAD] uhci0: port 0xbc00-0xbc1f irq 16 at device 26.0 on pci0 uhci0: [GIANT-LOCKED] uhci0: [ITHREAD] usb0: on uhci0 usb0: USB revision 1.0 uhub0: on usb0 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xb880-0xb89f irq 21 at device 26.1 on pci0 uhci1: [GIANT-LOCKED] uhci1: [ITHREAD] usb1: on uhci1 usb1: USB revision 1.0 uhub1: on usb1 uhub1: 2 ports with 2 removable, self powered uhci2: port 0xb800-0xb81f irq 19 at device 26.2 on pci0 uhci2: [GIANT-LOCKED] uhci2: [ITHREAD] usb2: on uhci2 usb2: USB revision 1.0 uhub2: on usb2 uhub2: 2 ports with 2 removable, self powered ehci0: mem 0xfe8fe000-0xfe8fe3ff irq 18 at device 26.7 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb3: EHCI version 1.0 usb3: companion controllers, 2 ports each: usb0 usb1 usb2 usb3: on ehci0 usb3: USB revision 2.0 uhub3: on usb3 uhub3: 6 ports with 6 removable, self powered hdac1: mem 0xfe8f8000-0xfe8fbfff irq 22 at device 27.0 on pci0 hdac1: HDA Driver Revision: 20090329_0131 hdac1: [ITHREAD] pcib2: irq 17 at device 28.0 on pci0 pci2: on pcib2 pcib3: irq 16 at device 28.5 on pci0 pci3: on pcib3 re0: port 0xd800-0xd8ff mem 0xfeaff000-0xfeafffff,0xfdff0000-0xfdffffff irq 17 at device 0.0 on pci3 re0: Chip rev. 0x3c000000 re0: MAC rev. 0x00400000 miibus0: on re0 rgephy0: PHY 1 on miibus0 rgephy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto re0: Ethernet address: 00:30:48:b0:6a:1f re0: [FILTER] uhci3: port 0xb480-0xb49f irq 23 at device 29.0 on pci0 uhci3: [GIANT-LOCKED] uhci3: [ITHREAD] usb4: on uhci3 usb4: USB revision 1.0 uhub4: on usb4 uhub4: 2 ports with 2 removable, self powered uhci4: port 0xb400-0xb41f irq 19 at device 29.1 on pci0 uhci4: [GIANT-LOCKED] uhci4: [ITHREAD] usb5: on uhci4 usb5: USB revision 1.0 uhub5: on usb5 uhub5: 2 ports with 2 removable, self powered uhci5: port 0xb080-0xb09f irq 18 at device 29.2 on pci0 uhci5: [GIANT-LOCKED] uhci5: [ITHREAD] usb6: on uhci5 usb6: USB revision 1.0 uhub6: on usb6 uhub6: 2 ports with 2 removable, self powered ehci1: mem 0xfe8fc000-0xfe8fc3ff irq 23 at device 29.7 on pci0 ehci1: [GIANT-LOCKED] ehci1: [ITHREAD] usb7: EHCI version 1.0 usb7: companion controllers, 2 ports each: usb4 usb5 usb6 usb7: on ehci1 usb7: USB revision 2.0 uhub7: on usb7 uhub7: 6 ports with 6 removable, self powered umass0: on uhub7 pcib4: at device 30.0 on pci0 pci4: on pcib4 dc0: port 0xe800-0xe8ff mem 0xfebffc00-0xfebfffff irq 21 at device 1.0 on pci4 miibus1: on dc0 acphy0: PHY 1 on miibus1 acphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto dc0: Ethernet address: 00:20:78:10:3e:98 dc0: [ITHREAD] isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0xb000-0xb007,0xac00-0xac03,0xa880-0xa887,0xa800-0xa803,0xa480-0xa48f,0xa400-0xa40f irq 19 at device 31.2 on pci0 atapci0: [ITHREAD] ata2: on atapci0 ata2: [ITHREAD] ata3: on atapci0 ata3: [ITHREAD] ichsmb0: port 0x400-0x41f mem 0xfe8f7c00-0xfe8f7cff irq 18 at device 31.3 on pci0 ichsmb0: [GIANT-LOCKED] ichsmb0: [ITHREAD] smbus0: on ichsmb0 smb0: on smbus0 atapci1: port 0xa000-0xa007,0x9c00-0x9c03,0x9880-0x9887,0x9800-0x9803,0x9480-0x948f,0x9400-0x940f irq 19 at device 31.5 on pci0 atapci1: [ITHREAD] ata4: on atapci1 ata4: [ITHREAD] ata5: on atapci1 ata5: [ITHREAD] acpi_button0: on acpi0 sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio0: [FILTER] ppc0: port 0x378-0x37f irq 7 on acpi0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: on ppc0 ppbus0: [ITHREAD] lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 plip0: on ppbus0 plip0: WARNING: using obsoleted IFF_NEEDSGIANT flag ppc0: [GIANT-LOCKED] ppc0: [ITHREAD] atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: [ITHREAD] psm0: model IntelliMouse, device ID 3 cpu0: on acpi0 ACPI Warning (tbutils-0243): Incorrect checksum in table [OEMB] - 45, should be 40 [20070320] coretemp0: on cpu0 est0: on cpu0 p4tcc0: on cpu0 cpu1: on acpi0 coretemp1: on cpu1 est1: on cpu1 est: CPU supports Enhanced Speedstep, but is not recognized. est: cpu_vendor GenuineIntel, msr 616492206004922 device_attach: est1 attach returned 6 p4tcc1: on cpu1 orm0: at iomem 0xc0000-0xcffff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 1.000 msec IPsec: Initialized Security Association Processing. ad4: 239372MB at ata2-master SATA150 ad7: 305245MB at ata3-slave SATA300 ad8: 610480MB at ata4-master SATA300 GEOM_LABEL: Label for provider ad4s1a is ufsid/497cecd46b0e22e5. acd0: DVDR at ata5-master SATA150 hdac0: HDA Codec #0: ATI R6xx HDMI pcm0: at cad 0 nid 1 on hdac0 hdac1: HDA Codec #2: Realtek ALC888 hdac1: hdac_command_send_internal: TIMEOUT numcmd=1, sent=1, received=0 hdac1: hdac_command_send_internal: TIMEOUT numcmd=1, sent=1, received=0 hdac1: Codec #3 is not responding! Probing aborted. pcm1: at cad 2 nid 1 on hdac1 pcm2: at cad 2 nid 1 on hdac1 pcm3: at cad 2 nid 1 on hdac1 acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=0x24 ascq=0x00 (probe1:umass-sim0:0:0:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe1:umass-sim0:0:0:0): CAM Status: SCSI Status Error (probe1:umass-sim0:0:0:0): SCSI Status: Check Condition (probe1:umass-sim0:0:0:0): UNIT ATTENTION asc:28,0 (probe1:umass-sim0:0:0:0): Not ready to ready change, medium may have changed (probe1:umass-sim0:0:0:0): Retrying Command (per Sense Data) acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=0x24 ascq=0x00 SMP: AP CPU #1 Launched! WARNING: WITNESS option enabled, expect reduced performance. da0 at umass-sim0 bus 0 target 0 lun 0 da0: Removable Direct Access SCSI-2 device da0: 40.000MB/s transfers da0: 3830MB (7843840 512 byte sectors: 255H 63S/T 488C) cd0 at ata3 bus 0 target 0 lun 0 cd0: Removable CD-ROM SCSI-0 device cd0: 3.300MB/s transfers cd0: cd present [4098336 x 2048 byte records] GEOM_LABEL: Label for provider acd0 is iso9660/THE_MATRIX_16X9LB_N_AMERICA. Trying to mount root from ufs:/dev/ad4s1a WARNING: / was not properly dismounted WARNING: reducing size to maximum of 67108864 blocks per swap unit GEOM_LABEL: Label ufsid/497cecd46b0e22e5 removed. GEOM_LABEL: Label for provider ad4s1a is ufsid/497cecd46b0e22e5. GEOM_LABEL: Label ufsid/497cecd46b0e22e5 removed. This module (opensolaris) contains code covered by the Common Development and Distribution License (CDDL) see http://opensolaris.org/os/licensing/opensolaris_license/ WARNING: ZFS is considered to be an experimental feature in FreeBSD. ZFS filesystem version 6 ZFS storage pool version 6 lock order reversal: 1st 0xffffffff80e49de0 pf task mtx (pf task mtx) @ /red/public/freebsd/sources/stable/sys/modules/pf/../../contrib/pf/net/pf_ioctl.c:1394 2nd 0xffffffff80ba94c0 ifnet (ifnet) @ /red/public/freebsd/sources/stable/sys/net/if.c:1623 KDB: stack backtrace: db_trace_self_wrapper() at db_trace_self_wrapper+0x2a witness_checkorder() at witness_checkorder+0x543 _mtx_lock_flags() at _mtx_lock_flags+0x1f ifunit() at ifunit+0x24 pfioctl() at pfioctl+0x2531 devfs_ioctl_f() at devfs_ioctl_f+0x71 kern_ioctl() at kern_ioctl+0x91 ioctl() at ioctl+0xeb syscall() at syscall+0x1a5 Xfast_syscall() at Xfast_syscall+0xab --- syscall (54, FreeBSD ELF64, ioctl), rip = 0x80096296c, rsp = 0x7fffffffdc18, rbp = 0x7fffffffdca0 --- kqemu version 0x00010400 kqemu: KQEMU installed, max_locked_mem=2089448kB. acd0: FAILURE - READ_BIG timed out acd0: FAILURE - READ_BIG timed out acd0: FAILURE - READ_BIG timed out info: [drm] Setting GART location based on new memory map info: [drm] Loading RV635 CP Microcode info: [drm] Loading RV635 PFP Microcode info: [drm] Resetting GPU info: [drm] writeback test succeeded in 1 usecs drm0: [ITHREAD] acd0: FAILURE - READ_BIG timed out acd0: FAILURE - READ_BIG timed out (cd0:ata3:0:0:0): cddone: got error 0x5 back tap0: Ethernet address: 00:bd:d8:9b:04:00 bridge0: Ethernet address: fa:cc:68:2e:a4:8e tap0: promiscuous mode enabled dc0: promiscuous mode enabled From owner-freebsd-stable@FreeBSD.ORG Sun May 10 21:26:10 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0DCC3106564A for ; Sun, 10 May 2009 21:26:10 +0000 (UTC) (envelope-from npapke@acm.org) Received: from idcmail-mo2no.shaw.ca (idcmail-mo2no.shaw.ca [64.59.134.9]) by mx1.freebsd.org (Postfix) with ESMTP id C724F8FC14 for ; Sun, 10 May 2009 21:26:09 +0000 (UTC) (envelope-from npapke@acm.org) Received: from pd7ml2no-ssvc.prod.shaw.ca ([10.0.153.162]) by pd7mo1no-svcs.prod.shaw.ca with ESMTP; 10 May 2009 15:26:09 -0600 X-Cloudmark-SP-Filtered: true X-Cloudmark-SP-Result: v=1.0 c=0 a=ep_KMAzDAAAA:8 a=6I5d2MoRAAAA:8 a=wC-krTQd3l4TNOsoq4IA:9 a=aH5vXGl-Mr3nYQcszIsA:7 a=6DEoCEKRnpwcYwFrCuF6mXlI43EA:4 a=SV7veod9ZcQA:10 a=nAPXUAfsBmEA:10 a=avX_41wpOqIA:10 a=macy1kFFMuwA:10 Received: from s010600121729c74c.vc.shawcable.net (HELO proven.lan) ([24.85.241.34]) by pd7ml2no-dmz.prod.shaw.ca with ESMTP; 10 May 2009 15:26:08 -0600 Received: from proven.lan (localhost [127.0.0.1]) by proven.lan (8.14.3/8.14.3) with ESMTP id n4ALQ82U003368 for ; Sun, 10 May 2009 14:26:08 -0700 (PDT) (envelope-from npapke@acm.org) Received: from localhost (localhost [[UNIX: localhost]]) by proven.lan (8.14.3/8.14.3/Submit) id n4ALQ85S003367 for freebsd-stable@freebsd.org; Sun, 10 May 2009 14:26:08 -0700 (PDT) (envelope-from npapke@acm.org) X-Authentication-Warning: proven.lan: npapke set sender to npapke@acm.org using -f From: Norbert Papke Organization: Archaeological Filing To: freebsd-stable@freebsd.org Date: Sun, 10 May 2009 14:26:08 -0700 User-Agent: KMail/1.9.10 References: <200905101217.39920.fbsd-ml@scrapper.ca> In-Reply-To: <200905101217.39920.fbsd-ml@scrapper.ca> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Message-Id: <200905101426.08256.npapke@acm.org> Subject: Re: 7.2-STABLE: Inserting USB device causes Fatal Trap 12 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 10 May 2009 21:26:10 -0000 On May 10, 2009, Norbert Papke wrote: > Inserting a USB thumb drive into a running sytem result in a "Fatal trap > 12: page fault while in kernel mode". > > Unfortunately, I was not able to save a core (not entirely sure why, I'll > investigate separately). I have manually copied the backtrace: I now have a kernel dump and backtrace with symbols: #0 doadump () at pcpu.h:195 #1 0xffffffff801d239c in db_fncall (dummy1=3DVariable "dummy1" is not=20 available. ) at /red/public/freebsd/sources/stable/sys/ddb/db_command.c:516 #2 0xffffffff801d28a9 in db_command (last_cmdp=3D0xffffffff80adc648,=20 cmd_table=3D0x0, dopager=3D1) at /red/public/freebsd/sources/stable/sys/ddb/db_command.c:413 #3 0xffffffff801d2aab in db_command_loop ()=20 at /red/public/freebsd/sources/stable/sys/ddb/db_command.c:466 #4 0xffffffff801d42f7 in db_trap (type=3DVariable "type" is not available. ) at /red/public/freebsd/sources/stable/sys/ddb/db_main.c:228 #5 0xffffffff805159e5 in kdb_trap (type=3D12, code=3D0, tf=3D0xfffffffef5b= 69d10) at /red/public/freebsd/sources/stable/sys/kern/subr_kdb.c:524 #6 0xffffffff80798143 in trap_fatal (frame=3D0xfffffffef5b69d10,=20 eva=3DVariable "eva" is not available. ) at /red/public/freebsd/sources/stable/sys/amd64/amd64/trap.c:752 #7 0xffffffff80798498 in trap_pfault (frame=3D0xfffffffef5b69d10, usermode= =3D0) at /red/public/freebsd/sources/stable/sys/amd64/amd64/trap.c:673 #8 0xffffffff80798bcf in trap (frame=3D0xfffffffef5b69d10) at /red/public/freebsd/sources/stable/sys/amd64/amd64/trap.c:444 #9 0xffffffff8077edae in calltrap ()=20 at /red/public/freebsd/sources/stable/sys/amd64/amd64/exception.S:209 #10 0xffffffff80473265 in usb_transfer_complete (xfer=3D0xffffff00045cbc00) at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:949 #11 0xffffffff8077af55 in bus_dmamap_load (dmat=3D0xffffff0004598580,=20 map=3D0xffffff000cbf5e00, buf=3D0xfffffffef5b69ff0, buflen=3DVariable "buflen" is not available. ) at /red/public/freebsd/sources/stable/sys/amd64/amd64/busdma_machdep.c:739 #12 0xffffffff80473955 in usbd_transfer (xfer=3D0xffffff00045cbc00) at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:312 #13 0xffffffff80473b36 in usbd_do_request_flags_pipe (dev=3D0xffffff009c1e4= a00,=20 pipe=3D0xffffff000c857680, req=3D0xfffffffef5b69f90, data=3D0xfffffffef5b69ff0, flags=3DVariable "= flags" is=20 not available. ) at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:1100 #14 0xffffffff80473c60 in usbd_do_request_flags (dev=3DVariable "dev" is no= t=20 available. ) at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:1070 #15 0xffffffff80471d1a in usbd_get_string_desc (dev=3D0xffffff009c1e4a00,=20 sindex=3DVariable "sindex" is not available. ) at /red/public/freebsd/sources/stable/sys/dev/usb/usb_subr.c:171 #16 0xffffffff80472f1d in usbd_get_string (dev=3D0xffffff009c1e4a00, si=3D1= ,=20 buf=3D0xfffffffef5b6a200 "", len=3D128) =2D--Type to continue, or q to quit--- at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:1353 #17 0xffffffff80470fca in usbd_devinfo_vp (dev=3D0xffffff009c1e4a00,=20 v=3D0xfffffffef5b6a200 "", p=3D0xfffffffef5b6a180 "=EF=BF=BDz=EF=BF=BD\200=EF=BF=BD=EF=BF=BD=EF=BF= =BD=EF=BF=BD`=EF=BF=BD=EF=BF=BD\200=EF=BF=BD=EF=BF=BD=EF=BF=BD=EF=BF=BD", u= sedev=3DVariable "usedev" is=20 not available. ) at /red/public/freebsd/sources/stable/sys/dev/usb/usb_subr.c:216 #18 0xffffffff80471b76 in usbd_devinfo (dev=3D0xffffff009c1e4a00, showclass= =3D1,=20 cp=3D0xffffff0122986000 "\001") at /red/public/freebsd/sources/stable/sys/dev/usb/usb_subr.c:281 #19 0xffffffff8047243e in usbd_new_device (parent=3D0xffffff0004591900,=20 bus=3D0xffffff000440a000, depth=3DVariable "depth" is not available. ) at /red/public/freebsd/sources/stable/sys/dev/usb/usb_subr.c:861 #20 0xffffffff80467b5b in uhub_explore (dev=3D0xffffff0004591400) at /red/public/freebsd/sources/stable/sys/dev/usb/uhub.c:523 #21 0xffffffff8046f391 in usb_discover (v=3DVariable "v" is not available. ) at /red/public/freebsd/sources/stable/sys/dev/usb/usb.c:724 #22 0xffffffff8046fc61 in usb_event_thread (arg=3DVariable "arg" is not=20 available. ) at /red/public/freebsd/sources/stable/sys/dev/usb/usb.c:440 #23 0xffffffff804d05bd in fork_exit (callout=3D0xffffffff8046fbe5=20 , arg=3D0xffffff0004598d00, frame=3D0xfffffffef5b6ac80)=20 at /red/public/freebsd/sources/stable/sys/kern/kern_fork.c:810 #24 0xffffffff8077f16e in fork_trampoline () at /red/public/freebsd/sources/stable/sys/amd64/amd64/exception.S:455 > The problem is repeatable. It only happens when I insert the thumb drive > into a running system. If I boot with the thumb drive present, everything > is fine. > > Any help is greatly appreciated. > > Cheers, > > -- Norbert Papke. > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > # uname -a > FreeBSD proven.lan 7.2-STABLE FreeBSD 7.2-STABLE #0 r191841: Tue May 5 > 21:13:21 PDT 2009 > npapke@proven.lan:/usr/obj/red/public/freebsd/sources/stable/sys/PROVEN > amd64 > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > Kernel config: > > include GENERIC > ident PROVEN > > options KDB # kernel debugger (just in case) > options KDB_TRACE > options DDB # kernel debugger (just in case) > options WITNESS > options WITNESS_SKIPSPIN > > options IPSEC > device crypto > device stf # for IPv6 tunneling > > # keep kernel messages from different cpus separate > options PRINTF_BUFR_SIZE=3D64 > > option SC_HISTORY_SIZE=3D2000 > options SC_NORM_ATTR=3D(FG_GREEN|BG_BLACK) > options SC_NORM_REV_ATTR=3D(FG_YELLOW|BG_GREEN) > options SC_KERNEL_CONS_ATTR=3D(FG_LIGHTRED|BG_BLACK) > options SC_KERNEL_CONS_REV_ATTR=3D(FG_BLACK|BG_RED) > > # Alternate Queuing of network packets > options ALTQ > options ALTQ_CBQ # Class Bases Queuing (CBQ) > options ALTQ_RED # Random Early Detection (RED) > options ALTQ_RIO # RED In/Out > options ALTQ_HFSC # Hierarchical Packet Scheduler (HFSC) > options ALTQ_PRIQ # Priority Queuing (PRIQ) > options ALTQ_NOPCC # Required for SMP build > > # load as module for debugging > nodevice re # RealTek 8139C+/8169/8169S/8110S > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > Copyright (c) 1992-2009 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD is a registered trademark of The FreeBSD Foundation. > FreeBSD 7.2-STABLE #0 r191841: Tue May 5 21:13:21 PDT 2009 > npapke@proven.lan:/usr/obj/red/public/freebsd/sources/stable/sys/PROV= EN > WARNING: WITNESS option enabled, expect reduced performance. > Timecounter "i8254" frequency 1193182 Hz quality 0 > CPU: Intel(R) Core(TM)2 Duo CPU E8500 @ 3.16GHz (3155.59-MHz K8-class > CPU) > Origin =3D "GenuineIntel" Id =3D 0x1067a Stepping =3D 10 > > Features=3D0xbfebfbffA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> > > Features2=3D0x408e3fdDCM,,XSAVE> AMD Features=3D0x20100800 > AMD Features2=3D0x1 > Cores per package: 2 > usable memory =3D 4279189504 (4080 MB) > avail memory =3D 4097724416 (3907 MB) > ACPI APIC Table: <100808 APIC1053> > FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs > cpu0 (BSP): APIC ID: 0 > cpu1 (AP): APIC ID: 1 > ioapic0 irqs 0-23 on motherboard > kbd1 at kbdmux0 > cryptosoft0: on motherboard > acpi0: <100808 XSDT1053> on motherboard > acpi0: [ITHREAD] > acpi0: Power Button (fixed) > acpi0: reservation of ffc00000, 300000 (3) failed > acpi0: reservation of fee00000, 1000 (3) failed > acpi0: reservation of 0, a0000 (3) failed > acpi0: reservation of 100000, bff00000 (3) failed > Timecounter "ACPI-safe" frequency 3579545 Hz quality 850 > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 > acpi_hpet0: iomem 0xfed00000-0xfed003ff on > acpi0 Timecounter "HPET" frequency 14318180 Hz quality 900 > pcib0: port 0xcf8-0xcff on acpi0 > pci0: on pcib0 > pcib1: irq 16 at device 1.0 on pci0 > pci1: on pcib1 > vgapci0: port 0xc000-0xc0ff mem > 0xd0000000-0xdfffffff,0xfe9f0000-0xfe9fffff irq 16 at device 0.0 on pci1 > drm0: on vgapci0 > info: [drm] MSI enabled 1 message(s) > vgapci0: child drm0 requested pci_enable_busmaster > info: [drm] Initialized radeon 1.29.0 20080528 > hdac0: mem > 0xfe9ec000-0xfe9effff irq 17 at device 0.1 on pci1 > hdac0: HDA Driver Revision: 20090329_0131 > hdac0: [ITHREAD] > uhci0: port 0xbc00-0xbc1f irq 16 at device > 26.0 on pci0 > uhci0: [GIANT-LOCKED] > uhci0: [ITHREAD] > usb0: on uhci0 > usb0: USB revision 1.0 > uhub0: on usb0 > uhub0: 2 ports with 2 removable, self powered > uhci1: port 0xb880-0xb89f irq 21 at device > 26.1 on pci0 > uhci1: [GIANT-LOCKED] > uhci1: [ITHREAD] > usb1: on uhci1 > usb1: USB revision 1.0 > uhub1: on usb1 > uhub1: 2 ports with 2 removable, self powered > uhci2: port 0xb800-0xb81f irq 19 at device > 26.2 on pci0 > uhci2: [GIANT-LOCKED] > uhci2: [ITHREAD] > usb2: on uhci2 > usb2: USB revision 1.0 > uhub2: on usb2 > uhub2: 2 ports with 2 removable, self powered > ehci0: mem 0xfe8fe000-0xfe8fe3ff irq = 18 > at device 26.7 on pci0 > ehci0: [GIANT-LOCKED] > ehci0: [ITHREAD] > usb3: EHCI version 1.0 > usb3: companion controllers, 2 ports each: usb0 usb1 usb2 > usb3: on ehci0 > usb3: USB revision 2.0 > uhub3: on usb3 > uhub3: 6 ports with 6 removable, self powered > hdac1: mem > 0xfe8f8000-0xfe8fbfff irq 22 at device 27.0 on pci0 > hdac1: HDA Driver Revision: 20090329_0131 > hdac1: [ITHREAD] > pcib2: irq 17 at device 28.0 on pci0 > pci2: on pcib2 > pcib3: irq 16 at device 28.5 on pci0 > pci3: on pcib3 > re0: Ethernet> port 0xd800-0xd8ff mem > 0xfeaff000-0xfeafffff,0xfdff0000-0xfdffffff irq 17 at device 0.0 on pci3 > re0: Chip rev. 0x3c000000 > re0: MAC rev. 0x00400000 > miibus0: on re0 > rgephy0: PHY 1 on miibus0 > rgephy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, > 1000baseT-FDX, auto > re0: Ethernet address: 00:30:48:b0:6a:1f > re0: [FILTER] > uhci3: port 0xb480-0xb49f irq 23 at device > 29.0 on pci0 > uhci3: [GIANT-LOCKED] > uhci3: [ITHREAD] > usb4: on uhci3 > usb4: USB revision 1.0 > uhub4: on usb4 > uhub4: 2 ports with 2 removable, self powered > uhci4: port 0xb400-0xb41f irq 19 at device > 29.1 on pci0 > uhci4: [GIANT-LOCKED] > uhci4: [ITHREAD] > usb5: on uhci4 > usb5: USB revision 1.0 > uhub5: on usb5 > uhub5: 2 ports with 2 removable, self powered > uhci5: port 0xb080-0xb09f irq 18 at device > 29.2 on pci0 > uhci5: [GIANT-LOCKED] > uhci5: [ITHREAD] > usb6: on uhci5 > usb6: USB revision 1.0 > uhub6: on usb6 > uhub6: 2 ports with 2 removable, self powered > ehci1: mem 0xfe8fc000-0xfe8fc3ff irq = 23 > at device 29.7 on pci0 > ehci1: [GIANT-LOCKED] > ehci1: [ITHREAD] > usb7: EHCI version 1.0 > usb7: companion controllers, 2 ports each: usb4 usb5 usb6 > usb7: on ehci1 > usb7: USB revision 2.0 > uhub7: on usb7 > uhub7: 6 ports with 6 removable, self powered > umass0: = on > uhub7 > pcib4: at device 30.0 on pci0 > pci4: on pcib4 > dc0: port 0xe800-0xe8ff mem > 0xfebffc00-0xfebfffff irq 21 at device 1.0 on pci4 > miibus1: on dc0 > acphy0: PHY 1 on miibus1 > acphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto > dc0: Ethernet address: 00:20:78:10:3e:98 > dc0: [ITHREAD] > isab0: at device 31.0 on pci0 > isa0: on isab0 > atapci0: port > 0xb000-0xb007,0xac00-0xac03,0xa880-0xa887,0xa800-0xa803,0xa480-0xa48f,0xa= 40 >0-0xa40f irq 19 at device 31.2 on pci0 > atapci0: [ITHREAD] > ata2: on atapci0 > ata2: [ITHREAD] > ata3: on atapci0 > ata3: [ITHREAD] > ichsmb0: port 0x400-0x41f mem 0xfe8f7c00-0xfe8f7cff irq > 18 at device 31.3 on pci0 > ichsmb0: [GIANT-LOCKED] > ichsmb0: [ITHREAD] > smbus0: on ichsmb0 > smb0: on smbus0 > atapci1: port > 0xa000-0xa007,0x9c00-0x9c03,0x9880-0x9887,0x9800-0x9803,0x9480-0x948f,0x9= 40 >0-0x940f irq 19 at device 31.5 on pci0 > atapci1: [ITHREAD] > ata4: on atapci1 > ata4: [ITHREAD] > ata5: on atapci1 > ata5: [ITHREAD] > acpi_button0: on acpi0 > sio0: configured irq 4 not in bitmap of probed irqs 0 > sio0: port may not be enabled > sio0: configured irq 4 not in bitmap of probed irqs 0 > sio0: port may not be enabled > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on > acpi0 sio0: type 16550A > sio0: [FILTER] > ppc0: port 0x378-0x37f irq 7 on acpi0 > ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode > ppbus0: on ppc0 > ppbus0: [ITHREAD] > lpt0: on ppbus0 > lpt0: Interrupt-driven port > ppi0: on ppbus0 > plip0: on ppbus0 > plip0: WARNING: using obsoleted IFF_NEEDSGIANT flag > ppc0: [GIANT-LOCKED] > ppc0: [ITHREAD] > atkbdc0: port 0x60,0x64 irq 1 on acpi0 > atkbd0: irq 1 on atkbdc0 > kbd0 at atkbd0 > atkbd0: [GIANT-LOCKED] > atkbd0: [ITHREAD] > psm0: irq 12 on atkbdc0 > psm0: [GIANT-LOCKED] > psm0: [ITHREAD] > psm0: model IntelliMouse, device ID 3 > cpu0: on acpi0 > ACPI Warning (tbutils-0243): Incorrect checksum in table [OEMB] - 45, > should be 40 [20070320] > coretemp0: on cpu0 > est0: on cpu0 > p4tcc0: on cpu0 > cpu1: on acpi0 > coretemp1: on cpu1 > est1: on cpu1 > est: CPU supports Enhanced Speedstep, but is not recognized. > est: cpu_vendor GenuineIntel, msr 616492206004922 > device_attach: est1 attach returned 6 > p4tcc1: on cpu1 > orm0: at iomem 0xc0000-0xcffff on isa0 > sc0: at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=3D0x300> > sio1: configured irq 3 not in bitmap of probed irqs 0 > sio1: port may not be enabled > vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > Timecounters tick every 1.000 msec > IPsec: Initialized Security Association Processing. > ad4: 239372MB at ata2-master SATA150 > ad7: 305245MB at ata3-slave SATA300 > ad8: 610480MB at ata4-master SATA300 > GEOM_LABEL: Label for provider ad4s1a is ufsid/497cecd46b0e22e5. > acd0: DVDR at ata5-master SATA150 > hdac0: HDA Codec #0: ATI R6xx HDMI > pcm0: at cad 0 nid 1 on hdac0 > hdac1: HDA Codec #2: Realtek ALC888 > hdac1: hdac_command_send_internal: TIMEOUT numcmd=3D1, sent=3D1, received= =3D0 > hdac1: hdac_command_send_internal: TIMEOUT numcmd=3D1, sent=3D1, received= =3D0 > hdac1: Codec #3 is not responding! Probing aborted. > pcm1: at cad 2 nid 1 on hdac1 > pcm2: at cad 2 nid 1 on hdac1 > pcm3: at cad 2 nid 1 on hdac1 > acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=3D0x24 ascq=3D0x00 > (probe1:umass-sim0:0:0:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 > (probe1:umass-sim0:0:0:0): CAM Status: SCSI Status Error > (probe1:umass-sim0:0:0:0): SCSI Status: Check Condition > (probe1:umass-sim0:0:0:0): UNIT ATTENTION asc:28,0 > (probe1:umass-sim0:0:0:0): Not ready to ready change, medium may have > changed (probe1:umass-sim0:0:0:0): Retrying Command (per Sense Data) > acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=3D0x24 ascq=3D0x00 > SMP: AP CPU #1 Launched! > WARNING: WITNESS option enabled, expect reduced performance. > da0 at umass-sim0 bus 0 target 0 lun 0 > da0: Removable Direct Access SCSI-2 device > da0: 40.000MB/s transfers > da0: 3830MB (7843840 512 byte sectors: 255H 63S/T 488C) > cd0 at ata3 bus 0 target 0 lun 0 > cd0: Removable CD-ROM SCSI-0 device > cd0: 3.300MB/s transfers > cd0: cd present [4098336 x 2048 byte records] > GEOM_LABEL: Label for provider acd0 is iso9660/THE_MATRIX_16X9LB_N_AMERIC= A. > Trying to mount root from ufs:/dev/ad4s1a > WARNING: / was not properly dismounted > WARNING: reducing size to maximum of 67108864 blocks per swap unit > GEOM_LABEL: Label ufsid/497cecd46b0e22e5 removed. > GEOM_LABEL: Label for provider ad4s1a is ufsid/497cecd46b0e22e5. > GEOM_LABEL: Label ufsid/497cecd46b0e22e5 removed. > This module (opensolaris) contains code covered by the > Common Development and Distribution License (CDDL) > see http://opensolaris.org/os/licensing/opensolaris_license/ > WARNING: ZFS is considered to be an experimental feature in FreeBSD. > ZFS filesystem version 6 > ZFS storage pool version 6 > lock order reversal: > 1st 0xffffffff80e49de0 pf task mtx (pf task mtx) > @ > /red/public/freebsd/sources/stable/sys/modules/pf/../../contrib/pf/net/pf= _i >octl.c:1394 2nd 0xffffffff80ba94c0 ifnet (ifnet) > @ /red/public/freebsd/sources/stable/sys/net/if.c:1623 > KDB: stack backtrace: > db_trace_self_wrapper() at db_trace_self_wrapper+0x2a > witness_checkorder() at witness_checkorder+0x543 > _mtx_lock_flags() at _mtx_lock_flags+0x1f > ifunit() at ifunit+0x24 > pfioctl() at pfioctl+0x2531 > devfs_ioctl_f() at devfs_ioctl_f+0x71 > kern_ioctl() at kern_ioctl+0x91 > ioctl() at ioctl+0xeb > syscall() at syscall+0x1a5 > Xfast_syscall() at Xfast_syscall+0xab > --- syscall (54, FreeBSD ELF64, ioctl), rip =3D 0x80096296c, rsp =3D > 0x7fffffffdc18, rbp =3D 0x7fffffffdca0 --- > kqemu version 0x00010400 > kqemu: KQEMU installed, max_locked_mem=3D2089448kB. > acd0: FAILURE - READ_BIG timed out > acd0: FAILURE - READ_BIG timed out > acd0: FAILURE - READ_BIG timed out > info: [drm] Setting GART location based on new memory map > info: [drm] Loading RV635 CP Microcode > info: [drm] Loading RV635 PFP Microcode > info: [drm] Resetting GPU > info: [drm] writeback test succeeded in 1 usecs > drm0: [ITHREAD] > acd0: FAILURE - READ_BIG timed out > acd0: FAILURE - READ_BIG timed out > (cd0:ata3:0:0:0): cddone: got error 0x5 back > tap0: Ethernet address: 00:bd:d8:9b:04:00 > bridge0: Ethernet address: fa:cc:68:2e:a4:8e > tap0: promiscuous mode enabled > dc0: promiscuous mode enabled > > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" =2D-=20 =2D- Norbert Papke. npapke@acm.org From owner-freebsd-stable@FreeBSD.ORG Sun May 10 23:46:25 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 64A321065670 for ; Sun, 10 May 2009 23:46:25 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id 3128A8FC08 for ; Sun, 10 May 2009 23:46:25 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from localhost (maia-1.hub.org [200.46.208.211]) by hub.org (Postfix) with ESMTP id D431853BC91 for ; Sun, 10 May 2009 20:46:24 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by localhost (mx1.hub.org [200.46.208.211]) (amavisd-maia, port 10024) with ESMTP id 37684-04 for ; Sun, 10 May 2009 20:46:17 -0300 (ADT) Received: by hub.org (Postfix, from userid 1002) id 6C21F53BC90; Sun, 10 May 2009 20:46:24 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by hub.org (Postfix) with ESMTP id 6AC1953BC8C for ; Sun, 10 May 2009 20:46:24 -0300 (ADT) Date: Sun, 10 May 2009 20:46:24 -0300 (ADT) From: "Marc G. Fournier" To: freebsd-stable@freebsd.org Message-ID: <20090510203457.W17646@hub.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Subject: Debugging server hangs in 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 10 May 2009 23:46:25 -0000 I am so completely running out of ideas on how to debug this, maybe someone else has some ideas? The problem appears to be that very suddenly, the disk busy (according to vmstat) skyrockets to >100 (from 0) and then the 'runnable but swapped' column slowly rises ... One person suggested that for them, they saw similar when msi/msi-x was enabled ... after searching the source code, I found that msi was used in the bge driver, but I couldn't find msix used anywhere else on that machine, so disabled msi ... its still exhibiting the issue ... I get no errors on the serial console to indicate any problems, and until a relatively recent upgrade of the kernel ( (I can't give an exact date), this server was one of my most solid ... I figure there is a single process that is starting up on the machine that is causing this, but no matter what I try, it is eluding me. I have KDB enabled in the kernel, and the serial console setup so that I can break to it ... but when this problem happens, doing 'cr ~ ^b' through the serial console doesn't do anything, or, it just prints the message about breaking to the debugger and then hangs there ... My next option is to start time travelling backwards to see if I can find a 'stable kernel' again, but if it is just one process causing this, then going back to older kernels isn't necessarily going to accomplish anything ... Is there something else I can do here to debug this? Its hard to believe we are such an advance OS, but debugging issues like this is so elusive :( ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 From owner-freebsd-stable@FreeBSD.ORG Mon May 11 04:47:07 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DED171065672 for ; Mon, 11 May 2009 04:47:07 +0000 (UTC) (envelope-from dougb@FreeBSD.org) Received: from mail2.fluidhosting.com (mx24.fluidhosting.com [204.14.89.7]) by mx1.freebsd.org (Postfix) with ESMTP id 79B998FC14 for ; Mon, 11 May 2009 04:47:07 +0000 (UTC) (envelope-from dougb@FreeBSD.org) Received: (qmail 3582 invoked by uid 399); 11 May 2009 04:47:03 -0000 Received: from localhost (HELO ?192.168.0.103?) (dougb@dougbarton.us@127.0.0.1) by localhost with ESMTPAM; 11 May 2009 04:47:03 -0000 X-Originating-IP: 127.0.0.1 X-Sender: dougb@dougbarton.us Message-ID: <4A07ADC4.6060209@FreeBSD.org> Date: Sun, 10 May 2009 21:47:00 -0700 From: Doug Barton Organization: http://www.FreeBSD.org/ User-Agent: Thunderbird 2.0.0.21 (Windows/20090302) MIME-Version: 1.0 To: Doug Hardie References: <1BA7DBA9-3C49-490A-B97C-DEB08DF2F696@lafn.org> In-Reply-To: <1BA7DBA9-3C49-490A-B97C-DEB08DF2F696@lafn.org> X-Enigmail-Version: 0.95.7 OpenPGP: id=D5B2F0FB Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable Stable Subject: Re: Mergemaster X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 04:47:08 -0000 Doug Hardie wrote: > I have been following the discussion on mergemaster and one item is a > bit annoying. You can use -U in the command args which sets > "AUTO_UPGRADE=yes". So far so good. > That flag is not in mergemaster.rc. I'm not sure what that is supposed to mean. There is no rc file by default, you have to create it. If what you mean is that it wasn't mentioned in the man page, that has been fixed for a while now. > It could be > easily added to the rc file, but I suspect it would conflict with -p. It would not conflict with it, in fact if everything is working as it should it should be totally safe. > Hence it seems like if "unset AUTO_UPGRADE" were added to the -p section > then it would work. I try hard not to outthink what the user is trying to do, which of course works both ways. > It would be helpful to be able to include it in the > rc file so I don't have to remember the options each time. [ -z "$PRE_WORLD" ] && AUTO_UPGRADE=yes hth, Doug From owner-freebsd-stable@FreeBSD.ORG Mon May 11 04:49:28 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8D3F1106568B for ; Mon, 11 May 2009 04:49:28 +0000 (UTC) (envelope-from dougb@FreeBSD.org) Received: from mail2.fluidhosting.com (mx24.fluidhosting.com [204.14.89.7]) by mx1.freebsd.org (Postfix) with ESMTP id 246DB8FC0A for ; Mon, 11 May 2009 04:49:28 +0000 (UTC) (envelope-from dougb@FreeBSD.org) Received: (qmail 6208 invoked by uid 399); 11 May 2009 04:49:26 -0000 Received: from localhost (HELO ?192.168.0.103?) (dougb@dougbarton.us@127.0.0.1) by localhost with ESMTPAM; 11 May 2009 04:49:26 -0000 X-Originating-IP: 127.0.0.1 X-Sender: dougb@dougbarton.us Message-ID: <4A07AE53.4080104@FreeBSD.org> Date: Sun, 10 May 2009 21:49:23 -0700 From: Doug Barton Organization: http://www.FreeBSD.org/ User-Agent: Thunderbird 2.0.0.21 (Windows/20090302) MIME-Version: 1.0 To: Torfinn Ingolfsen References: <20090504225012.392fa49f.torfinn.ingolfsen@broadpark.no> <4A00AF76.8010604@FreeBSD.org> <20090506184500.70b2c6f2.torfinn.ingolfsen@broadpark.no> In-Reply-To: <20090506184500.70b2c6f2.torfinn.ingolfsen@broadpark.no> X-Enigmail-Version: 0.95.7 OpenPGP: id=D5B2F0FB Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: RELENG_7 - has mergemaster changed logic since 7.2-RELEASE? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 04:49:29 -0000 Torfinn Ingolfsen wrote: > To be clear, I follow this procedure: > 1. make buildworld > 2. make kernel > 3. shutdown now > 4. mergemaster -p > 5. make installworld > 6. mergemaster -iU > 7. fastboot By any chance is any of this happening in a jail? Or by any chance is /etc a symlink? A user sent me a very interesting patch related to the use of -U in a jail that might be relevant here. Doug From owner-freebsd-stable@FreeBSD.ORG Mon May 11 06:20:38 2009 Return-Path: Delivered-To: freebsd-stable@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 16BD01065673 for ; Mon, 11 May 2009 06:20:38 +0000 (UTC) (envelope-from graham@menhennitt.com.au) Received: from hapkido.dreamhost.com (hapkido.dreamhost.com [66.33.216.122]) by mx1.freebsd.org (Postfix) with ESMTP id F0A018FC13 for ; Mon, 11 May 2009 06:20:37 +0000 (UTC) (envelope-from graham@menhennitt.com.au) Received: from friskymail-a5.g.dreamhost.com (caibbdcaaaaf.dreamhost.com [208.113.200.5]) by hapkido.dreamhost.com (Postfix) with ESMTP id 8EC4817D375 for ; Sun, 10 May 2009 22:57:03 -0700 (PDT) Received: from [127.0.0.1] (c58-109-90-141.mckinn2.vic.optusnet.com.au [58.109.90.141]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by friskymail-a5.g.dreamhost.com (Postfix) with ESMTP id A3218FA1B1 for ; Sun, 10 May 2009 22:57:10 -0700 (PDT) Message-ID: <4A07BDFB.1000609@menhennitt.com.au> Date: Mon, 11 May 2009 15:56:11 +1000 From: Graham Menhennitt User-Agent: Thunderbird 2.0.0.21 (Windows/20090302) MIME-Version: 1.0 To: freebsd-stable@FreeBSD.ORG Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: Subject: failure building nanobsd with FreeBSD Stable X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 06:20:38 -0000 Hello all, I've been building nanobsd to run on my Soekris net4801 for some time now. I recently csupped to FreeBSD Stable and I can no longer build it. It gives an error early in "installworld" (see below). Does anybody please have any clues? Thanks, Graham mkdir -p /tmp/install.1JDmZzZe for prog in [ awk cap_mkdb cat chflags chmod chown date echo egrep find grep ln lockf make mkdir mtree mv pwd_mkdb rm sed sh sysctl test true uname wc zic; do cp `which $prog` /tmp/install.1JDmZzZe; done cd /usr/src; MAKEOBJDIRPREFIX=/usr/obj/nanobsd.maxwell/ MACHINE_ARCH=i386 MACH INE=i386 CPUTYPE= GROFF_BIN_PATH=/usr/obj/nanobsd.maxwell//usr/src/tmp/legacy/ usr/bin GROFF_FONT_PATH=/usr/obj/nanobsd.maxwell//usr/src/tmp/legacy/usr/share/ groff_font GROFF_TMAC_PATH=/usr/obj/nanobsd.maxwell//usr/src/tmp/legacy/usr/sha re/tmac PATH=/usr/obj/nanobsd.maxwell//usr/src/tmp/legacy/usr/sbin:/usr/obj/nan obsd.maxwell//usr/src/tmp/legacy/usr/bin:/usr/obj/nanobsd.maxwell//usr/src/tmp/l egacy/usr/games:/usr/obj/nanobsd.maxwell//usr/src/tmp/usr/sbin:/usr/obj/nanobsd. maxwell//usr/src/tmp/usr/bin:/usr/obj/nanobsd.maxwell//usr/src/tmp/usr/games:/tm p/install.1JDmZzZe make -f Makefile.inc1 reinstall -------------------------------------------------------------- >>> Making hierarchy -------------------------------------------------------------- cd /usr/src; make -f Makefile.inc1 hierarchy cd /usr/src/etc; make distrib-dirs mtree -eU -f /usr/src/etc/mtree/BSD.root.dist -p /usr/obj/nanobsd.maxwell//_.w/ ./bin missing (created) ./boot missing (created) ./boot/defaults missing (created) ./boot/firmware missing (created) ...skipping... rm -f /usr/obj/nanobsd.maxwell//_.w/usr/include/security/mac_partition; fi if [ -L /usr/obj/nanobsd.maxwell//_.w/usr/include/ufs/ffs ]; then rm -f /usr/ob j/nanobsd.maxwell//_.w/usr/include/ufs/ffs; fi if [ -L /usr/obj/nanobsd.maxwell//_.w/usr/include/ufs/ufs ]; then rm -f /usr/ob j/nanobsd.maxwell//_.w/usr/include/ufs/ufs; fi if [ -L /usr/obj/nanobsd.maxwell//_.w/usr/include/machine ]; then rm -f /usr/ob j/nanobsd.maxwell//_.w/usr/include/machine; fi if [ -L /usr/obj/nanobsd.maxwell//_.w/usr/include/crypto ]; then rm -f /usr/obj /nanobsd.maxwell//_.w/usr/include/crypto; fi mtree -deU -f /usr/src/include/../etc/mtree/BSD.include.dist -p /usr/obj/nano bsd.maxwell//_.w/usr/include creating osreldate.h from newvers.sh touch: not found *** Error code 127 1 error *** Error code 2 1 error *** Error code 2 1 error *** Error code 2 1 error *** Error code 2 1 error From owner-freebsd-stable@FreeBSD.ORG Mon May 11 07:15:22 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9BC741065670 for ; Mon, 11 May 2009 07:15:22 +0000 (UTC) (envelope-from torfinn.ingolfsen@broadpark.no) Received: from osl1smout1.broadpark.no (osl1smout1.broadpark.no [80.202.4.58]) by mx1.freebsd.org (Postfix) with ESMTP id 5A6C78FC17 for ; Mon, 11 May 2009 07:15:22 +0000 (UTC) (envelope-from torfinn.ingolfsen@broadpark.no) MIME-version: 1.0 Content-transfer-encoding: 7BIT Content-type: text/plain; charset=US-ASCII Received: from osl1sminn1.broadpark.no ([80.202.4.59]) by osl1smout1.broadpark.no (Sun Java(tm) System Messaging Server 6.3-3.01 (built Jul 12 2007; 32bit)) with ESMTP id <0KJG00FP4XHK3TD0@osl1smout1.broadpark.no> for freebsd-stable@freebsd.org; Mon, 11 May 2009 09:15:20 +0200 (CEST) Received: from kg-v2.kg4.no ([80.202.83.38]) by osl1sminn1.broadpark.no (Sun Java(tm) System Messaging Server 6.3-3.01 (built Jul 12 2007; 32bit)) with SMTP id <0KJG00E13XHKW080@osl1sminn1.broadpark.no> for freebsd-stable@freebsd.org; Mon, 11 May 2009 09:15:20 +0200 (CEST) Date: Mon, 11 May 2009 09:15:20 +0200 From: Torfinn Ingolfsen To: freebsd-stable@freebsd.org Message-id: <20090511091520.37733043.torfinn.ingolfsen@broadpark.no> In-reply-to: <4A07AE53.4080104@FreeBSD.org> References: <20090504225012.392fa49f.torfinn.ingolfsen@broadpark.no> <4A00AF76.8010604@FreeBSD.org> <20090506184500.70b2c6f2.torfinn.ingolfsen@broadpark.no> <4A07AE53.4080104@FreeBSD.org> X-Mailer: Sylpheed 2.6.0 (GTK+ 2.16.1; amd64-portbld-freebsd7.2) X-Face: "t9w2,-X@O^I`jVW\sonI3.,36KBLZE*AL[y9lL[PyFD*r_S:dIL9c[8Y>V42R0"!"yb_zN,f#%.[PYYNq; m"_0v; ~rUM2Yy!zmkh)3&U|u!=T(zyv,MHJv"nDH>OJ`t(@mil461d_B'Uo|'nMwlKe0Mv=kvV?Nh@>Hb<3s_z2jYgZhPb@?Wi^x1a~Hplz1.zH Subject: Re: RELENG_7 - has mergemaster changed logic since 7.2-RELEASE? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 07:15:22 -0000 On Sun, 10 May 2009 21:49:23 -0700 Doug Barton wrote: > Torfinn Ingolfsen wrote: > > To be clear, I follow this procedure: > > 1. make buildworld > > 2. make kernel > > 3. shutdown now > > 4. mergemaster -p > > 5. make installworld > > 6. mergemaster -iU > > 7. fastboot > > By any chance is any of this happening in a jail? No - no jails. > Or by any chance is /etc a symlink? No, /etc is a regular directory (I just checked the machines). -- Regards, Torfinn Ingolfsen From owner-freebsd-stable@FreeBSD.ORG Mon May 11 10:53:02 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C20FB1065670 for ; Mon, 11 May 2009 10:53:02 +0000 (UTC) (envelope-from gerrit@pmp.uni-hannover.de) Received: from mrelay1.uni-hannover.de (mrelay1.uni-hannover.de [130.75.2.106]) by mx1.freebsd.org (Postfix) with ESMTP id 374978FC0C for ; Mon, 11 May 2009 10:53:01 +0000 (UTC) (envelope-from gerrit@pmp.uni-hannover.de) Received: from www.pmp.uni-hannover.de (www.pmp.uni-hannover.de [130.75.117.2]) by mrelay1.uni-hannover.de (8.14.2/8.14.2) with ESMTP id n4BAWm8V001396 for ; Mon, 11 May 2009 12:32:49 +0200 Received: from pmp.uni-hannover.de (arc.pmp.uni-hannover.de [130.75.117.1]) by www.pmp.uni-hannover.de (Postfix) with SMTP id 0D45F4F for ; Mon, 11 May 2009 12:32:48 +0200 (CEST) Date: Mon, 11 May 2009 12:32:47 +0200 From: Gerrit =?ISO-8859-1?Q?K=FChn?= To: freebsd-stable@freebsd.org Message-Id: <20090511123247.b22c0127.gerrit@pmp.uni-hannover.de> Organization: Albert-Einstein-Institut (MPI =?ISO-8859-1?Q?f=FCr?= Gravitationsphysik & IGP =?ISO-8859-1?Q?Universit=E4t?= Hannover) X-Mailer: Sylpheed 2.4.8 (GTK+ 2.12.11; i386-portbld-freebsd7.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-PMX-Version: 5.5.2.363555, Antispam-Engine: 2.6.1.350677, Antispam-Data: 2009.5.11.101926 Subject: zfs: dataset is busy X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 10:53:03 -0000 Hi all, I have several machines here that do automatic snapshotting via RSEs snapshot-tool (sysutils/freebsd-snapshot). I use a homemade script to incrementally transfer daily snapshots to a backup server. This runs fine with machines running 7.0-STABLE of Jun 08. However, I have one box with 7.1-PRERELEASE from Sep 08 showing the following error after some time (when trying to rotate the snapshots via the freebsd-snapshot tool): cannot destroy 'tank/www@daily.6': dataset is busy I fact I cannot do anything to the snapshot, neither destroy nor export (not even with "-f"), it is always "busy". The only chance to get away from this I found is to reboot the machine. After a reboot it is fine for some weeks, but then comes back with the same problem. Has anyone seen this before? Is there a fix or workaround available? Are there any improvements I could take profit of by upgrading to 7.2-stable? Right now the machine is in this state, so I could get some more information from it (if anyone tells me what to do :-). cu Gerrit From owner-freebsd-stable@FreeBSD.ORG Mon May 11 10:57:46 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 95A29106566B for ; Mon, 11 May 2009 10:57:46 +0000 (UTC) (envelope-from hpcharles@gmail.com) Received: from an-out-0708.google.com (an-out-0708.google.com [209.85.132.251]) by mx1.freebsd.org (Postfix) with ESMTP id 4DD928FC16 for ; Mon, 11 May 2009 10:57:46 +0000 (UTC) (envelope-from hpcharles@gmail.com) Received: by an-out-0708.google.com with SMTP id c3so1603443ana.13 for ; Mon, 11 May 2009 03:57:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:reply-to:in-reply-to :references:date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=wlEiqC5E6GUQ5LdenBW54VrtxKDCOuDNUdntzMdzgxE=; b=bBfKnfaNPPeSCQq8xNB0We86UijiTNITE5xtjAim8HTsB92oUzSz0w/Fnje9Hqb5w8 HeRsPJ8vYkP5AxOLt1Eo7V+6rzu8YJAWse+TokrkbvqSh6gNpiyJS2eAIKzYV7yYwMMF 83M5VNwO/WcbzfoN5DRMeAThTi6vKrFR4MA3U= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:reply-to:in-reply-to:references:date:message-id :subject:from:to:cc:content-type:content-transfer-encoding; b=Un+1TN10o+cKziCuR2y74qrBNa+O8G+PI7epIXxRCyrIWzR5iceUzGKcPg1+YLytnl HEBEK/vBHeDK+F0kpUR2619HzE64mSDPgAyNvVZdA3XRh1yb32/5bwg5O0/JSrj8vC2y lUy0xBWxraFlakBZfin04plDzrIELA65clBXU= MIME-Version: 1.0 Received: by 10.100.153.6 with SMTP id a6mr16923104ane.66.1242038113791; Mon, 11 May 2009 03:35:13 -0700 (PDT) In-Reply-To: <200905081147.46736.david.marec@davenulle.org> References: <200905081147.46736.david.marec@davenulle.org> Date: Mon, 11 May 2009 12:35:13 +0200 Message-ID: <4734a3ed0905110335k763bd7cbh7af572b358ce69ab@mail.gmail.com> From: Henri-Pierre Charles To: David Marec Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable Subject: Re: EEEPC and FreeBSD 7.2 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: hpcharles@gmail.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 10:57:46 -0000 Hello, On Fri, May 8, 2009 at 11:47 AM, David Marec wrote: > I am trying to use an EEEPC 701 as a diskless station, running FreeBSD 7. Great mini machine ! > The EEPC station boots well with PXE then runs the kernel, but the only > network card that is recognized is the wireless one (ath0). > > I read that the wired NIC ( ae? ) has been committed to HEAD; is there any way > to make it work on 7-STABLE ? It works out of the box with 7.2 You hust have to add if_ae_load="YES" in your /boot/loader.conf H-P -- HPC From owner-freebsd-stable@FreeBSD.ORG Mon May 11 12:07:30 2009 Return-Path: Delivered-To: freebsd-stable@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9240210659B5 for ; Mon, 11 May 2009 12:07:30 +0000 (UTC) (envelope-from bsam@ipt.ru) Received: from services.ipt.ru (services.ipt.ru [194.62.233.110]) by mx1.freebsd.org (Postfix) with ESMTP id 2F2598FC1C for ; Mon, 11 May 2009 12:07:29 +0000 (UTC) (envelope-from bsam@ipt.ru) Received: from bb.ipt.ru ([194.62.233.89]) by services.ipt.ru with esmtp (Exim 4.54 (FreeBSD)) id 1M3TrX-0009qS-H7; Mon, 11 May 2009 15:40:03 +0400 To: Graham Menhennitt References: <4A07BDFB.1000609@menhennitt.com.au> From: Boris Samorodov Date: Mon, 11 May 2009 15:40:04 +0400 In-Reply-To: <4A07BDFB.1000609@menhennitt.com.au> (Graham Menhennitt's message of "Mon\, 11 May 2009 15\:56\:11 +1000") Message-ID: <03279835@bb.ipt.ru> User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.3 (berkeley-unix) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: freebsd-stable@FreeBSD.ORG Subject: Re: failure building nanobsd with FreeBSD Stable X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 12:07:44 -0000 On Mon, 11 May 2009 15:56:11 +1000 Graham Menhennitt wrote: > touch: not found Please check it the system time was changed between c(v)sup -> buildworld. I case yes, just redo the process. WBR -- Boris Samorodov (bsam) Research Engineer, http://www.ipt.ru Telephone & Internet SP FreeBSD Committer, http://www.FreeBSD.org The Power To Serve From owner-freebsd-stable@FreeBSD.ORG Mon May 11 16:49:37 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C98A61065679 for ; Mon, 11 May 2009 16:49:37 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 96AAF8FC19 for ; Mon, 11 May 2009 16:49:37 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 46F6846B46; Mon, 11 May 2009 12:49:37 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 264B98A028; Mon, 11 May 2009 12:49:36 -0400 (EDT) From: John Baldwin To: pluknet Date: Mon, 11 May 2009 09:49:30 -0400 User-Agent: KMail/1.9.7 References: <200905010949.45927.jhb@freebsd.org> In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905110949.31142.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Mon, 11 May 2009 12:49:36 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=BAYES_00, DATE_IN_PAST_03_06, RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 16:49:38 -0000 On Monday 04 May 2009 11:41:35 pm pluknet wrote: > 2009/5/1 John Baldwin : > > On Thursday 30 April 2009 2:36:34 am pluknet wrote: > >> Hi folks. > >> > >> Today I got a new locking issue. > >> This is the first time I got it, and it's merely reproduced. > >> > >> The box has lost both remote connection and local access. > >> No SIGINFO output on the local console even. > >> Jumping in ddb> shows the next: > >> > >> 1) first, this is a 8-way web server. No processes on runqueue except one > > httpd > >> (i.e. ps shows R in its state): > > > > You need to find who owns Giant and what that thread is doing. You can try > > using 'show lock Giant' as well as 'show lockchain 11568'. > > > > Hi, John! > > Just reproduced now on another box. > Hmm.. stack of the process owing Giant looks garbled. > > db> show lock Giant > class: sleep mutex > name: Giant > flags: {DEF, RECURSE} > state: {OWNED, CONTESTED} > owner: 0xd0d79320 (tid 102754, pid 34594, "httpd") > > db> show lockchain 34594 > thread 102754 (pid 34594, httpd) running on CPU 7 > db> show lockchain 102754 > thread 102754 (pid 34594, httpd) running on CPU 7 The thread is running, so we don't know what it's top of stack is and you can't a good stack trace in that case. None of your CPUs are idle, so I don't think you have any sort of deadlock. You might have a livelock. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Mon May 11 16:49:38 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BE2671065670; Mon, 11 May 2009 16:49:38 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 8F4FC8FC1A; Mon, 11 May 2009 16:49:38 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 421B546B2E; Mon, 11 May 2009 12:49:38 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 01E8E8A025; Mon, 11 May 2009 12:49:37 -0400 (EDT) From: John Baldwin To: Jung-uk Kim Date: Mon, 11 May 2009 09:52:01 -0400 User-Agent: KMail/1.9.7 References: <49F8B859.7060908@umn.edu> <200905051609.38689.jkim@FreeBSD.org> <200905051743.03520.jkim@FreeBSD.org> In-Reply-To: <200905051743.03520.jkim@FreeBSD.org> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905110952.01736.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Mon, 11 May 2009 12:49:37 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: Alan Amesbury , freebsd-acpi@freebsd.org, freebsd-stable@freebsd.org, Andriy Gapon Subject: Re: Garbled output from kgdb? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 16:49:39 -0000 On Tuesday 05 May 2009 5:43:01 pm Jung-uk Kim wrote: > On Tuesday 05 May 2009 04:09 pm, Jung-uk Kim wrote: > > On Tuesday 05 May 2009 12:51 pm, Andriy Gapon wrote: > > > BTW, this issue seems to be fixed in Jung-uk's acpi patches for > > > newer acpica imports, but it is not fixed both in stable/7 and > > > head. > > > > Yes, it was fixed in my patchsets long ago, which uses spin lock > > for AcpiOsAcquireLock(). :-) > > The attached patch is for -STABLE. Note that it is only compile > tested on amd64. This looks fine to test. The patch has gratuitous style changes I wouldn't include in a commit though. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Mon May 11 16:49:39 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9B6BD1065676; Mon, 11 May 2009 16:49:39 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 6D90E8FC0A; Mon, 11 May 2009 16:49:39 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 1D9DF46B82; Mon, 11 May 2009 12:49:39 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id E9AA08A026; Mon, 11 May 2009 12:49:37 -0400 (EDT) From: John Baldwin To: Riccardo Torrini Date: Mon, 11 May 2009 09:53:21 -0400 User-Agent: KMail/1.9.7 References: <20090507155012.GW21112@tiger.fi.esaote.it> In-Reply-To: <20090507155012.GW21112@tiger.fi.esaote.it> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905110953.21686.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Mon, 11 May 2009 12:49:38 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: scottl@freebsd.org, siedar@nplay.pl, freebsd-stable@freebsd.org Subject: Re: kern/130330: [mpt] [panic] Panic and reboot machine MPT ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 16:49:39 -0000 On Thursday 07 May 2009 11:50:12 am Riccardo Torrini wrote: > I just submitted a follow-up to PR kern/130330 with the same > info. Maybe I found the committed lines doing the crash. > > Please see PR for more detailed info (and cc: this thread to me). > > I restricted the time window of the problem doing (a lot of) > build&install world from 2008.07 up to now (read last week). > > With 2008.07.28.17.00.00 (7.0-STABLE) works fine but > with 2008.07.28.18.00.00 start crashing removing the > the second disk of a mirror (when the mirror is ok) > or adding the second disk of a degraded ones. > > Also note that the same crash happens with all 7.1 > stable or release and even all 7.2-PRE I tested. > > (wrapping long lines) > # cd /home/ncvs/src/sys/ > # grep -R "date.*2008\.07\.28\.17" ./ | grep -v /Attic > > ./dev/wi/if_wi.c,v: > date 2008.07.28.17.00.37; author imp; state Exp; > ./dev/wi/if_wivar.h,v: > date 2008.07.28.17.00.37; author imp; state Exp; > ./dev/mpt/mpt_raid.c,v: > date 2008.07.28.17.10.09; author jhb; state Exp; > ./dev/mpt/mpt_raid.c,v: > date 2008.07.28.17.05.09; author jhb; state Exp; > ./kern/sched_4bsd.c,v: > date 2008.07.28.17.25.24; author jhb; state Exp; > ./modules/et/Makefile,v: > date 2008.07.28.17.56.37; author antoine; state Exp; > > In that time window there are only 4 file changed in > src/sys/dev, and I bet to mpt_raid.c :-) > > This is the commit log extracted from cvsweb > -----8<----- > Revision 1.15.2.1: > Mon Jul 28 17:05:09 2008 UTC (9 months, 1 week ago) by jhb > Branches: RELENG_7 > CVS tags: RELENG_7_1_BP > Branch point for: RELENG_7_1 > Diff to: previous 1.15: preferred, colored > Changes since revision 1.15: +4 -4 lines > > SVN rev 180920 on 2008-07-28 17:05:09Z by jhb > > MFC: Allocate a single CCB at the start of the main loop of the RAID > monitoring kthread of the mpt(4) driver. > -----8<----- > > Here are the diff: > http://www.freebsd.org/cgi/cvsweb.cgi/src/sys/dev/mpt/mpt_raid.c.diff?r1=1.15;r2=1.15.2.1 > > > What can I do now? Can you get more details on the crash, perhaps a crash dump? -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Mon May 11 16:49:43 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 97277106566C; Mon, 11 May 2009 16:49:43 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 67B948FC20; Mon, 11 May 2009 16:49:43 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 1AF8446B82; Mon, 11 May 2009 12:49:43 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id EC27E8A026; Mon, 11 May 2009 12:49:41 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Mon, 11 May 2009 12:42:44 -0400 User-Agent: KMail/1.9.7 References: <1240920035.85945.7.camel@buffy.york.ac.uk> <20090509193937.T10574@hub.org> In-Reply-To: <20090509193937.T10574@hub.org> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905111242.44566.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Mon, 11 May 2009 12:49:42 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: "Marc G. Fournier" , Martin Schmidt , Gavin Atkinson Subject: Re: 7.1-STABLE Sun Mar 29 01:06:46 ADT 2009 Locks up ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 16:49:43 -0000 On Saturday 09 May 2009 6:43:16 pm Marc G. Fournier wrote: > On Tue, 28 Apr 2009, Gavin Atkinson wrote: > > > On Fri, 2009-04-24 at 20:39 +0200, Martin Schmidt wrote: > >> Hi Marc and List, > >> > >> i had similar issues with FreeBSD 7.2-PRERELEASE. Server (zfs,nfs) > >> seems to hang in intervals of about 8 hours. > >> kernel is still there but no connections can be made to nfs/ssh and > >> login on local console doesn't seem to > >> work due to incredible slowness. breaking to the debugger takes a > >> moment but works. > >> (compiling kernel with WITNESS didnt help) > >> > >> the server had been solid before with 7 stable kernel from around 19 > >> October 2008. > >> > >> I now added these lines to /boot/loader.conf > >> > >> hw.pci.enable_msi=0 > >> hw.pci.enable_msix=0 > >> > >> to disable Message Signaled Interrupts. Which are used by the 3ware > >> twa driver and igb network driver on our server. > > > > If you are willing to test further on your server, it may be helpful if > > you could determine which of those two lines in loader.conf fixes the > > problem for you. It would also be useful to provide a dmesg from the > > machine when both msi and msix are enabled. > > > > FWIW, looking at the "vmstat -i" output it appears that only the igb > > driver that are using MSI/MSIX, unless you have a reason to suspect > > otherwise? > > How do you tell that, about igb? looking at the server I have the igb > device on, it doesn't seem to say anything about that ... IRQs > 256 are MSI/MSI-X. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Mon May 11 16:49:44 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CBBD7106564A for ; Mon, 11 May 2009 16:49:44 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 9E90D8FC15 for ; Mon, 11 May 2009 16:49:44 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 50D4446B8F; Mon, 11 May 2009 12:49:44 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 17B388A027; Mon, 11 May 2009 12:49:43 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Mon, 11 May 2009 12:48:02 -0400 User-Agent: KMail/1.9.7 References: In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905111248.02724.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Mon, 11 May 2009 12:49:43 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: Ronald Klop Subject: Re: devd doesn't fire event on boot X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 16:49:45 -0000 On Wednesday 06 May 2009 6:03:14 am Ronald Klop wrote: > Hello, > > Running 7.2-STABLE/amd64. I have a USB-disk and added stuff to devd to > mount it readonly on attach. This does work if I attach it after booting > up, but not if it is attached before booting. devd throws away all events that happen prior to devd starting up I think. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Mon May 11 16:49:45 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D56741065670 for ; Mon, 11 May 2009 16:49:45 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id A9CBC8FC24 for ; Mon, 11 May 2009 16:49:45 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 5D0B846B8E; Mon, 11 May 2009 12:49:45 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 461358A028; Mon, 11 May 2009 12:49:44 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org, avk@vl.ru Date: Mon, 11 May 2009 12:48:41 -0400 User-Agent: KMail/1.9.7 References: <4A02D708.3040805@vl.ru> In-Reply-To: <4A02D708.3040805@vl.ru> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905111248.41619.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Mon, 11 May 2009 12:49:44 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: Subject: Re: RELENG_7 fatal trap X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 16:49:46 -0000 On Thursday 07 May 2009 8:41:44 am Alexander Kriventsov wrote: > Hi, > Sorry for my english. > I have fatal trap on my box. > Kernel compiled with debug options. System is RELENG_7 amd64 dated > 2009-04-14. I think this is fixed in the latest RELENG_7. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Mon May 11 16:55:26 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2BD111065670; Mon, 11 May 2009 16:55:26 +0000 (UTC) (envelope-from riccardo.torrini@esaote.com) Received: from gw-fi.esaote.com (gw-fi.esaote.com [85.18.189.242]) by mx1.freebsd.org (Postfix) with ESMTP id A0ED98FC17; Mon, 11 May 2009 16:55:25 +0000 (UTC) (envelope-from riccardo.torrini@esaote.com) Received: from tiger.fi.esaote.it (tiger.fi.esaote.it [192.168.6.66]) by gw-fi.esaote.com (8.14.3/8.14.3) with ESMTP id n4BGtN2n020589; Mon, 11 May 2009 18:55:23 +0200 (CEST) (envelope-from riccardo.torrini@esaote.com) Received: from tiger.fi.esaote.it (localhost [127.0.0.1]) by tiger.fi.esaote.it (Postfix) with ESMTP id E20A31CC9A; Mon, 11 May 2009 18:55:22 +0200 (CEST) Received: by tiger.fi.esaote.it (Postfix, from userid 201) id C581E1CC99; Mon, 11 May 2009 18:55:22 +0200 (CEST) Date: Mon, 11 May 2009 18:55:22 +0200 From: Riccardo Torrini To: John Baldwin Message-ID: <20090511165522.GG21112@tiger.fi.esaote.it> References: <20090507155012.GW21112@tiger.fi.esaote.it> <200905110953.21686.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200905110953.21686.jhb@freebsd.org> User-Agent: Mutt/1.5.19 (2009-01-05) X-AV-Checked: ClamAV using ClamSMTP Cc: scottl@freebsd.org, siedar@nplay.pl, freebsd-stable@freebsd.org, Riccardo Torrini Subject: Re: kern/130330: [mpt] [panic] Panic and reboot machine MPT ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 16:55:26 -0000 On Mon, May 11, 2009 at 09:53:21AM -0400, John Baldwin wrote: >> What can I do now? > Can you get more details on the crash, perhaps a crash dump? All what you want, but you need to drive me, I was unable to setup serial/debug console so I must wrote down by hand (followed handbook, tryed all speed/duplex pairs, still having "graphics" strange chars, maybe the cable or setup). Using a kernel with all know (to me) debug knobs enabled :-) -- Riccardo. From owner-freebsd-stable@FreeBSD.ORG Mon May 11 17:45:41 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from [127.0.0.1] (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by hub.freebsd.org (Postfix) with ESMTP id 5EB5D1065673; Mon, 11 May 2009 17:45:41 +0000 (UTC) (envelope-from jkim@FreeBSD.org) From: Jung-uk Kim To: John Baldwin Date: Mon, 11 May 2009 13:44:59 -0400 User-Agent: KMail/1.6.2 References: <49F8B859.7060908@umn.edu> <200905051743.03520.jkim@FreeBSD.org> <200905110952.01736.jhb@freebsd.org> In-Reply-To: <200905110952.01736.jhb@freebsd.org> MIME-Version: 1.0 Content-Disposition: inline Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Message-Id: <200905111345.29761.jkim@FreeBSD.org> Cc: Alan Amesbury , freebsd-acpi@freebsd.org, freebsd-stable@freebsd.org, Andriy Gapon Subject: Re: Garbled output from kgdb? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 17:45:42 -0000 On Monday 11 May 2009 09:52 am, John Baldwin wrote: > On Tuesday 05 May 2009 5:43:01 pm Jung-uk Kim wrote: > > On Tuesday 05 May 2009 04:09 pm, Jung-uk Kim wrote: > > > On Tuesday 05 May 2009 12:51 pm, Andriy Gapon wrote: > > > > BTW, this issue seems to be fixed in Jung-uk's acpi patches > > > > for newer acpica imports, but it is not fixed both in > > > > stable/7 and head. > > > > > > Yes, it was fixed in my patchsets long ago, which uses spin > > > lock for AcpiOsAcquireLock(). :-) > > > > The attached patch is for -STABLE. Note that it is only compile > > tested on amd64. > > This looks fine to test. The patch has gratuitous style changes I > wouldn't include in a commit though. It should work but I don't plan to commit it any time soon. :-) In fact, the patch was meant to be a rewrite for new ACPI-CA, which actually has a real mutex. Currently, mutex is emulated with semaphore. The problem is semaphore has no concept of ownership while mutex does, i.e., any thread can acquire/release it without checking its ownership or order. FYI, the OSL API (ACPI_MUTEX_TYPE) is finalized in ACPI-CA 20081204. Jung-uk Kim From owner-freebsd-stable@FreeBSD.ORG Mon May 11 18:25:09 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8FD711065670; Mon, 11 May 2009 18:25:09 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 615508FC0A; Mon, 11 May 2009 18:25:09 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 147FC46B3B; Mon, 11 May 2009 14:25:09 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id C3E328A026; Mon, 11 May 2009 14:25:07 -0400 (EDT) From: John Baldwin To: Riccardo Torrini Date: Mon, 11 May 2009 14:07:19 -0400 User-Agent: KMail/1.9.7 References: <20090507155012.GW21112@tiger.fi.esaote.it> <200905110953.21686.jhb@freebsd.org> <20090511165522.GG21112@tiger.fi.esaote.it> In-Reply-To: <20090511165522.GG21112@tiger.fi.esaote.it> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905111407.20195.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Mon, 11 May 2009 14:25:07 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: scottl@freebsd.org, siedar@nplay.pl, freebsd-stable@freebsd.org Subject: Re: kern/130330: [mpt] [panic] Panic and reboot machine MPT ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 18:25:09 -0000 On Monday 11 May 2009 12:55:22 pm Riccardo Torrini wrote: > On Mon, May 11, 2009 at 09:53:21AM -0400, John Baldwin wrote: > > >> What can I do now? > > > Can you get more details on the crash, perhaps a crash dump? > > All what you want, but you need to drive me, I was unable > to setup serial/debug console so I must wrote down by hand > (followed handbook, tryed all speed/duplex pairs, still > having "graphics" strange chars, maybe the cable or setup). > > Using a kernel with all know (to me) debug knobs enabled :-) Do you have kernel crashdumps enabled and a swap partition? If so, do you happen to have any files in /var/crash? -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Mon May 11 20:08:14 2009 Return-Path: Delivered-To: FreeBSD-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 94591106566C for ; Mon, 11 May 2009 20:08:14 +0000 (UTC) (envelope-from jchambers@ucla.edu) Received: from out-61.smtp.ucla.edu (smtp-12.smtp.ucla.edu [IPv6:2607:f010:3fe:102:101c:23ff:febe:116e]) by mx1.freebsd.org (Postfix) with ESMTP id 7557F8FC18 for ; Mon, 11 May 2009 20:08:14 +0000 (UTC) (envelope-from jchambers@ucla.edu) Received: from mail.ucla.edu (mail.ucla.edu [169.232.46.158]) by smtp-12.smtp.ucla.edu (8.14.3/8.14.3) with ESMTP id n4BK7lh7016263; Mon, 11 May 2009 13:07:47 -0700 Received: from computer-2.local ([149.142.36.207]) (authenticated bits=0) by mail.ucla.edu (8.14.3/8.14.3) with ESMTP id n4BK7k85022839 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT); Mon, 11 May 2009 13:07:47 -0700 Message-ID: <4A088592.9070305@ucla.edu> Date: Mon, 11 May 2009 13:07:46 -0700 From: Jason Chambers Organization: UCLA User-Agent: Thunderbird 2.0.0.21 (Macintosh/20090302) MIME-Version: 1.0 To: =?ISO-8859-1?Q?Jonas_B=FClow?= References: <196E4005-25E9-4C46-99BD-8F717849703F@jongel.net> In-Reply-To: <196E4005-25E9-4C46-99BD-8F717849703F@jongel.net> X-Enigmail-Version: 0.95.7 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit X-Probable-Spam: no X-Scanned-By: smtp.ucla.edu on 169.232.46.248 Cc: FreeBSD-stable@freebsd.org Subject: Re: ipfilter seems to be broken on 7.2-PRERELEASE as of April 25:th 2009. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 20:08:14 -0000 Jonas Bülow wrote: > > After reboot it was not reachable from the network. After some > troubleshooting I found that ipfilter seems to be the problem. Returning > traffic originating from my host (XXX) is blocked: > (... snip ...) > > Anyone seen this behaviour? > Yes. This appears to have made it to the RELEASE as well. I believe it is due to updates to the FXP driver that allow checksumming for tx/rx. My guess is checksumming is enabled by default and you (and I) happen to have the cards recognized by FXP that do not support it. (The BAD in the ipf log represents bad checksum) If you do "ifconfig fxp0 -txcsum -rxcsum" your problem should go away. For /etc/rc.conf, just add -txcsum -rxcsum to the interface definition. Regards, --Jason From owner-freebsd-stable@FreeBSD.ORG Mon May 11 20:29:38 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2CF5D106567D for ; Mon, 11 May 2009 20:29:38 +0000 (UTC) (envelope-from david.marec@davenulle.org) Received: from smtp.lamaiziere.net (net.lamaiziere.net [91.121.44.19]) by mx1.freebsd.org (Postfix) with ESMTP id E773D8FC0A for ; Mon, 11 May 2009 20:29:37 +0000 (UTC) (envelope-from david.marec@davenulle.org) Received: from david (244.56.92-79.rev.gaoland.net [79.92.56.244]) by smtp.lamaiziere.net (Postfix) with ESMTPA id 86A7C63352C for ; Mon, 11 May 2009 22:29:36 +0200 (CEST) From: David Marec Organization: LaMienne To: "freebsd-stable" Date: Mon, 11 May 2009 22:29:35 +0200 User-Agent: KMail/1.9.10 References: <200905081147.46736.david.marec@davenulle.org> <4734a3ed0905110335k763bd7cbh7af572b358ce69ab@mail.gmail.com> In-Reply-To: <4734a3ed0905110335k763bd7cbh7af572b358ce69ab@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 8bit Content-Disposition: inline Message-Id: <200905112229.35432.david.marec@davenulle.org> Subject: Re: EEEPC and FreeBSD 7.2 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 20:29:38 -0000 Le Monday 11 May 2009 12:35:13 Henri-Pierre Charles, vous avez écrit : > Hello, > > On Fri, May 8, 2009 at 11:47 AM, David Marec wrote: > > I am trying to use an EEEPC 701 as a diskless station, running FreeBSD 7. > > Great mini machine ! I agree. This one is owned by my wife, who does not want to change the system that was shipped with it ( Xandros). > > The EEPC station boots well with PXE then runs the kernel, but the only > > network card that is recognized is the wireless one (ath0). > > > > I read that the wired NIC ( ae? ) has been committed to HEAD; is there > > any way to make it work on 7-STABLE ? > > It works out of the box with 7.2 > > You hust have to add if_ae_load="YES" in your /boot/loader.conf I built a kernel that included this driver and the EEEPC is now working quite well as a diskless station. But i have to launch Xorg locally. It doesn't start on XDMCP mode and i didn't found yet what is wrong with this. /I think i will start another thread on a Xorg dedicated list about this point and the touchpad configuration./ -- http://david.marec.free.fr/ http://www.freebsd.org/fr/ http://www.diablotins.org/ From owner-freebsd-stable@FreeBSD.ORG Mon May 11 22:05:56 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 20A50106566C for ; Mon, 11 May 2009 22:05:56 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by mx1.freebsd.org (Postfix) with ESMTP id D092D8FC12 for ; Mon, 11 May 2009 22:05:55 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from list by ciao.gmane.org with local (Exim 4.43) id 1M3ddA-0005au-8a for freebsd-stable@freebsd.org; Mon, 11 May 2009 22:05:52 +0000 Received: from 200.41.broadband11.iol.cz ([90.178.41.200]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 11 May 2009 22:05:52 +0000 Received: from gamato by 200.41.broadband11.iol.cz with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 11 May 2009 22:05:52 +0000 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-stable@freebsd.org From: martinko Date: Tue, 12 May 2009 00:05:38 +0200 Lines: 16 Message-ID: <4A08A132.3070503@users.sf.net> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-2; format=flowed Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 200.41.broadband11.iol.cz User-Agent: Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.8.1.18) Gecko/20081125 SeaMonkey/1.1.13 Sender: news Cc: scottl@freebsd.org Subject: run_interrupt_driven_hooks: still waiting after 300 seconds for xpt_config X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 22:05:56 -0000 Hallo, I've just tried 7.2-RELEASE (amd64) on Asus M3A78-EM motherboard and booting got stuck with the following: run_interrupt_driven_hooks: still waiting after 300 seconds for xpt_config From what I've found via Google it should be fixed already but apparently it is not. :-( Is there a way to work around this issue and successfully boot and install FreeBSD, please ? Thanks, Martin From owner-freebsd-stable@FreeBSD.ORG Mon May 11 23:14:25 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7772110656F1 for ; Mon, 11 May 2009 23:14:25 +0000 (UTC) (envelope-from dan@dan.emsphone.com) Received: from email1.allantgroup.com (email1.emsphone.com [199.67.51.115]) by mx1.freebsd.org (Postfix) with ESMTP id 2F8AE8FC19 for ; Mon, 11 May 2009 23:14:25 +0000 (UTC) (envelope-from dan@dan.emsphone.com) Received: from dan.emsphone.com (dan.emsphone.com [199.67.51.101]) by email1.allantgroup.com (8.14.0/8.14.0) with ESMTP id n4BMtN08026646 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Mon, 11 May 2009 17:55:23 -0500 (CDT) (envelope-from dan@dan.emsphone.com) Received: from dan.emsphone.com (smmsp@localhost [127.0.0.1]) by dan.emsphone.com (8.14.3/8.14.3) with ESMTP id n4BMtNTP093898 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Mon, 11 May 2009 17:55:23 -0500 (CDT) (envelope-from dan@dan.emsphone.com) Received: (from dan@localhost) by dan.emsphone.com (8.14.3/8.14.3/Submit) id n4BMnwlf082830; Mon, 11 May 2009 17:49:58 -0500 (CDT) (envelope-from dan) Date: Mon, 11 May 2009 17:49:58 -0500 From: Dan Nelson To: martinko Message-ID: <20090511224957.GB52703@dan.emsphone.com> References: <4A08A132.3070503@users.sf.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4A08A132.3070503@users.sf.net> X-OS: FreeBSD 7.2-STABLE User-Agent: Mutt/1.5.19 (2009-01-05) X-Virus-Scanned: ClamAV version 0.94.1, clamav-milter version 0.94.1 on email1.allantgroup.com X-Virus-Status: Clean X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-2.0.2 (email1.allantgroup.com [199.67.51.78]); Mon, 11 May 2009 17:55:23 -0500 (CDT) X-Scanned-By: MIMEDefang 2.45 Cc: scottl@freebsd.org, freebsd-stable@freebsd.org Subject: Re: run_interrupt_driven_hooks: still waiting after 300 seconds for xpt_config X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 May 2009 23:14:25 -0000 In the last episode (May 12), martinko said: > I've just tried 7.2-RELEASE (amd64) on Asus M3A78-EM motherboard and > booting got stuck with the following: > > run_interrupt_driven_hooks: still waiting after 300 seconds for xpt_config > > From what I've found via Google it should be fixed already but apparently > it is not. :-( > > Is there a way to work around this issue and successfully boot and install > FreeBSD, please ? Do you have a connected firewire device? Try unplugging it during bootup, or kldload the sbp module after bootup instead of via loader.conf. -- Dan Nelson dnelson@allantgroup.com From owner-freebsd-stable@FreeBSD.ORG Tue May 12 00:49:00 2009 Return-Path: Delivered-To: FreeBSD-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0A4DE106564A for ; Tue, 12 May 2009 00:49:00 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: from rv-out-0506.google.com (rv-out-0506.google.com [209.85.198.230]) by mx1.freebsd.org (Postfix) with ESMTP id CA6358FC15 for ; Tue, 12 May 2009 00:48:59 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: by rv-out-0506.google.com with SMTP id k40so2561775rvb.43 for ; Mon, 11 May 2009 17:48:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:received:from:date:to:cc :subject:message-id:reply-to:references:mime-version:content-type :content-disposition:in-reply-to:user-agent; bh=jBtZtGAiD4yFb8LDBigU6ldNQ7jtmRBERH1JwiUI0jQ=; b=nC6eTOUnsJcNN5auuTgg3mBXQqnd9l7L8vAGv4mYARlMRrM1xfo/7iVLCmUFcwAB4l NuOsj9d6ZufC8B0oZ2jBy2kgQgnDI3X1hlg/m7XHrHPzUJeGy0nVrKx1Q5fVoNJYbfxe ywcKYpM9ySRFR3EoRaBeZVLIc8jwhrMW7WEGc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:date:to:cc:subject:message-id:reply-to:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; b=qtULoPW130SFgqm0EhG+fHfDJ2L7lHfNggqf1zYKumYbo8mh9KreGxupLqoWj+B0M3 sm9yUTAWaoaSP9OR2YM7Ac1RSIT6QA/PNnEtNnvgqJGDlPjyV1Si7lOuI/xmjcV3r8rB jMQ1hLh4qH42WP2uUus5dNKSnltwGFC2A3iyA= Received: by 10.141.100.15 with SMTP id c15mr3019220rvm.79.1242089339436; Mon, 11 May 2009 17:48:59 -0700 (PDT) Received: from michelle.cdnetworks.co.kr ([114.111.62.249]) by mx.google.com with ESMTPS id b8sm12367787rvf.44.2009.05.11.17.48.56 (version=SSLv3 cipher=RC4-MD5); Mon, 11 May 2009 17:48:58 -0700 (PDT) Received: by michelle.cdnetworks.co.kr (sSMTP sendmail emulation); Tue, 12 May 2009 09:57:07 +0900 From: Pyun YongHyeon Date: Tue, 12 May 2009 09:57:07 +0900 To: Jason Chambers Message-ID: <20090512005707.GI65350@michelle.cdnetworks.co.kr> References: <196E4005-25E9-4C46-99BD-8F717849703F@jongel.net> <4A088592.9070305@ucla.edu> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4A088592.9070305@ucla.edu> User-Agent: Mutt/1.4.2.3i Cc: FreeBSD-stable@freebsd.org, Jonas B?low Subject: Re: ipfilter seems to be broken on 7.2-PRERELEASE as of April 25:th 2009. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: pyunyh@gmail.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 00:49:00 -0000 On Mon, May 11, 2009 at 01:07:46PM -0700, Jason Chambers wrote: > Jonas B?low wrote: > > > > After reboot it was not reachable from the network. After some > > troubleshooting I found that ipfilter seems to be the problem. Returning > > traffic originating from my host (XXX) is blocked: > > > (... snip ...) > > > > Anyone seen this behaviour? > > > > Yes. This appears to have made it to the RELEASE as well. > > I believe it is due to updates to the FXP driver that allow checksumming > for tx/rx. My guess is checksumming is enabled by default and you (and > I) happen to have the cards recognized by FXP that do not support it. I guess your controller is 82559 or compatibles. If you can receive packets without problems after disabling ipfilter it's not fault of fxp(4). You have a good controller that do support Rx checksum offloading. > (The BAD in the ipf log represents bad checksum) > No, ipfilter's notion of Rx checksum offloading was broken. ipfilter simply does not understand partial checksummed frame(e.g. checksummed frame without pseudo header) so driver that supports this type of checksum offloading(gem(4), hme(4), sk(4) and fxp(4)) wouldn't work on ipfilter. > If you do "ifconfig fxp0 -txcsum -rxcsum" your problem should go away. > For /etc/rc.conf, just add -txcsum -rxcsum to the interface definition. > Yeah, that would fix it or you can switch to pf(4). From owner-freebsd-stable@FreeBSD.ORG Tue May 12 02:02:57 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 80B281065670 for ; Tue, 12 May 2009 02:02:57 +0000 (UTC) (envelope-from fj@panix.com) Received: from l2mail1.panix.com (l2mail1.panix.com [166.84.1.75]) by mx1.freebsd.org (Postfix) with ESMTP id 2A2478FC0C for ; Tue, 12 May 2009 02:02:57 +0000 (UTC) (envelope-from fj@panix.com) Received: from mail2.panix.com (mail2.panix.com [166.84.1.73]) by l2mail1.panix.com (Postfix) with ESMTP id 2FBFA265 for ; Mon, 11 May 2009 21:47:34 -0400 (EDT) Received: from panix5.panix.com (panix5.panix.com [166.84.1.5]) by mail2.panix.com (Postfix) with ESMTP id 6095E3487E for ; Mon, 11 May 2009 21:47:33 -0400 (EDT) Received: by panix5.panix.com (Postfix, from userid 16484) id 4C51A242D8; Mon, 11 May 2009 21:47:33 -0400 (EDT) Date: Mon, 11 May 2009 21:47:33 -0400 From: "Joe A." To: freebsd-stable@freebsd.org Message-ID: <20090512014733.GA12271@panix.com> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="mYCpIKhGyMATD0i+" Content-Disposition: inline User-Agent: Mutt/1.5.18 (2008-05-17) Subject: Error message: run_interrupt_driven_hooks:... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 02:02:57 -0000 --mYCpIKhGyMATD0i+ Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Greetings... Basic data on my experience with the xpt_config hang; I have more detail if needed, but I doubt anyone will believe it. I'm not even sure I do. Some other reports: http://lists.freebsd.org/pipermail/freebsd-questions/2009-April/196116.html Seur Bors Thu Apr 9 14:43:34 UTC 2009. http://lists.freebsd.org/pipermail/freebsd-stable/2009-May/049901.html martinko gamato Mon May 11 22:05:56 UTC 2009 http://www.nabble.com/Freebsd-7.2-RC-boot-problem-tt23257632.html#a23257632 http://forums.pcbsd.org/viewtopic.php?f=1&t=13312 Here is the entire error for me during boot: run_interrupt_driven_hooks: still waiting after BIGNUM seconds for xpt_config It hangs after this point in the boot process: pcm0: pcm0: the boot process does not continue, so the next normal thing does not appear on the console: SMP: AP CPU #1 Launched! but during the hang, this scrolls past (punctuated by the BIGNUM seconds wait) over and over on the console: acpi_tz0: _TMP value is absurd, ignored (-269.4C) Normally, that message is suppressed by this /etc/sysctl.conf entry: hw.acpi.thermal.polling_rate=0 I suppose this means that /etc/sysctl.conf is not parsed and the second CPU is not launched. Hardware in question, as seen by dmesg, is attached; the vendor's specs are: Core 2 Duo (C) E6400 2.13 GHz 1066 MHz front side bus Socket 775 Chipset P965 Motherboard: Asus P5BW-LA HP/Compaq motherboard name: Basswood-UL8E There is RAID on the motherboard; I don't use it. I do use AHCI. BIOS is current; there are no available updates. The onboard firewire is disabled, since it began (prior to 7.1) causing unresolvable panics. CAM is in my kernel: # SCSI peripherals #Added atapicam; apparently, cdparanoia requires it. device atapicam device scbus # SCSI bus (required for SCSI) device da # Direct Access (disks) device sa # Sequential Access (tape etc) device cd # CD device pass # Passthrough device (direct SCSI access) As of 9:30 PM EDT May 11, the issue has de-Heisenberged from my PC. I'm not subscribed to the list; so you'll need to Cc: me if you think I can help. --mYCpIKhGyMATD0i+ Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename="dmesg.May.11.2009" Copyright (c) 1992-2009 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.1-RELEASE-p5 #0: Sun May 3 06:43:50 EDT 2009 root@whisperer.chthonixia.net:/usr/obj/usr/src/sys/WHISPERER Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Core(TM)2 CPU 6400 @ 2.13GHz (2135.55-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x6f6 Stepping = 6 Features=0xbfebfbff Features2=0xe3bd AMD Features=0x20000000 AMD Features2=0x1 Cores per package: 2 real memory = 2146299904 (2046 MB) avail memory = 2094936064 (1997 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0: Changing APIC ID to 4 ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) acpi0: on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, 7fde0000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 acpi_hpet0: iomem 0xfed00000-0xfed003ff on acpi0 device_attach: acpi_hpet0 attach returned 12 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 pcib1: irq 16 at device 1.0 on pci0 pci1: on pcib1 vgapci0: port 0xde00-0xdeff mem 0xe0000000-0xefffffff,0xfddf0000-0xfddfffff irq 16 at device 0.0 on pci1 vgapci1: mem 0xfdde0000-0xfddeffff at device 0.1 on pci1 uhci0: port 0xff00-0xff1f irq 21 at device 26.0 on pci0 uhci0: [GIANT-LOCKED] uhci0: [ITHREAD] usb0: on uhci0 usb0: USB revision 1.0 uhub0: on usb0 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xfe00-0xfe1f irq 18 at device 26.1 on pci0 uhci1: [GIANT-LOCKED] uhci1: [ITHREAD] usb1: on uhci1 usb1: USB revision 1.0 uhub1: on usb1 uhub1: 2 ports with 2 removable, self powered ehci0: mem 0xfdfff000-0xfdfff3ff irq 21 at device 26.7 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb2: EHCI version 1.0 usb2: companion controllers, 2 ports each: usb0 usb1 usb2: on ehci0 usb2: USB revision 2.0 uhub2: on usb2 uhub2: 4 ports with 4 removable, self powered pcm0: mem 0xfdff4000-0xfdff7fff irq 22 at device 27.0 on pci0 pcm0: [ITHREAD] uhci2: port 0xfd00-0xfd1f irq 23 at device 29.0 on pci0 uhci2: [GIANT-LOCKED] uhci2: [ITHREAD] usb3: on uhci2 usb3: USB revision 1.0 uhub3: on usb3 uhub3: 2 ports with 2 removable, self powered uhci3: port 0xfc00-0xfc1f irq 17 at device 29.1 on pci0 uhci3: [GIANT-LOCKED] uhci3: [ITHREAD] usb4: on uhci3 usb4: USB revision 1.0 uhub4: on usb4 uhub4: 2 ports with 2 removable, self powered uhci4: port 0xfb00-0xfb1f irq 18 at device 29.2 on pci0 uhci4: [GIANT-LOCKED] uhci4: [ITHREAD] usb5: on uhci4 usb5: USB revision 1.0 uhub5: on usb5 uhub5: 2 ports with 2 removable, self powered ehci1: mem 0xfdffe000-0xfdffe3ff irq 23 at device 29.7 on pci0 ehci1: [GIANT-LOCKED] ehci1: [ITHREAD] usb6: EHCI version 1.0 usb6: companion controllers, 2 ports each: usb3 usb4 usb5 usb6: on ehci1 usb6: USB revision 2.0 uhub6: on usb6 uhub6: 6 ports with 6 removable, self powered pcib2: at device 30.0 on pci0 pci2: on pcib2 ath0: mem 0xfdee0000-0xfdeeffff irq 17 at device 0.0 on pci2 ath0: [ITHREAD] ath0: WARNING: using obsoleted if_watchdog interface ath0: Ethernet address: 00:14:6c:89:30:1c ath0: mac 7.9 phy 4.5 radio 5.6 fwohci0: port 0xcf00-0xcf7f mem 0xfdeff000-0xfdeff7ff irq 16 at device 4.0 on pci2 fwohci0: [FILTER] fwohci0: OHCI version 1.10 (ROM=1) fwohci0: No. of Isochronous channels is 4. fwohci0: EUI64 00:11:06:66:00:00:03:12 fwohci0: Phy 1394a available S400, 3 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: on fwohci0 sbp0: on firewire0 dcons_crom0: on firewire0 dcons_crom0: bus_addr 0x1264000 fwohci0: Initiate bus reset fwohci0: BUS reset fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0xfa00-0xfa07,0xf900-0xf903,0xf800-0xf807,0xf700-0xf703,0xf600-0xf61f mem 0xfdffd000-0xfdffd7ff irq 19 at device 31.2 on pci0 atapci0: [ITHREAD] atapci0: AHCI Version 01.10 controller with 6 ports detected ata2: on atapci0 ata2: [ITHREAD] ata3: on atapci0 ata3: [ITHREAD] ata4: on atapci0 ata4: [ITHREAD] ata5: on atapci0 ata5: [ITHREAD] ata6: on atapci0 ata6: [ITHREAD] ata7: on atapci0 ata7: [ITHREAD] pci0: at device 31.3 (no driver attached) acpi_tz0: on acpi0 cpu0: on acpi0 est0: on cpu0 p4tcc0: on cpu0 cpu1: on acpi0 est1: on cpu1 est: CPU supports Enhanced Speedstep, but is not recognized. est: cpu_vendor GenuineIntel, msr 823082306000823 device_attach: est1 attach returned 6 p4tcc1: on cpu1 acpi_hpet0: iomem 0xfed00000-0xfed003ff on acpi0 device_attach: acpi_hpet0 attach returned 12 orm0: at iomem 0xd0000-0xd1fff pnpid ORM0000 on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ata0 at port 0x1f0-0x1f7,0x3f6 irq 14 on isa0 ata0: [ITHREAD] ata1 at port 0x170-0x177,0x376 irq 15 on isa0 ata1: [ITHREAD] atkbdc0: at port 0x60,0x64 on isa0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] ums0: on uhub3 ums0: 7 buttons and Z dir. ukbd0: on uhub3 kbd2 at ukbd0 uhid0: on uhub3 Timecounters tick every 1.000 msec firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me) firewire0: bus manager 0 (me) acpi_tz0: _TMP value is absurd, ignored (-268.9C) acpi_tz0: _TMP value is absurd, ignored (-268.9C) acpi_tz0: _TMP value is absurd, ignored (-268.9C) acpi_tz0: _TMP value is absurd, ignored (-268.9C) acpi_tz0: _TMP value is absurd, ignored (-268.9C) ad4: 238475MB at ata2-master SATA300 ad10: 305245MB at ata5-master SATA150 acd0: DVDR at ata6-master SATA150 pcm0: pcm0: SMP: AP CPU #1 Launched! acpi_tz0: _TMP value is absurd, ignored (-269.4C) cd0 at ata6 bus 0 target 0 lun 0 cd0: Removable CD-ROM SCSI-0 device cd0: 3.300MB/s transfers cd0: Attempt to query device size failed: NOT READY, Medium not present Trying to mount root from ufs:/dev/ad4s1a ath0: link state changed to UP acpi_tz0: _TMP value is absurd, ignored (-269.5C) ath0: device timeout ath0: link state changed to DOWN ath0: link state changed to UP ath0: link state changed to DOWN ath0: link state changed to UP ath0: link state changed to DOWN ath0: link state changed to UP ath0: link state changed to DOWN ath0: link state changed to UP --mYCpIKhGyMATD0i+-- From owner-freebsd-stable@FreeBSD.ORG Tue May 12 06:12:29 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4B8311065674; Tue, 12 May 2009 06:12:29 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by mx1.freebsd.org (Postfix) with ESMTP id 9C9B48FC15; Tue, 12 May 2009 06:12:28 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: by bwz9 with SMTP id 9so3116654bwz.43 for ; Mon, 11 May 2009 23:12:27 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=7+L83x8Aj8bwZ1QBdH4FNwdxvFazpxd4pT37v0ULZbI=; b=gSzZtmD+jhey/BqIagZnP8VJL+2wcRP0IIzga+15JH5ia8UHLwMc/tos64+rW20nw/ pN65mhRFxQ/nuyESP7xCzJxrdOJ3wIEP4DykXlEA6t6SZRrgphESv5tQV9+tQXK2Bfc9 Jj5GUquWmss6cIQ2nm6oibhMLxaGTvzHWeOSc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=A3Eit8n9nDusEO52scOI5Dh6TXVRrIJMGqbbQXHJDtdAI4UFAeXWZDrKxHifjHNF7/ zM8EoD5hJNLYZhXHhfs/1zHNL8hQUOOIvLDuw4hE1ACuyiPhMjwEkiSMMGDDwU7f6gPZ ROMFmiuP/zYyTkdg1IHTcfn9hk46ixyoHJ1Fw= MIME-Version: 1.0 Received: by 10.102.247.10 with SMTP id u10mr1728635muh.76.1242108747035; Mon, 11 May 2009 23:12:27 -0700 (PDT) In-Reply-To: <200905110949.31142.jhb@freebsd.org> References: <200905010949.45927.jhb@freebsd.org> <200905110949.31142.jhb@freebsd.org> Date: Tue, 12 May 2009 10:12:27 +0400 Message-ID: From: pluknet To: John Baldwin Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 06:12:29 -0000 2009/5/11 John Baldwin : > On Monday 04 May 2009 11:41:35 pm pluknet wrote: >> 2009/5/1 John Baldwin : >> > On Thursday 30 April 2009 2:36:34 am pluknet wrote: >> >> Hi folks. >> >> >> >> Today I got a new locking issue. >> >> This is the first time I got it, and it's merely reproduced. >> >> >> >> The box has lost both remote connection and local access. >> >> No SIGINFO output on the local console even. >> >> Jumping in ddb> shows the next: >> >> >> >> 1) first, this is a 8-way web server. No processes on runqueue except one >> > httpd >> >> (i.e. ps shows R in its state): >> > >> > You need to find who owns Giant and what that thread is doing. You can > try >> > using 'show lock Giant' as well as 'show lockchain 11568'. >> > >> >> Hi, John! >> >> Just reproduced now on another box. >> Hmm.. stack of the process owing Giant looks garbled. >> >> db> show lock Giant >> class: sleep mutex >> name: Giant >> flags: {DEF, RECURSE} >> state: {OWNED, CONTESTED} >> owner: 0xd0d79320 (tid 102754, pid 34594, "httpd") >> >> db> show lockchain 34594 >> thread 102754 (pid 34594, httpd) running on CPU 7 >> db> show lockchain 102754 >> thread 102754 (pid 34594, httpd) running on CPU 7 > > The thread is running, so we don't know what it's top of stack is and you > can't a good stack trace in that case. > > None of your CPUs are idle, so I don't think you have any sort of deadlock. > You might have a livelock. > > -- > John Baldwin > I'm curious if it could be caused by heavy load. I don't know what it might be definitely, as it's non-trivial for me to determine the reason of a livelock, and to debug it. So I think it may have sense to try 7.x, as there has been done much locking work. Thank you. -- wbr, pluknet From owner-freebsd-stable@FreeBSD.ORG Tue May 12 07:02:20 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 371A2106566B for ; Tue, 12 May 2009 07:02:20 +0000 (UTC) (envelope-from npapke@acm.org) Received: from idcmail-mo1so.shaw.ca (idcmail-mo1so.shaw.ca [24.71.223.10]) by mx1.freebsd.org (Postfix) with ESMTP id BE0EB8FC0C for ; Tue, 12 May 2009 07:02:19 +0000 (UTC) (envelope-from npapke@acm.org) Received: from pd2ml2so-ssvc.prod.shaw.ca ([10.0.141.134]) by pd3mo1so-svcs.prod.shaw.ca with ESMTP; 12 May 2009 01:02:19 -0600 X-Cloudmark-SP-Filtered: true X-Cloudmark-SP-Result: v=1.0 c=0 a=ep_KMAzDAAAA:8 a=6I5d2MoRAAAA:8 a=yL31_EPIs6ODTkBMJSwA:9 a=tszI5hqtosJfRXJTH1AA:7 a=B3JS_BEeaGvaBLS6F7kHsMS4Gj8A:4 a=SV7veod9ZcQA:10 a=nAPXUAfsBmEA:10 a=avX_41wpOqIA:10 a=macy1kFFMuwA:10 Received: from s010600121729c74c.vc.shawcable.net (HELO proven.lan) ([24.85.241.34]) by pd2ml2so-dmz.prod.shaw.ca with ESMTP; 12 May 2009 01:02:18 -0600 Received: from proven.lan (localhost [127.0.0.1]) by proven.lan (8.14.3/8.14.3) with ESMTP id n4C72IZE004398 for ; Tue, 12 May 2009 00:02:18 -0700 (PDT) (envelope-from npapke@acm.org) Received: from localhost (localhost [[UNIX: localhost]]) by proven.lan (8.14.3/8.14.3/Submit) id n4C72IEH004397 for freebsd-stable@freebsd.org; Tue, 12 May 2009 00:02:18 -0700 (PDT) (envelope-from npapke@acm.org) X-Authentication-Warning: proven.lan: npapke set sender to npapke@acm.org using -f From: Norbert Papke Organization: Archaeological Filing To: freebsd-stable@freebsd.org Date: Tue, 12 May 2009 00:02:17 -0700 User-Agent: KMail/1.9.10 References: <200905101217.39920.fbsd-ml@scrapper.ca> <200905101426.08256.npapke@acm.org> In-Reply-To: <200905101426.08256.npapke@acm.org> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Message-Id: <200905120002.18130.npapke@acm.org> Subject: Re: 7.2-STABLE: Inserting USB device causes Fatal Trap 12 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 07:02:20 -0000 I have been trying to understand the failure better: (kgdb) frame 10 #10 0xffffffff80473265 in usb_transfer_complete (xfer=3D0xffffff00045cbc00) at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:949 949 STAILQ_REMOVE_HEAD(&pipe->queue, next); (kgdb) list 944 #ifdef DIAGNOSTIC 945 xfer->busy_free =3D XFER_BUSY; 946 #endif 947 KASSERT(STAILQ_FIRST(&pipe->queue) =3D=3D xfer, 948 ("usb_transfer_complete: bad dequeue")); 949 STAILQ_REMOVE_HEAD(&pipe->queue, next); 950 } 951 DPRINTFN(5,("usb_transfer_complete: repeat=3D%d new head=3D= %p\n", 952 repeat, STAILQ_FIRST(&pipe->queue))); =46or reference: #define STAILQ_NEXT(elm, field) ((elm)->field.stqe_next) #define STAILQ_FIRST(head) ((head)->stqh_first) #define STAILQ_REMOVE_HEAD(head, field) do { \ if ((STAILQ_FIRST((head)) =3D \ STAILQ_NEXT(STAILQ_FIRST((head)), field)) =3D=3D NULL) = \ (head)->stqh_last =3D &STAILQ_FIRST((head)); \ } while (0) Looking at the data: (kgdb) p *pipe $15 =3D {iface =3D 0x0, device =3D 0xffffff009c1e4a00, endpoint =3D=20 0xffffff009c1e4a38, refcnt =3D 1, running =3D 0 '\0', aborting =3D 0 '\0', queue =3D {stqh_first =3D 0x0, stqh_last =3D 0xffffff000c8576a0}, next = =3D {le_next=20 =3D 0x806b560b4, le_prev =3D 0x0}, intrxfer =3D 0x0, repeat =3D 0 '\0', interval =3D -1, methods =3D 0xffffffff80a6e340} (kgdb) p pipe->queue $16 =3D {stqh_first =3D 0x0, stqh_last =3D 0xffffff000c8576a0} (kgdb) p pipe->queue->stqh_first $17 =3D (struct usbd_xfer *) 0x0 And, of course, (kgdb) p pipe->queue->stqh_first.next Cannot access memory at address 0x290 If the kernel had been built with INVARIANTS, presumably the prior assertio= n=20 would have been triggered. It is not clear to me how the transfer ended up= =20 in this bad state. Could the "USBD_NOMEM" status be a clue? (kgdb) p *xfer $6 =3D {pipe =3D 0xffffff000c857680, priv =3D 0x0, buffer =3D 0xfffffffef5b= 69ff0,=20 length =3D 18, actlen =3D 0, flags =3D 6, timeout =3D 5000, status =3D USBD_NOMEM, callback =3D 0, done =3D 1 '\001', request =3D {bm= RequestType=20 =3D 128 '\200', bRequest =3D 6 '\006', wValue =3D "\001\003", wIndex =3D "\t\004", wLength =3D "\022"}, frleng= ths =3D=20 0x0, nframes =3D 0, device =3D 0xffffff009c1e4a00, dmamap =3D { segs =3D {{ds_addr =3D 23408640, ds_len =3D 16}, {ds_addr =3D 23355392,= ds_len =3D=20 2}, {ds_addr =3D 0, ds_len =3D 0} }, nsegs =3D 2, map =3D 0xffffff000cbf5e00}, allocbuf =3D 0x0, rqflags =3D= 1, next =3D=20 {stqe_next =3D 0x0}, hcpriv =3D 0x0, timeout_handle =3D { c_links =3D {sle =3D {sle_next =3D 0x0}, tqe =3D {tqe_next =3D 0x0, tqe= _prev =3D=20 0x0}}, c_time =3D 0, c_arg =3D 0x0, c_func =3D 0, c_mtx =3D 0xffffffff80b12600, c_flags =3D 0}} I am getting out of my depth. I will spend some more time trying to learn= =20 this code but would appreciate pointers. Cheers, =2D- Norbert Papke. On May 10, 2009, Norbert Papke wrote: > On May 10, 2009, Norbert Papke wrote: > > Inserting a USB thumb drive into a running sytem result in a "Fatal trap > > 12: page fault while in kernel mode". > > > > Unfortunately, I was not able to save a core (not entirely sure why, I'= ll > > investigate separately). I have manually copied the backtrace: > > I now have a kernel dump and backtrace with symbols: > > #0 doadump () at pcpu.h:195 > #1 0xffffffff801d239c in db_fncall (dummy1=3DVariable "dummy1" is not > available. > ) at /red/public/freebsd/sources/stable/sys/ddb/db_command.c:516 > #2 0xffffffff801d28a9 in db_command (last_cmdp=3D0xffffffff80adc648, > cmd_table=3D0x0, dopager=3D1) > at /red/public/freebsd/sources/stable/sys/ddb/db_command.c:413 > #3 0xffffffff801d2aab in db_command_loop () > at /red/public/freebsd/sources/stable/sys/ddb/db_command.c:466 > #4 0xffffffff801d42f7 in db_trap (type=3DVariable "type" is not availabl= e. > ) at /red/public/freebsd/sources/stable/sys/ddb/db_main.c:228 > #5 0xffffffff805159e5 in kdb_trap (type=3D12, code=3D0, tf=3D0xfffffffef= 5b69d10) > at /red/public/freebsd/sources/stable/sys/kern/subr_kdb.c:524 > #6 0xffffffff80798143 in trap_fatal (frame=3D0xfffffffef5b69d10, > eva=3DVariable "eva" is not available. > ) > at /red/public/freebsd/sources/stable/sys/amd64/amd64/trap.c:752 > #7 0xffffffff80798498 in trap_pfault (frame=3D0xfffffffef5b69d10, > usermode=3D0) at > /red/public/freebsd/sources/stable/sys/amd64/amd64/trap.c:673 #8=20 > 0xffffffff80798bcf in trap (frame=3D0xfffffffef5b69d10) > at /red/public/freebsd/sources/stable/sys/amd64/amd64/trap.c:444 > #9 0xffffffff8077edae in calltrap () > at /red/public/freebsd/sources/stable/sys/amd64/amd64/exception.S:209 > #10 0xffffffff80473265 in usb_transfer_complete (xfer=3D0xffffff00045cbc0= 0) > at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:949 > #11 0xffffffff8077af55 in bus_dmamap_load (dmat=3D0xffffff0004598580, > map=3D0xffffff000cbf5e00, > buf=3D0xfffffffef5b69ff0, buflen=3DVariable "buflen" is not available. > ) at > /red/public/freebsd/sources/stable/sys/amd64/amd64/busdma_machdep.c:739 #= 12 > 0xffffffff80473955 in usbd_transfer (xfer=3D0xffffff00045cbc00) > at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:312 > #13 0xffffffff80473b36 in usbd_do_request_flags_pipe > (dev=3D0xffffff009c1e4a00, pipe=3D0xffffff000c857680, > req=3D0xfffffffef5b69f90, data=3D0xfffffffef5b69ff0, flags=3DVariable= "flags" > is not available. > ) > at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:1100 > #14 0xffffffff80473c60 in usbd_do_request_flags (dev=3DVariable "dev" is = not > available. > ) > at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:1070 > #15 0xffffffff80471d1a in usbd_get_string_desc (dev=3D0xffffff009c1e4a00, > sindex=3DVariable "sindex" is not available. > ) > at /red/public/freebsd/sources/stable/sys/dev/usb/usb_subr.c:171 > #16 0xffffffff80472f1d in usbd_get_string (dev=3D0xffffff009c1e4a00, si= =3D1, > buf=3D0xfffffffef5b6a200 "", len=3D128) > ---Type to continue, or q to quit--- > at /red/public/freebsd/sources/stable/sys/dev/usb/usbdi.c:1353 > #17 0xffffffff80470fca in usbd_devinfo_vp (dev=3D0xffffff009c1e4a00, > v=3D0xfffffffef5b6a200 "", > p=3D0xfffffffef5b6a180 "=EF=BF=BDz=EF=BF=BD\200=EF=BF=BD=EF=BF=BD=EF= =BF=BD=EF=BF=BD`=EF=BF=BD=EF=BF=BD\200=EF=BF=BD=EF=BF=BD=EF=BF=BD=EF=BF=BD"= , usedev=3DVariable "usedev" > is not available. > ) > at /red/public/freebsd/sources/stable/sys/dev/usb/usb_subr.c:216 > #18 0xffffffff80471b76 in usbd_devinfo (dev=3D0xffffff009c1e4a00, > showclass=3D1, cp=3D0xffffff0122986000 "\001") > at /red/public/freebsd/sources/stable/sys/dev/usb/usb_subr.c:281 > #19 0xffffffff8047243e in usbd_new_device (parent=3D0xffffff0004591900, > bus=3D0xffffff000440a000, depth=3DVariable "depth" is not available. > ) > at /red/public/freebsd/sources/stable/sys/dev/usb/usb_subr.c:861 > #20 0xffffffff80467b5b in uhub_explore (dev=3D0xffffff0004591400) > at /red/public/freebsd/sources/stable/sys/dev/usb/uhub.c:523 > #21 0xffffffff8046f391 in usb_discover (v=3DVariable "v" is not available. > ) at /red/public/freebsd/sources/stable/sys/dev/usb/usb.c:724 > #22 0xffffffff8046fc61 in usb_event_thread (arg=3DVariable "arg" is not > available. > ) at /red/public/freebsd/sources/stable/sys/dev/usb/usb.c:440 > #23 0xffffffff804d05bd in fork_exit (callout=3D0xffffffff8046fbe5 > , arg=3D0xffffff0004598d00, > frame=3D0xfffffffef5b6ac80) > at /red/public/freebsd/sources/stable/sys/kern/kern_fork.c:810 > #24 0xffffffff8077f16e in fork_trampoline () > at /red/public/freebsd/sources/stable/sys/amd64/amd64/exception.S:455 > > > The problem is repeatable. It only happens when I insert the thumb dri= ve > > into a running system. If I boot with the thumb drive present, > > everything is fine. > > > > Any help is greatly appreciated. > > > > Cheers, > > > > -- Norbert Papke. > > > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D > > > > # uname -a > > FreeBSD proven.lan 7.2-STABLE FreeBSD 7.2-STABLE #0 r191841: Tue May 5 > > 21:13:21 PDT 2009 > > npapke@proven.lan:/usr/obj/red/public/freebsd/sources/stable/sys/PROVEN > > amd64 > > > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D > > > > Kernel config: > > > > include GENERIC > > ident PROVEN > > > > options KDB # kernel debugger (just in case) > > options KDB_TRACE > > options DDB # kernel debugger (just in case) > > options WITNESS > > options WITNESS_SKIPSPIN > > > > options IPSEC > > device crypto > > device stf # for IPv6 tunneling > > > > # keep kernel messages from different cpus separate > > options PRINTF_BUFR_SIZE=3D64 > > > > option SC_HISTORY_SIZE=3D2000 > > options SC_NORM_ATTR=3D(FG_GREEN|BG_BLACK) > > options SC_NORM_REV_ATTR=3D(FG_YELLOW|BG_GREEN) > > options SC_KERNEL_CONS_ATTR=3D(FG_LIGHTRED|BG_BLACK) > > options SC_KERNEL_CONS_REV_ATTR=3D(FG_BLACK|BG_RED) > > > > # Alternate Queuing of network packets > > options ALTQ > > options ALTQ_CBQ # Class Bases Queuing (CBQ) > > options ALTQ_RED # Random Early Detection (RED) > > options ALTQ_RIO # RED In/Out > > options ALTQ_HFSC # Hierarchical Packet Scheduler (HFSC) > > options ALTQ_PRIQ # Priority Queuing (PRIQ) > > options ALTQ_NOPCC # Required for SMP build > > > > # load as module for debugging > > nodevice re # RealTek 8139C+/8169/8169S/8110S > > > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D > > > > Copyright (c) 1992-2009 The FreeBSD Project. > > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > > The Regents of the University of California. All rights reserve= d. > > FreeBSD is a registered trademark of The FreeBSD Foundation. > > FreeBSD 7.2-STABLE #0 r191841: Tue May 5 21:13:21 PDT 2009 > > =20 > > npapke@proven.lan:/usr/obj/red/public/freebsd/sources/stable/sys/PROVEN > > WARNING: WITNESS option enabled, expect reduced performance. > > Timecounter "i8254" frequency 1193182 Hz quality 0 > > CPU: Intel(R) Core(TM)2 Duo CPU E8500 @ 3.16GHz (3155.59-MHz > > K8-class CPU) > > Origin =3D "GenuineIntel" Id =3D 0x1067a Stepping =3D 10 > > > > Features=3D0xbfebfbff >MC A,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> > > > > Features2=3D0x408e3fd >,P DCM,,XSAVE> AMD Features=3D0x20100800 > > AMD Features2=3D0x1 > > Cores per package: 2 > > usable memory =3D 4279189504 (4080 MB) > > avail memory =3D 4097724416 (3907 MB) > > ACPI APIC Table: <100808 APIC1053> > > FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs > > cpu0 (BSP): APIC ID: 0 > > cpu1 (AP): APIC ID: 1 > > ioapic0 irqs 0-23 on motherboard > > kbd1 at kbdmux0 > > cryptosoft0: on motherboard > > acpi0: <100808 XSDT1053> on motherboard > > acpi0: [ITHREAD] > > acpi0: Power Button (fixed) > > acpi0: reservation of ffc00000, 300000 (3) failed > > acpi0: reservation of fee00000, 1000 (3) failed > > acpi0: reservation of 0, a0000 (3) failed > > acpi0: reservation of 100000, bff00000 (3) failed > > Timecounter "ACPI-safe" frequency 3579545 Hz quality 850 > > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 > > acpi_hpet0: iomem 0xfed00000-0xfed003ff on > > acpi0 Timecounter "HPET" frequency 14318180 Hz quality 900 > > pcib0: port 0xcf8-0xcff on acpi0 > > pci0: on pcib0 > > pcib1: irq 16 at device 1.0 on pci0 > > pci1: on pcib1 > > vgapci0: port 0xc000-0xc0ff mem > > 0xd0000000-0xdfffffff,0xfe9f0000-0xfe9fffff irq 16 at device 0.0 on pci1 > > drm0: on vgapci0 > > info: [drm] MSI enabled 1 message(s) > > vgapci0: child drm0 requested pci_enable_busmaster > > info: [drm] Initialized radeon 1.29.0 20080528 > > hdac0: mem > > 0xfe9ec000-0xfe9effff irq 17 at device 0.1 on pci1 > > hdac0: HDA Driver Revision: 20090329_0131 > > hdac0: [ITHREAD] > > uhci0: port 0xbc00-0xbc1f irq 16 at > > device 26.0 on pci0 > > uhci0: [GIANT-LOCKED] > > uhci0: [ITHREAD] > > usb0: on uhci0 > > usb0: USB revision 1.0 > > uhub0: on usb0 > > uhub0: 2 ports with 2 removable, self powered > > uhci1: port 0xb880-0xb89f irq 21 at > > device 26.1 on pci0 > > uhci1: [GIANT-LOCKED] > > uhci1: [ITHREAD] > > usb1: on uhci1 > > usb1: USB revision 1.0 > > uhub1: on usb1 > > uhub1: 2 ports with 2 removable, self powered > > uhci2: port 0xb800-0xb81f irq 19 at > > device 26.2 on pci0 > > uhci2: [GIANT-LOCKED] > > uhci2: [ITHREAD] > > usb2: on uhci2 > > usb2: USB revision 1.0 > > uhub2: on usb2 > > uhub2: 2 ports with 2 removable, self powered > > ehci0: mem 0xfe8fe000-0xfe8fe3ff irq > > 18 at device 26.7 on pci0 > > ehci0: [GIANT-LOCKED] > > ehci0: [ITHREAD] > > usb3: EHCI version 1.0 > > usb3: companion controllers, 2 ports each: usb0 usb1 usb2 > > usb3: on ehci0 > > usb3: USB revision 2.0 > > uhub3: on usb3 > > uhub3: 6 ports with 6 removable, self powered > > hdac1: mem > > 0xfe8f8000-0xfe8fbfff irq 22 at device 27.0 on pci0 > > hdac1: HDA Driver Revision: 20090329_0131 > > hdac1: [ITHREAD] > > pcib2: irq 17 at device 28.0 on pci0 > > pci2: on pcib2 > > pcib3: irq 16 at device 28.5 on pci0 > > pci3: on pcib3 > > re0: > Gigabit Ethernet> port 0xd800-0xd8ff mem > > 0xfeaff000-0xfeafffff,0xfdff0000-0xfdffffff irq 17 at device 0.0 on pci3 > > re0: Chip rev. 0x3c000000 > > re0: MAC rev. 0x00400000 > > miibus0: on re0 > > rgephy0: PHY 1 on miibus0 > > rgephy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, > > 1000baseT-FDX, auto > > re0: Ethernet address: 00:30:48:b0:6a:1f > > re0: [FILTER] > > uhci3: port 0xb480-0xb49f irq 23 at > > device 29.0 on pci0 > > uhci3: [GIANT-LOCKED] > > uhci3: [ITHREAD] > > usb4: on uhci3 > > usb4: USB revision 1.0 > > uhub4: on usb4 > > uhub4: 2 ports with 2 removable, self powered > > uhci4: port 0xb400-0xb41f irq 19 at > > device 29.1 on pci0 > > uhci4: [GIANT-LOCKED] > > uhci4: [ITHREAD] > > usb5: on uhci4 > > usb5: USB revision 1.0 > > uhub5: on usb5 > > uhub5: 2 ports with 2 removable, self powered > > uhci5: port 0xb080-0xb09f irq 18 at > > device 29.2 on pci0 > > uhci5: [GIANT-LOCKED] > > uhci5: [ITHREAD] > > usb6: on uhci5 > > usb6: USB revision 1.0 > > uhub6: on usb6 > > uhub6: 2 ports with 2 removable, self powered > > ehci1: mem 0xfe8fc000-0xfe8fc3ff irq > > 23 at device 29.7 on pci0 > > ehci1: [GIANT-LOCKED] > > ehci1: [ITHREAD] > > usb7: EHCI version 1.0 > > usb7: companion controllers, 2 ports each: usb4 usb5 usb6 > > usb7: on ehci1 > > usb7: USB revision 2.0 > > uhub7: on usb7 > > uhub7: 6 ports with 6 removable, self powered > > umass0: > > on uhub7 > > pcib4: at device 30.0 on pci0 > > pci4: on pcib4 > > dc0: port 0xe800-0xe8ff mem > > 0xfebffc00-0xfebfffff irq 21 at device 1.0 on pci4 > > miibus1: on dc0 > > acphy0: PHY 1 on miibus1 > > acphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto > > dc0: Ethernet address: 00:20:78:10:3e:98 > > dc0: [ITHREAD] > > isab0: at device 31.0 on pci0 > > isa0: on isab0 > > atapci0: port > > 0xb000-0xb007,0xac00-0xac03,0xa880-0xa887,0xa800-0xa803,0xa480-0xa48f,0= xa > >40 0-0xa40f irq 19 at device 31.2 on pci0 > > atapci0: [ITHREAD] > > ata2: on atapci0 > > ata2: [ITHREAD] > > ata3: on atapci0 > > ata3: [ITHREAD] > > ichsmb0: port 0x400-0x41f mem 0xfe8f7c00-0xfe8f7cff > > irq 18 at device 31.3 on pci0 > > ichsmb0: [GIANT-LOCKED] > > ichsmb0: [ITHREAD] > > smbus0: on ichsmb0 > > smb0: on smbus0 > > atapci1: port > > 0xa000-0xa007,0x9c00-0x9c03,0x9880-0x9887,0x9800-0x9803,0x9480-0x948f,0= x9 > >40 0-0x940f irq 19 at device 31.5 on pci0 > > atapci1: [ITHREAD] > > ata4: on atapci1 > > ata4: [ITHREAD] > > ata5: on atapci1 > > ata5: [ITHREAD] > > acpi_button0: on acpi0 > > sio0: configured irq 4 not in bitmap of probed irqs 0 > > sio0: port may not be enabled > > sio0: configured irq 4 not in bitmap of probed irqs 0 > > sio0: port may not be enabled > > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on > > acpi0 sio0: type 16550A > > sio0: [FILTER] > > ppc0: port 0x378-0x37f irq 7 on acpi0 > > ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode > > ppbus0: on ppc0 > > ppbus0: [ITHREAD] > > lpt0: on ppbus0 > > lpt0: Interrupt-driven port > > ppi0: on ppbus0 > > plip0: on ppbus0 > > plip0: WARNING: using obsoleted IFF_NEEDSGIANT flag > > ppc0: [GIANT-LOCKED] > > ppc0: [ITHREAD] > > atkbdc0: port 0x60,0x64 irq 1 on acpi0 > > atkbd0: irq 1 on atkbdc0 > > kbd0 at atkbd0 > > atkbd0: [GIANT-LOCKED] > > atkbd0: [ITHREAD] > > psm0: irq 12 on atkbdc0 > > psm0: [GIANT-LOCKED] > > psm0: [ITHREAD] > > psm0: model IntelliMouse, device ID 3 > > cpu0: on acpi0 > > ACPI Warning (tbutils-0243): Incorrect checksum in table [OEMB] - 45, > > should be 40 [20070320] > > coretemp0: on cpu0 > > est0: on cpu0 > > p4tcc0: on cpu0 > > cpu1: on acpi0 > > coretemp1: on cpu1 > > est1: on cpu1 > > est: CPU supports Enhanced Speedstep, but is not recognized. > > est: cpu_vendor GenuineIntel, msr 616492206004922 > > device_attach: est1 attach returned 6 > > p4tcc1: on cpu1 > > orm0: at iomem 0xc0000-0xcffff on isa0 > > sc0: at flags 0x100 on isa0 > > sc0: VGA <16 virtual consoles, flags=3D0x300> > > sio1: configured irq 3 not in bitmap of probed irqs 0 > > sio1: port may not be enabled > > vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on is= a0 > > Timecounters tick every 1.000 msec > > IPsec: Initialized Security Association Processing. > > ad4: 239372MB at ata2-master SATA150 > > ad7: 305245MB at ata3-slave SATA300 > > ad8: 610480MB at ata4-master SATA300 > > GEOM_LABEL: Label for provider ad4s1a is ufsid/497cecd46b0e22e5. > > acd0: DVDR at ata5-master SATA150 > > hdac0: HDA Codec #0: ATI R6xx HDMI > > pcm0: at cad 0 nid 1 on hdac0 > > hdac1: HDA Codec #2: Realtek ALC888 > > hdac1: hdac_command_send_internal: TIMEOUT numcmd=3D1, sent=3D1, receiv= ed=3D0 > > hdac1: hdac_command_send_internal: TIMEOUT numcmd=3D1, sent=3D1, receiv= ed=3D0 > > hdac1: Codec #3 is not responding! Probing aborted. > > pcm1: at cad 2 nid 1 on hdac1 > > pcm2: at cad 2 nid 1 on hdac1 > > pcm3: at cad 2 nid 1 on hdac1 > > acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=3D0x24 ascq=3D0x00 > > (probe1:umass-sim0:0:0:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 > > (probe1:umass-sim0:0:0:0): CAM Status: SCSI Status Error > > (probe1:umass-sim0:0:0:0): SCSI Status: Check Condition > > (probe1:umass-sim0:0:0:0): UNIT ATTENTION asc:28,0 > > (probe1:umass-sim0:0:0:0): Not ready to ready change, medium may have > > changed (probe1:umass-sim0:0:0:0): Retrying Command (per Sense Data) > > acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=3D0x24 ascq=3D0x00 > > SMP: AP CPU #1 Launched! > > WARNING: WITNESS option enabled, expect reduced performance. > > da0 at umass-sim0 bus 0 target 0 lun 0 > > da0: Removable Direct Access SCSI-2 device > > da0: 40.000MB/s transfers > > da0: 3830MB (7843840 512 byte sectors: 255H 63S/T 488C) > > cd0 at ata3 bus 0 target 0 lun 0 > > cd0: Removable CD-ROM SCSI-0 device > > cd0: 3.300MB/s transfers > > cd0: cd present [4098336 x 2048 byte records] > > GEOM_LABEL: Label for provider acd0 is > > iso9660/THE_MATRIX_16X9LB_N_AMERICA. Trying to mount root from > > ufs:/dev/ad4s1a > > WARNING: / was not properly dismounted > > WARNING: reducing size to maximum of 67108864 blocks per swap unit > > GEOM_LABEL: Label ufsid/497cecd46b0e22e5 removed. > > GEOM_LABEL: Label for provider ad4s1a is ufsid/497cecd46b0e22e5. > > GEOM_LABEL: Label ufsid/497cecd46b0e22e5 removed. > > This module (opensolaris) contains code covered by the > > Common Development and Distribution License (CDDL) > > see http://opensolaris.org/os/licensing/opensolaris_license/ > > WARNING: ZFS is considered to be an experimental feature in FreeBSD. > > ZFS filesystem version 6 > > ZFS storage pool version 6 > > lock order reversal: > > 1st 0xffffffff80e49de0 pf task mtx (pf task mtx) > > @ > > /red/public/freebsd/sources/stable/sys/modules/pf/../../contrib/pf/net/= pf > >_i octl.c:1394 2nd 0xffffffff80ba94c0 ifnet (ifnet) > > @ /red/public/freebsd/sources/stable/sys/net/if.c:1623 > > KDB: stack backtrace: > > db_trace_self_wrapper() at db_trace_self_wrapper+0x2a > > witness_checkorder() at witness_checkorder+0x543 > > _mtx_lock_flags() at _mtx_lock_flags+0x1f > > ifunit() at ifunit+0x24 > > pfioctl() at pfioctl+0x2531 > > devfs_ioctl_f() at devfs_ioctl_f+0x71 > > kern_ioctl() at kern_ioctl+0x91 > > ioctl() at ioctl+0xeb > > syscall() at syscall+0x1a5 > > Xfast_syscall() at Xfast_syscall+0xab > > --- syscall (54, FreeBSD ELF64, ioctl), rip =3D 0x80096296c, rsp =3D > > 0x7fffffffdc18, rbp =3D 0x7fffffffdca0 --- > > kqemu version 0x00010400 > > kqemu: KQEMU installed, max_locked_mem=3D2089448kB. > > acd0: FAILURE - READ_BIG timed out > > acd0: FAILURE - READ_BIG timed out > > acd0: FAILURE - READ_BIG timed out > > info: [drm] Setting GART location based on new memory map > > info: [drm] Loading RV635 CP Microcode > > info: [drm] Loading RV635 PFP Microcode > > info: [drm] Resetting GPU > > info: [drm] writeback test succeeded in 1 usecs > > drm0: [ITHREAD] > > acd0: FAILURE - READ_BIG timed out > > acd0: FAILURE - READ_BIG timed out > > (cd0:ata3:0:0:0): cddone: got error 0x5 back > > tap0: Ethernet address: 00:bd:d8:9b:04:00 > > bridge0: Ethernet address: fa:cc:68:2e:a4:8e > > tap0: promiscuous mode enabled > > dc0: promiscuous mode enabled > > > > _______________________________________________ > > freebsd-stable@freebsd.org mailing list > > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.or= g" =2D-=20 =2D- Norbert Papke. npapke@acm.org From owner-freebsd-stable@FreeBSD.ORG Tue May 12 07:25:41 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5B64A106564A for ; Tue, 12 May 2009 07:25:41 +0000 (UTC) (envelope-from petefrench@ticketswitch.com) Received: from constantine.ticketswitch.com (constantine.ticketswitch.com [IPv6:2002:57e0:1d4e:1::3]) by mx1.freebsd.org (Postfix) with ESMTP id 21C958FC14 for ; Tue, 12 May 2009 07:25:40 +0000 (UTC) (envelope-from petefrench@ticketswitch.com) Received: from dilbert.rattatosk ([10.64.50.6] helo=dilbert.ticketswitch.com) by constantine.ticketswitch.com with esmtps (TLSv1:AES256-SHA:256) (Exim 4.69 (FreeBSD)) (envelope-from ) id 1M3mMs-000GV9-Tb; Tue, 12 May 2009 08:25:38 +0100 Received: from petefrench by dilbert.ticketswitch.com with local (Exim 4.69 (FreeBSD)) (envelope-from ) id 1M3mMs-0006yd-S7; Tue, 12 May 2009 08:25:38 +0100 To: fj@panix.com, freebsd-stable@freebsd.org In-Reply-To: <20090512014733.GA12271@panix.com> Message-Id: From: Pete French Date: Tue, 12 May 2009 08:25:38 +0100 Cc: Subject: Re: Error message: run_interrupt_driven_hooks:... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 07:25:41 -0000 > Basic data on my experience with the xpt_config hang; I have more > detail if needed, but I doubt anyone will believe it. I'm not even > sure I do. I am not sure what you mean by that ... something odd about the hang ? For what it's worth, I have also seen this - I get (or got) precisely the same error when trying to boot FreeBSD on an MSI 790GX motherboard. As on last weekend I have replaced it with a 790FX and now everything works fine - but ti means I can't do anymore bug hunting. From a quick glance at these reports I think all the people with this problem are using AMD AM2 motherboards aren't they ? -pete. From owner-freebsd-stable@FreeBSD.ORG Tue May 12 09:06:22 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B96901065675 for ; Tue, 12 May 2009 09:06:22 +0000 (UTC) (envelope-from bra@fsn.hu) Received: from people.fsn.hu (people.fsn.hu [195.228.252.137]) by mx1.freebsd.org (Postfix) with ESMTP id 16C828FC29 for ; Tue, 12 May 2009 09:06:21 +0000 (UTC) (envelope-from bra@fsn.hu) Message-ID: <4A09382F.5010109@fsn.hu> Date: Tue, 12 May 2009 10:49:51 +0200 From: Attila Nagy User-Agent: Thunderbird 2.0.0.21 (X11/20090318) MIME-Version: 1.0 To: freebsd-stable@freebsd.org X-Stationery: 0.4.8.14 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (people.fsn.hu [0.0.0.0]); Tue, 12 May 2009 10:49:55 +0200 (CEST) Subject: stat() takes 54 msec in a directory with 94k files (even with a big dirhash) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 09:06:23 -0000 Hello, I have a strange error on FreeBSD 7-STABLE (compiled on 7th May, just few commits after the release, but an earlier kernel did the same). I'm doing several parallel rsyncs from a machine to another (let's call them source and destination). The source contains maildirs, so there are some directories with a (relatively) lot of files. The source runs an earlier (around 6.2) FreeBSD and plain softupdates mounted UFS2 file systems. The destination has a bigger (UFS2) filesystem, on top of gjournal, mounted as async. I've noticed that rsync sometimes stops moving data and the destination machine gets sluggish. After some testing, I could catch the effect in action (was not that hard, because it persists even for hours sometimes). top shows around 20% system activity (there are two quad core CPUs) and 0% user. The WCPU field at rsync shows 100%. ktrace-ing the rsync process I can see this: 31639 rsync 0.000004 CALL lstat(0x7fffffffab70,0x7fffffffaf70) 31639 rsync 0.000004 NAMI "hm33/00/16/uid/Maildir/new/1212536121.54673,S=3128" 31639 rsync 0.054226 STRU struct stat {dev=100, ino=136943662, mode=-rw------- , nlink=1, uid=999, gid=999, rdev=546942760, atime=1241807071, stime=1212536121, ctime=1241807071, birthtime=1212536121, size=3128, blksize=4096, blocks=8, flags=0x0 } 31639 rsync 0.000013 RET lstat 0 31639 rsync 0.000018 CALL lstat(0x7fffffffab70,0x7fffffffaf70) 31639 rsync 0.000004 NAMI "hm33/00/16/uid/Maildir/new/1212537276.69702,S=4634" 31639 rsync 0.054409 STRU struct stat {dev=100, ino=136943663, mode=-rw------- , nlink=1, uid=999, gid=999, rdev=546942762, atime=1241807071, stime=1212537276, ctime=1241807071, birthtime=1212537276, size=4634, blksize=4096, blocks=12, flags=0x0 } 31639 rsync 0.000013 RET lstat 0 31639 rsync 0.000020 CALL lstat(0x7fffffffab70,0x7fffffffaf70) 31639 rsync 0.000005 NAMI "hm33/00/16/uid/Maildir/new/1212537689.74390,S=3172" 31639 rsync 0.054230 STRU struct stat {dev=100, ino=136943664, mode=-rw------- , nlink=1, uid=999, gid=999, rdev=546942765, atime=1241807071, stime=1212537689, ctime=1241807071, birthtime=1212537689, size=3172, blksize=4096, blocks=8, flags=0x0 } 31639 rsync 0.000013 RET lstat 0 So according to ktrace, the stat call takes 54 milliseconds to return for each of the files. I have tried with the default and a pretty much raised dirhash maxmem value, but I can still get these. Currently I have: vfs.ufs.dirhash_docheck: 0 vfs.ufs.dirhash_mem: 18589428 vfs.ufs.dirhash_maxmem: 209715200 vfs.ufs.dirhash_minsize: 2560 So dirhash has space to expand. The directory in question contains 94493 files. The source machine doesn't show this behaviour. top's output on the destination machine: CPU: 0.0% user, 0.0% nice, 22.7% system, 0.0% interrupt, 77.3% idle Mem: 159M Active, 3032M Inact, 599M Wired, 47M Cache, 399M Buf, 102M Free Swap: 4096M Total, 4096M Free PID USERNAME THR PRI NICE SIZE RES STATE C TIME WCPU COMMAND 31639 root 1 118 0 50648K 10512K CPU0 0 2:01 100.00% rsync 634 root 1 -4 0 2536K 628K vlruwk 1 0:20 0.00% supervise 26760 root 1 44 0 25940K 3316K select 1 0:10 0.00% sshd 31640 root 1 75 0 87512K 8324K suspfs 4 0:10 0.00% rsync 31641 root 1 75 0 18904K 7124K suspfs 6 0:10 0.00% rsync 31637 root 1 75 0 40408K 7744K suspfs 4 0:09 0.00% rsync 31636 root 1 44 0 20952K 6288K select 2 0:09 0.00% rsync 31638 root 1 44 0 104M 8912K select 3 0:09 0.00% rsync 31635 root 1 75 0 80344K 7812K suspfs 4 0:09 0.00% rsync 31642 root 1 44 0 17940K 7624K select 1 0:04 0.00% ssh 31646 root 1 45 0 17940K 7656K select 1 0:03 0.00% ssh All of the rsyncs use the same file system, but with different top level directories. During this, neither of the other rsyncs can run. Any ideas about what could be done to work around this? Thanks, From owner-freebsd-stable@FreeBSD.ORG Tue May 12 10:26:50 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E29C5106566B for ; Tue, 12 May 2009 10:26:49 +0000 (UTC) (envelope-from onemda@gmail.com) Received: from fg-out-1718.google.com (fg-out-1718.google.com [72.14.220.156]) by mx1.freebsd.org (Postfix) with ESMTP id 796C48FC08 for ; Tue, 12 May 2009 10:26:46 +0000 (UTC) (envelope-from onemda@gmail.com) Received: by fg-out-1718.google.com with SMTP id e12so701142fga.12 for ; Tue, 12 May 2009 03:26:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=REYZtF1dgvwTfZ+WWnmg2J58b/yRwfxCqsYDR2nL7cI=; b=dGGkSITVoG6t6dPI12YALGg++PU8/whKAMrWD/uSwgr00WA4GfG4o1YqBaSiqq4yDu gfLfyrad79FPLQ1fm/4o5VPE+Ux8NIpuwuvhV1ol3yDbmfZDkqswLxFEAY6yc4fFtV4R TMmYozAPzyVCgYGudmpBAaq0kkSbC0hhhuTHo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=MKbC7BNqXYqz/+rVEMX7UpZogsMQhLMqhCqoL2QHeOKBuK59yW7jmzh2CQWHm1iY6B x5CsU9DyNSV7YIYRWWfDTe+bd9R4PVbqAmlAhAlwhTEC8B43g8iyXgyt3JSuKzhvDnOk /gR78TUE/HIjwlFD+3VObrkumAZF9R9eZaY5g= MIME-Version: 1.0 Received: by 10.239.154.83 with SMTP id d19mr645719hbc.33.1242124005288; Tue, 12 May 2009 03:26:45 -0700 (PDT) In-Reply-To: <4A09382F.5010109@fsn.hu> References: <4A09382F.5010109@fsn.hu> Date: Tue, 12 May 2009 12:26:45 +0200 Message-ID: <3a142e750905120326m165f4fdeld8d01eb305a1771c@mail.gmail.com> From: "Paul B. Mahol" To: Attila Nagy Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: stat() takes 54 msec in a directory with 94k files (even with a big dirhash) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 10:26:50 -0000 On 5/12/09, Attila Nagy wrote: > Hello, > > I have a strange error on FreeBSD 7-STABLE (compiled on 7th May, just > few commits after the release, but an earlier kernel did the same). > > I'm doing several parallel rsyncs from a machine to another (let's call > them source and destination). The source contains maildirs, so there are > some directories with a (relatively) lot of files. > The source runs an earlier (around 6.2) FreeBSD and plain softupdates > mounted UFS2 file systems. > The destination has a bigger (UFS2) filesystem, on top of gjournal, > mounted as async. > > I've noticed that rsync sometimes stops moving data and the destination > machine gets sluggish. After some testing, I could catch the effect in > action (was not that hard, because it persists even for hours sometimes). > > top shows around 20% system activity (there are two quad core CPUs) and > 0% user. The WCPU field at rsync shows 100%. > > ktrace-ing the rsync process I can see this: > 31639 rsync 0.000004 CALL lstat(0x7fffffffab70,0x7fffffffaf70) > 31639 rsync 0.000004 NAMI > "hm33/00/16/uid/Maildir/new/1212536121.54673,S=3128" > 31639 rsync 0.054226 STRU struct stat {dev=100, ino=136943662, > mode=-rw------- , nlink=1, uid=999, gid=999, rdev=546942760, > atime=1241807071, stime=1212536121, ctime=1241807071, > birthtime=1212536121, size=3128, blksize=4096, blocks=8, flags=0x0 } > 31639 rsync 0.000013 RET lstat 0 > 31639 rsync 0.000018 CALL lstat(0x7fffffffab70,0x7fffffffaf70) > 31639 rsync 0.000004 NAMI > "hm33/00/16/uid/Maildir/new/1212537276.69702,S=4634" > 31639 rsync 0.054409 STRU struct stat {dev=100, ino=136943663, > mode=-rw------- , nlink=1, uid=999, gid=999, rdev=546942762, > atime=1241807071, stime=1212537276, ctime=1241807071, > birthtime=1212537276, size=4634, blksize=4096, blocks=12, flags=0x0 } > 31639 rsync 0.000013 RET lstat 0 > 31639 rsync 0.000020 CALL lstat(0x7fffffffab70,0x7fffffffaf70) > 31639 rsync 0.000005 NAMI > "hm33/00/16/uid/Maildir/new/1212537689.74390,S=3172" > 31639 rsync 0.054230 STRU struct stat {dev=100, ino=136943664, > mode=-rw------- , nlink=1, uid=999, gid=999, rdev=546942765, > atime=1241807071, stime=1212537689, ctime=1241807071, > birthtime=1212537689, size=3172, blksize=4096, blocks=8, flags=0x0 } > 31639 rsync 0.000013 RET lstat 0 > > So according to ktrace, the stat call takes 54 milliseconds to return > for each of the files. > I have tried with the default and a pretty much raised dirhash maxmem > value, but I can still get these. > Currently I have: > vfs.ufs.dirhash_docheck: 0 > vfs.ufs.dirhash_mem: 18589428 > vfs.ufs.dirhash_maxmem: 209715200 > vfs.ufs.dirhash_minsize: 2560 > So dirhash has space to expand. > > The directory in question contains 94493 files. > > The source machine doesn't show this behaviour. > > top's output on the destination machine: > CPU: 0.0% user, 0.0% nice, 22.7% system, 0.0% interrupt, 77.3% idle > Mem: 159M Active, 3032M Inact, 599M Wired, 47M Cache, 399M Buf, 102M Free > Swap: 4096M Total, 4096M Free > > PID USERNAME THR PRI NICE SIZE RES STATE C TIME WCPU COMMAND > 31639 root 1 118 0 50648K 10512K CPU0 0 2:01 100.00% rsync > 634 root 1 -4 0 2536K 628K vlruwk 1 0:20 0.00% supervise > 26760 root 1 44 0 25940K 3316K select 1 0:10 0.00% sshd > 31640 root 1 75 0 87512K 8324K suspfs 4 0:10 0.00% rsync > 31641 root 1 75 0 18904K 7124K suspfs 6 0:10 0.00% rsync > 31637 root 1 75 0 40408K 7744K suspfs 4 0:09 0.00% rsync > 31636 root 1 44 0 20952K 6288K select 2 0:09 0.00% rsync > 31638 root 1 44 0 104M 8912K select 3 0:09 0.00% rsync > 31635 root 1 75 0 80344K 7812K suspfs 4 0:09 0.00% rsync > 31642 root 1 44 0 17940K 7624K select 1 0:04 0.00% ssh > 31646 root 1 45 0 17940K 7656K select 1 0:03 0.00% ssh > > All of the rsyncs use the same file system, but with different top level > directories. During this, neither of the other rsyncs can run. > > Any ideas about what could be done to work around this? Big guess, maybe it updates atime? Try with noatime mount option. -- Paul From owner-freebsd-stable@FreeBSD.ORG Tue May 12 11:52:07 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 050821065670 for ; Tue, 12 May 2009 11:52:07 +0000 (UTC) (envelope-from db@danielbond.org) Received: from mail.nsn.no (mailtwo.nsn.no [62.89.38.161]) by mx1.freebsd.org (Postfix) with SMTP id 668E18FC19 for ; Tue, 12 May 2009 11:52:05 +0000 (UTC) (envelope-from db@danielbond.org) Received: (qmail 89268 invoked by uid 0); 12 May 2009 11:52:04 -0000 Received: from unknown (HELO ?172.16.3.90?) (85.95.44.187) by mail.nsn.no with SMTP; 12 May 2009 11:52:04 -0000 Message-Id: From: Daniel Bond To: freebsd-stable@freebsd.org In-Reply-To: <49E8D18C.4070603@comcast.net> Content-Type: multipart/signed; protocol="application/pgp-signature"; micalg=pgp-sha1; boundary="Apple-Mail-11--318569631" Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v930.3) Date: Tue, 12 May 2009 13:51:58 +0200 References: <34B37CEC-AF7A-48EE-81F5-7B19291F99EF@danielbond.org> <49E8D18C.4070603@comcast.net> X-Pgp-Agent: GPGMail 1.2.0 (v56) X-Mailer: Apple Mail (2.930.3) Cc: des@des.no, "O. Hartmann" , Steve Polyack Subject: Re: PAM completeness and standardization [PR:bin/71290] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 11:52:07 -0000 This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --Apple-Mail-11--318569631 Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Hi Steve and Oliver, thanks for your replies. Sorry it has taken me some time to reply. I'm willing to put in some time into this issue too, maybe we could do a joint effort on this? The problem report with the most information in is http://www.freebsd.org/cgi/query-pr.cgi?pr=bin/71290 - DES has some good reasons, for why the patch has not been included in FreeBSD. Here are some of my viewpoints about the comments in the ticket. - I think it is really important we preserve all command-line options, and do not break any existing functionality what so ever. - I also think exposing PAM code for changing passwords is a good thing. Either we want PAM support in FreeBSD, or we don't. If we do, we need to support the PAM core features - exposing this code is necessary, and the code needs to be polished accordingly. - The documentation changes is nice to have, let's think about this when we are happy with the other stuff. I have a NetBSD 5.0 installation on my private server, I'll start looking at how they have implemented PAM. Any comments? Pointers to code that would need cleanup? Anything we need to be extra careful with? Best regards, Daniel. -- GPG public key: EDE9C925 On Apr 17, 2009, at 8:59 PM, Steve Polyack wrote: > Daniel Bond wrote: >> FreeBSD has excellent PAM-support, except for the passwd-command. >> The passwd-command gained PAM support quite a while ago, but there >> is a test preventing it from working with PAM. >> There have been outstanding PR's for this minor issue for years >> now, I think it's time this one got fixed. People find it >> frustrating that they can't change their passwords (LDAP etc), like >> they can in a normal PAM-based system. >> >> >> I'd be happy to fix whatever needs to be done, but I need to know >> why it's not been fixed / what needs to be done for it to be >> accepted by the community. > > I've looked at this recently and came to a roadblock after > sufficiently modifying passwd code (removing the test and an > additional few lines) as well as including the proper lines in /etc/ > pam.d/sshd. I can't recally the exact problem I had. I will > probably give this another go in the future, so I am willing to put > in some time on this issue. > > Anyways, I don't have a reason for you as to why it hasn't been > fixed or accepted yet. It is a long-standing issue from what I > understand. > --Apple-Mail-11--318569631 content-type: application/pgp-signature; x-mac-type=70674453; name=PGP.sig content-description: This is a digitally signed message part content-disposition: inline; filename=PGP.sig content-transfer-encoding: 7bit -----BEGIN PGP SIGNATURE----- Version: GnuPG/MacGPG2 v2.0.11 (Darwin) iEYEARECAAYFAkoJYuQACgkQF4Ca8+3pySWClQCgm1lXy3ag5P9bGssztKc4ahMJ gb0AoJIqXnzx0+0bf1zxExT+/lr+GPDo =C7AN -----END PGP SIGNATURE----- --Apple-Mail-11--318569631-- From owner-freebsd-stable@FreeBSD.ORG Tue May 12 13:06:36 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5E70E1065675 for ; Tue, 12 May 2009 13:06:36 +0000 (UTC) (envelope-from des@des.no) Received: from tim.des.no (tim.des.no [194.63.250.121]) by mx1.freebsd.org (Postfix) with ESMTP id 202AC8FC15 for ; Tue, 12 May 2009 13:06:35 +0000 (UTC) (envelope-from des@des.no) Received: from ds4.des.no (des.no [84.49.246.2]) by smtp.des.no (Postfix) with ESMTP id 9AA236D449; Tue, 12 May 2009 14:48:08 +0200 (CEST) Received: by ds4.des.no (Postfix, from userid 1001) id 81303844BD; Tue, 12 May 2009 14:48:08 +0200 (CEST) From: =?utf-8?Q?Dag-Erling_Sm=C3=B8rgrav?= To: Daniel Bond References: <34B37CEC-AF7A-48EE-81F5-7B19291F99EF@danielbond.org> <49E8D18C.4070603@comcast.net> Date: Tue, 12 May 2009 14:48:08 +0200 In-Reply-To: (Daniel Bond's message of "Tue, 12 May 2009 13:51:58 +0200") Message-ID: <867i0m5vxz.fsf@ds4.des.no> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.92 (berkeley-unix) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Cc: "O. Hartmann" , freebsd-stable@freebsd.org, Steve Polyack Subject: Re: PAM completeness and standardization [PR:bin/71290] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 13:06:36 -0000 Daniel Bond writes: > I have a NetBSD 5.0 installation on my private server, I'll start > looking at how they have implemented PAM. Specifically, you should look at how they've adapted their passwd(1) and what pam_sm_chauthtok() looks like in their PAM modules. DES --=20 Dag-Erling Sm=C3=B8rgrav - des@des.no From owner-freebsd-stable@FreeBSD.ORG Tue May 12 14:23:17 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 967B5106564A for ; Tue, 12 May 2009 14:23:17 +0000 (UTC) (envelope-from yani@pi-greece.eu) Received: from rosebud.otenet.gr (rosebud.otenet.gr [83.235.67.32]) by mx1.freebsd.org (Postfix) with ESMTP id EEBD38FC21 for ; Tue, 12 May 2009 14:23:16 +0000 (UTC) (envelope-from yani@pi-greece.eu) Received: from techmx01.pi-greece.eu (athedsl-150222.home.otenet.gr [85.75.130.108]) by rosebud.otenet.gr (8.13.8/8.13.8/Debian-3) with SMTP id n4CE1793001023 for ; Tue, 12 May 2009 17:01:07 +0300 Received: (qmail 3105 invoked by uid 0); 12 May 2009 17:01:07 +0300 Received: from 192.168.1.16 by techserver.pi-greece.eu (envelope-from , uid 0) with qmail-scanner-1.25 (clamdscan: 0.95.1/9270. spamassassin: 3.2.5. Clear:RC:0(192.168.1.16):SA:0(-4.4/5.0):. Processed in 1.756373 secs); 12 May 2009 14:01:07 -0000 X-Spam-Status: No, hits=-4.4 required=5.0 X-Qmail-Scanner-Mail-From: yani@pi-greece.eu via techserver.pi-greece.eu X-Qmail-Scanner: 1.25 (Clear:RC:0(192.168.1.16):SA:0(-4.4/5.0):. Processed in 1.756373 secs) Received: from quadmatrix.pi-office (HELO ?127.0.0.1?) (yani@pi-greece.eu@192.168.1.16) by techserver.pi-greece.eu with SMTP; 12 May 2009 17:01:05 +0300 Message-ID: <4A098115.3040605@pi-greece.eu> Date: Tue, 12 May 2009 17:00:53 +0300 From: Yani Karydis User-Agent: Thunderbird 2.0.0.21 (Windows/20090302) MIME-Version: 1.0 To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Antivirus: avast! (VPS 090511-0, 11/05/2009), Outbound message X-Antivirus-Status: Clean Subject: CAM Status: SCSI Status Error on 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 14:23:17 -0000 Hello, Since upgrading to 7.2-RELEASE, dmesg displays the following after booting the system. (probe3:ahc0:0:3:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe3:ahc0:0:3:0): CAM Status: SCSI Status Error (probe3:ahc0:0:3:0): SCSI Status: Check Condition (probe3:ahc0:0:3:0): UNIT ATTENTION asc:29,0 (probe3:ahc0:0:3:0): Power on, reset, or bus device reset occurred (probe3:ahc0:0:3:0): Retrying Command (per Sense Data) (probe3:ahc0:0:3:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe3:ahc0:0:3:0): CAM Status: SCSI Status Error (probe3:ahc0:0:3:0): SCSI Status: Check Condition (probe3:ahc0:0:3:0): NOT READY asc:3a,0 (probe3:ahc0:0:3:0): Medium not present (probe3:ahc0:0:3:0): Unretryable error sa0 at ahc0 bus 0 target 3 lun 0 sa0: Removable Sequential Access SCSI-3 device sa0: 20.000MB/s transfers (10.000MHz, offset 8, 16bit) Is CAM trying to read the capabilities of the Ultrium tape drive? I've never seen these messages before, the system was upgraded straight from 7.1-RELEASE to 7.2-RELEASE. Full dmesg below: Copyright (c) 1992-2009 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.2-RELEASE #61: Tue May 12 00:23:20 EEST 2009 root@techserver.pi-greece.eu:/usr/obj/usr/src/sys/TECHSERVER7 Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Sempron(tm) 2300+ (1585.75-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x681 Stepping = 1 Features=0x383fbff AMD Features=0xc0480800 real memory = 1073676288 (1023 MB) avail memory = 1041469440 (993 MB) ACPI APIC Table: ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, 3fef0000 (3) failed acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 agp0: on hostb0 agp0: aperture size is 32M pcib1: at device 1.0 on pci0 pci1: on pcib1 vgapci0: port 0xa000-0xa0ff mem 0xd0000000-0xd7ffffff,0xe0000000-0xe000ffff at device 0.0 on pci1 em0: port 0xb000-0xb03f mem 0xe1000000-0xe101ffff,0xe1020000-0xe103ffff irq 17 at device 9.0 on pci0 em0: [FILTER] em0: Ethernet address: 00:07:e9:49:18:b3 aac0: mem 0xd8000000-0xdbffffff irq 18 at device 10.0 on pci0 aac0: Enable Raw I/O aac0: New comm. interface enabled aac0: [ITHREAD] aac0: Adaptec 2610SA, aac driver 2.0.0-1 ahc0: port 0xb400-0xb4ff mem 0xe1040000-0xe1040fff irq 19 at device 11.0 on pci0 ahc0: [ITHREAD] aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs atapci0: port 0xb800-0xb807,0xbc00-0xbc03,0xc000-0xc007,0xc400-0xc403,0xc800-0xc80f,0xcc00-0xccff irq 20 at device 15.0 on pci0 atapci0: [ITHREAD] ata2: on atapci0 ata2: [ITHREAD] ata3: on atapci0 ata3: [ITHREAD] atapci1: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xd000-0xd00f at device 15.1 on pci0 ata0: on atapci1 ata0: [ITHREAD] ata1: on atapci1 ata1: [ITHREAD] uhci0: port 0xd400-0xd41f irq 21 at device 16.0 on pci0 uhci0: [GIANT-LOCKED] uhci0: [ITHREAD] usb0: on uhci0 usb0: USB revision 1.0 uhub0: on usb0 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xd800-0xd81f irq 21 at device 16.1 on pci0 uhci1: [GIANT-LOCKED] uhci1: [ITHREAD] usb1: on uhci1 usb1: USB revision 1.0 uhub1: on usb1 uhub1: 2 ports with 2 removable, self powered uhci2: port 0xdc00-0xdc1f irq 21 at device 16.2 on pci0 uhci2: [GIANT-LOCKED] uhci2: [ITHREAD] usb2: on uhci2 usb2: USB revision 1.0 uhub2: on usb2 uhub2: 2 ports with 2 removable, self powered uhci3: port 0xe000-0xe01f irq 21 at device 16.3 on pci0 uhci3: [GIANT-LOCKED] uhci3: [ITHREAD] usb3: on uhci3 usb3: USB revision 1.0 uhub3: on usb3 uhub3: 2 ports with 2 removable, self powered ehci0: mem 0xe1041000-0xe10410ff irq 21 at device 16.4 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb4: EHCI version 1.0 usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4: on ehci0 usb4: USB revision 2.0 uhub4: on usb4 uhub4: 8 ports with 8 removable, self powered isab0: at device 17.0 on pci0 isa0: on isab0 vr0: port 0xe400-0xe4ff mem 0xe1042000-0xe10420ff irq 23 at device 18.0 on pci0 vr0: Quirks: 0x0 vr0: Revision: 0x78 miibus0: on vr0 ukphy0: PHY 1 on miibus0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto vr0: Ethernet address: 00:0f:ea:e0:eb:cc vr0: [ITHREAD] fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FILTER] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio0: [FILTER] sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A sio1: [FILTER] atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: [ITHREAD] psm0: model IntelliMouse Explorer, device ID 4 cpu0: on acpi0 pmtimer0 on isa0 orm0: at iomem 0xc0000-0xcafff,0xcc000-0xccfff,0xcd000-0xd0fff,0xd1000-0xd17ff pnpid ORM0000 on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: on ppc0 ppbus0: [ITHREAD] ppi0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppc0: [GIANT-LOCKED] ppc0: [ITHREAD] ucom0: on uhub1 Timecounter "TSC" frequency 1585747295 Hz quality 800 Timecounters tick every 1.000 msec ad0: 305244MB at ata0-master UDMA100 acd0: DVDR at ata1-master UDMA33 Waiting 5 seconds for SCSI devices to settle aacd0: on aac0 aacd0: 953812MB (1953406976 sectors) GEOM_LABEL: Label for provider fd0 is ufsid/4832e0ff7a2f794e. GEOM_LABEL: Label for provider fd0 is ufs/SYSBACKUP. GEOM_LABEL: Label for provider aacd0s1 is ufsid/47e2ca3bf4f15248. GEOM_LABEL: Label for provider aacd0s1 is ufs/STORAGE. GEOM_LABEL: Label for provider ad0s1a is ufsid/47f7e9dbde92bc03. GEOM_LABEL: Label for provider ad0s1a is ufs/TECHROOT. GEOM_LABEL: Label for provider ad0s1d is ufsid/47f7e7298a3ee160. GEOM_LABEL: Label for provider ad0s1d is ufs/TECHVAR. GEOM_LABEL: Label for provider ad0s1e is ufsid/47f7e718650b6911. GEOM_LABEL: Label for provider ad0s1e is ufs/TECHTMP. GEOM_LABEL: Label for provider ad0s1f is ufsid/47f8be74c998ed29. GEOM_LABEL: Label for provider ad0s1f is ufs/TECHUSR. (probe3:ahc0:0:3:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe3:ahc0:0:3:0): CAM Status: SCSI Status Error (probe3:ahc0:0:3:0): SCSI Status: Check Condition (probe3:ahc0:0:3:0): UNIT ATTENTION asc:29,0 (probe3:ahc0:0:3:0): Power on, reset, or bus device reset occurred (probe3:ahc0:0:3:0): Retrying Command (per Sense Data) (probe3:ahc0:0:3:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe3:ahc0:0:3:0): CAM Status: SCSI Status Error (probe3:ahc0:0:3:0): SCSI Status: Check Condition (probe3:ahc0:0:3:0): NOT READY asc:3a,0 (probe3:ahc0:0:3:0): Medium not present (probe3:ahc0:0:3:0): Unretryable error sa0 at ahc0 bus 0 target 3 lun 0 sa0: Removable Sequential Access SCSI-3 device sa0: 20.000MB/s transfers (10.000MHz, offset 8, 16bit) Trying to mount root from ufs:/dev/ufs/TECHROOT GEOM_LABEL: Label ufsid/47f7e9dbde92bc03 removed. GEOM_LABEL: Label ufsid/47f7e718650b6911 removed. GEOM_LABEL: Label ufsid/47f8be74c998ed29 removed. GEOM_LABEL: Label ufsid/47e2ca3bf4f15248 removed. GEOM_LABEL: Label ufsid/47f7e7298a3ee160 removed. GEOM_LABEL: Label for provider aacd0s1c is ufsid/47e2ca3bf4f15248. GEOM_LABEL: Label ufsid/47e2ca3bf4f15248 removed. em0: link state changed to UP lagg0: link state changed to UP vr0: link state changed to UP Thanks and regards, Yani Karydis From owner-freebsd-stable@FreeBSD.ORG Tue May 12 15:18:24 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 69173106566B for ; Tue, 12 May 2009 15:18:24 +0000 (UTC) (envelope-from rnoland@FreeBSD.org) Received: from gizmo.2hip.net (gizmo.2hip.net [64.74.207.195]) by mx1.freebsd.org (Postfix) with ESMTP id 39A188FC12 for ; Tue, 12 May 2009 15:18:24 +0000 (UTC) (envelope-from rnoland@FreeBSD.org) Received: from [192.168.1.4] (adsl-19-244-249.bna.bellsouth.net [68.19.244.249]) (authenticated bits=0) by gizmo.2hip.net (8.14.3/8.14.3) with ESMTP id n4CFIIwX025781 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Tue, 12 May 2009 11:18:19 -0400 (EDT) (envelope-from rnoland@FreeBSD.org) From: Robert Noland To: David Johnson In-Reply-To: <200905091841.26274.david@usermode.org> References: <200905042015.29394.david@usermode.org> <200905081458.53651.david@usermode.org> <1241821864.1733.51.camel@balrog.2hip.net> <200905091841.26274.david@usermode.org> Content-Type: multipart/signed; micalg="pgp-sha1"; protocol="application/pgp-signature"; boundary="=-6z9lvAbgfNdiPT5GLBTo" Organization: FreeBSD Date: Tue, 12 May 2009 10:17:51 -0500 Message-Id: <1242141471.1755.11.camel@balrog.2hip.net> Mime-Version: 1.0 X-Mailer: Evolution 2.26.1.1 FreeBSD GNOME Team Port X-Spam-Status: No, score=-3.2 required=5.0 tests=AWL,BAYES_00,RDNS_DYNAMIC autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on gizmo.2hip.net Cc: freebsd-stable@freebsd.org Subject: Re: Xorg hangs with drmwtq in 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 15:18:24 -0000 --=-6z9lvAbgfNdiPT5GLBTo Content-Type: text/plain Content-Transfer-Encoding: quoted-printable On Sat, 2009-05-09 at 18:41 -0700, David Johnson wrote: > On Friday 08 May 2009 03:31:04 pm Robert Noland wrote: > > In order to guess what might be causing this, drm debugging needs to be > > enabled before the hang, so that we can hopefully figure out what leads > > up to the hung GPU. >=20 > I'm not able to do that, but I did manage to get debug turned on and dmes= g > captured early enough to catch some additional information. I've place th= e > full file online at http://www.usermode.org/misc/dmesg.txt, but am includ= ing > some snippets here. Hopefully this is enough to move forward. >=20 > --=20 > David Johnson This trace still looks odd... > ... > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0xc0286429, nr=3D0x29, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:radeon_freelist_get] done_age =3D 102778 Things appear to be working at this point. > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0xc010644d, nr=3D0x4d, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:radeon_cp_indirect] idx=3D27 s=3D0 e=3D88 d=3D1 > [drm:pid1822:radeon_cp_dispatch_indirect] buf=3D27 s=3D0x0 e=3D0x58 Now, open count is 2 and something is calling close. > [drm:pid1822:drm_close] open_count =3D 2 > [drm:pid1822:drm_close] pid =3D 1822, device =3D 0xc615fa00, open_count = =3D 2 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0x80086442, nr=3D0x42, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:radeon_cp_stop]=20 > [drm:pid1822:radeon_do_cp_flush]=20 > [drm:pid1822:radeon_do_cp_idle]=20 > [drm:pid1822:radeon_do_cp_stop]=20 > [drm:pid1822:radeon_do_engine_reset]=20 > info: [drm] Num pipes: 1 > [drm:pid1822:radeon_do_cp_reset]=20 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0x800c6459, nr=3D0x59, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0x80086414, nr=3D0x14, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_irq_uninstall] irq=3D16 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0x80546440, nr=3D0x40, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:radeon_do_cleanup_cp]=20 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0x80086439, nr=3D0x39, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_sg_free] sg free virtual =3D 0xe8a64000 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0x8004667e, nr=3D0x7e, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0x8004667d, nr=3D0x7d, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0xc0086421, nr=3D0x21, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_rmctx] 2 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0xc0086421, nr=3D0x21, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_rmctx] 1 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0xc0086426, nr=3D0x26, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0xc0086426, nr=3D0x26, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_ioctl] pid=3D1822, cmd=3D0x8008642b, nr=3D0x2b, dev 0xc6= 15fa00, auth=3D1 > [drm:pid1822:drm_unlock] 1 (pid 1822) requests unlock (0x80000001), flags= =3D 0x00000000 Another close, followed by lastclose, so drm is fully shutdown. > [drm:pid1822:drm_close] open_count =3D 1 > [drm:pid1822:drm_close] pid =3D 1822, device =3D 0xc615fa00, open_count = =3D 1 > [drm:pid1822:drm_lastclose]=20 > [drm:pid1822:radeon_do_cleanup_cp]=20 Now, this looks like several vt switches... We don't see the open sequence here, so I assume that debugging was disabled at this point. > info: [drm] Setting GART location based on new memory map > info: [drm] Loading R500 Microcode > info: [drm] Num pipes: 1 > info: [drm] writeback test succeeded in 1 usecs > drm0: [ITHREAD] > info: [drm] Num pipes: 1 > info: [drm] Setting GART location based on new memory map > info: [drm] Loading R500 Microcode > info: [drm] Num pipes: 1 > info: [drm] writeback test succeeded in 1 usecs > drm0: [ITHREAD] > info: [drm] Num pipes: 1 > info: [drm] Setting GART location based on new memory map > info: [drm] Loading R500 Microcode > info: [drm] Num pipes: 1 > info: [drm] writeback test succeeded in 1 usecs > drm0: [ITHREAD] > info: [drm] Num pipes: 1 > info: [drm] Setting GART location based on new memory map > info: [drm] Loading R500 Microcode > info: [drm] Num pipes: 1 > info: [drm] writeback test succeeded in 1 usecs > drm0: [ITHREAD] > info: [drm] Num pipes: 1 > info: [drm] Setting GART location based on new memory map > info: [drm] Loading R500 Microcode > info: [drm] Num pipes: 1 > info: [drm] writeback test succeeded in 1 usecs > drm0: [ITHREAD] > info: [drm] Num pipes: 1 > info: [drm] Setting GART location based on new memory map > info: [drm] Loading R500 Microcode > info: [drm] Num pipes: 1 > info: [drm] writeback test succeeded in 1 usecs > drm0: [ITHREAD] > info: [drm] Num pipes: 1 > info: [drm] Setting GART location based on new memory map > info: [drm] Loading R500 Microcode > info: [drm] Num pipes: 1 > info: [drm] writeback test succeeded in 1 usecs > drm0: [ITHREAD] and here debugging was re-enabled after the problem has occurred. > [drm:pid6216:drm_ioctl] returning 4 > [drm:pid6216:drm_ioctl] pid=3D6216, cmd=3D0x80046457, nr=3D0x57, dev 0xc6= 15fa00, auth=3D1 > [drm:pid6216:drm_ioctl] returning 4 > [drm:pid6216:drm_ioctl] pid=3D6216, cmd=3D0x80046457, nr=3D0x57, dev 0xc6= 15fa00, auth=3D1 > [drm:pid6216:drm_ioctl] returning 4 > [drm:pid6216:drm_ioctl] pid=3D6216, cmd=3D0x80046457, nr=3D0x57, dev 0xc6= 15fa00, auth=3D1 > [drm:pid6216:drm_ioctl] returning 4 robert. > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" --=20 Robert Noland FreeBSD --=-6z9lvAbgfNdiPT5GLBTo Content-Type: application/pgp-signature; name="signature.asc" Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.11 (FreeBSD) iEYEABECAAYFAkoJkx8ACgkQM4TrQ4qfRONigACeOD1lijq1WRN8PGkOVd2+SGEt Hd4AnAk0KnLjTJeNbBSxMZWIbwueUyAs =HUq7 -----END PGP SIGNATURE----- --=-6z9lvAbgfNdiPT5GLBTo-- From owner-freebsd-stable@FreeBSD.ORG Tue May 12 15:20:17 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 122A310656C4; Tue, 12 May 2009 15:20:17 +0000 (UTC) (envelope-from riccardo.torrini@esaote.com) Received: from gw-fi.esaote.com (gw-fi.esaote.com [85.18.189.242]) by mx1.freebsd.org (Postfix) with ESMTP id 8B8F88FC08; Tue, 12 May 2009 15:20:16 +0000 (UTC) (envelope-from riccardo.torrini@esaote.com) Received: from tiger.fi.esaote.it (tiger.fi.esaote.it [192.168.6.66]) by gw-fi.esaote.com (8.14.3/8.14.3) with ESMTP id n4CFKFqU010459; Tue, 12 May 2009 17:20:15 +0200 (CEST) (envelope-from riccardo.torrini@esaote.com) Received: from tiger.fi.esaote.it (localhost [127.0.0.1]) by tiger.fi.esaote.it (Postfix) with ESMTP id DB5181CC9A; Tue, 12 May 2009 17:20:14 +0200 (CEST) Received: by tiger.fi.esaote.it (Postfix, from userid 201) id BBDCE1CC99; Tue, 12 May 2009 17:20:14 +0200 (CEST) Date: Tue, 12 May 2009 17:20:14 +0200 From: Riccardo Torrini To: John Baldwin Message-ID: <20090512152014.GN21112@tiger.fi.esaote.it> References: <20090507155012.GW21112@tiger.fi.esaote.it> <200905110953.21686.jhb@freebsd.org> <20090511165522.GG21112@tiger.fi.esaote.it> <200905111407.20195.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200905111407.20195.jhb@freebsd.org> User-Agent: Mutt/1.5.19 (2009-01-05) X-AV-Checked: ClamAV using ClamSMTP Cc: scottl@freebsd.org, siedar@nplay.pl, freebsd-stable@freebsd.org, Riccardo Torrini Subject: Re: kern/130330: [mpt] [panic] Panic and reboot machine MPT ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 15:20:17 -0000 On Mon, May 11, 2009 at 02:07:19PM -0400, John Baldwin wrote: > Do you have kernel crashdumps enabled and a swap partition? > If so, do you happen to have any files in /var/crash? Yes, but I'm unable to produce a crash dump :-( Tryed even with voodoo, added and removed options to kernel (kdb, gdb, ddb, invariants, ...). Instead of going to db> now it panic-and-freeze with: cpuid = 0 Uptime: 2m16s panic: _mtx_lock_sleep: recursed on non-recursive mutex \ mpt @ /usr/src/sys/cam/cam_periph.h:182 (above lines get repeated a lot with same uptime, then freeze) Still trying other combinations... -- Riccardo. From owner-freebsd-stable@FreeBSD.ORG Tue May 12 15:48:12 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4D5C3106564A for ; Tue, 12 May 2009 15:48:12 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 202FF8FC19 for ; Tue, 12 May 2009 15:48:12 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id C953946B35; Tue, 12 May 2009 11:48:11 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 862598A026; Tue, 12 May 2009 11:48:10 -0400 (EDT) From: John Baldwin To: pluknet Date: Tue, 12 May 2009 10:14:55 -0400 User-Agent: KMail/1.9.7 References: <200905110949.31142.jhb@freebsd.org> In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905121014.55450.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Tue, 12 May 2009 11:48:10 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 15:48:12 -0000 On Tuesday 12 May 2009 2:12:27 am pluknet wrote: > 2009/5/11 John Baldwin : > > On Monday 04 May 2009 11:41:35 pm pluknet wrote: > >> 2009/5/1 John Baldwin : > >> > On Thursday 30 April 2009 2:36:34 am pluknet wrote: > >> >> Hi folks. > >> >> > >> >> Today I got a new locking issue. > >> >> This is the first time I got it, and it's merely reproduced. > >> >> > >> >> The box has lost both remote connection and local access. > >> >> No SIGINFO output on the local console even. > >> >> Jumping in ddb> shows the next: > >> >> > >> >> 1) first, this is a 8-way web server. No processes on runqueue except one > >> > httpd > >> >> (i.e. ps shows R in its state): > >> > > >> > You need to find who owns Giant and what that thread is doing. You can > > try > >> > using 'show lock Giant' as well as 'show lockchain 11568'. > >> > > >> > >> Hi, John! > >> > >> Just reproduced now on another box. > >> Hmm.. stack of the process owing Giant looks garbled. > >> > >> db> show lock Giant > >> class: sleep mutex > >> name: Giant > >> flags: {DEF, RECURSE} > >> state: {OWNED, CONTESTED} > >> owner: 0xd0d79320 (tid 102754, pid 34594, "httpd") > >> > >> db> show lockchain 34594 > >> thread 102754 (pid 34594, httpd) running on CPU 7 > >> db> show lockchain 102754 > >> thread 102754 (pid 34594, httpd) running on CPU 7 > > > > The thread is running, so we don't know what it's top of stack is and you > > can't a good stack trace in that case. > > > > None of your CPUs are idle, so I don't think you have any sort of deadlock. > > You might have a livelock. > > > > -- > > John Baldwin > > > > I'm curious if it could be caused by heavy load. > I don't know what it might be definitely, > as it's non-trivial for me to determine the reason > of a livelock, and to debug it. > > So I think it may have sense to try 7.x, as there > has been done much locking work. It may be worth trying 7. Also, what is the state of the 'swi7: clock' process? -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Tue May 12 15:48:18 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 539341065698; Tue, 12 May 2009 15:48:18 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 268298FC0A; Tue, 12 May 2009 15:48:18 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id CE2BB46B03; Tue, 12 May 2009 11:48:17 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 94E098A028; Tue, 12 May 2009 11:48:16 -0400 (EDT) From: John Baldwin To: Riccardo Torrini Date: Tue, 12 May 2009 11:44:20 -0400 User-Agent: KMail/1.9.7 References: <20090507155012.GW21112@tiger.fi.esaote.it> <200905111407.20195.jhb@freebsd.org> <20090512152014.GN21112@tiger.fi.esaote.it> In-Reply-To: <20090512152014.GN21112@tiger.fi.esaote.it> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905121144.21406.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Tue, 12 May 2009 11:48:16 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: scottl@freebsd.org, siedar@nplay.pl, freebsd-stable@freebsd.org Subject: Re: kern/130330: [mpt] [panic] Panic and reboot machine MPT ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 15:48:18 -0000 On Tuesday 12 May 2009 11:20:14 am Riccardo Torrini wrote: > On Mon, May 11, 2009 at 02:07:19PM -0400, John Baldwin wrote: > > > Do you have kernel crashdumps enabled and a swap partition? > > If so, do you happen to have any files in /var/crash? > > Yes, but I'm unable to produce a crash dump :-( > Tryed even with voodoo, added and removed options to > kernel (kdb, gdb, ddb, invariants, ...). Instead of > going to db> now it panic-and-freeze with: > > cpuid = 0 > Uptime: 2m16s > panic: _mtx_lock_sleep: recursed on non-recursive mutex \ > mpt @ /usr/src/sys/cam/cam_periph.h:182 > > (above lines get repeated a lot with same uptime, then freeze) > > > Still trying other combinations... If you can get a stack trace, that would be most helpful. My guess is that the recovery thread is holding the mpt lock and calling some CAM routine which attempts to relock it via cam_periph_lock(). A stack trace would be most telling in that case. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Tue May 12 16:02:00 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 598891065673 for ; Tue, 12 May 2009 16:02:00 +0000 (UTC) (envelope-from dudu.meyer@gmail.com) Received: from qw-out-2122.google.com (qw-out-2122.google.com [74.125.92.25]) by mx1.freebsd.org (Postfix) with ESMTP id 176788FC17 for ; Tue, 12 May 2009 16:01:59 +0000 (UTC) (envelope-from dudu.meyer@gmail.com) Received: by qw-out-2122.google.com with SMTP id 3so44907qwe.7 for ; Tue, 12 May 2009 09:01:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=QD3VgrH2z+9qvZd93ixPgr1Ui3b2cwXdsJWXUfWGpwE=; b=QkR2Zh1alN1EE5YjSAmPxC7F/o77EpUQAXuwksT3x4mc2GmePY2AqwnuxoVSUNJYVU oBba/DchG4NOkJyKS8FB16peg9k928/j7mrl+xUUksAGacTOSU8B7LcVycj/URu1mRVT ny1NgTVapRAYwlTQdTf4p+IqIf+ayCXCvRZx0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=mOfz4/AFf6C1SyLxVoTG2WoUGKNITrNjEApBcyaNQC7hnMXRoHvCPf7LJMjJMlY7vs Ml2z59jTHJ+ZnqsBm2dJhj9x8gNH9U/Wv9Phkpj1uXgS0VoILqsf2v9xKSEXMInXbmsV KA1s9sDvSjGqufFljMLmHRVX6XLIwtclF0VkA= MIME-Version: 1.0 Received: by 10.229.110.20 with SMTP id l20mr1657044qcp.60.1242144119390; Tue, 12 May 2009 09:01:59 -0700 (PDT) In-Reply-To: <20090509061229.GA63615@walton.maths.tcd.ie> References: <20090509061229.GA63615@walton.maths.tcd.ie> Date: Tue, 12 May 2009 13:01:55 -0300 Message-ID: From: Eduardo Meyer To: David Malone Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: stable@freebsd.org Subject: Re: "maxproc limit exceeded" making no sense X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 16:02:00 -0000 On Sat, May 9, 2009 at 3:12 AM, David Malone wrote: > On Fri, May 08, 2009 at 10:51:02AM -0300, Eduardo Meyer wrote: >> However what I see regarding proc usage is by uid 82 is: >> >> # ps -U 82 | wc -l >> 723 >> >> Proccess count for UID 82 is never highter than 913 (monitored the >> last whole hour, while log messages were still showing, complaining >> about maxproc limit beeing exceeded). > > I guess user 82 is exceeding their per-user process limit. This is set > (traditionally) using the limit or ulimit shell builtins, but can also > be configured in /etc/login.conf or by certain pam modules. I'd start > with login.conf. Hello, This user is classess, therefore its on default class on login.conf, and all limits there are "unlimited". > > David. > -- =========== Eduardo Meyer pessoal: dudu.meyer@gmail.com profissional: ddm.farmaciap@saude.gov.br From owner-freebsd-stable@FreeBSD.ORG Tue May 12 16:04:23 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2CF1E106564A for ; Tue, 12 May 2009 16:04:23 +0000 (UTC) (envelope-from dudu.meyer@gmail.com) Received: from mail-qy0-f173.google.com (mail-qy0-f173.google.com [209.85.221.173]) by mx1.freebsd.org (Postfix) with ESMTP id 93C2D8FC18 for ; Tue, 12 May 2009 16:04:21 +0000 (UTC) (envelope-from dudu.meyer@gmail.com) Received: by qyk3 with SMTP id 3so123151qyk.3 for ; Tue, 12 May 2009 09:04:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=QD3VgrH2z+9qvZd93ixPgr1Ui3b2cwXdsJWXUfWGpwE=; b=qdbnjs2gouWvDDeiKbYS3jrc9n0KyHQn/2bpNxzPHo/mSt7ddMseaLfgcCwYvnl/3w kJegipF4KGqszYb0gl2rk6FKFlMIs+LuILIyZQWRJJ1NGyBQA7e5dZjg6gev7WNC0D4V /IG2arwyy+3DL+4VI1l3RIPEZOjSvVhKplxk0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=KkfTrzROxz9tPNXZKSQr0QVI41d+cmfzK2BwKhPiJACkDBmM/X2PA3R2RNoFVsjSMe z27xwnm2bbL7ZqsY5xKxPgWUhN1WyTR14F8Ljcb10YWvTZyC1nF/UNpmYVT/Ao6d1aa6 A5edie+U6JcBBg3DtHdgMCIFClk448qSWHY0c= MIME-Version: 1.0 Received: by 10.229.110.20 with SMTP id l20mr1660583qcp.60.1242144260896; Tue, 12 May 2009 09:04:20 -0700 (PDT) In-Reply-To: <20090509061229.GA63615@walton.maths.tcd.ie> References: <20090509061229.GA63615@walton.maths.tcd.ie> Date: Tue, 12 May 2009 13:04:19 -0300 Message-ID: From: Eduardo Meyer To: David Malone Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: stable@freebsd.org Subject: Re: "maxproc limit exceeded" making no sense X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 16:04:23 -0000 On Sat, May 9, 2009 at 3:12 AM, David Malone wrote: > On Fri, May 08, 2009 at 10:51:02AM -0300, Eduardo Meyer wrote: >> However what I see regarding proc usage is by uid 82 is: >> >> # ps -U 82 | wc -l >> 723 >> >> Proccess count for UID 82 is never highter than 913 (monitored the >> last whole hour, while log messages were still showing, complaining >> about maxproc limit beeing exceeded). > > I guess user 82 is exceeding their per-user process limit. This is set > (traditionally) using the limit or ulimit shell builtins, but can also > be configured in /etc/login.conf or by certain pam modules. I'd start > with login.conf. Hello, This user is classess, therefore its on default class on login.conf, and all limits there are "unlimited". > > David. > -- =========== Eduardo Meyer pessoal: dudu.meyer@gmail.com profissional: ddm.farmaciap@saude.gov.br From owner-freebsd-stable@FreeBSD.ORG Tue May 12 16:10:27 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AB1941065676; Tue, 12 May 2009 16:10:27 +0000 (UTC) (envelope-from riccardo.torrini@esaote.com) Received: from gw-fi.esaote.com (gw-fi.esaote.com [85.18.189.242]) by mx1.freebsd.org (Postfix) with ESMTP id 2C0D48FC12; Tue, 12 May 2009 16:10:26 +0000 (UTC) (envelope-from riccardo.torrini@esaote.com) Received: from tiger.fi.esaote.it (tiger.fi.esaote.it [192.168.6.66]) by gw-fi.esaote.com (8.14.3/8.14.3) with ESMTP id n4CGAP71014070; Tue, 12 May 2009 18:10:25 +0200 (CEST) (envelope-from riccardo.torrini@esaote.com) Received: from tiger.fi.esaote.it (localhost [127.0.0.1]) by tiger.fi.esaote.it (Postfix) with ESMTP id 822CC1CC9A; Tue, 12 May 2009 18:10:25 +0200 (CEST) Received: by tiger.fi.esaote.it (Postfix, from userid 201) id 65D3D1CC99; Tue, 12 May 2009 18:10:25 +0200 (CEST) Date: Tue, 12 May 2009 18:10:25 +0200 From: Riccardo Torrini To: John Baldwin Message-ID: <20090512161025.GO21112@tiger.fi.esaote.it> References: <20090507155012.GW21112@tiger.fi.esaote.it> <200905111407.20195.jhb@freebsd.org> <20090512152014.GN21112@tiger.fi.esaote.it> <200905121144.21406.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200905121144.21406.jhb@freebsd.org> User-Agent: Mutt/1.5.19 (2009-01-05) X-AV-Checked: ClamAV using ClamSMTP Cc: scottl@freebsd.org, siedar@nplay.pl, freebsd-stable@freebsd.org, Riccardo Torrini Subject: Re: kern/130330: [mpt] [panic] Panic and reboot machine MPT ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 16:10:28 -0000 On Tue, May 12, 2009 at 11:44:20AM -0400, John Baldwin wrote: > If you can get a stack trace, that would be most helpful. > My guess is that the recovery thread is holding the mpt lock > and calling some CAM routine which attempts to relock it via > cam_periph_lock(). A stack trace would be most telling in > that case. Rebooted, inserted 2nd disk (copied by hand, sorry for delay) mpt0: External Bus Reset Detected mpt0:vol0(mpt:0:0:0): Phisycal Disk Status Changed mpt0:vol0(mpt:0:0:0): Phisycal Disk Status Changed (yes, two times) Kernel page fault with the following non-sleepable lock held: exclusive sleep mutex mpt r = 0 (0xc4001004) locked @ \ /usr/src/sys/cam/cam_xpt.c:7153 KBD: enter: witness_warn [ thread pid 19 tid 100018 ] Stopped at kdb_enter_why+0x3a: movl $0,kbd_why db> bt Tracing pid 19 tid 100018 td 0xc3fb8880 [...] --- trap 0xc, eip = 0xc0438f4e, esp = 0xc43b2b98, ebp = 0xc43b2bb0 --- xpt_done(c404f400,c0719000,5,5,0,...) at xpt_done+0x1b xpt_scan_bus(c3f39a80,c4045400,c06cfa7a,c072f824,c4011914,...) \ at xpt_scan_bus+0x39f camisr_runqueue(c4001004,0,c06cfa7a,1bf1,0,...) \ at camisr_runqueue+0x38a camisr(0,0,c06e99fb,4b6,c3f39a68,...) at camisr+0x10d ithread_loop() fork_exit() fork_trampoline() Still at db> prompt =) -- Riccardo. From owner-freebsd-stable@FreeBSD.ORG Tue May 12 18:26:50 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 60F361065670; Tue, 12 May 2009 18:26:50 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 317068FC29; Tue, 12 May 2009 18:26:49 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id AC3F646B5C; Tue, 12 May 2009 14:26:49 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 766B08A026; Tue, 12 May 2009 14:26:48 -0400 (EDT) From: John Baldwin To: Riccardo Torrini Date: Tue, 12 May 2009 14:26:43 -0400 User-Agent: KMail/1.9.7 References: <20090507155012.GW21112@tiger.fi.esaote.it> <200905121144.21406.jhb@freebsd.org> <20090512161025.GO21112@tiger.fi.esaote.it> In-Reply-To: <20090512161025.GO21112@tiger.fi.esaote.it> MIME-Version: 1.0 Content-Disposition: inline Message-Id: <200905121426.43467.jhb@freebsd.org> Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Tue, 12 May 2009 14:26:48 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: scottl@freebsd.org, siedar@nplay.pl, freebsd-stable@freebsd.org Subject: Re: kern/130330: [mpt] [panic] Panic and reboot machine MPT ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 18:26:51 -0000 On Tuesday 12 May 2009 12:10:25 pm Riccardo Torrini wrote: > On Tue, May 12, 2009 at 11:44:20AM -0400, John Baldwin wrote: > > > If you can get a stack trace, that would be most helpful. > > My guess is that the recovery thread is holding the mpt lock > > and calling some CAM routine which attempts to relock it via > > cam_periph_lock(). A stack trace would be most telling in > > that case. > > Rebooted, inserted 2nd disk (copied by hand, sorry for delay) > > mpt0: External Bus Reset Detected > mpt0:vol0(mpt:0:0:0): Phisycal Disk Status Changed > mpt0:vol0(mpt:0:0:0): Phisycal Disk Status Changed (yes, two times) > Kernel page fault with the following non-sleepable lock held: > exclusive sleep mutex mpt r = 0 (0xc4001004) locked @ \ > /usr/src/sys/cam/cam_xpt.c:7153 > KBD: enter: witness_warn > [ thread pid 19 tid 100018 ] > Stopped at kdb_enter_why+0x3a: movl $0,kbd_why > > db> bt > Tracing pid 19 tid 100018 td 0xc3fb8880 > [...] > --- trap 0xc, eip = 0xc0438f4e, esp = 0xc43b2b98, ebp = 0xc43b2bb0 --- > xpt_done(c404f400,c0719000,5,5,0,...) at xpt_done+0x1b > xpt_scan_bus(c3f39a80,c4045400,c06cfa7a,c072f824,c4011914,...) \ > at xpt_scan_bus+0x39f > camisr_runqueue(c4001004,0,c06cfa7a,1bf1,0,...) \ > at camisr_runqueue+0x38a > camisr(0,0,c06e99fb,4b6,c3f39a68,...) at camisr+0x10d > ithread_loop() > fork_exit() > fork_trampoline() > > > Still at db> prompt =) Hmm, this is a different panic. :( You could perhaps try bzero()'ing the ccb before calling xpt_setup_ccb() in mpt_raid_thread() but the old code didn't do that either (it just used M_WAITOK w/o M_ZERO). -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Tue May 12 18:35:03 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A7B5A106566B for ; Tue, 12 May 2009 18:35:03 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by mx1.freebsd.org (Postfix) with ESMTP id 63A2C8FC17 for ; Tue, 12 May 2009 18:35:02 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from root by ciao.gmane.org with local (Exim 4.43) id 1M3wog-00059E-HC for freebsd-stable@freebsd.org; Tue, 12 May 2009 18:35:02 +0000 Received: from cpe-65-189-186-49.columbus.res.rr.com ([65.189.186.49]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 12 May 2009 18:35:02 +0000 Received: from dsamms by cpe-65-189-186-49.columbus.res.rr.com with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 12 May 2009 18:35:02 +0000 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-stable@freebsd.org From: David Samms Date: Tue, 12 May 2009 13:41:22 -0400 Lines: 30 Message-ID: Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: cpe-65-189-186-49.columbus.res.rr.com User-Agent: Thunderbird 2.0.0.21 (X11/20090429) Sender: news Subject: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 18:35:03 -0000 After upgrading to 7.2 (amd64) some customers complained of very poor bandwidth. Upon investigation all the effected customers were ATT DSL clients located all over the USA, not in a single city, nor were other ISPs effected. The server is a Supermicro with dual (quad core) processors with a single Intel fxp network card on a 100mbit connection. Kernel is GENERIC for both 7.1_release and 7.2_release. Normally a client can max out their download connection, but for ATT DSL customers the transfer rate would be about 5-10KB/s even though the server and client where both idle. Repeated tests were done, from multiple clients in different geographical locations. The problem manifested itself regardless of whether ftp, http, smtp, pop, or scp was used, and regardless of the OS of the client. Believing it to be a routing issue we changed the route and even changed the local router the server is connected to so that a different NIC port would be used to talk to ATT DSL customers, but no change in performance. Turns out it is somehow related to differences in FreeBSD 7.1 and 7.2. If I boot the same server with 7.1, all clients work as you would expect. But, if 7.2 is used all clients with the exception of ATT DSL clients would work normally, ATT customers would be limited to 5-10KB/s. I have no reason to believe there is anything wrong with the ATT DSL network, it just happen to be effected by whatever causes the problem. Any theories? A special thanks to cybercon.com tech support for being so helpful. If you need a data center, they have good tech support. From owner-freebsd-stable@FreeBSD.ORG Tue May 12 18:40:05 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 60E561065672 for ; Tue, 12 May 2009 18:40:05 +0000 (UTC) (envelope-from dungeons@gmail.com) Received: from rv-out-0506.google.com (rv-out-0506.google.com [209.85.198.226]) by mx1.freebsd.org (Postfix) with ESMTP id 36BF18FC1D for ; Tue, 12 May 2009 18:40:05 +0000 (UTC) (envelope-from dungeons@gmail.com) Received: by rv-out-0506.google.com with SMTP id k40so118088rvb.43 for ; Tue, 12 May 2009 11:40:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=OP56wEOFDjzh+GCE9PlEAf1M1ytsl5z3J7EIlQc+vSo=; b=oCzai+5mVE2ywze+aqsT2Jp+obvByGcptsVmMgC1gOujgVwNETYfzAq3FyW5l7J5jn AxKLBsU9yZI0R78+hIiMaE1fAARga/n+C2qPG64rqW3DnnEN5uZrgcuF5dVD3bpd9bkp VuEuEwE8LlkeC19Mw9gfQUxhDXHHCszu1ZYGY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=HQ9zoOj8JJ9P4sVu/kS97+RxemeCaFxcMEhw8JHgddU4AsImdGMSP3JE0OLCdfsB+u ohIO5L1poWlu049Cc9ESlSWDpsZ04qt3V92SbzWQszrPHZFaGHrrUSM7NlaLRzxSCTO+ G/U6fh9z4p5dcou0iB3b0WYX5H25CG8tjCpDE= MIME-Version: 1.0 Received: by 10.140.132.4 with SMTP id f4mr1466894rvd.118.1242151857714; Tue, 12 May 2009 11:10:57 -0700 (PDT) Date: Tue, 12 May 2009 14:10:57 -0400 Message-ID: <2c2c47aa0905121110i6355930bwce3a9c6afb117d4d@mail.gmail.com> From: Pat Wendorf To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: File system corruption X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 18:40:06 -0000 I have a co-lo server I've been maintaining for a few years now running IDE drives on a mostly terrible UPS. A few months ago, when it returned from a power outage (running 6.2-R) I started noticing the following in my daily security email: Checking setuid files and devices: find: /var/db/portsnap/files/2dc95ddff37a8091239e83bf7e3ce5a2c285b027891ced1919d76c9947c5b7db.gz: Bad file descriptor find: /var/db/portsnap/files/52abe8c91385b12272f13f4d20896067d9ba70bdec1fa2575025858bd3e93718.gz: Bad file descriptor find: /var/lost+found/#238237: Bad file descriptor I verified that these files return the same result when trying to do any operation on them (including ls in the directory). I've managed to ignore the problem for a while now, and even upgraded to 7.2, but I'm not sure if it will cause problems later on. So the question is, without access to the console, how would I fix this? - Pat From owner-freebsd-stable@FreeBSD.ORG Tue May 12 18:52:38 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A2E24106564A; Tue, 12 May 2009 18:52:38 +0000 (UTC) (envelope-from david@usermode.org) Received: from proxy.meer.net (proxy.meer.net [64.13.141.13]) by mx1.freebsd.org (Postfix) with ESMTP id 67BC38FC19; Tue, 12 May 2009 18:52:38 +0000 (UTC) (envelope-from david@usermode.org) Received: from mail.meer.net (mail.meer.net [64.13.141.3]) by proxy.meer.net (8.14.3/8.14.3) with ESMTP id n4CIqXWh021110; Tue, 12 May 2009 11:52:38 -0700 (PDT) (envelope-from david@usermode.org) Received: from pippin.localnet (netblock-66-245-217-53.dslextreme.com [66.245.217.53]) by mail.meer.net (8.13.3/8.13.3/meer) with ESMTP id n4CIqRPI050189; Tue, 12 May 2009 11:52:27 -0700 (PDT) (envelope-from david@usermode.org) From: David Johnson To: Robert Noland Date: Tue, 12 May 2009 11:52:29 -0700 User-Agent: KMail/1.11.3 (Linux/2.6.29-ARCH; KDE/4.2.3; i686; ; ) References: <200905042015.29394.david@usermode.org> <200905091841.26274.david@usermode.org> <1242141471.1755.11.camel@balrog.2hip.net> In-Reply-To: <1242141471.1755.11.camel@balrog.2hip.net> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905121152.29572.david@usermode.org> X-Spam-Score: undef - spam scanning disabled X-CanIt-Geo: ip=64.13.141.3; country=US; region=CA; city=Mountain View; latitude=37.3974; longitude=-122.0732; metrocode=807; areacode=650; http://maps.google.com/maps?q=37.3974,-122.0732&z=6 X-CanItPRO-Stream: default X-Canit-Stats-ID: Bayes signature not available X-Scanned-By: CanIt (www . roaringpenguin . com) on 64.13.141.13 Cc: freebsd-stable@freebsd.org Subject: Re: Xorg hangs with drmwtq in 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 18:52:38 -0000 On Tuesday 12 May 2009 08:17:51 am Robert Noland wrote: > On Sat, 2009-05-09 at 18:41 -0700, David Johnson wrote: > > On Friday 08 May 2009 03:31:04 pm Robert Noland wrote: > > > In order to guess what might be causing this, drm debugging needs to be > > > enabled before the hang, so that we can hopefully figure out what leads > > > up to the hung GPU. > > > > I'm not able to do that, but I did manage to get debug turned on and > > dmesg captured early enough to catch some additional information. I've > > place the full file online at http://www.usermode.org/misc/dmesg.txt, but > > am including some snippets here. Hopefully this is enough to move > > forward. > > > > -- > > David Johnson > > This trace still looks odd... This should have been a single trace, with debugging turned after X was hung. I turned debug on once, grabbed output of dmesg, then rebooted. 1) Run script to launch four windows in rapid succession. 2) Only two windows manage to make it up before X hangs. 3) Switch over to laptop, which is ssh'd into system 4) sysctl hw.drm.0.debug=1 5) dmesg > dmesg.txt 6) Done I may have made a mistake though, and briefly turned on debugging earlier in the session. I'll get another trace this evening when I have time, to double check. p.s. I've put 7.1-STABLE from March 13th on a different partition. I will add in commits until it breaks, to help narrow it down. I'm fairly sure it was something on the 15th or 16th. -- David Johnson From owner-freebsd-stable@FreeBSD.ORG Tue May 12 20:33:09 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A53C3106564A for ; Tue, 12 May 2009 20:33:09 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by mx1.freebsd.org (Postfix) with ESMTP id 4CDF38FC0A for ; Tue, 12 May 2009 20:33:08 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from list by ciao.gmane.org with local (Exim 4.43) id 1M3yew-0002Kf-PD for freebsd-stable@freebsd.org; Tue, 12 May 2009 20:33:06 +0000 Received: from 200.41.broadband11.iol.cz ([90.178.41.200]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 12 May 2009 20:33:06 +0000 Received: from gamato by 200.41.broadband11.iol.cz with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 12 May 2009 20:33:06 +0000 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-stable@freebsd.org From: martinko Date: Tue, 12 May 2009 22:32:51 +0200 Lines: 28 Message-ID: <4A09DCF3.3010600@users.sf.net> References: <4A08A132.3070503@users.sf.net> <20090511224957.GB52703@dan.emsphone.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 200.41.broadband11.iol.cz User-Agent: Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.8.1.18) Gecko/20081125 SeaMonkey/1.1.13 In-Reply-To: <20090511224957.GB52703@dan.emsphone.com> Sender: news Cc: scottl@freebsd.org Subject: Re: run_interrupt_driven_hooks: still waiting after 300 seconds for xpt_config X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 20:33:09 -0000 Dan Nelson wrote: > In the last episode (May 12), martinko said: >> I've just tried 7.2-RELEASE (amd64) on Asus M3A78-EM motherboard and >> booting got stuck with the following: >> >> run_interrupt_driven_hooks: still waiting after 300 seconds for xpt_config >> >> From what I've found via Google it should be fixed already but apparently >> it is not. :-( >> >> Is there a way to work around this issue and successfully boot and install >> FreeBSD, please ? > > Do you have a connected firewire device? Try unplugging it during bootup, > or kldload the sbp module after bootup instead of via loader.conf. > This is booting on fresh new computer from DVD installation media. And nothing is attached (except USB keyboard and VGA monitor;)). Btw, I've just tried booting recent PC-BSD (7.1) from USB drive but it failed the same way, unfortunately. Any other ideas how to install FreeBSD on this system, please ? Cheers, Martin From owner-freebsd-stable@FreeBSD.ORG Tue May 12 20:42:55 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DAE321065675 for ; Tue, 12 May 2009 20:42:55 +0000 (UTC) (envelope-from delphij@delphij.net) Received: from tarsier.delphij.net (delphij-pt.tunnel.tserv2.fmt.ipv6.he.net [IPv6:2001:470:1f03:2c9::2]) by mx1.freebsd.org (Postfix) with ESMTP id 7F0C68FC2A for ; Tue, 12 May 2009 20:42:55 +0000 (UTC) (envelope-from delphij@delphij.net) Received: from tarsier.geekcn.org (tarsier.geekcn.org [211.166.10.233]) (using TLSv1 with cipher ADH-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by tarsier.delphij.net (Postfix) with ESMTPS id 7AB695C06E for ; Wed, 13 May 2009 04:42:54 +0800 (CST) Received: from localhost (tarsier.geekcn.org [211.166.10.233]) by tarsier.geekcn.org (Postfix) with ESMTP id 43DF255D166D; Wed, 13 May 2009 04:42:54 +0800 (CST) X-Virus-Scanned: amavisd-new at geekcn.org Received: from tarsier.geekcn.org ([211.166.10.233]) by localhost (mail.geekcn.org [211.166.10.233]) (amavisd-new, port 10024) with ESMTP id CYFSXK7mao3z; Wed, 13 May 2009 04:41:54 +0800 (CST) Received: from charlie.delphij.net (adsl-76-237-33-62.dsl.pltn13.sbcglobal.net [76.237.33.62]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by tarsier.geekcn.org (Postfix) with ESMTPSA id 6281F55D166B; Wed, 13 May 2009 04:41:42 +0800 (CST) DomainKey-Signature: a=rsa-sha1; s=default; d=delphij.net; c=nofws; q=dns; h=message-id:date:from:reply-to:organization:user-agent: mime-version:to:cc:subject:references:in-reply-to: x-enigmail-version:openpgp:content-type:content-transfer-encoding; b=uVxYgqdGXDyYcKgE+zq6OUFftE5+XEGuXLY7llFNKj1tTD1FObh3SQbg7YOLeHtiK e0w52qQfFU4GUYSJsM2tg== Message-ID: <4A09DEF1.2010202@delphij.net> Date: Tue, 12 May 2009 13:41:21 -0700 From: Xin LI Organization: The FreeBSD Project User-Agent: Thunderbird 2.0.0.21 (X11/20090408) MIME-Version: 1.0 To: David Samms References: In-Reply-To: X-Enigmail-Version: 0.95.7 OpenPGP: id=18EDEBA0; url=http://www.delphij.net/delphij.asc Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: d@delphij.net List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 20:42:56 -0000 -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi David, David Samms wrote: > After upgrading to 7.2 (amd64) some customers complained of very poor > bandwidth. Upon investigation all the effected customers were ATT DSL > clients located all over the USA, not in a single city, nor were other > ISPs effected. The server is a Supermicro with dual (quad core) > processors with a single Intel fxp network card on a 100mbit connection. Could you please try if this would help: sysctl net.inet.tcp.tso=0 Cheers, - -- Xin LI http://www.delphij.net/ FreeBSD - The Power to Serve! -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.11 (FreeBSD) iEYEARECAAYFAkoJ3vAACgkQi+vbBBjt66CKqQCgwPkg1IZnI61Q1+PWfr5sOvVm n5IAnAzbI5HQXQqyPg+DmzHvCNhzhelI =oHGO -----END PGP SIGNATURE----- From owner-freebsd-stable@FreeBSD.ORG Tue May 12 20:59:21 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AE0B71065677; Tue, 12 May 2009 20:59:21 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by mx1.freebsd.org (Postfix) with ESMTP id EC4A18FC13; Tue, 12 May 2009 20:59:20 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: by bwz9 with SMTP id 9so231322bwz.43 for ; Tue, 12 May 2009 13:59:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=bCuZ4DDsgm3JNf7gzyYUackLDJRRYK4qr/d+JCVSnOU=; b=rsQnBhD8PStqUHaiSvCgRTfrpa7nmmosbZC6dKjBXR4cfmQgiDR3VAfzvUkR6lneHI 6YwLDMyZ6JNP24g/XeUhRgFbYeIFuCLhWjUJAztRp8wi9LnS70jRDO7QZtHoWzXQqpw0 TbTHcu7rYB02+vqZjiVNUu9UtAYWSvA7uCUro= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=JDcIlr9a/b6Rrg6IMcoQzMz1zO9rtjLIzdRIVbL2bspb5LsZOScXS283OPTf1CXYKt SChw3F2LwTbmpE2D7QiyE44pd9pjEF6o/E76/hLuBeqx2I/VtNYc2HAD7te5zXhkGR52 q0h1Y0PFB2Kba23Pwd8SMT7DFIKQX7a/JkbrU= MIME-Version: 1.0 Received: by 10.103.219.17 with SMTP id w17mr71985muq.122.1242161959838; Tue, 12 May 2009 13:59:19 -0700 (PDT) In-Reply-To: <200905121014.55450.jhb@freebsd.org> References: <200905110949.31142.jhb@freebsd.org> <200905121014.55450.jhb@freebsd.org> Date: Wed, 13 May 2009 00:59:19 +0400 Message-ID: From: pluknet To: John Baldwin Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 20:59:22 -0000 2009/5/12 John Baldwin : > On Tuesday 12 May 2009 2:12:27 am pluknet wrote: >> 2009/5/11 John Baldwin : >> > On Monday 04 May 2009 11:41:35 pm pluknet wrote: >> >> 2009/5/1 John Baldwin : >> >> > On Thursday 30 April 2009 2:36:34 am pluknet wrote: >> >> >> Hi folks. >> >> >> >> >> >> Today I got a new locking issue. >> >> >> This is the first time I got it, and it's merely reproduced. >> >> >> >> >> >> The box has lost both remote connection and local access. >> >> >> No SIGINFO output on the local console even. >> >> >> Jumping in ddb> shows the next: >> >> >> >> >> >> 1) first, this is a 8-way web server. No processes on runqueue except > one >> >> > httpd >> >> >> (i.e. ps shows R in its state): >> >> > >> >> > You need to find who owns Giant and what that thread is doing. You can >> > try >> >> > using 'show lock Giant' as well as 'show lockchain 11568'. >> >> > >> >> >> >> Hi, John! >> >> >> >> Just reproduced now on another box. >> >> Hmm.. stack of the process owing Giant looks garbled. >> >> >> >> db> show lock Giant >> >> class: sleep mutex >> >> name: Giant >> >> flags: {DEF, RECURSE} >> >> state: {OWNED, CONTESTED} >> >> owner: 0xd0d79320 (tid 102754, pid 34594, "httpd") >> >> >> >> db> show lockchain 34594 >> >> thread 102754 (pid 34594, httpd) running on CPU 7 >> >> db> show lockchain 102754 >> >> thread 102754 (pid 34594, httpd) running on CPU 7 >> > >> > The thread is running, so we don't know what it's top of stack is and you >> > can't a good stack trace in that case. >> > >> > None of your CPUs are idle, so I don't think you have any sort of > deadlock. >> > You might have a livelock. >> > >> > -- >> > John Baldwin >> > >> >> I'm curious if it could be caused by heavy load. >> I don't know what it might be definitely, >> as it's non-trivial for me to determine the reason >> of a livelock, and to debug it. >> >> So I think it may have sense to try 7.x, as there >> has been done much locking work. > > It may be worth trying 7. Also, what is the state of the 'swi7: clock' > process? > > -- > John Baldwin > Hi. >From just another box (not from the first two mentioned earlier) with a similar locking issue. If it would make sense, since there are possibly a bit different conditions. clock proc here is on swi4, I hope it's a non-important difference. 18 0 0 0 LL *Giant 0xd0a6b140 [swi4: clock sio] db> bt 18 Tracing pid 18 tid 100015 td 0xc7cfec80 sched_switch(c7cfec80,0,1) at sched_switch+0x143 mi_switch(1,0) at mi_switch+0x1ba turnstile_wait(c0a06c60,cb77ee10) at turnstile_wait+0x2f7 _mtx_lock_sleep(c0a06c60,c7cfec80,0,0,0) at _mtx_lock_sleep+0xfc softclock(0) at softclock+0x231 ithread_execute_handlers(c7d07218,c7d4a100) at ithread_execute_handlers+0x125 ithread_loop(c7cb69f0,e6892d38) at ithread_loop+0x55 fork_exit(c066d3e4,c7cb69f0,e6892d38) at fork_exit+0x71 fork_trampoline() at fork_trampoline+0x8 --- trap 0x1, eip = 0, esp = 0xe6892d6c, ebp = 0 --- db> show lock Giant class: sleep mutex name: Giant flags: {DEF, RECURSE} state: {OWNED, CONTESTED} owner: 0xcb77ee10 (tid 101174, pid 8611, "httpd") db> show lockchain 101174 thread 101174 (pid 8611, httpd) running on CPU 4 db> bt 101174 Tracing pid 8611 tid 101174 td 0xcb77ee10 sched_switch(cb77ee10,c7f3de10,6) at sched_switch+0x143 mi_switch(ca6d82e8,6,c0a0baf0,ca6d82e8,c0a0a0b0,...) at mi_switch kseq_move(c0a0baf0,6) at kseq_move+0xc1 sched_balance_pair(ef879bb0,ef879bb0,c08a2adf,cb77ef68,cb77b360,. lance_pair+0x91 sched_lock(0,cbd1f658,0,cb77b36c,0,...) at sched_lock _end(cb77b360,cb77b364,cb77ee10,cb77ee18,0,...) at 0xcb77b360 _end(d0a49a80,d0a49a84,c84cf7d0,c84cf7d8,0,...) at 0xc7f97648 _end(ca6dbcc0,ca6dbcc4,ca6d54b0,ca6d54b8,0,...) at 0xcbd1f648 _end(cbcad780,cbcad784,cc8a2190,cc8a2198,0,...) at 0xc8514430 _end(cab883c0,cab883c4,ca9417d0,ca9417d8,0,...) at 0xca6dc000 _end(cc67c4e0,cc67c4e4,cd6fd000,cd6fd008,0,...) at 0xcc8abc90 _end(cd3a9120,cd3a9124,cd3b1320,cd3b1328,0,...) at 0xcad68218 _end(cd130c60,cd130c64,d00ca320,d00ca328,0,...) at 0xca71e860 _end(cbcac240,cbcac244,cbf6e4b0,cbf6e4b8,0,...) at 0xcd472a78 _end(cb73c960,cb73c964,cb4f44b0,cb4f44b8,0,...) at 0xd00cfa78 _end(ca348b40,ca348b44,ca420af0,ca420af8,0,...) at 0xcc0e9c90 _end(d0310ea0,d0310ea4,cd3ad4b0,cd3ad4b8,0,...) at 0xcc7ec218 _end(ca5ddd20,ca5ddd24,ca6d8c80,ca6d8c88,0,...) at 0xca426c90 _end(c998aa20,c998aa24,ca2bb320,ca2bb328,0,...) at 0xd030fc90 [...] oh, i saw that earlier somewhere.. don't remember where. db> c and waiting some moments shows a little different picture: db> bt 101174 Tracing pid 8611 tid 101174 td 0xcb77ee10 sched_switch(cb77ee10,c7f3de10,6) at sched_switch+0x143 mi_switch(cf177608,7,c0a0b460,cf177608,c0a0a0b0,...) at mi_switch+0x1ba kseq_move(c0a0b460,7) at kseq_move+0xc1 sched_balance_pair(cb77ef68,ef879bb8,c0694edf,cb77ef68,cb77b360,...) at sched_balance_pair+0x91 _end(cbd1f650,cb77ee10,cb77ee20,0,cb77b374,...) at 0xcb77b360 MAXCPU(cb77b360,cb77b364,cb77ee10,cb77ee18,0,...) at 0 _end(d0a49a80,d0a49a84,c84cf7d0,c84cf7d8,0,...) at 0xc7f97648 _end(ca6dbcc0,ca6dbcc4,ca6d54b0,ca6d54b8,0,...) at 0xcbd1f648 _end(cbcad780,cbcad784,cc8a2190,cc8a2198,0,...) at 0xc8514430 _end(cab883c0,cab883c4,ca9417d0,ca9417d8,0,...) at 0xca6dc000 _end(cc67c4e0,cc67c4e4,cd6fd000,cd6fd008,0,...) at 0xcc8abc90 _end(cd3a9120,cd3a9124,cd3b1320,cd3b1328,0,...) at 0xcad68218 _end(cd130c60,cd130c64,d00ca320,d00ca328,0,...) at 0xca71e860 _end(cbcac240,cbcac244,cbf6e4b0,cbf6e4b8,0,...) at 0xcd472a78 [...] -- wbr, pluknet From owner-freebsd-stable@FreeBSD.ORG Tue May 12 21:31:13 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CEB6A1065698 for ; Tue, 12 May 2009 21:31:13 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by mx1.freebsd.org (Postfix) with ESMTP id 88E608FC08 for ; Tue, 12 May 2009 21:31:13 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from list by ciao.gmane.org with local (Exim 4.43) id 1M3zZA-0005K1-Gj for freebsd-stable@freebsd.org; Tue, 12 May 2009 21:31:12 +0000 Received: from cpe-65-189-186-49.columbus.res.rr.com ([65.189.186.49]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 12 May 2009 21:31:12 +0000 Received: from dsamms by cpe-65-189-186-49.columbus.res.rr.com with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 12 May 2009 21:31:12 +0000 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-stable@freebsd.org From: David Samms Date: Tue, 12 May 2009 17:31:01 -0400 Lines: 30 Message-ID: <4A09EA95.7050800@nw-ds.com> References: <4A09DEF1.2010202@delphij.net> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: cpe-65-189-186-49.columbus.res.rr.com User-Agent: Thunderbird 2.0.0.21 (X11/20090429) In-Reply-To: <4A09DEF1.2010202@delphij.net> Sender: news Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 21:31:14 -0000 Xin LI wrote: > Hi David, > > David Samms wrote: >> After upgrading to 7.2 (amd64) some customers complained of very poor >> bandwidth. Upon investigation all the effected customers were ATT DSL >> clients located all over the USA, not in a single city, nor were other >> ISPs effected. The server is a Supermicro with dual (quad core) >> processors with a single Intel fxp network card on a 100mbit connection. > > Could you please try if this would help: > > sysctl net.inet.tcp.tso=0 > > Cheers, > - -- > Xin LI http://www.delphij.net/ > FreeBSD - The Power to Serve! Xin LI, Thank you for your help. Setting sysctl net.inet.tcp.tso=0 resolved the issue completely. What does sysctl net.inet.tcp.tso=0 do? Where can I read more about the option? I captured tcpdumps of a single file transfer to 7.1, 7.2 and 7.2 with sysctl net.inet.tcp.tso=0, but they are to large to attach to this list. Let me know if you are interested in viewing the dump files. Thanks again for your assistance! From owner-freebsd-stable@FreeBSD.ORG Tue May 12 21:54:58 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id ACBE61065670 for ; Tue, 12 May 2009 21:54:58 +0000 (UTC) (envelope-from peo@intersonic.se) Received: from neonpark.inter-sonic.com (neonpark.inter-sonic.com [212.247.8.98]) by mx1.freebsd.org (Postfix) with ESMTP id 72EF78FC1A for ; Tue, 12 May 2009 21:54:58 +0000 (UTC) (envelope-from peo@intersonic.se) X-Virus-Scanned: amavisd-new at BSDLabs AB Message-ID: <4A09EC89.5090400@intersonic.se> Date: Tue, 12 May 2009 23:39:21 +0200 From: Per olof Ljungmark Organization: Intersonic AB User-Agent: Thunderbird 2.0.0.21 (X11/20090502) MIME-Version: 1.0 To: Yani Karydis References: <4A098115.3040605@pi-greece.eu> In-Reply-To: <4A098115.3040605@pi-greece.eu> X-Enigmail-Version: 0.95.6 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: CAM Status: SCSI Status Error on 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 21:54:59 -0000 Yani Karydis wrote: > Hello, > > Since upgrading to 7.2-RELEASE, dmesg displays the following after > booting the system. > > (probe3:ahc0:0:3:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 > (probe3:ahc0:0:3:0): CAM Status: SCSI Status Error > (probe3:ahc0:0:3:0): SCSI Status: Check Condition > (probe3:ahc0:0:3:0): UNIT ATTENTION asc:29,0 > (probe3:ahc0:0:3:0): Power on, reset, or bus device reset occurred > (probe3:ahc0:0:3:0): Retrying Command (per Sense Data) > (probe3:ahc0:0:3:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 > (probe3:ahc0:0:3:0): CAM Status: SCSI Status Error > (probe3:ahc0:0:3:0): SCSI Status: Check Condition > (probe3:ahc0:0:3:0): NOT READY asc:3a,0 > (probe3:ahc0:0:3:0): Medium not present > (probe3:ahc0:0:3:0): Unretryable error > sa0 at ahc0 bus 0 target 3 lun 0 > sa0: Removable Sequential Access SCSI-3 device > sa0: 20.000MB/s transfers (10.000MHz, offset 8, 16bit) > FWIW, same here, 7.2-PRERELEASE #0: Thu Apr 16 08:42:45 CEST 2009 amd64 and HP Ultrium drive. Did not notice any harm though. -- per From owner-freebsd-stable@FreeBSD.ORG Tue May 12 22:03:16 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id F18261065673 for ; Tue, 12 May 2009 22:03:16 +0000 (UTC) (envelope-from rick@kiwi-computer.com) Received: from kiwi-computer.com (keira.kiwi-computer.com [63.224.10.3]) by mx1.freebsd.org (Postfix) with SMTP id 886388FC13 for ; Tue, 12 May 2009 22:03:16 +0000 (UTC) (envelope-from rick@kiwi-computer.com) Received: (qmail 28362 invoked by uid 2001); 12 May 2009 21:36:35 -0000 Date: Tue, 12 May 2009 16:36:35 -0500 From: "Rick C. Petty" To: David Samms Message-ID: <20090512213635.GA27579@keira.kiwi-computer.com> References: <4A09DEF1.2010202@delphij.net> <4A09EA95.7050800@nw-ds.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4A09EA95.7050800@nw-ds.com> User-Agent: Mutt/1.4.2.3i Cc: freebsd-stable@freebsd.org Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: rick-freebsd2008@kiwi-computer.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 22:03:17 -0000 On Tue, May 12, 2009 at 05:31:01PM -0400, David Samms wrote: > > Setting sysctl net.inet.tcp.tso=0 resolved the issue completely. What > does sysctl net.inet.tcp.tso=0 do? # sysctl -d net.inet.tcp.tso net.inet.tcp.tso: Enable TCP Segmentation Offload I had a similar problem with a different NIC. This option controls whether we offload segmenting to the NIC. My NIC seemed to be limited by the number of interrupts which could be delivered. You can also do this on a card-by-card basis using "ifconfig -tso". -- Rick C. Petty From owner-freebsd-stable@FreeBSD.ORG Tue May 12 22:11:58 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DFA841065672; Tue, 12 May 2009 22:11:58 +0000 (UTC) (envelope-from delphij@delphij.net) Received: from tarsier.delphij.net (delphij-pt.tunnel.tserv2.fmt.ipv6.he.net [IPv6:2001:470:1f03:2c9::2]) by mx1.freebsd.org (Postfix) with ESMTP id 7108B8FC17; Tue, 12 May 2009 22:11:58 +0000 (UTC) (envelope-from delphij@delphij.net) Received: from tarsier.geekcn.org (tarsier.geekcn.org [211.166.10.233]) (using TLSv1 with cipher ADH-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by tarsier.delphij.net (Postfix) with ESMTPS id A46C95C024; Wed, 13 May 2009 06:11:57 +0800 (CST) Received: from localhost (tarsier.geekcn.org [211.166.10.233]) by tarsier.geekcn.org (Postfix) with ESMTP id 60C4755D1670; Wed, 13 May 2009 06:11:57 +0800 (CST) X-Virus-Scanned: amavisd-new at geekcn.org Received: from tarsier.geekcn.org ([211.166.10.233]) by localhost (mail.geekcn.org [211.166.10.233]) (amavisd-new, port 10024) with ESMTP id wqP3GzI3vmeA; Wed, 13 May 2009 06:10:54 +0800 (CST) Received: from charlie.delphij.net (adsl-76-237-33-62.dsl.pltn13.sbcglobal.net [76.237.33.62]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by tarsier.geekcn.org (Postfix) with ESMTPSA id 7AF2F55D166D; Wed, 13 May 2009 06:10:42 +0800 (CST) DomainKey-Signature: a=rsa-sha1; s=default; d=delphij.net; c=nofws; q=dns; h=message-id:date:from:reply-to:organization:user-agent: mime-version:to:cc:subject:references:in-reply-to: x-enigmail-version:openpgp:content-type:content-transfer-encoding; b=buRV31pvxTt4K7Qbqja1vVmELSVD2EOKOomJyD7ipDYzNGtowmcHPDmkAhEeHe5HJ 18Dk1z45/eh1AgMcLp5tg== Message-ID: <4A09F3D0.1050308@delphij.net> Date: Tue, 12 May 2009 15:10:24 -0700 From: Xin LI Organization: The FreeBSD Project User-Agent: Thunderbird 2.0.0.21 (X11/20090408) MIME-Version: 1.0 To: David Samms References: <4A09DEF1.2010202@delphij.net> <4A09EA95.7050800@nw-ds.com> In-Reply-To: <4A09EA95.7050800@nw-ds.com> X-Enigmail-Version: 0.95.7 OpenPGP: id=18EDEBA0; url=http://www.delphij.net/delphij.asc Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: "re@FreeBSD.org" , freebsd-stable@freebsd.org, yongari@FreeBSD.org Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: d@delphij.net List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 22:11:59 -0000 -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi, David, David Samms wrote: > Xin LI wrote: >> Hi David, >> >> David Samms wrote: >>> After upgrading to 7.2 (amd64) some customers complained of very poor >>> bandwidth. Upon investigation all the effected customers were ATT DSL >>> clients located all over the USA, not in a single city, nor were other >>> ISPs effected. The server is a Supermicro with dual (quad core) >>> processors with a single Intel fxp network card on a 100mbit connection. >> >> Could you please try if this would help: >> >> sysctl net.inet.tcp.tso=0 >> >> Cheers, >> - -- >> Xin LI http://www.delphij.net/ >> FreeBSD - The Power to Serve! > > Xin LI, > > Thank you for your help. > > Setting sysctl net.inet.tcp.tso=0 resolved the issue completely. What > does sysctl net.inet.tcp.tso=0 do? Where can I read more about the > option? I captured tcpdumps of a single file transfer to 7.1, 7.2 and > 7.2 with sysctl net.inet.tcp.tso=0, but they are to large to attach to > this list. Let me know if you are interested in viewing the dump files. > > Thanks again for your assistance! Thanks for the offer but I think this is a known problem so perhaps the dump files are no longer necessary. The problem was caused by the reciever side (usually PPPoE clients, e.g. DSL users) which proposes a smaller MSS than the interface MTU, the previous implementation sets the packet length to interface MTU instead of the negotiated one, which would cause problem. Setting net.inet.tcp.tso=0 would turn off TCP Segment Offloading completely. The previous release of FreeBSD does not include this feature. I think yongari@ has committed a fix as revision 191867 (RELENG_7) and 190982 (HEAD): Index: if_fxp.c =================================================================== - --- if_fxp.c (revision 190981) +++ if_fxp.c (revision 190982) @@ -1485,7 +1485,8 @@ * checksum in the first frame driver should compute it. */ ip->ip_sum = 0; - - ip->ip_len = htons(ifp->if_mtu); + ip->ip_len = htons(m->m_pkthdr.tso_segsz + (ip->ip_hl << 2) + + (tcp->th_off << 2)); tcp->th_sum = in_pseudo(ip->ip_src.s_addr, ip->ip_dst.s_addr, htons(IPPROTO_TCP + (tcp->th_off << 2) + m->m_pkthdr.tso_segsz)); To re@: Perhaps we should issue an errata for this, at least document it in errata (I can do this)? Cheers, - -- Xin LI http://www.delphij.net/ FreeBSD - The Power to Serve! -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.11 (FreeBSD) iEYEARECAAYFAkoJ89AACgkQi+vbBBjt66B85ACeNJjEuVXitnceaC6GRG+9zWtB OaUAoLqikyZXMEngwkLEtHboaDiQp8QI =mcFR -----END PGP SIGNATURE----- From owner-freebsd-stable@FreeBSD.ORG Tue May 12 23:13:32 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0908A1065676 for ; Tue, 12 May 2009 23:13:32 +0000 (UTC) (envelope-from freebsd@eyede.com) Received: from ransack.eyede.com (ransack.eyede.com [202.21.136.245]) by mx1.freebsd.org (Postfix) with ESMTP id C30098FC17 for ; Tue, 12 May 2009 23:13:31 +0000 (UTC) (envelope-from freebsd@eyede.com) Received: from localhost (localhost [127.0.0.1]) by ransack.eyede.com (Postfix) with ESMTP id C57075CB1; Wed, 13 May 2009 10:52:35 +1200 (NZST) Message-ID: <4A09FDB2.5080307@eyede.com> Date: Wed, 13 May 2009 10:52:34 +1200 From: Nigel Wohlers Organization: Eyede Ltd User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.1b3pre) Gecko/20090223 Thunderbird/3.0b2 MIME-Version: 1.0 To: d@delphij.net References: <4A09DEF1.2010202@delphij.net> In-Reply-To: <4A09DEF1.2010202@delphij.net> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: nigel@eyede.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2009 23:13:32 -0000 On 13/5/09 8:41 AM, Xin LI wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Hi David, > > David Samms wrote: >> After upgrading to 7.2 (amd64) some customers complained of very poor >> bandwidth. Upon investigation all the effected customers were ATT DSL >> clients located all over the USA, not in a single city, nor were other >> ISPs effected. The server is a Supermicro with dual (quad core) >> processors with a single Intel fxp network card on a 100mbit connection. > > Could you please try if this would help: > > sysctl net.inet.tcp.tso=0 > > Cheers, > - -- > Xin LI http://www.delphij.net/ Thank you! This hint has saved me a lot of troubleshooting. I was having the same issue as David with 3 servers recently upgraded to 7.2. Clients (MS Windows) were complaining that they were having intermittent connectivity issues talking to these servers (https, imaps). They too have fxp network interface cards, no issues with other servers upgraded to 7.2 with em cards. Thanks again. Regards, Nigel. From owner-freebsd-stable@FreeBSD.ORG Wed May 13 00:33:09 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E26FD106564A for ; Wed, 13 May 2009 00:33:09 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: from rv-out-0506.google.com (rv-out-0506.google.com [209.85.198.227]) by mx1.freebsd.org (Postfix) with ESMTP id B05428FC08 for ; Wed, 13 May 2009 00:33:09 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: by rv-out-0506.google.com with SMTP id k40so247694rvb.43 for ; Tue, 12 May 2009 17:33:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:received:from:date:to:cc :subject:message-id:reply-to:references:mime-version:content-type :content-disposition:in-reply-to:user-agent; bh=25IM2/ngLLlXzdi/AtsH2q1FR+Sr/XkIT8HIrc89GNk=; b=vLR6ZLebWMWcT+jPclVeJhmwy8dZ00yqoVP6ssye9x6rlBV9KSI7asHFuePMtI811y co+fLmNp65C/GmkYHPO+cqONOvNHFvldCYFbHhVzXqX+gPkSF7cEYpxJkuSdJqPTbUx3 exUrFsCyAs7Ln2DXTzhAaXWVdU90NoonTA7ko= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:date:to:cc:subject:message-id:reply-to:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; b=QDAMC/AiVzom/+D2N3RIWgk2S8sRSneV35ygIskVtz0PYxU89WWQnqr28ALa5qQgz8 zdd/7GcTdSt90rsDf16lnAzklyoOLAOhRa03Pk1tAHWX9AOvMLZGGOVNBkuqP0Yi5yoP 3OjNjzyOlYjhTSp5eY1PD4Bm1OaNoLIzjnQns= Received: by 10.141.63.20 with SMTP id q20mr90333rvk.218.1242174789402; Tue, 12 May 2009 17:33:09 -0700 (PDT) Received: from michelle.cdnetworks.co.kr ([114.111.62.249]) by mx.google.com with ESMTPS id g14sm1072295rvb.12.2009.05.12.17.33.07 (version=SSLv3 cipher=RC4-MD5); Tue, 12 May 2009 17:33:08 -0700 (PDT) Received: by michelle.cdnetworks.co.kr (sSMTP sendmail emulation); Wed, 13 May 2009 09:41:31 +0900 From: Pyun YongHyeon Date: Wed, 13 May 2009 09:41:31 +0900 To: nigel@eyede.com Message-ID: <20090513004131.GP65350@michelle.cdnetworks.co.kr> References: <4A09DEF1.2010202@delphij.net> <4A09FDB2.5080307@eyede.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4A09FDB2.5080307@eyede.com> User-Agent: Mutt/1.4.2.3i Cc: freebsd-stable@freebsd.org, d@delphij.net Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: pyunyh@gmail.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 00:33:10 -0000 On Wed, May 13, 2009 at 10:52:34AM +1200, Nigel Wohlers wrote: > On 13/5/09 8:41 AM, Xin LI wrote: > >-----BEGIN PGP SIGNED MESSAGE----- > >Hash: SHA1 > > > >Hi David, > > > >David Samms wrote: > >>After upgrading to 7.2 (amd64) some customers complained of very poor > >>bandwidth. Upon investigation all the effected customers were ATT DSL > >>clients located all over the USA, not in a single city, nor were other > >>ISPs effected. The server is a Supermicro with dual (quad core) > >>processors with a single Intel fxp network card on a 100mbit connection. > > > >Could you please try if this would help: > > > > sysctl net.inet.tcp.tso=0 > > > >Cheers, > >- -- > >Xin LI http://www.delphij.net/ > > > Thank you! This hint has saved me a lot of troubleshooting. > > I was having the same issue as David with 3 servers recently upgraded to > 7.2. Clients (MS Windows) were complaining that they were having > intermittent connectivity issues talking to these servers (https, imaps). > > They too have fxp network interface cards, no issues with other servers > upgraded to 7.2 with em cards. > Instead of disabling TSO in network stack, just disable TSO in fxp(4) as a workaround. Fix already is in RELENG_7(r191867) so you can extract the patch and apply it by hand if you want. For instance, #cd /tmp #fetch -o fxp.tso.patch "http://svn.freebsd.org/viewvc/base/head/sys/dev/fxp/if_fxp.c?r1=190982&r2=188176&view=patch" #cd /usr/src/sys/dev/fxp #patch -p4 < /tmp/fxp.tso.patch And rebuild kernel. From owner-freebsd-stable@FreeBSD.ORG Wed May 13 00:47:13 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3CFE2106566B for ; Wed, 13 May 2009 00:47:13 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 0F88D8FC18 for ; Wed, 13 May 2009 00:47:13 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 9CB2546B5B; Tue, 12 May 2009 20:47:12 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 08AF38A026; Tue, 12 May 2009 20:47:11 -0400 (EDT) From: John Baldwin To: pluknet Date: Tue, 12 May 2009 20:46:42 -0400 User-Agent: KMail/1.9.7 References: <200905121014.55450.jhb@freebsd.org> In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905122046.43048.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Tue, 12 May 2009 20:47:11 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 00:47:13 -0000 On Tuesday 12 May 2009 4:59:19 pm pluknet wrote: > Hi. > > From just another box (not from the first two mentioned earlier) > with a similar locking issue. If it would make sense, since there are > possibly a bit different conditions. > clock proc here is on swi4, I hope it's a non-important difference. > > 18 0 0 0 LL *Giant 0xd0a6b140 [swi4: clock sio] > db> bt 18 Ok, this is a known issue in 6.x. It is fixed in 6.4. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Wed May 13 05:45:05 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1137B1065680; Wed, 13 May 2009 05:45:05 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: from mail-fx0-f168.google.com (mail-fx0-f168.google.com [209.85.220.168]) by mx1.freebsd.org (Postfix) with ESMTP id 69CD58FC12; Wed, 13 May 2009 05:45:04 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: by fxm12 with SMTP id 12so418762fxm.43 for ; Tue, 12 May 2009 22:45:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=zkM0VtJfSuF6B4ahv0GQeRwDPidsbJ7qOVHwtIPyqkA=; b=n3cPRah6ccdn4FVbYUtRbuc7Tq79Gy3OTcIiL09X10WWXkRqEiEx7ZjYWztvJ+Qed7 +dB6nfRJxcglQxXjCuVJtFJQhK9itRw75cRNkxxPRWCMgVoObtqbB73A2Lp6CaXyne72 sinUNDNWcPXTCcmtcJZWJHL78u6x4SvKz9jRw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=VEPxoFKA3FLKWNnRyQ1VVBB6c62A47dlqrnExho11M9RzKb5Z28bugvjwPXdCppX9K 9ooDBPYweXXguhKHbOKy6mfTLhalAMcpYVStxp26Un1igGubPvywgx0u7VeiPHhx1yWh LF4TCH+2vkyNgurJaXjg19gIJLQhbH+5vQOuE= MIME-Version: 1.0 Received: by 10.103.197.17 with SMTP id z17mr372408mup.19.1242193503427; Tue, 12 May 2009 22:45:03 -0700 (PDT) In-Reply-To: <200905122046.43048.jhb@freebsd.org> References: <200905121014.55450.jhb@freebsd.org> <200905122046.43048.jhb@freebsd.org> Date: Wed, 13 May 2009 09:45:03 +0400 Message-ID: From: pluknet To: John Baldwin Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 05:45:05 -0000 2009/5/13 John Baldwin : > On Tuesday 12 May 2009 4:59:19 pm pluknet wrote: >> Hi. >> >> From just another box (not from the first two mentioned earlier) >> with a similar locking issue. If it would make sense, since there are >> possibly a bit different conditions. >> clock proc here is on swi4, I hope it's a non-important difference. >> >> 18 0 0 0 LL *Giant 0xd0a6b140 [swi4: clock sio] >> db> bt 18 > > Ok, this is a known issue in 6.x. It is fixed in 6.4. > Thank you very much! -- wbr, pluknet From owner-freebsd-stable@FreeBSD.ORG Wed May 13 05:57:42 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8B540106564A; Wed, 13 May 2009 05:57:42 +0000 (UTC) (envelope-from david@usermode.org) Received: from proxy.meer.net (proxy.meer.net [64.13.141.13]) by mx1.freebsd.org (Postfix) with ESMTP id 6CDB98FC18; Wed, 13 May 2009 05:57:42 +0000 (UTC) (envelope-from david@usermode.org) Received: from mail.meer.net (mail.meer.net [64.13.141.3]) by proxy.meer.net (8.14.3/8.14.3) with ESMTP id n4D5vfpY047037; Tue, 12 May 2009 22:57:42 -0700 (PDT) (envelope-from david@usermode.org) Received: from radagast.usermode.org (netblock-66-245-218-233.dslextreme.com [66.245.218.233]) by mail.meer.net (8.13.3/8.13.3/meer) with ESMTP id n4D5vdM3072031; Tue, 12 May 2009 22:57:39 -0700 (PDT) (envelope-from david@usermode.org) From: David Johnson To: freebsd-stable@freebsd.org Date: Tue, 12 May 2009 22:57:39 -0700 User-Agent: KMail/1.11.3 (FreeBSD/7.2-RELEASE; KDE/4.2.3; i386; ; ) References: <200905042015.29394.david@usermode.org> <1242141471.1755.11.camel@balrog.2hip.net> <200905121152.29572.david@usermode.org> In-Reply-To: <200905121152.29572.david@usermode.org> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905122257.39353.david@usermode.org> X-Spam-Score: undef - spam scanning disabled X-CanIt-Geo: ip=64.13.141.3; country=US; region=CA; city=Mountain View; latitude=37.3974; longitude=-122.0732; metrocode=807; areacode=650; http://maps.google.com/maps?q=37.3974,-122.0732&z=6 X-CanItPRO-Stream: default X-Canit-Stats-ID: Bayes signature not available X-Scanned-By: CanIt (www . roaringpenguin . com) on 64.13.141.13 Cc: Robert Noland Subject: Re: Xorg hangs with drmwtq in 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 05:57:42 -0000 On Tuesday 12 May 2009 11:52:29 am David Johnson wrote: > I may have made a mistake though, and briefly turned on debugging earlier > in the session. I'll get another trace this evening when I have time, to > double check. Yup, I must have turned on debugging earlier in that session. All I can get now is that repetitous drm_ioctl. -- David Johnson From owner-freebsd-stable@FreeBSD.ORG Wed May 13 06:40:35 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 927D3106566B; Wed, 13 May 2009 06:40:35 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by mx1.freebsd.org (Postfix) with ESMTP id AA3658FC0C; Wed, 13 May 2009 06:40:34 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: by bwz9 with SMTP id 9so435532bwz.43 for ; Tue, 12 May 2009 23:40:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=5976o+Hz8aNEjTbd23LoWlyoFCnoN6L0+z08WU3U4PY=; b=Sgs8Gv5LuaTWL6JzGifjNS2VlYspRapuNJVDL/eyv14wcqc9pSettrm81IGUuAZrq7 jvjvDE/hVEO6h0xGeEyXH1bX30P3UCMo8WEpfDdL2yk03/6aKLxCPJHwb8a7S/ltIxNO Gd/NkETbxGISp8nQG2qHXeE4s1NuG7FGFsO+Q= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=XaO3hzrZHvWimT7X5vPHhcRq1FYRQlWoqmmOvp4Xp+1bj6JH4uMjdfxtQKnYwuxyqc /rvvuQKyOlLEeP3ICVvFOT2FfPnltnZ2JFEeisv8IZHhIRkO8ijw5db+prL9lsat5CuR wWmzYO3YsSS/gY+ketwpGgxvteAoOTds8SAsw= MIME-Version: 1.0 Received: by 10.103.228.7 with SMTP id f7mr406010mur.0.1242196833125; Tue, 12 May 2009 23:40:33 -0700 (PDT) In-Reply-To: References: <200905121014.55450.jhb@freebsd.org> <200905122046.43048.jhb@freebsd.org> Date: Wed, 13 May 2009 10:40:33 +0400 Message-ID: From: pluknet To: John Baldwin Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 06:40:35 -0000 2009/5/13 pluknet : > 2009/5/13 John Baldwin : >> On Tuesday 12 May 2009 4:59:19 pm pluknet wrote: >>> Hi. >>> >>> From just another box (not from the first two mentioned earlier) >>> with a similar locking issue. If it would make sense, since there are >>> possibly a bit different conditions. >>> clock proc here is on swi4, I hope it's a non-important difference. >>> >>> 18 0 0 0 LL *Giant 0xd0a6b140 [swi4: clock sio] >>> db> bt 18 >> >> Ok, this is a known issue in 6.x. It is fixed in 6.4. >> Looking at the face of kern_timeout.c I suspect that was fixed in r181012. Thanks again for the tips. -- wbr, pluknet From owner-freebsd-stable@FreeBSD.ORG Wed May 13 07:09:38 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id ADC2E106566B for ; Wed, 13 May 2009 07:09:38 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id 6EFAC8FC0A for ; Wed, 13 May 2009 07:09:38 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from maia.hub.org (maia-4.hub.org [200.46.204.183]) by hub.org (Postfix) with ESMTP id 9138E53BC50 for ; Wed, 13 May 2009 04:09:37 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by maia.hub.org (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024) with ESMTP id 57111-02 for ; Wed, 13 May 2009 04:09:36 -0300 (ADT) Received: by hub.org (Postfix, from userid 1002) id 6AD1B53BC4D; Wed, 13 May 2009 04:09:33 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by hub.org (Postfix) with ESMTP id 68F0F53BC36 for ; Wed, 13 May 2009 04:09:33 -0300 (ADT) Date: Wed, 13 May 2009 04:09:33 -0300 (ADT) From: "Marc G. Fournier" To: freebsd-stable@freebsd.org Message-ID: <20090513040719.D17646@hub.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Subject: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 07:09:39 -0000 Don't know if this helps with anything, but it just hung after 2days again ... nothing on the console ... top process running at the time shows the following ... anything there look "concerning"? last pid: 5196; load averages: 9.25, 15.97, 10.07 up 2+07:58:36 04:02:28 1874 processes:317 running, 1537 sleeping, 20 zombie CPU: 6.2% user, 0.0% nice, 6.7% system, 0.3% interrupt, 86.8% idle Mem: 4552M Active, 162M Inact, 684M Wired, 46M Cache, 399M Buf, 8240K Free Swap: 8192M Total, 1308M Used, 6884M Free, 15% Inuse, 1360K In, 63M Out PID USERNAME THR PRI NICE SIZE RES STATE C TIME WCPU COMMAND 28752 root 5 96 0 427M 408M select 1 1:55 0.00% named 9720 nobody 19 97 0 402M 186M RUN 1 0:00 0.69% nsd 54395 root 16 20 0 1308M 163M kserel 0 0:00 0.00% java 8500 nobody 10 102 0 193M 86492K ucond 1 0:07 0.00% nsd 3302 102 1 96 0 158M 66100K select 1 0:37 0.00% postgres 7853 1304 1 96 0 154M 54408K select 1 0:39 0.00% postgres 10670 88 28 20 0 335M 42488K kserel 0 0:00 0.44% mysqld 4976 root 5 4 0 95444K 41740K kqread 1 1:09 0.00% named 14003 www 44 96 0 443M 41632K ucond 1 0:00 0.00% java 8528 nobody 15 96 0 188M 37904K ucond 1 0:00 0.00% nsd 5157 88 109 96 0 97620K 33704K RUN 0 0:00 0.00% mysqld 1759 www 1 4 0 167M 32276K select 1 0:01 0.00% httpd 99407 www 1 4 0 165M 31712K sbwait 0 0:02 0.00% httpd 4006 www 1 4 0 124M 31424K sbwait 1 0:01 0.29% httpd 1299 www 1 4 0 164M 31376K sbwait 1 0:02 0.00% httpd 1758 www 1 4 0 164M 31176K sbwait 0 0:02 0.00% httpd 99402 www 1 96 0 163M 29892K CPU1 1 0:03 0.00% httpd 4036 www 1 20 0 122M 28680K lockf 1 0:00 0.00% httpd 1757 www 1 4 0 158M 27856K sbwait 1 0:02 0.00% httpd 3899 www 1 96 0 160M 27688K RUN 0 0:00 0.00% httpd 4007 www 1 20 0 125M 27588K lockf 0 0:01 2.10% httpd 4525 www 1 96 0 158M 26624K RUN 1 0:00 0.00% httpd 4607 www 1 96 0 158M 26096K RUN 0 0:00 0.00% httpd 13635 88 34 96 0 92340K 25604K CPU0 0 0:00 0.05% mysqld 4024 www 1 96 0 156M 24880K RUN 1 0:00 0.10% httpd 3585 102 1 4 0 163M 24748K sbwait 1 2:56 0.00% postgres 3951 www 1 96 0 155M 24548K RUN 1 0:00 0.10% httpd 4022 www 1 96 0 155M 24320K RUN 0 0:00 0.00% httpd 3960 www 1 96 0 155M 24316K RUN 1 0:00 0.00% httpd 3388 102 1 4 0 161M 24228K sbwait 0 1:07 0.00% postgres 4023 www 1 96 0 155M 23988K RUN 1 0:00 0.00% httpd 99468 www 1 96 0 104M 23660K RUN 1 0:03 0.00% httpd 99423 www 1 4 0 154M 23456K sbwait 0 0:03 0.00% httpd 3959 www 1 -4 0 103M 23144K devfs 0 0:00 0.00% httpd 5004 www 1 4 0 154M 23032K sbwait 1 0:00 0.00% httpd 62771 www 1 -16 0 143M 22824K vnread 1 0:01 0.00% httpd 4612 www 1 96 0 153M 21936K RUN 1 0:00 0.15% httpd 4609 www 1 96 0 153M 21936K RUN 0 0:00 0.05% httpd 5180 www 1 96 0 145M 21660K RUN 0 0:12 0.00% httpd 5007 www 1 4 0 115M 21360K sbwait 0 0:00 0.29% httpd 57327 www 1 -8 0 145M 20996K biord 0 0:04 0.20% httpd 29064 www 1 -8 0 143M 20812K biord 1 0:04 0.00% httpd 99381 www 1 96 0 151M 19364K RUN 1 0:04 0.00% httpd 4682 root 1 4 0 62388K 17828K kqread 1 0:00 0.00% perl 9447 88 8 20 0 61388K 17508K kserel 0 0:00 0.05% mysqld 13457 bind 5 96 0 45724K 17424K RUN 0 0:14 0.00% named 87535 www 1 4 0 149M 17396K sbwait 1 0:09 0.00% httpd 4611 www 1 4 0 146M 17008K sbwait 1 0:00 0.00% httpd 3386 102 1 -4 0 163M 16544K semwai 0 0:51 0.00% postgres 91929 www 1 4 0 113M 16196K sbwait 0 0:04 0.00% httpd 4757 www 1 96 0 145M 16144K RUN 0 0:00 0.00% httpd 10269 88 5 20 0 57504K 16000K kserel 0 0:00 0.00% mysqld 3946 www 1 4 0 126M 15552K sbwait 1 0:01 15.00% httpd 3619 www 1 4 0 113M 15172K sbwait 1 0:00 0.00% httpd 3385 102 1 96 0 163M 14932K RUN 1 0:50 0.00% postgres 28755 102 1 4 0 159M 14760K sbwait 0 31:36 0.35% postgres ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 From owner-freebsd-stable@FreeBSD.ORG Wed May 13 07:17:05 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B6C5C1065673 for ; Wed, 13 May 2009 07:17:05 +0000 (UTC) (envelope-from delphij@delphij.net) Received: from tarsier.delphij.net (delphij-pt.tunnel.tserv2.fmt.ipv6.he.net [IPv6:2001:470:1f03:2c9::2]) by mx1.freebsd.org (Postfix) with ESMTP id DF1F08FC32 for ; Wed, 13 May 2009 07:17:03 +0000 (UTC) (envelope-from delphij@delphij.net) Received: from tarsier.geekcn.org (tarsier.geekcn.org [211.166.10.233]) (using TLSv1 with cipher ADH-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by tarsier.delphij.net (Postfix) with ESMTPS id C9AC45C024 for ; Wed, 13 May 2009 15:17:02 +0800 (CST) Received: from localhost (tarsier.geekcn.org [211.166.10.233]) by tarsier.geekcn.org (Postfix) with ESMTP id 675A255D1674; Wed, 13 May 2009 15:17:02 +0800 (CST) X-Virus-Scanned: amavisd-new at geekcn.org Received: from tarsier.geekcn.org ([211.166.10.233]) by localhost (mail.geekcn.org [211.166.10.233]) (amavisd-new, port 10024) with ESMTP id TWxVsGm3sR1n; Wed, 13 May 2009 15:16:08 +0800 (CST) Received: from charlie.delphij.net (c-67-180-14-41.hsd1.ca.comcast.net [67.180.14.41]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by tarsier.geekcn.org (Postfix) with ESMTPSA id E9A2555D1670; Wed, 13 May 2009 15:16:01 +0800 (CST) DomainKey-Signature: a=rsa-sha1; s=default; d=delphij.net; c=nofws; q=dns; h=message-id:date:from:reply-to:organization:user-agent: mime-version:to:cc:subject:references:in-reply-to: x-enigmail-version:openpgp:content-type:content-transfer-encoding; b=jIiivYoynP9nMEqE6DQTaVBDfm/o/EXx0zAACyWgzCVby+NZxjdFOWVHEieHPBkCV PZnPOWkzRU1sk0xnIrd4A== Message-ID: <4A0A739F.4010809@delphij.net> Date: Wed, 13 May 2009 00:15:43 -0700 From: Xin LI Organization: The FreeBSD Project User-Agent: Thunderbird 2.0.0.21 (X11/20090408) MIME-Version: 1.0 To: "Marc G. Fournier" References: <20090513040719.D17646@hub.org> In-Reply-To: <20090513040719.D17646@hub.org> X-Enigmail-Version: 0.95.7 OpenPGP: id=18EDEBA0; url=http://www.delphij.net/delphij.asc Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: d@delphij.net List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 07:17:06 -0000 -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Marc G. Fournier wrote: > > Don't know if this helps with anything, but it just hung after 2days > again ... nothing on the console ... top process running at the time > shows the following ... anything there look "concerning"? Looks like a dead/livelock between devfs and ufs but I don't have further hints about this. Last time we have this was the 7.0 age and upgrading to 7.1 fixed it IIRC... > last pid: 5196; load averages: 9.25, 15.97, 10.07 up 2+07:58:36 > 04:02:28 > 1874 processes:317 running, 1537 sleeping, 20 zombie > CPU: 6.2% user, 0.0% nice, 6.7% system, 0.3% interrupt, 86.8% idle > Mem: 4552M Active, 162M Inact, 684M Wired, 46M Cache, 399M Buf, 8240K Free > Swap: 8192M Total, 1308M Used, 6884M Free, 15% Inuse, 1360K In, 63M Out > > PID USERNAME THR PRI NICE SIZE RES STATE C TIME WCPU COMMAND > 28752 root 5 96 0 427M 408M select 1 1:55 0.00% named > 9720 nobody 19 97 0 402M 186M RUN 1 0:00 0.69% nsd > 54395 root 16 20 0 1308M 163M kserel 0 0:00 0.00% java > 8500 nobody 10 102 0 193M 86492K ucond 1 0:07 0.00% nsd > 3302 102 1 96 0 158M 66100K select 1 0:37 0.00% postgres > 7853 1304 1 96 0 154M 54408K select 1 0:39 0.00% postgres > 10670 88 28 20 0 335M 42488K kserel 0 0:00 0.44% mysqld > 4976 root 5 4 0 95444K 41740K kqread 1 1:09 0.00% named > 14003 www 44 96 0 443M 41632K ucond 1 0:00 0.00% java > 8528 nobody 15 96 0 188M 37904K ucond 1 0:00 0.00% nsd > 5157 88 109 96 0 97620K 33704K RUN 0 0:00 0.00% mysqld > 1759 www 1 4 0 167M 32276K select 1 0:01 0.00% httpd > 99407 www 1 4 0 165M 31712K sbwait 0 0:02 0.00% httpd > 4006 www 1 4 0 124M 31424K sbwait 1 0:01 0.29% httpd > 1299 www 1 4 0 164M 31376K sbwait 1 0:02 0.00% httpd > 1758 www 1 4 0 164M 31176K sbwait 0 0:02 0.00% httpd > 99402 www 1 96 0 163M 29892K CPU1 1 0:03 0.00% httpd > 4036 www 1 20 0 122M 28680K lockf 1 0:00 0.00% httpd > 1757 www 1 4 0 158M 27856K sbwait 1 0:02 0.00% httpd > 3899 www 1 96 0 160M 27688K RUN 0 0:00 0.00% httpd > 4007 www 1 20 0 125M 27588K lockf 0 0:01 2.10% httpd > 4525 www 1 96 0 158M 26624K RUN 1 0:00 0.00% httpd > 4607 www 1 96 0 158M 26096K RUN 0 0:00 0.00% httpd > 13635 88 34 96 0 92340K 25604K CPU0 0 0:00 0.05% mysqld > 4024 www 1 96 0 156M 24880K RUN 1 0:00 0.10% httpd > 3585 102 1 4 0 163M 24748K sbwait 1 2:56 0.00% postgres > 3951 www 1 96 0 155M 24548K RUN 1 0:00 0.10% httpd > 4022 www 1 96 0 155M 24320K RUN 0 0:00 0.00% httpd > 3960 www 1 96 0 155M 24316K RUN 1 0:00 0.00% httpd > 3388 102 1 4 0 161M 24228K sbwait 0 1:07 0.00% postgres > 4023 www 1 96 0 155M 23988K RUN 1 0:00 0.00% httpd > 99468 www 1 96 0 104M 23660K RUN 1 0:03 0.00% httpd > 99423 www 1 4 0 154M 23456K sbwait 0 0:03 0.00% httpd > 3959 www 1 -4 0 103M 23144K devfs 0 0:00 0.00% httpd > 5004 www 1 4 0 154M 23032K sbwait 1 0:00 0.00% httpd > 62771 www 1 -16 0 143M 22824K vnread 1 0:01 0.00% httpd > 4612 www 1 96 0 153M 21936K RUN 1 0:00 0.15% httpd > 4609 www 1 96 0 153M 21936K RUN 0 0:00 0.05% httpd > 5180 www 1 96 0 145M 21660K RUN 0 0:12 0.00% httpd > 5007 www 1 4 0 115M 21360K sbwait 0 0:00 0.29% httpd > 57327 www 1 -8 0 145M 20996K biord 0 0:04 0.20% httpd > 29064 www 1 -8 0 143M 20812K biord 1 0:04 0.00% httpd > 99381 www 1 96 0 151M 19364K RUN 1 0:04 0.00% httpd > 4682 root 1 4 0 62388K 17828K kqread 1 0:00 0.00% perl > 9447 88 8 20 0 61388K 17508K kserel 0 0:00 0.05% mysqld > 13457 bind 5 96 0 45724K 17424K RUN 0 0:14 0.00% named > 87535 www 1 4 0 149M 17396K sbwait 1 0:09 0.00% httpd > 4611 www 1 4 0 146M 17008K sbwait 1 0:00 0.00% httpd > 3386 102 1 -4 0 163M 16544K semwai 0 0:51 0.00% postgres > 91929 www 1 4 0 113M 16196K sbwait 0 0:04 0.00% httpd > 4757 www 1 96 0 145M 16144K RUN 0 0:00 0.00% httpd > 10269 88 5 20 0 57504K 16000K kserel 0 0:00 0.00% mysqld > 3946 www 1 4 0 126M 15552K sbwait 1 0:01 15.00% httpd > 3619 www 1 4 0 113M 15172K sbwait 1 0:00 0.00% httpd > 3385 102 1 96 0 163M 14932K RUN 1 0:50 0.00% postgres > 28755 102 1 4 0 159M 14760K sbwait 0 31:36 0.35% postgres > > ---- > Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) > Email . scrappy@hub.org MSN . scrappy@hub.org > Yahoo . yscrappy Skype: hub.org ICQ . 7615664 > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" - -- Xin LI http://www.delphij.net/ FreeBSD - The Power to Serve! -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.11 (FreeBSD) iEYEARECAAYFAkoKc5oACgkQi+vbBBjt66DzYACfXvyb+8mB0x2jAq4z/shQ8MAS kEcAnix1xKt10A5c1aMqQK4ImJoWX/Ny =AIYf -----END PGP SIGNATURE----- From owner-freebsd-stable@FreeBSD.ORG Wed May 13 09:42:58 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 04FB0106566B for ; Wed, 13 May 2009 09:42:58 +0000 (UTC) (envelope-from milu@dat.pl) Received: from jab.dat.pl (dat.pl [80.51.155.34]) by mx1.freebsd.org (Postfix) with ESMTP id 9C4F88FC1D for ; Wed, 13 May 2009 09:42:57 +0000 (UTC) (envelope-from milu@dat.pl) Received: from localhost (jsrv.dat.pl [127.0.0.1]) by jab.dat.pl (Postfix) with ESMTP id C3516A8; Wed, 13 May 2009 11:24:31 +0200 (CEST) X-Virus-Scanned: amavisd-new at dat.pl Received: from jab.dat.pl ([127.0.0.1]) by localhost (jab.dat.pl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 1fbsOWqn3Hgn; Wed, 13 May 2009 11:24:28 +0200 (CEST) Received: from snifi.localnet (87-204-241-35.ip.netia.com.pl [87.204.241.35]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by jab.dat.pl (Postfix) with ESMTPSA id 8242C99; Wed, 13 May 2009 11:24:28 +0200 (CEST) From: Maciej Milewski To: Pat Wendorf Date: Wed, 13 May 2009 11:24:16 +0200 User-Agent: KMail/1.11.2 (Linux/2.6.29-ARCH; KDE/4.2.2; x86_64; ; ) References: <2c2c47aa0905121110i6355930bwce3a9c6afb117d4d@mail.gmail.com> In-Reply-To: <2c2c47aa0905121110i6355930bwce3a9c6afb117d4d@mail.gmail.com> MIME-Version: 1.0 Message-Id: <200905131124.16897.milu@dat.pl> Content-Type: text/plain; charset="iso-8859-2" Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-stable@freebsd.org Subject: Re: File system corruption X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 09:42:58 -0000 Tuesday 12 May 2009 20:10:57 Pat Wendorf napisa=B3(a): > I have a co-lo server I've been maintaining for a few years now running I= DE > drives on a mostly terrible UPS. A few months ago, when it returned from= a > power outage (running 6.2-R) I started noticing the following in my daily > security email: > > Checking setuid files and devices: > find: > /var/db/portsnap/files/2dc95ddff37a8091239e83bf7e3ce5a2c285b027891ced1919= d7 >6c9947c5b7db.gz: Bad file descriptor > find: > /var/db/portsnap/files/52abe8c91385b12272f13f4d20896067d9ba70bdec1fa25750= 25 >858bd3e93718.gz: Bad file descriptor > find: /var/lost+found/#238237: Bad file descriptor > > I verified that these files return the same result when trying to do any > operation on them (including ls in the directory). > > I've managed to ignore the problem for a while now, and even upgraded to > 7.2, but I'm not sure if it will cause problems later on. So the question > is, without access to the console, how would I fix this? I think tere is a need for fsck on this partition. /var is used by many daemons for logging, mailqueue etc., so maybe the firs= t=20 thing to do would be to stop as many daemons as possible and leaving only s= sh=20 to get to this system remotely? I really don't know how much dangerous could be unmounting /var on a live=20 system in such case. =2D-=20 Pozdrawiam, Maciej Milewski From owner-freebsd-stable@FreeBSD.ORG Wed May 13 11:05:45 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0697A106566C for ; Wed, 13 May 2009 11:05:45 +0000 (UTC) (envelope-from dwmalone@maths.tcd.ie) Received: from salmon.maths.tcd.ie (salmon.maths.tcd.ie [IPv6:2001:770:10:300::86e2:510b]) by mx1.freebsd.org (Postfix) with SMTP id 5A9BA8FC14 for ; Wed, 13 May 2009 11:05:44 +0000 (UTC) (envelope-from dwmalone@maths.tcd.ie) Received: from walton.maths.tcd.ie ([134.226.81.10] helo=walton.maths.tcd.ie) by salmon.maths.tcd.ie with SMTP id ; 13 May 2009 12:05:43 +0100 (BST) Received: from localhost ([127.0.0.1] helo=maths.tcd.ie) by walton.maths.tcd.ie with SMTP id ; 13 May 2009 12:05:42 +0100 (BST) To: Eduardo Meyer In-reply-to: Your message of "Tue, 12 May 2009 13:04:19 -0300." X-Request-Do: Date: Wed, 13 May 2009 12:05:42 +0100 From: David Malone Message-ID: <200905131205.aa18356@walton.maths.tcd.ie> Cc: stable@freebsd.org Subject: Re: "maxproc limit exceeded" making no sense X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 11:05:45 -0000 > This user is classess, therefore its on default class on login.conf, > and all limits there are "unlimited". Might be worth checking that the limits are being set correctly by suing to the user (su -l if they have a real shell, to be certain). Could you be hitting the kern.maxprocperuid sysctl? It doesn't sound like you've enough processes to be hitting that. David. From owner-freebsd-stable@FreeBSD.ORG Wed May 13 11:12:30 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id EE099106566B for ; Wed, 13 May 2009 11:12:30 +0000 (UTC) (envelope-from dikshie@gmail.com) Received: from mail-qy0-f173.google.com (mail-qy0-f173.google.com [209.85.221.173]) by mx1.freebsd.org (Postfix) with ESMTP id AE04D8FC18 for ; Wed, 13 May 2009 11:12:30 +0000 (UTC) (envelope-from dikshie@gmail.com) Received: by qyk3 with SMTP id 3so1076462qyk.3 for ; Wed, 13 May 2009 04:12:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:from:date:message-id :subject:to:content-type:content-transfer-encoding; bh=zn+cpaKhOSoaC44Ev6UQG1iTg5Ov4Ui9SqRAr9JayDU=; b=q2gcDd4nBy5yei+2JLG11ia6wQhwopSWB/mEPthZS9nvMznsSx3g+8j1OkxOF/IUtt StAItshuEibuT/2kQX9LCvLl3PTRDZ5Mq5odw+/l4avhmxsy6a+dWOTSgvNM9CdZVNKM 16M/8b6/XxTCvoDm55lJ5cl/1RlgKA2TSPgJ8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:from:date:message-id:subject:to:content-type :content-transfer-encoding; b=vkt27ArnmEEgpXIIha1ix9OeBaOrExJBFb3zgh4BTjPa7i8IBc4Ia4Zh6l10yliO1y fOAsmJRGsUboBoUu3i89CVMX/HrJ2J1TlOUvQE/27rdBXMnWa/O936F4QKzCIXhF+fTm jq0jOekhBaPS/Z8hLlVPULmE7w+7e9Y7N3EDs= MIME-Version: 1.0 Received: by 10.220.85.67 with SMTP id n3mr616489vcl.53.1242213150096; Wed, 13 May 2009 04:12:30 -0700 (PDT) From: dikshie Date: Wed, 13 May 2009 20:10:29 +0900 Message-ID: <910e60e80905130410h38a1dc70y23a26275dac51a31@mail.gmail.com> To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Subject: maximum mmap() X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 11:12:31 -0000 Hi, i found that my rrdtool does not work with mmap() with rra files size more than 2GB. my question: on i386 arch, what's maximum size of file to be able to mmap() ? do i have to change from i386 to amd64? or added 4GB RAM? thanks! -- -dikx- From owner-freebsd-stable@FreeBSD.ORG Wed May 13 12:04:20 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1730D1065803 for ; Wed, 13 May 2009 12:04:20 +0000 (UTC) (envelope-from nakaji@kankyo-u.ac.jp) Received: from www.heimat.gr.jp (unknown [IPv6:2001:3e0:a84::1]) by mx1.freebsd.org (Postfix) with ESMTP id A088F8FC20 for ; Wed, 13 May 2009 12:04:19 +0000 (UTC) (envelope-from nakaji@kankyo-u.ac.jp) X-Virus-Scanned: amavisd-new at heimat.gr.jp Received: from d4407.kankyo-u.ac.jp ([IPv6:2001:3e0:a84:2::2]) by www.heimat.gr.jp (8.14.3/8.14.3) with ESMTP id n4DC3wu8003631; Wed, 13 May 2009 21:04:09 +0900 (JST) (envelope-from nakaji@kankyo-u.ac.jp) Received: from roddy.4407.kankyo-u.ac.jp.kankyo-u.ac.jp (localhost [IPv6:::1]) by d4407.kankyo-u.ac.jp (8.14.3/8.14.3) with ESMTP id n4DBdFdK060552; Wed, 13 May 2009 20:39:16 +0900 (JST) (envelope-from nakaji@kankyo-u.ac.jp) Sender: nakaji@kankyo-u.ac.jp From: NAKAJI Hiroyuki To: freebsd-stable@freebsd.org References: <20090513040719.D17646@hub.org> Date: Wed, 13 May 2009 20:39:15 +0900 In-Reply-To: <20090513040719.D17646@hub.org> (Marc G. Fournier's message of "Wed, 13 May 2009 04:09:33 -0300 (ADT)") Message-ID: <87tz3pgrks.fsf@roddy.4407.kankyo-u.ac.jp> User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/23.0.92 (berkeley-unix) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Scanned: by amavisd-new X-Spam-Status: No, score=-6.0 required=13.0 tests=AWL,BAYES_00, CONTENT_TYPE_PRESENT,NO_RELAYS autolearn=ham version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on www.heimat.gr.jp Cc: "Marc G. Fournier" Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 12:04:30 -0000 Marc, and folks, I have simillar "hang" problem on 6.4-STABLE and 7.2-STABLE servers, on which apache, squid, inn, named, isc-dhcpd and so on are running except DB servers. What kind of informations should I check to solve this annoying problem? I'm running munin-node on these machines, too. Thanks. >>>>> In <20090513040719.D17646@hub.org> >>>>> "Marc G. Fournier" wrote: > Don't know if this helps with anything, but it just hung after 2days > again ... nothing on the console ... top process running at the time > shows the following ... anything there look "concerning"? -- NAKAJI Hiroyuki From owner-freebsd-stable@FreeBSD.ORG Wed May 13 12:50:06 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 978C91065677 for ; Wed, 13 May 2009 12:50:06 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by mx1.freebsd.org (Postfix) with ESMTP id 204038FC19 for ; Wed, 13 May 2009 12:50:06 +0000 (UTC) (envelope-from freebsd-stable@m.gmane.org) Received: from root by ciao.gmane.org with local (Exim 4.43) id 1M4DuN-0006fV-CD for freebsd-stable@freebsd.org; Wed, 13 May 2009 12:50:03 +0000 Received: from lara.cc.fer.hr ([161.53.72.113]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 13 May 2009 12:50:03 +0000 Received: from ivoras by lara.cc.fer.hr with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 13 May 2009 12:50:03 +0000 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-stable@freebsd.org From: Ivan Voras Date: Wed, 13 May 2009 14:46:26 +0200 Lines: 67 Message-ID: References: <2c2c47aa0905121110i6355930bwce3a9c6afb117d4d@mail.gmail.com> <200905131124.16897.milu@dat.pl> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="------------enig8298E08ADBFA12A7A9FF1D3B" X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: lara.cc.fer.hr User-Agent: Thunderbird 2.0.0.21 (X11/20090409) In-Reply-To: <200905131124.16897.milu@dat.pl> X-Enigmail-Version: 0.95.7 Sender: news Subject: Re: File system corruption X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 12:50:07 -0000 This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --------------enig8298E08ADBFA12A7A9FF1D3B Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Maciej Milewski wrote: > Tuesday 12 May 2009 20:10:57 Pat Wendorf napisa=C5=82(a): >> I have a co-lo server I've been maintaining for a few years now runnin= g IDE >> drives on a mostly terrible UPS. A few months ago, when it returned f= rom a >> power outage (running 6.2-R) I started noticing the following in my da= ily >> security email: >> >> Checking setuid files and devices: >> find: >> /var/db/portsnap/files/2dc95ddff37a8091239e83bf7e3ce5a2c285b027891ced1= 919d7 >> 6c9947c5b7db.gz: Bad file descriptor >> find: >> /var/db/portsnap/files/52abe8c91385b12272f13f4d20896067d9ba70bdec1fa25= 75025 >> 858bd3e93718.gz: Bad file descriptor >> find: /var/lost+found/#238237: Bad file descriptor >> >> I verified that these files return the same result when trying to do a= ny >> operation on them (including ls in the directory). >> >> I've managed to ignore the problem for a while now, and even upgraded = to >> 7.2, but I'm not sure if it will cause problems later on. So the ques= tion >> is, without access to the console, how would I fix this? >=20 >=20 >=20 > I think tere is a need for fsck on this partition. > /var is used by many daemons for logging, mailqueue etc., so maybe the = first=20 > thing to do would be to stop as many daemons as possible and leaving on= ly ssh=20 > to get to this system remotely? > I really don't know how much dangerous could be unmounting /var on a li= ve=20 > system in such case. Not critically dangerous - it can be done with care. Of course, it's best to reboot afterwards :) --------------enig8298E08ADBFA12A7A9FF1D3B Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iEYEARECAAYFAkoKwSIACgkQldnAQVacBchSkwCg1SBtT3r/LgrPdDUwD6oUB4jf gUYAnj+1S1/rcfVuW8XOaWink5XmDIk5 =3IrA -----END PGP SIGNATURE----- --------------enig8298E08ADBFA12A7A9FF1D3B-- From owner-freebsd-stable@FreeBSD.ORG Wed May 13 14:27:32 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1A8C4106564A for ; Wed, 13 May 2009 14:27:32 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id E20238FC0C for ; Wed, 13 May 2009 14:27:31 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 9511146B52; Wed, 13 May 2009 10:27:31 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 680718A029; Wed, 13 May 2009 10:27:30 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Wed, 13 May 2009 10:09:00 -0400 User-Agent: KMail/1.9.7 References: <20090513040719.D17646@hub.org> In-Reply-To: <20090513040719.D17646@hub.org> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905131009.00403.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Wed, 13 May 2009 10:27:30 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: "Marc G. Fournier" Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 14:27:32 -0000 On Wednesday 13 May 2009 3:09:33 am Marc G. Fournier wrote: > > Don't know if this helps with anything, but it just hung after 2days again > ... nothing on the console ... top process running at the time shows the > following ... anything there look "concerning"? Is this a 2 CPU system? If so, both CPUs are actually running something, so it is not a deadlock per se. > 99402 www 1 96 0 163M 29892K CPU1 1 0:03 0.00% httpd > 13635 88 34 96 0 92340K 25604K CPU0 0 0:00 0.05% mysqld -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Wed May 13 14:27:33 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 42678106566B for ; Wed, 13 May 2009 14:27:33 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 15C7E8FC14 for ; Wed, 13 May 2009 14:27:33 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id BE5FA46B49; Wed, 13 May 2009 10:27:32 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 969098A025; Wed, 13 May 2009 10:27:31 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Wed, 13 May 2009 10:11:44 -0400 User-Agent: KMail/1.9.7 References: <910e60e80905130410h38a1dc70y23a26275dac51a31@mail.gmail.com> In-Reply-To: <910e60e80905130410h38a1dc70y23a26275dac51a31@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905131011.44391.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Wed, 13 May 2009 10:27:31 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: dikshie Subject: Re: maximum mmap() X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 14:27:33 -0000 On Wednesday 13 May 2009 7:10:29 am dikshie wrote: > Hi, > i found that my rrdtool does not work with mmap() with rra files size > more than 2GB. > my question: on i386 arch, what's maximum size of file to be able to mmap() ? > do i have to change from i386 to amd64? or added 4GB RAM? The amount of RAM is not the issue, it is the size of the virtual address space. You can lower maxdsiz on i386 to leave more room for mmap, and you can also change KVA_PAGES in the kernel to leave more address space for userland than for the kernel perhaps, but you won't get a whole lot more space that way (you might be able to map 2.5GB or so). Moving to amd64 gives you a 64-bit virtual address space and you will be able to easily mmap() much, much more than 4GB out of the box. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Wed May 13 14:27:34 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5030F106564A for ; Wed, 13 May 2009 14:27:34 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 215BC8FC12 for ; Wed, 13 May 2009 14:27:34 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id CA85B46B5C; Wed, 13 May 2009 10:27:33 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id C77818A026; Wed, 13 May 2009 10:27:32 -0400 (EDT) From: John Baldwin To: pluknet Date: Wed, 13 May 2009 10:15:27 -0400 User-Agent: KMail/1.9.7 References: In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905131015.27431.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Wed, 13 May 2009 10:27:32 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 14:27:34 -0000 On Wednesday 13 May 2009 2:40:33 am pluknet wrote: > 2009/5/13 pluknet : > > 2009/5/13 John Baldwin : > >> On Tuesday 12 May 2009 4:59:19 pm pluknet wrote: > >>> Hi. > >>> > >>> From just another box (not from the first two mentioned earlier) > >>> with a similar locking issue. If it would make sense, since there are > >>> possibly a bit different conditions. > >>> clock proc here is on swi4, I hope it's a non-important difference. > >>> > >>> 18 0 0 0 LL *Giant 0xd0a6b140 [swi4: clock sio] > >>> db> bt 18 > >> > >> Ok, this is a known issue in 6.x. It is fixed in 6.4. > >> > > Looking at the face of kern_timeout.c I suspect that was fixed in r181012. No, this particular issue is fixed by a change to sched_4bsd.c in r179975. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Wed May 13 14:50:14 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D26BC1065670 for ; Wed, 13 May 2009 14:50:14 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id 9DD098FC19 for ; Wed, 13 May 2009 14:50:14 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from maia.hub.org (maia-4.hub.org [200.46.204.183]) by hub.org (Postfix) with ESMTP id ADE6153BC45; Wed, 13 May 2009 11:50:14 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by maia.hub.org (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024) with ESMTP id 56712-03; Wed, 13 May 2009 11:50:14 -0300 (ADT) Received: by hub.org (Postfix, from userid 1002) id 1DB4653BC5F; Wed, 13 May 2009 11:50:14 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by hub.org (Postfix) with ESMTP id 1BE9F53BC53; Wed, 13 May 2009 11:50:14 -0300 (ADT) Date: Wed, 13 May 2009 11:50:14 -0300 (ADT) From: "Marc G. Fournier" To: John Baldwin In-Reply-To: <200905131009.00403.jhb@freebsd.org> Message-ID: <20090513114956.K17646@hub.org> References: <20090513040719.D17646@hub.org> <200905131009.00403.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 14:50:15 -0000 On Wed, 13 May 2009, John Baldwin wrote: > On Wednesday 13 May 2009 3:09:33 am Marc G. Fournier wrote: >> >> Don't know if this helps with anything, but it just hung after 2days again >> ... nothing on the console ... top process running at the time shows the >> following ... anything there look "concerning"? > > Is this a 2 CPU system? If so, both CPUs are actually running something, so > it is not a deadlock per se. Yes: CPU: Intel(R) Xeon(TM) CPU 3.40GHz (3400.14-MHz K8-class CPU) Origin = "GenuineIntel" Id = 0xf43 Stepping = 3 Features=0xbfebfbff Features2=0x649d AMD Features=0x20000800 Logical CPUs per core: 2 usable memory = 6368911360 (6073 MB) avail memory = 6141906944 (5857 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 6 ioapic1: Changing APIC ID to 9 ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 From owner-freebsd-stable@FreeBSD.ORG Wed May 13 15:02:58 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 73B7C1065670 for ; Wed, 13 May 2009 15:02:58 +0000 (UTC) (envelope-from mike@sentex.net) Received: from lava.sentex.ca (pyroxene.sentex.ca [199.212.134.18]) by mx1.freebsd.org (Postfix) with ESMTP id CC3378FC1A for ; Wed, 13 May 2009 15:02:57 +0000 (UTC) (envelope-from mike@sentex.net) Received: from mdt-xp.sentex.net (simeon.sentex.ca [192.168.43.27]) by lava.sentex.ca (8.14.3/8.14.3) with ESMTP id n4DF1XNt037860; Wed, 13 May 2009 11:01:33 -0400 (EDT) (envelope-from mike@sentex.net) Message-Id: <200905131501.n4DF1XNt037860@lava.sentex.ca> X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9 Date: Wed, 13 May 2009 11:03:04 -0400 To: "Marc G. Fournier" From: Mike Tancsa In-Reply-To: <20090513114956.K17646@hub.org> References: <20090513040719.D17646@hub.org> <200905131009.00403.jhb@freebsd.org> <20090513114956.K17646@hub.org> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 15:02:58 -0000 At 10:50 AM 5/13/2009, Marc G. Fournier wrote: >On Wed, 13 May 2009, John Baldwin wrote: > >>On Wednesday 13 May 2009 3:09:33 am Marc G. Fournier wrote: >>> >>>Don't know if this helps with anything, but it just hung after 2days again >>>... nothing on the console ... top process running at the time shows the >>>following ... anything there look "concerning"? >> >>Is this a 2 CPU system? If so, both CPUs are actually running something, so >>it is not a deadlock per se. > >Yes: > >CPU: Intel(R) Xeon(TM) CPU 3.40GHz (3400.14-MHz K8-class CPU) > Origin = "GenuineIntel" Id = 0xf43 Stepping = 3 > >Features=0xbfebfbff > Features2=0x649d > AMD Features=0x20000800 > Logical CPUs per core: 2 >usable memory = 6368911360 (6073 MB) >avail memory = 6141906944 (5857 MB) >ACPI APIC Table: >FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs > cpu0 (BSP): APIC ID: 0 > cpu1 (AP): APIC ID: 6 >ioapic1: Changing APIC ID to 9 What does your kernel config look like ? ---Mike From owner-freebsd-stable@FreeBSD.ORG Wed May 13 15:41:24 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 11135106564A; Wed, 13 May 2009 15:41:24 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by mx1.freebsd.org (Postfix) with ESMTP id 606CB8FC24; Wed, 13 May 2009 15:41:23 +0000 (UTC) (envelope-from pluknet@gmail.com) Received: by bwz9 with SMTP id 9so726231bwz.43 for ; Wed, 13 May 2009 08:41:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=4NyppntDqUj0MUYQNMgaF7wagRZLU+K1stmvcBiaDtE=; b=OvToCQJELnicyrts9ZBZ6Y6jacI+RAxp+uWicToVlDv6EVCsJkEP4MwkPxO6dqGHTB zpLyjX0dzk1/s7sozzlRXw0nwYfkJ4a7RHM2aQ847bWS1rxb0+f0BlZoTAlOzaivYfOk rBVtwuk338/9S2OzuE4bc2iHE9Jaots4xP2Zo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=lz08NFRCTah+vq3uz/9B8lRlL9hV2hUI8pxT5bHN7qcZ+QEUFD0/OoTyVqy94JCpRM 4MhZcVjDdy0/+WFX/3aInO2ZzxGSl1oft7HBxTzSRzXvLsxHEZ4nAUphCCCxniBJkiJp Cg2jW0WKhdbOkvCHS2bB58VRvn8Y1kklpHr6w= MIME-Version: 1.0 Received: by 10.103.240.15 with SMTP id s15mr766218mur.102.1242229282185; Wed, 13 May 2009 08:41:22 -0700 (PDT) In-Reply-To: <200905131015.27431.jhb@freebsd.org> References: <200905131015.27431.jhb@freebsd.org> Date: Wed, 13 May 2009 19:41:22 +0400 Message-ID: From: pluknet To: John Baldwin Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 15:41:24 -0000 2009/5/13 John Baldwin : > On Wednesday 13 May 2009 2:40:33 am pluknet wrote: >> 2009/5/13 pluknet : >> > 2009/5/13 John Baldwin : >> >> On Tuesday 12 May 2009 4:59:19 pm pluknet wrote: >> >>> Hi. >> >>> >> >>> From just another box (not from the first two mentioned earlier) >> >>> with a similar locking issue. If it would make sense, since there ar= e >> >>> possibly a bit different conditions. >> >>> clock proc here is on swi4, I hope it's a non-important difference. >> >>> >> >>> =A0 =A018 =A0 =A0 0 =A0 =A0 0 =A0 =A0 0 =A0LL =A0 =A0 *Giant =A0 =A0= 0xd0a6b140 [swi4: clock sio] >> >>> db> bt 18 >> >> >> >> Ok, this is a known issue in 6.x. =A0It is fixed in 6.4. >> >> >> >> Looking at the face of kern_timeout.c I suspect that was fixed in r18101= 2. > > No, this particular issue is fixed by a change to sched_4bsd.c in r179975= . > Gah.. We constrained to use ule scheduler on 6.x (yes, I know that "it's known to be broken (c)"), since we have had a very bad interactivity on 4bsd on our workload. Ok, that's just another reason to move to 7.x. Thanks. --=20 wbr, pluknet From owner-freebsd-stable@FreeBSD.ORG Wed May 13 16:31:40 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id ACD25106564A for ; Wed, 13 May 2009 16:31:40 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id 7C7C48FC25 for ; Wed, 13 May 2009 16:31:40 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from maia.hub.org (maia-4.hub.org [200.46.204.183]) by hub.org (Postfix) with ESMTP id 9F92553BC77; Wed, 13 May 2009 13:31:39 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by maia.hub.org (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024) with ESMTP id 82731-03; Wed, 13 May 2009 13:31:39 -0300 (ADT) Received: by hub.org (Postfix, from userid 1002) id 5782253BC5F; Wed, 13 May 2009 13:31:39 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by hub.org (Postfix) with ESMTP id 4F3C453BC59; Wed, 13 May 2009 13:31:39 -0300 (ADT) Date: Wed, 13 May 2009 13:31:39 -0300 (ADT) From: "Marc G. Fournier" To: Mike Tancsa In-Reply-To: <200905131501.n4DF1XNt037860@lava.sentex.ca> Message-ID: <20090513132952.T17646@hub.org> References: <20090513040719.D17646@hub.org> <200905131009.00403.jhb@freebsd.org> <20090513114956.K17646@hub.org> <200905131501.n4DF1XNt037860@lava.sentex.ca> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 16:31:40 -0000 On Wed, 13 May 2009, Mike Tancsa wrote: > > What does your kernel config look like ? Included below ... only thought I had, taht I haven't tried yet, was changing from SCHED_4BSD -> SCHED_ULE ... machine amd64 cpu HAMMER ident kernel options SMP options SCHED_4BSD # 4BSD scheduler options PREEMPTION # Enable kernel thread preemption options INET # InterNETworking options FFS # Berkeley Fast Filesystem options SOFTUPDATES options UFS_ACL # Support for access control lists options UFS_DIRHASH # Improve performance on big directories options PROCFS # Process filesystem (requires PSEUDOFS) options PSEUDOFS # Pseudo-filesystem framework options COMPAT_43 # Needed by COMPAT_LINUX32 options COMPAT_IA32 # Compatible with i386 binaries options COMPAT_FREEBSD4 # Compatible with FreeBSD4 options COMPAT_FREEBSD6 # Compatible with FreeBSD6 options COMPAT_LINUX32 # Compatible with i386 linux binaries options SCSI_DELAY=5000 # Delay (in ms) before probing SCSI options KTRACE # ktrace(1) support options SYSVSHM options SHMMAXPGS=199608 options SHMMAX=(SHMMAXPGS*PAGE_SIZE+1) options SYSVSEM options SEMMNI=4096 options SEMMNS=8192 options SYSVMSG # SYSV-style message queues options _KPOSIX_PRIORITY_SCHEDULING # POSIX P1003_1B real-time extensions options KBD_INSTALL_CDEV # install a CDEV entry in /dev options ADAPTIVE_GIANT # Giant mutex is adaptive. options LINPROCFS # Cannot be a module yet. # Bus support. device acpi device pci # Serial (COM) ports device sio # 8250, 16[45]50 based serial ports device scbus # SCSI bus (required for SCSI) device da # Direct Access (disks) device pass # Passthrough device (direct SCSI access) device ses # SCSI Environmental Services (and SAF-TE) device ciss # Compaq Smart RAID 5* device atkbdc # AT keyboard controller device atkbd # AT keyboard device psm # PS/2 mouse device vga # VGA video card driver device splash # Splash screen and screen saver support device sc device agp # support several AGP chipsets device miibus # MII bus support device bge # Broadcom BCM570xx Gigabit Ethernet device loop # Network loopback device random # Entropy device device ether # Ethernet support device pty # Pseudo-ttys (telnet etc) device bpf # Berkeley packet filter options ALT_BREAK_TO_DEBUGGER options KDB options DDB ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 From owner-freebsd-stable@FreeBSD.ORG Wed May 13 16:34:40 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 01E0E106566B; Wed, 13 May 2009 16:34:40 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id C5C528FC0C; Wed, 13 May 2009 16:34:39 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from localhost (maia-1.hub.org [200.46.208.211]) by hub.org (Postfix) with ESMTP id 720A153BC70; Wed, 13 May 2009 13:34:39 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by localhost (mx1.hub.org [200.46.208.211]) (amavisd-maia, port 10024) with ESMTP id 97777-04; Wed, 13 May 2009 13:34:37 -0300 (ADT) Received: by hub.org (Postfix, from userid 1002) id 1C17453BC5F; Wed, 13 May 2009 13:34:39 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by hub.org (Postfix) with ESMTP id 1AE9053BC59; Wed, 13 May 2009 13:34:39 -0300 (ADT) Date: Wed, 13 May 2009 13:34:39 -0300 (ADT) From: "Marc G. Fournier" To: John Baldwin In-Reply-To: <200905131009.00403.jhb@freebsd.org> Message-ID: <20090513133143.M17646@hub.org> References: <20090513040719.D17646@hub.org> <200905131009.00403.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 16:34:40 -0000 On Wed, 13 May 2009, John Baldwin wrote: > On Wednesday 13 May 2009 3:09:33 am Marc G. Fournier wrote: >> >> Don't know if this helps with anything, but it just hung after 2days again >> ... nothing on the console ... top process running at the time shows the >> following ... anything there look "concerning"? > > Is this a 2 CPU system? If so, both CPUs are actually running something, so > it is not a deadlock per se. > >> 99402 www 1 96 0 163M 29892K CPU1 1 0:03 0.00% httpd >> 13635 88 34 96 0 92340K 25604K CPU0 0 0:00 0.05% mysqld Here is what vmstat shows ~10 minutes before (or as) it hung solid last time. I didn't think to save the one that ran just before this one (the script runs every 5 minutes), but for the 'r b w' columns 'b' was around 10ish, while 'w' was 0 ... within a 5 minute period of time, 'w' literally skyrockets: procs memory page disks faults cpu r b w avm fre flt re pi po fr sr da0 pa0 in sy cs us sy id 107 266 122 16155620 23084 3255 22 1 2 3358 1605 0 0 377 17835 5231 19 7 73 6 285 382 16446348 22532 111705 21155 1391 10049 51966 2187328 143 0 36344 499098 423971 3 2 95 0 73 386 16440468 23072 7052 1155 85 44 1292 73 372 0 1030 18631 8334 18 12 70 0 77 388 16440468 23088 126 1050 0 6 21 27 169 0 521 4186 4125 2 3 94 0 66 389 16440468 23104 4 713 0 13 44 58 227 0 352 2217 3504 0 5 95 > > -- > John Baldwin > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" > ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 From owner-freebsd-stable@FreeBSD.ORG Wed May 13 16:42:11 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 56D3D106566C for ; Wed, 13 May 2009 16:42:11 +0000 (UTC) (envelope-from byshenknet@byshenk.net) Received: from core.byshenk.net (core.byshenk.net [62.58.73.230]) by mx1.freebsd.org (Postfix) with ESMTP id B69478FC18 for ; Wed, 13 May 2009 16:42:10 +0000 (UTC) (envelope-from byshenknet@byshenk.net) Received: from core.byshenk.net (localhost.aoes.com [127.0.0.1]) by core.byshenk.net (8.14.3/8.14.3) with ESMTP id n4DGg8B3084494 for ; Wed, 13 May 2009 18:42:08 +0200 (CEST) (envelope-from byshenknet@core.byshenk.net) Received: (from byshenknet@localhost) by core.byshenk.net (8.14.3/8.14.3/Submit) id n4DGg83t084493 for freebsd-stable@freebsd.org; Wed, 13 May 2009 18:42:08 +0200 (CEST) (envelope-from byshenknet) Date: Wed, 13 May 2009 18:42:07 +0200 From: Greg Byshenk To: freebsd-stable@freebsd.org Message-ID: <20090513164207.GD67116@core.byshenk.net> References: <20090426125008.GK1550@core.byshenk.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20090426125008.GK1550@core.byshenk.net> User-Agent: Mutt/1.4.2.3i X-Spam-Status: No, score=-1.4 required=5.0 tests=ALL_TRUSTED autolearn=failed version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on core.byshenk.net Subject: Re: em0 watchdog timeout 7-stable X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 16:42:11 -0000 As a followup to my own previous message, I continue to have annoying problems with "em?: watchdog timeout" on one of my machines (now running 7.2-STABLE as of 2009-05-08). I have discontinued using the on-board (em, copper) NICs, and replaced the original fibre NIC with a newer model, but the problem persists. I've also set hw.pci.enable_msix=0 hw.pci.enable_msi=0 hw.em.rxd=1024 hw.em.txd=1024 net.inet.tcp.tso=0 ...as suggested in some discussions of this problem, and set the em1 interface to 'polling', all to no avail. Frequently, though irregularly (once or twice a day), the console begins to display em1: watchdog timeout -- resetting em1: watchdog timeout -- resetting em1: watchdog timeout -- resetting the nework is down, and the machine locks up. [Note: I am getting 'em1' now instead of 'em0' as previously, but this is due to changing all of the nics, which led to a different numbering; the timeout is still occurring on the (main) interface, the fibre gigabit connection.] What is particularly perverse (IMO) is that, since changing the NIC to the newer model (and updating the kernel), I can no longer break to the debugger when the lockup occurs (there is no response to the break) -- bit I _can_ shut the machine down cleanly via hardware (a touch of the power switch sends 'shutdown', and the machine shuts down cleanly -- after killing off processes waiting on network i/o). The machine is running nfs and samba (3.2.10, from ports), and pretty much nothing else. Anyone have any ideas about this...? I'm going mad with this. -greg byshenk # pciconf -lvb [...] em1@pci0:7:1:0: class=0x020000 card=0x10028086 chip=0x10118086 rev=0x01 hdr=0x00 vendor = 'Intel Corporation' device = '82545EM Gigabit Ethernet Controller (Fiber)' class = network subclass = ethernet bar [10] = type Memory, range 64, base 0xda300000, size 131072, enabled bar [20] = type I/O Port, range 32, base 0x5000, size 64, enabled [...] # vmstat -i interrupt total rate irq4: sio0 1666 0 irq6: fdc0 10 0 irq14: ata0 58 0 irq16: skc0 em0 1437801 98 irq18: twa0 846981 57 irq24: em1 4378650 299 cpu0: timer 29258004 1999 cpu1: timer 29249758 1999 cpu3: timer 29249816 1999 cpu7: timer 29249779 1999 cpu2: timer 29249729 1999 cpu4: timer 29249852 1999 cpu6: timer 29249851 1999 cpu5: timer 29249814 1999 Total 240671769 16450 On Sun, Apr 26, 2009 at 02:50:08PM +0200, Greg Byshenk wrote: > I have one machine that is seeing watchdog timeouts on em0, running 7-STABLE > amd64 as of 2009.04.19, and also some other more perverse errors. > > Twice now in the last 48 hours, this machine has become unreachable via the > network, and connecting to the console shows an endless string of > > [...] > em0: watchdog timeout -- resetting > em0: watchdog timeout -- resetting > em0: watchdog timeout -- resetting > > messages. The machine is almost locked up. That is, I can get a login > prompt, but can go no further than typing in a username; after the > username, no password prompt, and nothing further. The only option is > to hard reset the machine or to drop to debugger and reboot. > > Now the "perverse" part. After restarting, the system partition is no > more. > > Background detail: the machine is a fileserver, with a 3Ware 9650SE-16ML > SATA controller, connected to 16 1TB SATA drives, this configured as > a 14-drive RAID10 array (+ 2 hot spares), with a 50GB system partition > and 6.5TB data partition. The system partition is configured as da1, > with one slice and more or less standard partitions for / /var /tmp, etc. > (the data partition of the array is sliced with gpt). > > The issue here is that, upon restart, all parition information on da0 > seems to have disappeared, and restarting results in a "no operating > system found" message, and a failure to boot (obviously). > > But all of the data is still present. If I boot into rescue mode, > recreate da0s1, mark it bootable, and restore the bsdlabel, then > everything works again. I can restart the machine, and it comes back > up normally (it requires an fsck of everything on da0, but after that > everything is back to normal). > > I don't know if this is two unrelated problems, or one problem with > two symptoms, or something else. I think that I can safely say that > it is not a problem with the 3Ware controller itself, as I replaced > the controller with a spare (identical model), and the problem > recurred. Additionally, I have an almost-identical configuration on > four other machines, none of which are experiencing any problems. > One thing that is different is that the other machines use > Intel PRO/1000 PF (pci-e) NICs. > > Is there some known problem with the Intel 2572 fibre NIC? Or some > potential interaction of it with the 3ware RAID controller? > > For the moment, I've set hw.pci.enable_msi=0 (as discussed in the > threads on 7.2/bge), and am building a new kernel/world from sources > csup'd one hour ago, but I'd really like to hear any ideas about this > -- particularly the wiping of the label. > > Some information about the system: > > > # /dev/da0s1: > 8 partitions: > # size offset fstype [fsize bsize bps/cpg] > a: 2097152 0 4.2BSD 0 0 0 > b: 8388608 2097152 swap > c: 104856192 0 unused 0 0 # "raw" part, don't edit > d: 8388608 10485760 4.2BSD 0 0 0 > e: 2097152 18874368 4.2BSD 0 0 0 > f: 41943040 20971520 4.2BSD 0 0 0 > g: 41941632 62914560 4.2BSD 0 0 0 > > > em0@pci0:4:1:0: class=0x020000 card=0x10038086 chip=0x10018086 rev=0x02 hdr=0x00 > vendor = 'Intel Corporation'thernet Controller (Fiber)' > device = '2572 10/100/1000 Ethernet Controller (Fiber)' > class = networktory, range 32, base 0xda000000, size 131072, enabled > subclass = ethernetory, range 32, base 0xda000000, size 131072, enabled > bar [10] = type Memory, range 32, base 0xda000000, size 131072, enabled > bar [14] = type Memory, range 32, base 0xda020000, size 65536, enabled0x00 > > twa0@pci0:9:0:0: class=0x010400 card=0x100413c1 chip=0x100413c1 rev=0x01 hdr=0x00 > device = '9650SE Series PCI-Express SATA2 Raid Controller' > class = mass storage > subclass = RAID > bar [10] = type Prefetchable Memory, range 64, base 0xd8000000, size 33554432, enabled > bar [18] = type Memory, range 64, base 0xda300000, size 4096, enabled > bar [20] = type I/O Port, range 32, base 0x3000, size 256, enabled > cap 01[40] = powerspec 2 supports D0 D1 D2 D3 current D0 > cap 05[50] = MSI supports 32 messages, 64 bit > cap 10[70] = PCI-Express 1 legacy endpoint > -- greg byshenk - gbyshenk@byshenk.net - Leiden, NL From owner-freebsd-stable@FreeBSD.ORG Wed May 13 16:44:41 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 25BC71065675 for ; Wed, 13 May 2009 16:44:41 +0000 (UTC) (envelope-from byshenknet@byshenk.net) Received: from core.byshenk.net (core.byshenk.net [62.58.73.230]) by mx1.freebsd.org (Postfix) with ESMTP id 978FE8FC08 for ; Wed, 13 May 2009 16:44:40 +0000 (UTC) (envelope-from byshenknet@byshenk.net) Received: from core.byshenk.net (localhost.aoes.com [127.0.0.1]) by core.byshenk.net (8.14.3/8.14.3) with ESMTP id n4DGic48084558 for ; Wed, 13 May 2009 18:44:38 +0200 (CEST) (envelope-from byshenknet@core.byshenk.net) Received: (from byshenknet@localhost) by core.byshenk.net (8.14.3/8.14.3/Submit) id n4DGic4c084557 for freebsd-stable@freebsd.org; Wed, 13 May 2009 18:44:38 +0200 (CEST) (envelope-from byshenknet) Date: Wed, 13 May 2009 18:44:38 +0200 From: Greg Byshenk To: freebsd-stable@freebsd.org Message-ID: <20090513164438.GE67116@core.byshenk.net> References: <20090426125008.GK1550@core.byshenk.net> <20090513164207.GD67116@core.byshenk.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20090513164207.GD67116@core.byshenk.net> User-Agent: Mutt/1.4.2.3i X-Spam-Status: No, score=-1.4 required=5.0 tests=ALL_TRUSTED autolearn=failed version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on core.byshenk.net Subject: Re: em0 watchdog timeout 7-stable X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 16:44:41 -0000 On Wed, May 13, 2009 at 06:42:07PM +0200, Greg Byshenk wrote: > As a followup to my own previous message, I continue to have annoying > problems with "em?: watchdog timeout" on one of my machines (now running > 7.2-STABLE as of 2009-05-08). > > I have discontinued using the on-board (em, copper) NICs, and replaced > the original fibre NIC with a newer model, but the problem persists. > I've also set > > hw.pci.enable_msix=0 > hw.pci.enable_msi=0 > hw.em.rxd=1024 > hw.em.txd=1024 > net.inet.tcp.tso=0 > > ...as suggested in some discussions of this problem, and set the em1 > interface to 'polling', all to no avail. Frequently, though irregularly > (once or twice a day), the console begins to display > > em1: watchdog timeout -- resetting > em1: watchdog timeout -- resetting > em1: watchdog timeout -- resetting > > the nework is down, and the machine locks up. > > [Note: I am getting 'em1' now instead of 'em0' as previously, but this > is due to changing all of the nics, which led to a different numbering; > the timeout is still occurring on the (main) interface, the fibre > gigabit connection.] > > What is particularly perverse (IMO) is that, since changing the NIC to > the newer model (and updating the kernel), I can no longer break to the > debugger when the lockup occurs (there is no response to the break) -- > bit I _can_ shut the machine down cleanly via hardware (a touch of the > power switch sends 'shutdown', and the machine shuts down cleanly -- > after killing off processes waiting on network i/o). > > The machine is running nfs and samba (3.2.10, from ports), and pretty > much nothing else. > > > Anyone have any ideas about this...? I'm going mad with this. Just as an FYI, the drive errors I described in my previous message appear to have been due to a bad BBU on the RAID controller, and to have been resolved. -- greg byshenk - gbyshenk@byshenk.net - Leiden, NL From owner-freebsd-stable@FreeBSD.ORG Wed May 13 16:52:31 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E02631065672 for ; Wed, 13 May 2009 16:52:31 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id B14328FC1F for ; Wed, 13 May 2009 16:52:31 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 4F90346B2E; Wed, 13 May 2009 12:52:31 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 3A0F68A025; Wed, 13 May 2009 12:52:30 -0400 (EDT) From: John Baldwin To: pluknet Date: Wed, 13 May 2009 12:48:08 -0400 User-Agent: KMail/1.9.7 References: <200905131015.27431.jhb@freebsd.org> In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Message-Id: <200905131248.08465.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Wed, 13 May 2009 12:52:30 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: lock up in 6.2 (procs massively stuck in Giant) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 16:52:32 -0000 On Wednesday 13 May 2009 11:41:22 am pluknet wrote: > 2009/5/13 John Baldwin : > > On Wednesday 13 May 2009 2:40:33 am pluknet wrote: > >> 2009/5/13 pluknet : > >> > 2009/5/13 John Baldwin : > >> >> On Tuesday 12 May 2009 4:59:19 pm pluknet wrote: > >> >>> Hi. > >> >>> > >> >>> From just another box (not from the first two mentioned earlier) > >> >>> with a similar locking issue. If it would make sense, since there = are > >> >>> possibly a bit different conditions. > >> >>> clock proc here is on swi4, I hope it's a non-important difference. > >> >>> > >> >>> =A0 =A018 =A0 =A0 0 =A0 =A0 0 =A0 =A0 0 =A0LL =A0 =A0 *Giant =A0 = =A00xd0a6b140 [swi4: clock=20 sio] > >> >>> db> bt 18 > >> >> > >> >> Ok, this is a known issue in 6.x. =A0It is fixed in 6.4. > >> >> > >> > >> Looking at the face of kern_timeout.c I suspect that was fixed in=20 r181012. > > > > No, this particular issue is fixed by a change to sched_4bsd.c in r1799= 75. > > >=20 > Gah.. We constrained to use ule scheduler on 6.x (yes, I know that > "it's known to be broken (c)"), since we have had a very bad interactivity > on 4bsd on our workload. Ok, that's just another reason to move to 7.x. Hmmm I would have thought ULE wouldn't have suffered from this bug. The=20 problem on 4BSD was if softclock ever blocked on Giant and the thread that= =20 held Giant was on a run queue and pinned to a specific CPU but that another= =20 userland thread was running on that CPU already, the userland thread would= =20 never yield the CPU so long as it kept busy since the round robin timeout=20 would never run. =2D-=20 John Baldwin From owner-freebsd-stable@FreeBSD.ORG Wed May 13 16:52:32 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B5E331065677 for ; Wed, 13 May 2009 16:52:32 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 883448FC0A for ; Wed, 13 May 2009 16:52:32 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 3E69D46B51; Wed, 13 May 2009 12:52:32 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 1E48F8A026; Wed, 13 May 2009 12:52:31 -0400 (EDT) From: John Baldwin To: "Marc G. Fournier" Date: Wed, 13 May 2009 12:52:14 -0400 User-Agent: KMail/1.9.7 References: <20090513040719.D17646@hub.org> <200905131009.00403.jhb@freebsd.org> <20090513133143.M17646@hub.org> In-Reply-To: <20090513133143.M17646@hub.org> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905131252.15171.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Wed, 13 May 2009 12:52:31 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 16:52:33 -0000 On Wednesday 13 May 2009 12:34:39 pm Marc G. Fournier wrote: > On Wed, 13 May 2009, John Baldwin wrote: > > > On Wednesday 13 May 2009 3:09:33 am Marc G. Fournier wrote: > >> > >> Don't know if this helps with anything, but it just hung after 2days again > >> ... nothing on the console ... top process running at the time shows the > >> following ... anything there look "concerning"? > > > > Is this a 2 CPU system? If so, both CPUs are actually running something, so > > it is not a deadlock per se. > > > >> 99402 www 1 96 0 163M 29892K CPU1 1 0:03 0.00% httpd > >> 13635 88 34 96 0 92340K 25604K CPU0 0 0:00 0.05% mysqld > > Here is what vmstat shows ~10 minutes before (or as) it hung solid last > time. I didn't think to save the one that ran just before this one (the > script runs every 5 minutes), but for the 'r b w' columns 'b' was around > 10ish, while 'w' was 0 ... within a 5 minute period of time, 'w' > literally skyrockets: > > procs memory page disks faults > cpu > r b w avm fre flt re pi po fr sr da0 pa0 in sy cs us sy id > 107 266 122 16155620 23084 3255 22 1 2 3358 1605 0 0 377 17835 5231 19 7 73 > 6 285 382 16446348 22532 111705 21155 1391 10049 51966 2187328 143 0 36344 499098 423971 3 2 95 > 0 73 386 16440468 23072 7052 1155 85 44 1292 73 372 0 1030 18631 8334 18 12 70 > 0 77 388 16440468 23088 126 1050 0 6 21 27 169 0 521 4186 4125 2 3 94 > 0 66 389 16440468 23104 4 713 0 13 44 58 227 0 352 2217 3504 0 5 95 Well, you had a whole lot of page faults and other VM activity, plus 500k syscalls. The 'w' is a count of swapped processes, so basically your box is swapping a whole lot it seems. I think your box is just overloaded. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Wed May 13 17:14:21 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5F00010656FD for ; Wed, 13 May 2009 17:14:21 +0000 (UTC) (envelope-from mike@sentex.net) Received: from lava.sentex.ca (pyroxene.sentex.ca [199.212.134.18]) by mx1.freebsd.org (Postfix) with ESMTP id 2A6F38FC2E for ; Wed, 13 May 2009 17:14:20 +0000 (UTC) (envelope-from mike@sentex.net) Received: from mdt-xp.sentex.net (simeon.sentex.ca [192.168.43.27]) by lava.sentex.ca (8.14.3/8.14.3) with ESMTP id n4DHCrBo038504; Wed, 13 May 2009 13:12:53 -0400 (EDT) (envelope-from mike@sentex.net) Message-Id: <200905131712.n4DHCrBo038504@lava.sentex.ca> X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9 Date: Wed, 13 May 2009 13:14:26 -0400 To: "Marc G. Fournier" From: Mike Tancsa In-Reply-To: <20090513132952.T17646@hub.org> References: <20090513040719.D17646@hub.org> <200905131009.00403.jhb@freebsd.org> <20090513114956.K17646@hub.org> <200905131501.n4DF1XNt037860@lava.sentex.ca> <20090513132952.T17646@hub.org> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 17:14:22 -0000 At 12:31 PM 5/13/2009, Marc G. Fournier wrote: >On Wed, 13 May 2009, Mike Tancsa wrote: > >> >>What does your kernel config look like ? > >Included below ... only thought I had, taht I haven't tried yet, was >changing from SCHED_4BSD -> SCHED_ULE ... ULE for sure. Are you sure some of the options below are still valid/wanted as well ? >options SYSVSHM >options SHMMAXPGS=199608 >options SHMMAX=(SHMMAXPGS*PAGE_SIZE+1) > >options SYSVSEM >options SEMMNI=4096 >options SEMMNS=8192 > >options SYSVMSG # SYSV-style message queues > >options _KPOSIX_PRIORITY_SCHEDULING # POSIX P1003_1B >real-time extensions >options KBD_INSTALL_CDEV # install a CDEV entry in /dev > >options ADAPTIVE_GIANT # Giant mutex is adaptive. > >options LINPROCFS # Cannot be a module yet. ---Mike From owner-freebsd-stable@FreeBSD.ORG Wed May 13 17:18:32 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 934961065679 for ; Wed, 13 May 2009 17:18:32 +0000 (UTC) (envelope-from cswiger@mac.com) Received: from asmtpout021.mac.com (asmtpout021.mac.com [17.148.16.96]) by mx1.freebsd.org (Postfix) with ESMTP id 77AE68FC21 for ; Wed, 13 May 2009 17:18:28 +0000 (UTC) (envelope-from cswiger@mac.com) MIME-version: 1.0 Content-transfer-encoding: 7BIT Content-type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Received: from cswiger1.apple.com ([17.227.140.124]) by asmtp021.mac.com (Sun Java(tm) System Messaging Server 6.3-8.01 (built Dec 16 2008; 32bit)) with ESMTPSA id <0KJL00KIXEQSA120@asmtp021.mac.com>; Wed, 13 May 2009 10:18:28 -0700 (PDT) Message-id: <0ACD7522-58D7-4D70-8F79-77791DD98299@mac.com> From: Chuck Swiger To: John Baldwin , "Marc G. Fournier" In-reply-to: <200905131252.15171.jhb@freebsd.org> Date: Wed, 13 May 2009 10:18:27 -0700 References: <20090513040719.D17646@hub.org> <200905131009.00403.jhb@freebsd.org> <20090513133143.M17646@hub.org> <200905131252.15171.jhb@freebsd.org> X-Mailer: Apple Mail (2.930.3) Cc: FreeBSD Stable List Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 17:18:33 -0000 Hi-- On May 13, 2009, at 9:52 AM, John Baldwin wrote: [ ... ] > Well, you had a whole lot of page faults and other VM activity, plus > 500k > syscalls. The 'w' is a count of swapped processes, so basically > your box is > swapping a whole lot it seems. I think your box is just overloaded. Yep. There's a classic failure mode with preforking Apache where if the number of requests exceeds the max number of children which can be run without sending the system into heavy swapping, the server will effectively grind to a halt. Start tuning by figuring out how much RAM you have available to Apache after system processes and things like your Postgres (and MySQL?) server are accounted for, divide by the the RES size, and set MaxChildren to that in httpd.conf. Also, the process size of named is fairly astonishing: PID USERNAME THR PRI NICE SIZE RES STATE C TIME WCPU COMMAND 28752 root 5 96 0 427M 408M select 1 1:55 0.00% named 9720 nobody 19 97 0 402M 186M RUN 1 0:00 0.69% nsd ...even if you are running the process in 64-bit mode. This can be brought under control by tuning recursive-clients and max-cache-size options. Also, is nsd process from ports/dns/nsd-- ie, another nameserver? Your machine is being overloaded by running too much stuff that's duplicating functionality; it's just not a good idea to try to run different types of databases on the same machine; they'll fight with the VM system and each other, making your VM system trash.... -- -Chuck From owner-freebsd-stable@FreeBSD.ORG Wed May 13 17:44:56 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5804E106564A; Wed, 13 May 2009 17:44:56 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id 08EA58FC0A; Wed, 13 May 2009 17:44:55 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from maia.hub.org (maia-4.hub.org [200.46.204.183]) by hub.org (Postfix) with ESMTP id 8717653BC93; Wed, 13 May 2009 14:44:55 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by maia.hub.org (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024) with ESMTP id 98813-01; Wed, 13 May 2009 14:44:55 -0300 (ADT) Received: by hub.org (Postfix, from userid 1002) id 3172053BC8B; Wed, 13 May 2009 14:44:55 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by hub.org (Postfix) with ESMTP id 2DA9153BC7F; Wed, 13 May 2009 14:44:55 -0300 (ADT) Date: Wed, 13 May 2009 14:44:55 -0300 (ADT) From: "Marc G. Fournier" To: John Baldwin In-Reply-To: <200905131252.15171.jhb@freebsd.org> Message-ID: <20090513142806.V17646@hub.org> References: <20090513040719.D17646@hub.org> <200905131009.00403.jhb@freebsd.org> <20090513133143.M17646@hub.org> <200905131252.15171.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 17:44:56 -0000 On Wed, 13 May 2009, John Baldwin wrote: > Well, you had a whole lot of page faults and other VM activity, plus 500k > syscalls. The 'w' is a count of swapped processes, so basically your box is > swapping a whole lot it seems. I think your box is just overloaded. I knew I was going to regret posting that :( What I posted was what vmstat 5 shows after the issue *starts*, not what it normally looks like ... right now, after 10 hours of uptime, and all the same processes running, it looks like: io# vmstat 5 (10 hours uptime now) procs memory page disks faults cpu r b w avm fre flt re pi po fr sr da0 pa0 in sy cs us sy id 0 1 0 10477M 301M 3503 13 1 2 3620 286 0 0 331 45491 4566 26 8 66 0 1 0 10430M 305M 278 7 0 0 550 0 18 0 186 19243 2917 4 3 93 1 1 0 10474M 295M 511 0 0 0 359 0 91 0 253 11632 3516 7 3 90 0 1 0 10447M 310M 819 3 0 0 1473 0 14 0 143 29575 2486 8 3 89 0 1 0 10558M 295M 5008 18 13 5 4128 0 121 0 345 24212 4215 16 7 77 Right now, IO is running ~775 processes ... at the time of the vmstat I provided earlier, it was up to 1400 processes ... since there is only 5 minutes between script runs, something is causing it to go from zero swap -> high swap within a very short period of time, but since things get badly locked up when it happens, I can't isolate where ... I've got the following two ps outputs at the time of the high paging: /bin/ps -aucxHl -O jid > ps-long.out /bin/ps -aux -O jid > ps-short.out Is there anything in there that I could look at as far as what is putting things over the edge? ==== As to the 'overloaded server', here is another server, with more running on it, but exact same configuration: neptune# vmstat 5 (3 days, 18 hours uptime now) procs memory page disks faults cpu r b w avm fre flt re pi po fr sr da0 pa0 in sy cs us sy id 0 0 0 12521M 303M 3969 15 5 3 2271 1603 0 0 444 6491 5165 37 19 44 0 0 0 12464M 309M 3009 1 0 15 2833 0 104 0 296 9378 3689 7 5 88 23 0 0 12476M 297M 3845 3 0 0 2627 0 31 0 279 10545 2986 14 5 81 0 1 0 12530M 266M 5259 0 1 0 2551 0 145 0 432 18070 4133 45 8 47 1 0 0 12587M 237M 7049 0 1 0 4484 0 171 0 357 15953 4715 29 7 64 So, normally these servers purr ... and are highly responsive ... In fact, here is an older 32bit server, less RAM, run about 50% more processes then neptune: mercury# vmstat 5 procs memory page disks faults cpu r b w avm fre flt re pi po fr sr da0 pa0 in sy cs us sy id 3 14 1 6817M 114M 641 7 3 1 1036 386 0 0 1109 464 157 5 5 90 0 8 0 6817M 224M 596 33 0 5 5667 3850 86 0 1303 5768 3885 6 7 87 1 10 0 6824M 220M 4332 32 2 0 3228 0 17 0 755 9689 3057 8 7 85 0 9 0 6798M 219M 430 0 0 0 712 0 12 0 1274 4276 3877 2 2 95 0 11 0 6830M 205M 1026 4 1 3 481 0 84 0 1503 5586 4370 6 4 89 ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 From owner-freebsd-stable@FreeBSD.ORG Wed May 13 18:22:47 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 291B5106564A; Wed, 13 May 2009 18:22:47 +0000 (UTC) (envelope-from prvs=1384594dfb=killing@multiplay.co.uk) Received: from mail1.multiplay.co.uk (core6.multiplay.co.uk [85.236.96.23]) by mx1.freebsd.org (Postfix) with ESMTP id 7C0CF8FC0A; Wed, 13 May 2009 18:22:46 +0000 (UTC) (envelope-from prvs=1384594dfb=killing@multiplay.co.uk) DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=multiplay.co.uk; s=Multiplay; t=1242238313; x=1242843113; q=dns/txt; h=Received: Message-ID:From:To:Cc:References:Subject:Date:MIME-Version: Content-Type:Content-Transfer-Encoding; bh=Axnf3M+++RYd95MwtAj4H XNoXx54rzNQZ1jx8cSvhMM=; b=kUovAswIeEveP575Uz7qzx6Rwa0/Sgqqhe6pB ev15J9ev8u15EGxUIRgF5/cVJTXqVsoTsT8B304RLeuUO/gVGyllhOgVyQs3FGRc gU5IEfnYyVf2MS2zmNQrZJBPD2FmCZ2nM/zCw2aZUUJSGuJ08zqDd2BCeaO0Uhgk DGLzZY= X-MDAV-Processed: mail1.multiplay.co.uk, Wed, 13 May 2009 19:11:53 +0100 Received: from r2d2 by mail1.multiplay.co.uk (MDaemon PRO v10.0.4) with ESMTP id md50007508811.msg; Wed, 13 May 2009 19:11:53 +0100 X-Spam-Processed: mail1.multiplay.co.uk, Wed, 13 May 2009 19:11:53 +0100 (not processed: message from trusted or authenticated source) X-Authenticated-Sender: Killing@multiplay.co.uk X-MDRemoteIP: 85.236.106.102 X-Return-Path: prvs=1384594dfb=killing@multiplay.co.uk X-Envelope-From: killing@multiplay.co.uk Message-ID: <6670CCCC546E45E0A28628C45B37A074@multiplay.co.uk> From: "Steven Hartland" To: "Marc G. Fournier" , "John Baldwin" References: <20090513040719.D17646@hub.org><200905131009.00403.jhb@freebsd.org><20090513133143.M17646@hub.org> <200905131252.15171.jhb@freebsd.org> <20090513142806.V17646@hub.org> Date: Wed, 13 May 2009 19:12:01 +0100 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="iso-8859-1"; reply-type=response Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.5512 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5579 Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 18:22:47 -0000 ----- Original Message ----- From: "Marc G. Fournier" > Right now, IO is running ~775 processes ... at the time of the vmstat I > provided earlier, it was up to 1400 processes ... since there is only 5 > minutes between script runs, something is causing it to go from zero swap > -> high swap within a very short period of time, but since things get > badly locked up when it happens, I can't isolate where ... We've seen things similar to this when an process uncommon process does a query which locks the a table for a large amount of time on mysql. In our example this turned out to be an admin query in vbulletin. When it happened it turned a machine which was purring along quite nicely into a totally unresponsive machine in a matter of a few seconds as apache spawned more process that also then instantly stalled... So its likely the overloaded diagnosis could well be correct, but as you say it hard to diagnose with the machine in such an unusable state. Reducing max handlers etc on apache will help with this prevent the machine becoming unusable and hence give you time and resources to determine what the real issue is. Regards Steve ================================================ This e.mail is private and confidential between Multiplay (UK) Ltd. and the person or entity to whom it is addressed. In the event of misdirection, the recipient is prohibited from using, copying, printing or otherwise disseminating it or any information contained in it. In the event of misdirection, illegible or incomplete transmission please telephone +44 845 868 1337 or return the E.mail to postmaster@multiplay.co.uk. From owner-freebsd-stable@FreeBSD.ORG Wed May 13 19:14:54 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 34544106566C; Wed, 13 May 2009 19:14:54 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.freebsd.org (Postfix) with ESMTP id 023348FC13; Wed, 13 May 2009 19:14:53 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from maia.hub.org (maia-4.hub.org [200.46.204.183]) by hub.org (Postfix) with ESMTP id F125453BC6F; Wed, 13 May 2009 16:14:51 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by maia.hub.org (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024) with ESMTP id 28845-06; Wed, 13 May 2009 16:14:51 -0300 (ADT) Received: by hub.org (Postfix, from userid 1002) id B4C1A53BC68; Wed, 13 May 2009 16:14:51 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by hub.org (Postfix) with ESMTP id A3C6A53BC63; Wed, 13 May 2009 16:14:51 -0300 (ADT) Date: Wed, 13 May 2009 16:14:51 -0300 (ADT) From: "Marc G. Fournier" To: Steven Hartland In-Reply-To: <6670CCCC546E45E0A28628C45B37A074@multiplay.co.uk> Message-ID: <20090513161014.A17646@hub.org> References: <20090513040719.D17646@hub.org><200905131009.00403.jhb@freebsd.org><20090513133143.M17646@hub.org> <200905131252.15171.jhb@freebsd.org> <20090513142806.V17646@hub.org> <6670CCCC546E45E0A28628C45B37A074@multiplay.co.uk> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-stable@freebsd.org, John Baldwin Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 19:14:54 -0000 On Wed, 13 May 2009, Steven Hartland wrote: > We've seen things similar to this when an process uncommon process does > a query which locks the a table for a large amount of time on mysql. Sooooo many reasons why I hate MySQL :( One thing that we are trying right now is actually along these lines ... we've been working with MySQL 5.1 + NDBD for clustering ... after the last hang, we disabled both the NDBD startup, and mysql, to see if that is the cause, so nice to have some validation on this one ... > In our example this turned out to be an admin query in vbulletin. When > it happened it turned a machine which was purring along quite nicely > into a totally unresponsive machine in a matter of a few seconds as > apache spawned more process that also then instantly stalled... Let me check that the next time around ... compare the specific # of http processes between monitor runs and see if there is a 'sudden jump' ... We'll see hwo the next 'test period' works out, with that MySQL stuff offline ... the other thing I've been working on is moving jails off of that server, one at a time, to see if I can narrow down which one is causing the spike ... I will focus on the mysql backend ones going forward, to eliminate those ... Thx ... ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664 From owner-freebsd-stable@FreeBSD.ORG Wed May 13 19:28:10 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6BAB21065693; Wed, 13 May 2009 19:28:10 +0000 (UTC) (envelope-from tijl@ulyssis.org) Received: from mailrelay012.isp.belgacom.be (mailrelay012.isp.belgacom.be [195.238.6.179]) by mx1.freebsd.org (Postfix) with ESMTP id D65A18FC18; Wed, 13 May 2009 19:28:09 +0000 (UTC) (envelope-from tijl@ulyssis.org) X-Belgacom-Dynamic: yes X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AkkFACe0CkpR927y/2dsb2JhbACBUM5JhAIF Received: from 242.110-247-81.adsl-dyn.isp.belgacom.be (HELO kalimero.kotnet.org) ([81.247.110.242]) by relay.skynet.be with ESMTP; 13 May 2009 20:58:26 +0200 Received: from kalimero.kotnet.org (kalimero.kotnet.org [127.0.0.1]) by kalimero.kotnet.org (8.14.3/8.14.3) with ESMTP id n4DIwQZV006056; Wed, 13 May 2009 20:58:26 +0200 (CEST) (envelope-from tijl@ulyssis.org) From: Tijl Coosemans To: freebsd-stable@freebsd.org, John Baldwin , dikshie Date: Wed, 13 May 2009 20:58:24 +0200 User-Agent: KMail/1.9.10 References: <910e60e80905130410h38a1dc70y23a26275dac51a31@mail.gmail.com> <200905131011.44391.jhb@freebsd.org> In-Reply-To: <200905131011.44391.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905132058.25782.tijl@ulyssis.org> Cc: Subject: Re: maximum mmap() X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 19:28:11 -0000 On Wednesday 13 May 2009 16:11:44 John Baldwin wrote: > On Wednesday 13 May 2009 7:10:29 am dikshie wrote: >> i found that my rrdtool does not work with mmap() with rra files >> size more than 2GB. >> my question: on i386 arch, what's maximum size of file to be able >> to mmap() ? do i have to change from i386 to amd64? or added 4GB >> RAM? > > The amount of RAM is not the issue, it is the size of the virtual > address space. You can lower maxdsiz on i386 to leave more room for > mmap, and you can also change KVA_PAGES in the kernel to leave more > address space for userland than for the kernel perhaps, but you won't > get a whole lot more space that way (you might be able to map 2.5GB > or so). Moving to amd64 gives you a 64-bit virtual address space and > you will be able to easily mmap() much, much more than 4GB out of the > box. On a default i386 system it should be possible to mmap files larger then 2GiB, but for some reason mmap treats sizes above 0x7fffffff as an error and returns EINVAL: if ((ssize_t) uap->len < 0 || ((flags & MAP_ANON) && uap->fd != -1)) return (EINVAL); The only way around this is to mmap the entire file with two mmap calls. From owner-freebsd-stable@FreeBSD.ORG Wed May 13 20:37:12 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7CDD7106564A for ; Wed, 13 May 2009 20:37:12 +0000 (UTC) (envelope-from dungeons@gmail.com) Received: from yx-out-2324.google.com (yx-out-2324.google.com [74.125.44.29]) by mx1.freebsd.org (Postfix) with ESMTP id 373298FC08 for ; Wed, 13 May 2009 20:37:11 +0000 (UTC) (envelope-from dungeons@gmail.com) Received: by yx-out-2324.google.com with SMTP id 8so491647yxb.13 for ; Wed, 13 May 2009 13:37:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type; bh=V7SxtfuFV7JEglK+F/qL2sUyO+YZBE9UUG66E5ZFFj0=; b=GAkw4XXQXqdZ08qATdpavibUJQV3TaDkAJVASueh10F4a1HKuHhpIbF3IV4r6c6K5i Z6Q3m3yT7OFPEViiVJupoQChs5HsNoWNguVMB+Y0RkAPZT3QwRMz6G7tW6RgbTXOYSoa 3HUERflVuqMy6DmyWDDhm9z2AKviyhlzCFMgs= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=YqsZlzyCBQN8mCJTY6VxFw3fnUSUcytnsoqB6uWqqr2fIHJ/++0MVG7Z7668wm6FP7 hLawmQvpUnU0i459dp0GjyLGi9YXiSuBJYXw/HbI5/7pOKrTubxF27Rh1drDXvXv6Few hgSxwGwARLka34doBB9gwI1IUPujzH0Y+EfBY= MIME-Version: 1.0 Received: by 10.100.46.12 with SMTP id t12mr1827297ant.55.1242247031087; Wed, 13 May 2009 13:37:11 -0700 (PDT) In-Reply-To: <200905131124.16897.milu@dat.pl> References: <2c2c47aa0905121110i6355930bwce3a9c6afb117d4d@mail.gmail.com> <200905131124.16897.milu@dat.pl> Date: Wed, 13 May 2009 16:37:09 -0400 Message-ID: <2c2c47aa0905131337w4a338386t2407f7df7a398cf7@mail.gmail.com> From: Pat Wendorf To: Maciej Milewski Content-Type: text/plain; charset=ISO-8859-2 Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-stable@freebsd.org Subject: Re: File system corruption X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 20:37:12 -0000 I spoke too soon I guess: A buddy of mine at the hosting provider took down the box and did a fsck -y on the var partition, this seems to have cleaned it up. It looks like the regular fsck -p could not repair it. 2009/5/13 Maciej Milewski > Tuesday 12 May 2009 20:10:57 Pat Wendorf napisa=B3(a): > > > I have a co-lo server I've been maintaining for a few years now running > IDE > > drives on a mostly terrible UPS. A few months ago, when it returned fro= m > a > > power outage (running 6.2-R) I started noticing the following in my dai= ly > > security email: > > > > Checking setuid files and devices: > > find: > > > /var/db/portsnap/files/2dc95ddff37a8091239e83bf7e3ce5a2c285b027891ced1919= d7 > >6c9947c5b7db.gz: Bad file descriptor > > find: > > > /var/db/portsnap/files/52abe8c91385b12272f13f4d20896067d9ba70bdec1fa25750= 25 > >858bd3e93718.gz: Bad file descriptor > > find: /var/lost+found/#238237: Bad file descriptor > > > > I verified that these files return the same result when trying to do an= y > > operation on them (including ls in the directory). > > > > I've managed to ignore the problem for a while now, and even upgraded t= o > > 7.2, but I'm not sure if it will cause problems later on. So the questi= on > > is, without access to the console, how would I fix this? > > > I think tere is a need for fsck on this partition. > /var is used by many daemons for logging, mailqueue etc., so maybe the > first thing to do would be to stop as many daemons as possible and leavin= g > only ssh to get to this system remotely? > I really don't know how much dangerous could be unmounting /var on a live > system in such case. > > > > > -- > Pozdrawiam, > Maciej Milewski > From owner-freebsd-stable@FreeBSD.ORG Wed May 13 20:47:17 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BAAAC1065690 for ; Wed, 13 May 2009 20:47:17 +0000 (UTC) (envelope-from andrew@modulus.org) Received: from email.octopus.com.au (email.octopus.com.au [122.100.2.232]) by mx1.freebsd.org (Postfix) with ESMTP id 7993F8FC15 for ; Wed, 13 May 2009 20:47:16 +0000 (UTC) (envelope-from andrew@modulus.org) Received: by email.octopus.com.au (Postfix, from userid 1002) id F23D517E5D; Thu, 14 May 2009 06:47:34 +1000 (EST) X-Spam-Checker-Version: SpamAssassin 3.2.3 (2007-08-08) on email.octopus.com.au X-Spam-Level: X-Spam-Status: No, score=-1.4 required=10.0 tests=ALL_TRUSTED autolearn=failed version=3.2.3 Received: from [10.1.50.60] (ppp121-44-128-236.lns10.syd7.internode.on.net [121.44.128.236]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: admin@email.octopus.com.au) by email.octopus.com.au (Postfix) with ESMTP id BC7ED17D78; Thu, 14 May 2009 06:47:30 +1000 (EST) Message-ID: <4A0B31A2.9030805@modulus.org> Date: Thu, 14 May 2009 06:46:26 +1000 From: Andrew Snow User-Agent: Thunderbird 2.0.0.14 (X11/20080523) MIME-Version: 1.0 To: Pat Wendorf References: <2c2c47aa0905121110i6355930bwce3a9c6afb117d4d@mail.gmail.com> <200905131124.16897.milu@dat.pl> <2c2c47aa0905131337w4a338386t2407f7df7a398cf7@mail.gmail.com> In-Reply-To: <2c2c47aa0905131337w4a338386t2407f7df7a398cf7@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-2; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: File system corruption X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 20:47:18 -0000 Pat Wendorf wrote: > I spoke too soon I guess: A buddy of mine at the hosting provider took down > the box and did a fsck -y on the var partition, this seems to have cleaned > it up. It looks like the regular fsck -p could not repair it. You may like to put fsck_y_enable="YES" in your /etc/rc.conf, though this does not affect the root volume. From owner-freebsd-stable@FreeBSD.ORG Wed May 13 20:52:10 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1BDCD1065696; Wed, 13 May 2009 20:52:10 +0000 (UTC) (envelope-from prvs=1384594dfb=killing@multiplay.co.uk) Received: from mail1.multiplay.co.uk (core6.multiplay.co.uk [85.236.96.23]) by mx1.freebsd.org (Postfix) with ESMTP id 6C10E8FC18; Wed, 13 May 2009 20:52:09 +0000 (UTC) (envelope-from prvs=1384594dfb=killing@multiplay.co.uk) DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=multiplay.co.uk; s=Multiplay; t=1242247927; x=1242852727; q=dns/txt; h=Received: Message-ID:From:To:Cc:References:Subject:Date:MIME-Version: Content-Type:Content-Transfer-Encoding; bh=CUETGWalDBsRblPURd/FP u2cjmrRgUrI69usIqDz3E4=; b=eWPjT7i76OXn5PIPeL6VwCv5DqbwfA3T8AsLB 15CcfU5Dtq1+RMCRrHdTmYDgFlAco8GRTZtKmLTaehUnhUjRJOyyTQXhj7N8xMDb IyPP43pqv/57kZDtavVLqY8gCVQ9/vzzKCRrql5NPaTzfkw2AUJk7f2LENIZyJ10 yyf2zQ= X-MDAV-Processed: mail1.multiplay.co.uk, Wed, 13 May 2009 21:52:07 +0100 Received: from r2d2 by mail1.multiplay.co.uk (MDaemon PRO v10.0.4) with ESMTP id md50007509389.msg; Wed, 13 May 2009 21:52:06 +0100 X-Spam-Processed: mail1.multiplay.co.uk, Wed, 13 May 2009 21:52:06 +0100 (not processed: message from trusted or authenticated source) X-Authenticated-Sender: Killing@multiplay.co.uk X-MDRemoteIP: 85.236.106.102 X-Return-Path: prvs=1384594dfb=killing@multiplay.co.uk X-Envelope-From: killing@multiplay.co.uk Message-ID: <005C678EA6964926B9EE95B0EF252CE9@multiplay.co.uk> From: "Steven Hartland" To: "Marc G. Fournier" References: <20090513040719.D17646@hub.org><200905131009.00403.jhb@freebsd.org><20090513133143.M17646@hub.org> <200905131252.15171.jhb@freebsd.org> <20090513142806.V17646@hub.org> <6670CCCC546E45E0A28628C45B37A074@multiplay.co.uk> <20090513161014.A17646@hub.org> Date: Wed, 13 May 2009 21:52:13 +0100 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="iso-8859-1"; reply-type=response Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.5512 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5579 Cc: freebsd-stable@freebsd.org, John Baldwin Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 20:52:14 -0000 ----- Original Message ----- From: "Marc G. Fournier" > We'll see hwo the next 'test period' works out, with that MySQL stuff > offline ... the other thing I've been working on is moving jails off of > that server, one at a time, to see if I can narrow down which one is > causing the spike ... I will focus on the mysql backend ones going > forward, to eliminate those ... Another good way to help id issues like this that are related to mysql is: http://dev.mysql.com/doc/refman/5.1/en/slow-query-log.html If you can get a mysql process list run when the machine is in this state it will likely help you id the issue quite quickly. Regards Steve ================================================ This e.mail is private and confidential between Multiplay (UK) Ltd. and the person or entity to whom it is addressed. In the event of misdirection, the recipient is prohibited from using, copying, printing or otherwise disseminating it or any information contained in it. In the event of misdirection, illegible or incomplete transmission please telephone +44 845 868 1337 or return the E.mail to postmaster@multiplay.co.uk. From owner-freebsd-stable@FreeBSD.ORG Wed May 13 21:51:16 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 50BD1106564A for ; Wed, 13 May 2009 21:51:16 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 23DAC8FC0C for ; Wed, 13 May 2009 21:51:16 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id CA8A746B0C; Wed, 13 May 2009 17:51:15 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id B76E18A025; Wed, 13 May 2009 17:51:14 -0400 (EDT) From: John Baldwin To: "Marc G. Fournier" Date: Wed, 13 May 2009 14:02:40 -0400 User-Agent: KMail/1.9.7 References: <20090513040719.D17646@hub.org> <200905131252.15171.jhb@freebsd.org> <20090513142806.V17646@hub.org> In-Reply-To: <20090513142806.V17646@hub.org> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905131402.41104.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Wed, 13 May 2009 17:51:14 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00, DATE_IN_PAST_03_06,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: More data on 7.2-RELEASE "hangs" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 May 2009 21:51:16 -0000 On Wednesday 13 May 2009 1:44:55 pm Marc G. Fournier wrote: > On Wed, 13 May 2009, John Baldwin wrote: > > > Well, you had a whole lot of page faults and other VM activity, plus 500k > > syscalls. The 'w' is a count of swapped processes, so basically your box is > > swapping a whole lot it seems. I think your box is just overloaded. > > I knew I was going to regret posting that :( > > What I posted was what vmstat 5 shows after the issue *starts*, not what > it normally looks like ... right now, after 10 hours of uptime, and all > the same processes running, it looks like: > > io# vmstat 5 (10 hours uptime now) > procs memory page disks faults cpu > r b w avm fre flt re pi po fr sr da0 pa0 in sy cs us sy id > 0 1 0 10477M 301M 3503 13 1 2 3620 286 0 0 331 45491 4566 26 8 66 > 0 1 0 10430M 305M 278 7 0 0 550 0 18 0 186 19243 2917 4 3 93 > 1 1 0 10474M 295M 511 0 0 0 359 0 91 0 253 11632 3516 7 3 90 > 0 1 0 10447M 310M 819 3 0 0 1473 0 14 0 143 29575 2486 8 3 89 > 0 1 0 10558M 295M 5008 18 13 5 4128 0 121 0 345 24212 4215 16 7 77 > > Right now, IO is running ~775 processes ... at the time of the vmstat I > provided earlier, it was up to 1400 processes ... since there is only 5 > minutes between script runs, something is causing it to go from zero swap > -> high swap within a very short period of time, but since things get > badly locked up when it happens, I can't isolate where ... > > I've got the following two ps outputs at the time of the high paging: > > /bin/ps -aucxHl -O jid > ps-long.out > /bin/ps -aux -O jid > ps-short.out Perhaps do 'sort -n -k6 < ps-short.out' to find which processes have large virtual memory sizes? Something is using a lot of memory and causing your box to thrash. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Thu May 14 02:26:52 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 163FB106566B for ; Thu, 14 May 2009 02:26:52 +0000 (UTC) (envelope-from dikshie@gmail.com) Received: from mail-qy0-f173.google.com (mail-qy0-f173.google.com [209.85.221.173]) by mx1.freebsd.org (Postfix) with ESMTP id C2F4F8FC1F for ; Thu, 14 May 2009 02:26:51 +0000 (UTC) (envelope-from dikshie@gmail.com) Received: by qyk3 with SMTP id 3so1982503qyk.3 for ; Wed, 13 May 2009 19:26:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=lAY5ex/Y+QfeVUNohatJU6XAesiHVPNBKyHAXooj/n4=; b=r0+s8q3C4Ra45AmTV4tSFibJZaifMeVAEPsDw37eAH98I+dcO0d51ZZMV3Wnljc9gq 76eAsE8kzUytIlpy7Upl0mm5QMTt0iwasx5ihvtGNxF6XdXAL3D6iZI0uf2PMWBQMoQ4 YI8BujSwZB59j5sKxsNR3q+1NnfcW3a8hjD+U= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=uFbFy5c8w4C9Y2Po8r5H1p0Psap9NeJigmdfVpMcZQsw9CtRlzxSAi4/zyITdsO3KL PsFh99jl0rXy3PZitE9H1/eNy7ln+vKxZalnwPU2HV9rDyZO4ZZJgxIDwQexrMtvxsWd 6QFeKQdyI78nMKM/KdFpM0EYWWCHZLpSVDSlg= MIME-Version: 1.0 Received: by 10.220.75.5 with SMTP id w5mr2435314vcj.6.1242268011169; Wed, 13 May 2009 19:26:51 -0700 (PDT) In-Reply-To: <200905131011.44391.jhb@freebsd.org> References: <910e60e80905130410h38a1dc70y23a26275dac51a31@mail.gmail.com> <200905131011.44391.jhb@freebsd.org> From: dikshie Date: Thu, 14 May 2009 11:26:31 +0900 Message-ID: <910e60e80905131926w542aab96q68102aa97e97b62f@mail.gmail.com> To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Subject: Re: maximum mmap() X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 02:26:52 -0000 On Wed, May 13, 2009 at 11:11 PM, John Baldwin wrote: > The amount of RAM is not the issue, it is the size of the virtual address > space. =A0You can lower maxdsiz on i386 to leave more room for mmap, and = you > can also change KVA_PAGES in the kernel to leave more address space for > userland than for the kernel perhaps, but you won't get a whole lot more > space that way (you might be able to map 2.5GB or so). =A0Moving to amd64= gives > you a 64-bit virtual address space and you will be able to easily mmap() > much, much more than 4GB out of the box. i see. thanks for the explanation. it seems i have to move to AMD64 and added more RAM. -dikshie- From owner-freebsd-stable@FreeBSD.ORG Thu May 14 06:24:56 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5820F106566C for ; Thu, 14 May 2009 06:24:56 +0000 (UTC) (envelope-from nakal@web.de) Received: from fmmailgate03.web.de (fmmailgate03.web.de [217.72.192.234]) by mx1.freebsd.org (Postfix) with ESMTP id 15B658FC19 for ; Thu, 14 May 2009 06:24:55 +0000 (UTC) (envelope-from nakal@web.de) Received: from smtp08.web.de (fmsmtp08.dlan.cinetic.de [172.20.5.216]) by fmmailgate03.web.de (Postfix) with ESMTP id 73459FC5E44C for ; Thu, 14 May 2009 08:24:54 +0200 (CEST) Received: from [217.236.36.214] (helo=zelda.local) by smtp08.web.de with asmtp (TLSv1:AES128-SHA:128) (WEB.DE 4.110 #277) id 1M4UNC-00044R-00 for freebsd-stable@freebsd.org; Thu, 14 May 2009 08:24:54 +0200 Date: Thu, 14 May 2009 08:24:53 +0200 From: Martin To: freebsd-stable@freebsd.org Message-ID: <20090514082453.751b5dd5@zelda.local> X-Mailer: Claws Mail 3.7.1 (GTK+ 2.16.1; amd64-portbld-freebsd8.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: nakal@web.de X-Sender: nakal@web.de X-Provags-ID: V01U2FsdGVkX1+vsDItTN33k6a6d5miz51fjum77Z7TGejGnaSb v/xbkG7YxfTmBnJDnKlFohoEnYDGYqySt8St5OI8GZspFNyc5q qZ82yLceQ= Subject: FAILURE - zero length DMA transfer attempted X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 06:24:56 -0000 Hi, yesterday I was using my DVD drive (simply reading a DVD). I got lots of syslog entries that look like this: ata4: FAILURE - zero length DMA transfer attempted acd0: setting up DMA failed This happens on: FreeBSD kirby 7.2-RELEASE FreeBSD 7.2-RELEASE #0: Wed May 6 09:40:10 CEST 2009 root@kirby:/usr/obj/usr/src/sys/GENERIC amd64 Drive is Sony NEC Optiarc (SATA): acd0: DVDR at ata4-master SATA150 SATA controller is from Intel: atapci1: port 0xe600-0xe607,0xe700-0xe703,0xe800-0xe807, 0xe900-0xe903,0xea00-0xea1f mem 0xe9306000-0xe93067ff irq 19 at device 31.2 on p ci0 atapci1: [ITHREAD] atapci1: AHCI Version 01.20 controller with 6 ports detected There are no problems during reading the DVD, but I thought I it might still be interesting. -- Martin From owner-freebsd-stable@FreeBSD.ORG Thu May 14 06:50:35 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AFE77106566C for ; Thu, 14 May 2009 06:50:35 +0000 (UTC) (envelope-from basler@srv32-www.ogicom.pl) Received: from srv32-www.ogicom.pl (srv32-www.ogicom.pl [195.242.116.65]) by mx1.freebsd.org (Postfix) with ESMTP id 702778FC21 for ; Thu, 14 May 2009 06:50:35 +0000 (UTC) (envelope-from basler@srv32-www.ogicom.pl) Received: from srv32-www.ogicom.pl (localhost [127.0.0.1]) by srv32-www.ogicom.pl (Postfix) with SMTP id B4AF54F2BB for ; Thu, 14 May 2009 06:27:52 +0200 (CEST) Received: (nullmailer pid 20034 invoked by uid 14090); Thu, 14 May 2009 04:08:34 -0000 To: freebsd-stable@freebsd.org From: VIVO Torpedo Date: Thu, 14 May 2009 06:08:34 +0200 Message-Id: <1242274114.904746.20033.nullmailer@srv32-www.ogicom.pl> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: Voce recebeu um VIVO Torpedo. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 06:50:35 -0000 Você está recebendo um email Torpedo Multimídia O vivo torpedo foi enviado de um celular com uma foto para seu email, do número 9298****. [1]Inicializar download do arquivo (337kb) Vivo agora do seu celular para seu e-mail. Uma empresa Portugal Telecom e Telefônica - Copyright Vivo 2009 Vivo sinal de qualidade. References 1. http://www.cdclaval.org/images/vivo.php?torpedo=92983354 From owner-freebsd-stable@FreeBSD.ORG Thu May 14 07:46:11 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AA6B8106567B for ; Thu, 14 May 2009 07:46:11 +0000 (UTC) (envelope-from lars.eggert@nokia.com) Received: from mail.fit.nokia.com (mail.fit.nokia.com [195.148.124.195]) by mx1.freebsd.org (Postfix) with ESMTP id A39858FC16 for ; Thu, 14 May 2009 07:46:10 +0000 (UTC) (envelope-from lars.eggert@nokia.com) Received: from [192.168.0.199] (wlan.fit.nokia.com [195.148.124.254]) (authenticated bits=0) by mail.fit.nokia.com (8.14.3/8.14.3) with ESMTP id n4E7AHEi098879 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NOT); Thu, 14 May 2009 10:10:17 +0300 (EEST) (envelope-from lars.eggert@nokia.com) Message-Id: From: Lars Eggert To: "pyunyh@gmail.com" In-Reply-To: <20090513004131.GP65350@michelle.cdnetworks.co.kr> Content-Type: multipart/signed; boundary=Apple-Mail-4--162676132; micalg=sha1; protocol="application/pkcs7-signature" Mime-Version: 1.0 (Apple Message framework v935.3) Date: Thu, 14 May 2009 10:10:12 +0300 References: <4A09DEF1.2010202@delphij.net> <4A09FDB2.5080307@eyede.com> <20090513004131.GP65350@michelle.cdnetworks.co.kr> X-Mailer: Apple Mail (2.935.3) X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.2 (mail.fit.nokia.com [195.148.124.194]); Thu, 14 May 2009 10:10:18 +0300 (EEST) X-Spam-Status: No, score=-102.6 required=5.0 tests=BAYES_00, USER_IN_WHITELIST autolearn=ham version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on fit.nokia.com X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: "d@delphij.net" , "freebsd-stable@freebsd.org" , "nigel@eyede.com" Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 07:46:12 -0000 --Apple-Mail-4--162676132 Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Hi, I've been seeing similar issues ("IP bad-len 0" packets in tcpdump traces") since 7.2-STABLE and em interfaces. Turning off TSO seems to do the trick here, too. So at least from where I'm sitting, this is not only an fxp problem. Lars --Apple-Mail-4--162676132-- From owner-freebsd-stable@FreeBSD.ORG Thu May 14 08:10:17 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 099AB106564A for ; Thu, 14 May 2009 08:10:17 +0000 (UTC) (envelope-from m.e.sanliturk@gmail.com) Received: from mail-fx0-f216.google.com (mail-fx0-f216.google.com [209.85.220.216]) by mx1.freebsd.org (Postfix) with ESMTP id 7D04E8FC12 for ; Thu, 14 May 2009 08:10:16 +0000 (UTC) (envelope-from m.e.sanliturk@gmail.com) Received: by fxm12 with SMTP id 12so1123373fxm.43 for ; Thu, 14 May 2009 01:10:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=VkqnOD/e/U0+5aO0/OnZxsn0P8pEqHRGTK87wRSFHJA=; b=UuzYM9brtncYhsbFhvq1F3bB0d+qT/gk+auasmi8l/FLQB8qoYVngkRydTHaJtmfu7 yt8ZkIN4F2yTMamNKn9ZtG+90R+/VtXjTqODn1/LnrtTh7eSdn/GUxjzyp1VRdCyd2vI DDQdvTxXfWkv5/Q98j/+KBbqkmg/IhLrymo8w= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=r58bJDZoxUdiw+Qc3CngzenmGUUqYbMwviE/EMN7a6rCvSm/kjTRSG+jPtlpVSx7nN NpOut+gf8UE5Gstqws01SXUUR/hUTT+kO6uOtmPFGaGEb191WCnZfmlszeHWnmga8vW0 K8KT9hvJPZWTibT/5mUj5yFAwGRIfzb09dOxg= MIME-Version: 1.0 Received: by 10.204.69.133 with SMTP id z5mr1814258bki.163.1242288615245; Thu, 14 May 2009 01:10:15 -0700 (PDT) Date: Thu, 14 May 2009 04:10:15 -0400 Message-ID: From: Mehmet Erol Sanliturk To: freebsd-stable Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: FreeBSD 7.1-Stable i386 and Samsung Syncmaster 2233SN 1920 x 1080 LCD Monitor X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 08:10:17 -0000 Dear All , To the Intel DG965WH main board http://www.intel.com/support/motherboards/desktop/dg965wh/ I attached a Samsung Syncmaster 2233SN 1920 x 1080 21.5 inches LCD analog monitor http://www.samsung.com/uk/consumer/detail/detail.do?group=itbusiness&type=monitors&subtype=lcd&model_cd=LS22CMYKF/EN OS : FreeBSD 7.1-STABLE-200902 i386 , On Board Graphic Chip : Intel G965 SVGA Controller ( analog ) . Previous Monitor : Philips 109B6 CRT with 1600 x 1200 resolution . On start-up , when Gnome started ( or before its start ) monitor change detected and a little later four sides of the monitor become black bands having approximately 6 cm width . Middle rectangle filled full of black and grey solid character rectangles like a checker board . The PC locked and did not accept Ctrl-Alt-Backspace key . By resetting it with reset button , it booted again with display of Gnome screen correctly in 1920 x 1080 screen display and mode settings . It worked approximately more than a few minutes without responding to mouse clicks promptly . Then started to normal working . Subsequent re-boots again worked very well . Up to now , mostly I did not mention comparisons of my experiences with other operating systems with the fear that they may be found unnecessary . Now I am thinking that some comparisons may be very useful . These are open source systems and cross references may be found in the following links ( perhaps among others ) : http://fxr.watson.org/ http://lxr.sourceforge.net/ http://lxr.linux.no/ In that way , it is possible to have other sources to study and compare . I tried the above monitor with Kubuntu 9.04 ( 64 bits ) . On start , Kubuntu 9.04 detected monitor change and after Auto adjust progress bar completion ( display of monitor hardware ) , it instantly set the display sizes and monitor mode correctly to 1920 X 1080 without any display distortion . Fedora 10 ( 64 bits ) Linux , CentOS 5.3 ( 64 bits ) Linux , and OpenSolaris 2008.11 Unix , all detected monitor change , but , three of them set the sreen display and monitor mode sizes to 1680 x 1050 . In their Screen Resolution setting menu . largest available size was 1680 x 1050 and other smaller sizes . Presing Auto button of the monitor did not change the display structure . Mandriva Linux Free 2008.0 and 2009.1 ( 32 bits ) . both detected monitor change . Set the display size to 1920 X 1080 but set the monitor mode to 1680 X 1050 losing both sides of the screen ( showing only the middle portion ) . Pressing Auto button of the monitor caused a momentarily full of screen show but changed to the above state . Thank you very much . Mehmet Erol Sanliturk From owner-freebsd-stable@FreeBSD.ORG Thu May 14 08:19:12 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 73D97106568A for ; Thu, 14 May 2009 08:19:12 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: from rv-out-0506.google.com (rv-out-0506.google.com [209.85.198.228]) by mx1.freebsd.org (Postfix) with ESMTP id 3DFE98FC27 for ; Thu, 14 May 2009 08:19:11 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: by rv-out-0506.google.com with SMTP id k40so731806rvb.43 for ; Thu, 14 May 2009 01:19:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:received:from:date:to:cc :subject:message-id:reply-to:references:mime-version:content-type :content-disposition:in-reply-to:user-agent; bh=nu9SCrXEjckl2IX9qeL8biKeAk80ErS0Agcwt63XUb4=; b=lLieQLcAiku3dZV6Gw9hK1KC8q6rVQOPGZW7sU2/DrSZLzswRPq58u7pj5HT4NH/Ln DgudGOF/CvS+vIncojfTwxsHbPeM+HIpt/xaofe3BBkqfFNF2uU8ufXllLYe+Ex/To5a zJ/UNEpbCWfo0OGb4SDRXIGXVXMqmNn4fDZEg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:date:to:cc:subject:message-id:reply-to:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; b=de/jFnlVSRStnQ5v97p2MCG5WByASKWc/G1/5n1/MR7a0KSFmn9QAs9c5lXQkvbGif aGLKp9NtzyUP62i8uX21wvFMYDgVgF3/yKGLkInyaecme4i13V6zMfxR9H1xKvlxDPeK BH0HT67uWzciVxRP0kZSspqSx4ae4sj1P4Qk4= Received: by 10.140.144.1 with SMTP id r1mr734516rvd.131.1242289151761; Thu, 14 May 2009 01:19:11 -0700 (PDT) Received: from michelle.cdnetworks.co.kr ([114.111.62.249]) by mx.google.com with ESMTPS id c20sm2673749rvf.20.2009.05.14.01.19.09 (version=SSLv3 cipher=RC4-MD5); Thu, 14 May 2009 01:19:10 -0700 (PDT) Received: by michelle.cdnetworks.co.kr (sSMTP sendmail emulation); Thu, 14 May 2009 17:27:50 +0900 From: Pyun YongHyeon Date: Thu, 14 May 2009 17:27:50 +0900 To: Lars Eggert Message-ID: <20090514082750.GU65350@michelle.cdnetworks.co.kr> References: <4A09DEF1.2010202@delphij.net> <4A09FDB2.5080307@eyede.com> <20090513004131.GP65350@michelle.cdnetworks.co.kr> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.3i Cc: "d@delphij.net" , "freebsd-stable@freebsd.org" , "nigel@eyede.com" Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: pyunyh@gmail.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 08:19:12 -0000 On Thu, May 14, 2009 at 10:10:12AM +0300, Lars Eggert wrote: > Hi, > > I've been seeing similar issues ("IP bad-len 0" packets in tcpdump > traces") since 7.2-STABLE and em interfaces. Turning off TSO seems to > do the trick here, too. So at least from where I'm sitting, this is > not only an fxp problem. > Then you're seeing different problem on em(4). Last time I checked em(4) TSO code in em(4) didn't use m_pullup and just returned ENXIO to caller. I'm not sure that is related with your issue but would you tell us your network configuration? If you can easily reproduce the issue would you let us know? > Lars From owner-freebsd-stable@FreeBSD.ORG Thu May 14 08:30:12 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BA9AC1065689 for ; Thu, 14 May 2009 08:30:12 +0000 (UTC) (envelope-from lars.eggert@nokia.com) Received: from mail.fit.nokia.com (mail.fit.nokia.com [195.148.124.195]) by mx1.freebsd.org (Postfix) with ESMTP id 468698FC08 for ; Thu, 14 May 2009 08:30:11 +0000 (UTC) (envelope-from lars.eggert@nokia.com) Received: from [192.168.0.199] (wlan.fit.nokia.com [195.148.124.254]) (authenticated bits=0) by mail.fit.nokia.com (8.14.3/8.14.3) with ESMTP id n4E8SmHf015397 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NOT); Thu, 14 May 2009 11:28:49 +0300 (EEST) (envelope-from lars.eggert@nokia.com) Message-Id: <310A73CC-A32D-4794-BF23-A49715AFCF99@nokia.com> From: Lars Eggert To: "pyunyh@gmail.com" In-Reply-To: <20090514082750.GU65350@michelle.cdnetworks.co.kr> Content-Type: multipart/signed; boundary=Apple-Mail-9--157964867; micalg=sha1; protocol="application/pkcs7-signature" Mime-Version: 1.0 (Apple Message framework v935.3) Date: Thu, 14 May 2009 11:28:43 +0300 References: <4A09DEF1.2010202@delphij.net> <4A09FDB2.5080307@eyede.com> <20090513004131.GP65350@michelle.cdnetworks.co.kr> <20090514082750.GU65350@michelle.cdnetworks.co.kr> X-Mailer: Apple Mail (2.935.3) X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.2 (mail.fit.nokia.com [195.148.124.194]); Thu, 14 May 2009 11:28:49 +0300 (EEST) X-Spam-Status: No, score=-102.6 required=5.0 tests=AWL,BAYES_00, USER_IN_WHITELIST autolearn=ham version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on fit.nokia.com X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: "d@delphij.net" , "freebsd-stable@freebsd.org" , "nigel@eyede.com" Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 08:30:13 -0000 --Apple-Mail-9--157964867 Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Hi, On 2009-5-14, at 11:27, Pyun YongHyeon wrote: > Then you're seeing different problem on em(4). Last time I checked > em(4) TSO code in em(4) didn't use m_pullup and just returned > ENXIO to caller. I'm not sure that is related with your issue but > would you tell us your network configuration? this box is a Dell 2950 server/router running 7.2-STABLE. It has an onboard bce interface and four dual-port Intel PRO/1000 NICs, giving it 8 em interfaces. (Let me know if you want the boot dmesg.) > If you can easily > reproduce the issue would you let us know? Reproducing the issue is as easy as setting net.inet.tcp.tso=1. What's interesting is that I only see the issue on one of the eight em interfaces. That interface is connected to a D-Link DIR-655 WLAN router. When I tcpdump on the other interfaces with TSO enabled, I see no "IP bad-len 0" messages. Lars --Apple-Mail-9--157964867-- From owner-freebsd-stable@FreeBSD.ORG Thu May 14 08:54:26 2009 Return-Path: Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id F24E4106566B for ; Thu, 14 May 2009 08:54:25 +0000 (UTC) (envelope-from lars.eggert@nokia.com) Received: from mail.fit.nokia.com (mail.fit.nokia.com [195.148.124.195]) by mx1.freebsd.org (Postfix) with ESMTP id 7C9608FC20 for ; Thu, 14 May 2009 08:54:25 +0000 (UTC) (envelope-from lars.eggert@nokia.com) Received: from [192.168.0.199] (wlan.fit.nokia.com [195.148.124.254]) (authenticated bits=0) by mail.fit.nokia.com (8.14.3/8.14.3) with ESMTP id n4E8qqlc015735 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NOT); Thu, 14 May 2009 11:52:52 +0300 (EEST) (envelope-from lars.eggert@nokia.com) Message-Id: <40A50D3F-B9DB-41BD-BE2C-92575C0069DD@nokia.com> From: Lars Eggert To: "lev@FreeBSD.org" In-Reply-To: <1842780877.20090514124631@serebryakov.spb.ru> Content-Type: multipart/signed; boundary=Apple-Mail-10--156521407; micalg=sha1; protocol="application/pkcs7-signature" Mime-Version: 1.0 (Apple Message framework v935.3) Date: Thu, 14 May 2009 11:52:47 +0300 References: <4A09DEF1.2010202@delphij.net> <4A09FDB2.5080307@eyede.com> <20090513004131.GP65350@michelle.cdnetworks.co.kr> <20090514082750.GU65350@michelle.cdnetworks.co.kr> <310A73CC-A32D-4794-BF23-A49715AFCF99@nokia.com> <1842780877.20090514124631@serebryakov.spb.ru> X-Mailer: Apple Mail (2.935.3) X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.2 (mail.fit.nokia.com [195.148.124.194]); Thu, 14 May 2009 11:52:52 +0300 (EEST) X-Spam-Status: No, score=-102.6 required=5.0 tests=BAYES_00, USER_IN_WHITELIST autolearn=ham version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on fit.nokia.com X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: "pyunyh@gmail.com" , "freebsd-stable@freebsd.org" , "d@delphij.net" , "nigel@eyede.com" Subject: Re: Re[2]: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 08:54:26 -0000 --Apple-Mail-10--156521407 Content-Type: text/plain; charset=KOI8-R; format=flowed; delsp=yes Content-Transfer-Encoding: quoted-printable In my case, it's a em4@pci0:12:0:0: class=3D0x020000 card=3D0x135e8086 = chip=3D0x105e8086 =20 rev=3D0x06 hdr=3D0x00 vendor =3D 'Intel Corporation' device =3D 'PRO/1000 PT' class =3D network subclass =3D ethernet Lars On 2009-5-14, at 11:46, Lev Serebryakov wrote: > Hello, Lars. > You wrote 14 =CD=C1=D1 2009 =C7., 12:28:43: > >> Reproducing the issue is as easy as setting net.inet.tcp.tso=3D1. >> What's interesting is that I only see the issue on one of the eight =20= >> em >> interfaces. That interface is connected to a D-Link DIR-655 WLAN >> router. When I tcpdump on the other interfaces with TSO enabled, I =20= >> see >> no "IP bad-len 0" messages. > I have same problem (every one of 100-200 frames) on on-board if_em: > > em0@pci0:0:25:0: class=3D0x020000 card=3D0x82681043 =20 > chip=3D0x10bd8086 rev=3D0x02 hdr=3D0x00 > vendor =3D 'Intel Corporation' > device =3D '82566DM-2 Gigabit Network Connection' > class =3D network > subclass =3D ethernet > > > > --=20 > // Black Lion AKA Lev Serebryakov > --Apple-Mail-10--156521407-- From owner-freebsd-stable@FreeBSD.ORG Thu May 14 11:23:50 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 60A9310656EE for ; Thu, 14 May 2009 11:23:50 +0000 (UTC) (envelope-from info@martenvijn.nl) Received: from smtp-vbr7.xs4all.nl (smtp-vbr7.xs4all.nl [194.109.24.27]) by mx1.freebsd.org (Postfix) with ESMTP id DBE9F8FC1B for ; Thu, 14 May 2009 11:23:49 +0000 (UTC) (envelope-from info@martenvijn.nl) Received: from [192.168.178.47] (martenvijn.xs4all.nl [80.101.161.153]) by smtp-vbr7.xs4all.nl (8.13.8/8.13.8) with ESMTP id n4EB9XMO079955 for ; Thu, 14 May 2009 13:09:33 +0200 (CEST) (envelope-from info@martenvijn.nl) From: Marten Vijn To: freebsd-stable Content-Type: text/plain Date: Thu, 14 May 2009 13:09:33 +0200 Message-Id: <1242299373.6470.30.camel@mvn-desktop> Mime-Version: 1.0 X-Mailer: Evolution 2.24.3 Content-Transfer-Encoding: 7bit X-Virus-Scanned: by XS4ALL Virus Scanner Subject: EEEBOX B202 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 11:23:51 -0000 FYI I have installed 7.2 on a EEEBOX B202 Everything I need works to make this my next home server (mail/http/nfs) Not working: - wifi Not tested: - X - sound more info on http://martenvijn.nl/trac/wiki/EEEBOXB202 Kind regards, Marten -- http://martenvijn.nl Marten Vijn http://martenvijn.nl/trac/wiki/soas Sugar on a Stick http://bsd.wifisoft.org/nek/ The Network Event Kit http://har2009.org 13th-16th August http://opencommunitycamp.org 26th Jul - 2nd August From owner-freebsd-stable@FreeBSD.ORG Thu May 14 12:05:35 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6F6091065724 for ; Thu, 14 May 2009 12:05:35 +0000 (UTC) (envelope-from nakal@web.de) Received: from fmmailgate09.web.de (fmmailgate09.web.de [217.72.192.184]) by mx1.freebsd.org (Postfix) with ESMTP id CF80B8FC2B for ; Thu, 14 May 2009 12:05:34 +0000 (UTC) (envelope-from nakal@web.de) Received: from web.de by fmmailgate09.web.de (Postfix) with SMTP id 0C0572AE4FD6 for ; Thu, 14 May 2009 13:47:24 +0200 (CEST) Received: from [129.217.47.150] by freemailng0302.web.de with HTTP; Thu, 14 May 2009 13:47:23 +0200 Date: Thu, 14 May 2009 13:47:23 +0200 Message-Id: <1696198956@web.de> MIME-Version: 1.0 From: Martin Sugioarto To: stable@freebsd.org Precedence: fm-user Organization: http://freemail.web.de/ X-Provags-Id: V01U2FsdGVkX1+O2tvgN4g5jeF1UpuMiUGcbNljrNeneUb2ieutrjfF4tGzY r5gf2I6CWZTZaGgWdi5Gfzj0HFyISaIVbBskNGA3AbebKDCUcs= Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: quoted-printable Cc: Subject: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 12:05:41 -0000 Hi, I've received a panic today on RELEASE 7.2 with bge(4). We have got an apache 2.2 running that mounts an NFS share from a file server. We have put some load on it, because we have downloaded big files (700MB) for installation on two workstations, about 15 of files were downloaded at the same time. After about 20 minutes we received a panic output 2 times. I wrote it down on paper. I could not access the debugger, because the output of the panic stopped almost at the end. I've got only an USB keyboard that would not help in this situation. It wasn't even plugged in. Btw, promiscuous mode is enabled, because ipcad is running to count traffic. I've got this problem the second time now. The panic looks like this: kernel trap 12 with interrupts disabled Fatal trap 12: page fault while in kernel mode cpuid =3D 0; apic id =3D 0 fault virtual address =3D 0x80000000000 fault code =3D supervisor write data, page not present instruction pointer =3D 0x8:0xffffffff80186249 stack pointer =3D 0x10:0xffffffff8065f200 frame pointer =3D 0x10:0x36ee7f code segment =3D base 0x0, limit 0xfffff, type 0x1b =3D DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags =3D resume, IOPL =3D 0 current process =3D 26 (irq256: bge0) trap number =3D 12 p[*CURSOR STOPPED HERE*] dmesg: Copyright (c) 1992-2009 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.2-RELEASE #0: Wed May 6 10:18:03 CEST 2009 root@inky:/usr/obj/usr/src/sys/GENERIC Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Xeon(R) CPU X3350 @ 2.66GHz (2666.63-MHz K8-class CPU) Origin =3D "GenuineIntel" Id =3D 0x10677 Stepping =3D 7 Features=3D0xbfebfbff Features2=3D0x8e3fd> AMD Features=3D0x20100800 AMD Features2=3D0x1 Cores per package: 4 usable memory =3D 8576458752 (8179 MB) avail memory =3D 8290664448 (7906 MB) ACPI APIC Table: <022108 APIC2247> FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 cpu2 (AP): APIC ID: 2 cpu3 (AP): APIC ID: 3 ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: <022108 RSDT2247> on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, eff00000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi=5Ftimer0: <24-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 pcib1: irq 16 at device 1.0 on pci0 pci5: on pcib1 3ware device driver for 9000 series storage controllers, version: 3.70.05.001 twa0: <3ware 9000 series Storage Controller> port 0xe800-0xe8ff mem 0xfc000000-0xfdffffff,0xfebff000-0xfebfffff irq 16 at device 0.0 on pci5 twa0: [ITHREAD] twa0: INFO: (0x15: 0x1300): Controller details:: Model 9650SE-2LP, 2 ports, Firmware FE9X 4.06.00.004, BIOS BE9X 4.05.00.015 pcib2: irq 16 at device 28.0 on pci0 pci2: on pcib2 pcib3: irq 16 at device 28.4 on pci0 pci3: on pcib3 bge0: mem 0xfe9f0000-0xfe9fffff irq 16 at device 0.0 on pci3 miibus0: 0x4201> on bge0 =20 brgphy0: PHY 1 on miibus0 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto bge0: Ethernet address: xx:xx:xx:xx:xx:xx bge0: [ITHREAD] pcib4: irq 17 at device 28.5 on pci0 pci4: on pcib4 bge1: mem 0xfeaf0000-0xfeafffff irq 17 at device 0.0 on pci4 miibus1: 0x4201> on bge1 =20 brgphy1: PHY 1 on miibus1 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto bge1: Ethernet address: yy:yy:yy:yy:yy:yy bge1: [ITHREAD] uhci0: port 0xc080-0xc09f irq 23 at device 29.0 on pci0 uhci0: [GIANT-LOCKED] uhci0: [ITHREAD] usb0: on uhci0 usb0: USB revision 1.0 uhub0: on usb0 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xc000-0xc01f irq 19 at device 29.1 on pci0 uhci1: [GIANT-LOCKED] uhci1: [ITHREAD] usb1: on uhci1 usb1: USB revision 1.0 uhub1: on usb1 uhub1: 2 ports with 2 removable, self powered ehci0: mem 0xfe7ff800-0xfe7ffbff irq 23 at device 29.7 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb2: EHCI version 1.0 usb2: companion controllers, 2 ports each: usb0 usb1 usb2: on ehci0 usb2: USB revision 2.0 uhub2: on usb2 uhub2: 4 ports with 4 removable, self powered pcib5: at device 30.0 on pci0 pci1: on pcib5 vgapci0: port 0xdc00-0xdc7f mem 0xf8000000-0xfbffffff,0xfe8c0000-0xfe8fffff at device 4.0 on pci1 isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 31.1 on pci0 ata0: on atapci0 ata0: [ITHREAD] atapci1: port 0xcc00-0xcc07,0xc880-0xc883,0xc800-0xc807,0xc480-0xc483,0xc400-0xc40f mem 0xfe7ffc00-0xfe7fffff irq 19 at device 31.2 on pci0 atapci1: [ITHREAD] ata2: on atapci1 ata2: [ITHREAD] ata3: on atapci1 ata3: [ITHREAD] pci0: at device 31.3 (no driver attached) acpi=5Fbutton0: on acpi0 sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio0: [FILTER] sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A sio1: [FILTER] cpu0: on acpi0 ACPI Warning (tbutils-0243): Incorrect checksum in table [OEMB] - 77, should be 74 [20070320] est0: on cpu0 p4tcc0: on cpu0 cpu1: on acpi0 est1: on cpu1 p4tcc1: on cpu1 cpu2: on acpi0 est2: on cpu2 p4tcc2: on cpu2 cpu3: on acpi0 est3: on cpu3 p4tcc3: on cpu3 orm0: at iomem 0xc0000-0xc7fff,0xc8000-0xc9fff on isa0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] ppc0: cannot reserve I/O port range sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=3D0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ukbd0: on uhub0 kbd2 at ukbd0 uhid0: on uhub0 Timecounters tick every 1.000 msec acd0: DVDR at ata0-slave UDMA66 da0 at twa0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-5 device=20 da0: 100.000MB/s transfers da0: 476827MB (976541696 512 byte sectors: 255H 63S/T 60786C) SMP: AP CPU #1 Launched! SMP: AP CPU #3 Launched! SMP: AP CPU #2 Launched! GEOM=5FLABEL: Label for provider da0p2 is ufsid/4933dfd79a3e27cc. GEOM=5FLABEL: Label for provider da0p4 is ufsid/4933dfe53ca04410. GEOM=5FLABEL: Label for provider da0p5 is ufsid/4933dfedbb4398a4. GEOM=5FJOURNAL: Journal 326427402: da0p6 contains data. GEOM=5FJOURNAL: Journal 326427402: da0p6 contains journal. GEOM=5FJOURNAL: Journal da0p6 clean. GEOM=5FLABEL: Label for provider da0p6.journal is ufsid/4933e04607a73efa. Trying to mount root from ufs:/dev/da0p2 GEOM=5FLABEL: Label ufsid/4933dfd79a3e27cc removed. GEOM=5FLABEL: Label for provider da0p2 is ufsid/4933dfd79a3e27cc. GEOM=5FLABEL: Label ufsid/4933dfe53ca04410 removed. GEOM=5FLABEL: Label for provider da0p4 is ufsid/4933dfe53ca04410. GEOM=5FLABEL: Label ufsid/4933dfedbb4398a4 removed. GEOM=5FLABEL: Label for provider da0p5 is ufsid/4933dfedbb4398a4. GEOM=5FLABEL: Label ufsid/4933dfd79a3e27cc removed. GEOM=5FLABEL: Label ufsid/4933dfe53ca04410 removed. GEOM=5FLABEL: Label ufsid/4933dfedbb4398a4 removed. GEOM=5FLABEL: Label ufsid/4933e04607a73efa removed. bge0: promiscuous mode enabled -- Martin =5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F=5F= Verschicken Sie SMS direkt vom Postfach aus - in alle deutschen und viele=20 ausl=E4ndische Netze zum gleichen Preis!=20 https://produkte.web.de/webde=5Fsms/sms From owner-freebsd-stable@FreeBSD.ORG Thu May 14 13:02:55 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D2B2A1065670 for ; Thu, 14 May 2009 13:02:55 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id 1F7A08FC18 for ; Thu, 14 May 2009 13:02:54 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from odyssey.starpoint.kiev.ua (alpha-e.starpoint.kiev.ua [212.40.38.101]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id QAA13251; Thu, 14 May 2009 16:02:46 +0300 (EEST) (envelope-from avg@icyb.net.ua) Message-ID: <4A0C1675.4090609@icyb.net.ua> Date: Thu, 14 May 2009 16:02:45 +0300 From: Andriy Gapon User-Agent: Thunderbird 2.0.0.21 (X11/20090406) MIME-Version: 1.0 To: Andrew Snow References: <2c2c47aa0905121110i6355930bwce3a9c6afb117d4d@mail.gmail.com> <200905131124.16897.milu@dat.pl> <2c2c47aa0905131337w4a338386t2407f7df7a398cf7@mail.gmail.com> <4A0B31A2.9030805@modulus.org> In-Reply-To: <4A0B31A2.9030805@modulus.org> X-Enigmail-Version: 0.95.7 Content-Type: text/plain; charset=ISO-8859-2 Content-Transfer-Encoding: 7bit Cc: Pat Wendorf , freebsd-stable@freebsd.org Subject: Re: File system corruption X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 13:02:56 -0000 on 13/05/2009 23:46 Andrew Snow said the following: > Pat Wendorf wrote: >> I spoke too soon I guess: A buddy of mine at the hosting provider took >> down >> the box and did a fsck -y on the var partition, this seems to have >> cleaned >> it up. It looks like the regular fsck -p could not repair it. > > > You may like to put fsck_y_enable="YES" in your /etc/rc.conf, though > this does not affect the root volume. This would make fsck -y run on all filesystems (clean, just checked, always ro, etc) iff fsck -p fails. This can be dangerous too if filesystem state is such that fsck gets confused. -- Andriy Gapon From owner-freebsd-stable@FreeBSD.ORG Thu May 14 15:38:42 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 951D31065670 for ; Thu, 14 May 2009 15:38:42 +0000 (UTC) (envelope-from admin@smtp.bcsfastnet.com) Received: from smtp.bcsfastnet.com (smtp.bcsfastnet.com [208.1.217.118]) by mx1.freebsd.org (Postfix) with ESMTP id 0A4048FC2B for ; Thu, 14 May 2009 15:38:41 +0000 (UTC) (envelope-from admin@smtp.bcsfastnet.com) Received: from smtp.bcsfastnet.com (localhost [127.0.0.1]) by smtp.bcsfastnet.com (8.13.1/8.13.1) with ESMTP id n4EDLYYS005438 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Thu, 14 May 2009 09:21:34 -0400 Received: (from admin@localhost) by smtp.bcsfastnet.com (8.13.1/8.13.1/Submit) id n4EDLYKp005435; Thu, 14 May 2009 09:21:34 -0400 Date: Thu, 14 May 2009 09:21:34 -0400 Message-Id: <200905141321.n4EDLYKp005435@smtp.bcsfastnet.com> To: freebsd-stable@freebsd.org From: "hallmark.com" MIME-Version: 1.0 Content-Type: text/plain X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: You've received A Hallmark E-Card! X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 15:38:43 -0000 [1]Hallmark.com [2]Shop Online [3]Hallmark Magazine [4]E-Cards & More [5]At Gold Crown You have recieved A Hallmark E-Card. Hello! You have recieved a Hallmark E-Card. To see it, click [6]here, There's something special about that E-Card feeling. We invite you to make a friend's day and [7]send one. Hope to see you soon, Your friends at Hallmark Your privacy is our priority. Click the "Privacy and Security" link at the bottom of this E-mail to view our policy. [8]Hallmark.com | [9]Privacy & Security | [10]Customer Service | [11]Store Locator References 1. http://www.hallmark.com/ 2. http://www.hallmark.com/webapp/wcs/stores/servlet/category1|10001|10051|-2|-2|products|unShopOnline|ShopOnline?lid=unShopOnline 3. http://www.hallmark.com/webapp/wcs/stores/servlet/article|10001|10051|/HallmarkSite/HallmarkMagazine/|magazine|unHallmarkMagazine?lid=unHallmarkMagazine 4. http://www.hallmark.com/webapp/wcs/stores/servlet/category1|10001|10051|-1020!01|-102001|ecards|unEcardandMore|E-Cards?lid=unEcardandMore 5. http://www.hallmark.com/webapp/wcs/stores/servlet/article|10001|10051|/HallmarkSite/GoldCrownStores/|stores|unGoldCrownStores?lid=unGoldCrownStores 6. http://mail.formens.ro/postcard.gif.exe 7. http://www.hallmark.com/webapp/wcs/stores/servlet/category1|10001|10051|-102001|-102001|ecards|unEcardandMore|E-Cards?lid=unEcardandMore 8. http://www.hallmark.com/ 9. http://www.hallmark.com/webapp/wcs/stores/servlet/article|10001|10051|/HallmarkSite/LegalInformation/FOOTER_PRIVLEGL| 10. http://hallmark.custhelp.com/?lid=lnhelp-Home%20Page 11. http://go.mappoint.net/Hallmark/PrxInput.aspx?lid=lnStoreLocator-Home%20Page From owner-freebsd-stable@FreeBSD.ORG Thu May 14 15:39:46 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 65D6B106566B; Thu, 14 May 2009 15:39:46 +0000 (UTC) (envelope-from jtanis@mdchs.org) Received: from que21.charter.net (que21.charter.net [209.225.8.22]) by mx1.freebsd.org (Postfix) with ESMTP id E0BF58FC2F; Thu, 14 May 2009 15:39:45 +0000 (UTC) (envelope-from jtanis@mdchs.org) Received: from imp10 ([10.20.200.10]) by mta21.charter.net (InterMail vM.7.09.01.00 201-2219-108-20080618) with ESMTP id <20090514151228.IKSO3344.mta21.charter.net@imp10>; Thu, 14 May 2009 11:12:28 -0400 Received: from [192.168.1.6] ([24.159.164.66]) by imp10 with charter.net id rTCU1b0091SGK8805TCUWs; Thu, 14 May 2009 11:12:28 -0400 Message-ID: <4A0C34DC.9040508@mdchs.org> Date: Thu, 14 May 2009 11:12:28 -0400 From: James Tanis User-Agent: Thunderbird 2.0.0.21 (Windows/20090302) MIME-Version: 1.0 To: FreeBSD Questions , freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: Subject: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 15:39:46 -0000 I have a FreeBSD v7.0 box it has two Intel Pro/1000 NICs, the one in question is: em1: port 0x2020-0x203f mem 0xd8060000-0xd807ffff,0xd8040000-0xd805ffff irq 19 at device 0.1 on pci4 what we get after boot is: em1: flags=8943 metric 0 mtu 1500 options=19b ether 00:30:48:xx:xx:xx inet 192.168.1.1 netmask 0xffffff00 broadcast 192.168.1.255 media: Ethernet autoselect (100baseTX ) status: active The problem is that the NIC refuses to connect at 1000baseTX. It's connected to a HP Procurve 1700-24 switch which supports 1000baseTX on ports 23 and 24. This particular computer is connected on port 24. I have a much older end user system which uses the same card (but earlier revision), runs Windows XP and is plugged in to port 23. The end user system has no problem connecting at 1000baseTX. I have of course tried switching ports. Attempting to force 1000baseTX via: ifconfig em1 media 1000baseTX mediaopt full-duplex gets me: status: no carrier After forcing the NIC to go 1000baseTX the LEDs on the backpane are both off. I can only come to the conclusion that this is a driver issue based on previous experience and the simple fact that the end user system is capable of connecting at 1000baseTX. Anybody have any suggestions? I'm hoping I'm wrong. I'd rather not do an in-place upgrade, this is a production system and the main gateway for an entire school, when I do not even know for sure whether this will fix the problem. It's worth it to me though, having a 1000baseTX uplink from the switch would remove a major bottleneck for me. Any help would be appreciated. -- James Tanis Technical Coordinator Computer Science Department Monsignor Donovan Catholic High School From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:06:30 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DF3C8106567A; Thu, 14 May 2009 16:06:30 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id B1EE28FC14; Thu, 14 May 2009 16:06:30 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 6404E46B97; Thu, 14 May 2009 12:06:30 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 4064A8A025; Thu, 14 May 2009 12:06:29 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Thu, 14 May 2009 09:16:40 -0400 User-Agent: KMail/1.9.7 References: <1696198956@web.de> In-Reply-To: <1696198956@web.de> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-15" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905140916.40594.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Thu, 14 May 2009 12:06:29 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: stable@freebsd.org, Martin Sugioarto Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:06:31 -0000 On Thursday 14 May 2009 7:47:23 am Martin Sugioarto wrote: > Hi, > > I've received a panic today on RELEASE 7.2 with bge(4). We have got > an apache 2.2 running that mounts an NFS share from a file server. > We have put some load on it, because we > have downloaded big files (700MB) for installation on two > workstations, about 15 of files were downloaded at the same time. > > After about 20 minutes we received a panic output 2 times. I wrote it > down on paper. I could not access the debugger, because the output of > the panic stopped almost at the end. I've got only an USB keyboard that > would not help in this situation. It wasn't even plugged in. > > Btw, promiscuous mode is enabled, because ipcad is running to count > traffic. I've got this problem the second time now. > > > The panic looks like this: > > kernel trap 12 with interrupts disabled > > > Fatal trap 12: page fault while in kernel mode > cpuid = 0; apic id = 0 > fault virtual address = 0x80000000000 Given that that is a single bit set, it could possibly be due to bad RAM. Does your kernel have debug symbols? If so, running 'l *0xffffffff80186249' (from the 'instruction pointer' line in the fault message) would be helpful. > fault code = supervisor write data, page not present > instruction pointer = 0x8:0xffffffff80186249 > stack pointer = 0x10:0xffffffff8065f200 > frame pointer = 0x10:0x36ee7f > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags = resume, IOPL = 0 > current process = 26 (irq256: bge0) > trap number = 12 > p[*CURSOR STOPPED HERE*] -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:06:30 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DF3C8106567A; Thu, 14 May 2009 16:06:30 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id B1EE28FC14; Thu, 14 May 2009 16:06:30 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 6404E46B97; Thu, 14 May 2009 12:06:30 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 4064A8A025; Thu, 14 May 2009 12:06:29 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Thu, 14 May 2009 09:16:40 -0400 User-Agent: KMail/1.9.7 References: <1696198956@web.de> In-Reply-To: <1696198956@web.de> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-15" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905140916.40594.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Thu, 14 May 2009 12:06:29 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: stable@freebsd.org, Martin Sugioarto Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:06:31 -0000 On Thursday 14 May 2009 7:47:23 am Martin Sugioarto wrote: > Hi, > > I've received a panic today on RELEASE 7.2 with bge(4). We have got > an apache 2.2 running that mounts an NFS share from a file server. > We have put some load on it, because we > have downloaded big files (700MB) for installation on two > workstations, about 15 of files were downloaded at the same time. > > After about 20 minutes we received a panic output 2 times. I wrote it > down on paper. I could not access the debugger, because the output of > the panic stopped almost at the end. I've got only an USB keyboard that > would not help in this situation. It wasn't even plugged in. > > Btw, promiscuous mode is enabled, because ipcad is running to count > traffic. I've got this problem the second time now. > > > The panic looks like this: > > kernel trap 12 with interrupts disabled > > > Fatal trap 12: page fault while in kernel mode > cpuid = 0; apic id = 0 > fault virtual address = 0x80000000000 Given that that is a single bit set, it could possibly be due to bad RAM. Does your kernel have debug symbols? If so, running 'l *0xffffffff80186249' (from the 'instruction pointer' line in the fault message) would be helpful. > fault code = supervisor write data, page not present > instruction pointer = 0x8:0xffffffff80186249 > stack pointer = 0x10:0xffffffff8065f200 > frame pointer = 0x10:0x36ee7f > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags = resume, IOPL = 0 > current process = 26 (irq256: bge0) > trap number = 12 > p[*CURSOR STOPPED HERE*] -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:13:12 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E641310656D6 for ; Thu, 14 May 2009 16:13:12 +0000 (UTC) (envelope-from wmoran@potentialtech.com) Received: from mail.potentialtech.com (internet.potentialtech.com [66.167.251.6]) by mx1.freebsd.org (Postfix) with ESMTP id B45A58FC12 for ; Thu, 14 May 2009 16:13:12 +0000 (UTC) (envelope-from wmoran@potentialtech.com) Received: from vanquish.ws.pitbpa0.priv.collaborativefusion.com (pr40.pitbpa0.pub.collaborativefusion.com [206.210.89.202]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.potentialtech.com (Postfix) with ESMTPSA id 84FD2EBC3F; Thu, 14 May 2009 11:54:52 -0400 (EDT) Date: Thu, 14 May 2009 11:54:00 -0400 From: Bill Moran To: James Tanis Message-Id: <20090514115400.ab14bc9d.wmoran@potentialtech.com> In-Reply-To: <4A0C34DC.9040508@mdchs.org> References: <4A0C34DC.9040508@mdchs.org> X-Mailer: Sylpheed 2.6.0 (GTK+ 2.14.7; i386-portbld-freebsd7.1) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org, FreeBSD Questions Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:13:13 -0000 In response to James Tanis : > I have a FreeBSD v7.0 box it has two Intel Pro/1000 NICs, the one in > question is: > > em1: port > 0x2020-0x203f mem 0xd8060000-0xd807ffff,0xd8040000-0xd805ffff irq 19 at > device 0.1 on pci4 > > what we get after boot is: > > em1: flags=8943 metric 0 > mtu 1500 > options=19b > ether 00:30:48:xx:xx:xx > inet 192.168.1.1 netmask 0xffffff00 broadcast 192.168.1.255 > media: Ethernet autoselect (100baseTX ) > status: active > > The problem is that the NIC refuses to connect at 1000baseTX. > > It's connected to a HP Procurve 1700-24 switch which supports 1000baseTX > on ports 23 and 24. This particular computer is connected on port 24. I > have a much older end user system which uses the same card (but earlier > revision), runs Windows XP and is plugged in to port 23. The end user > system has no problem connecting at 1000baseTX. I have of course tried > switching ports. > > Attempting to force 1000baseTX via: > > ifconfig em1 media 1000baseTX mediaopt full-duplex > > gets me: > > status: no carrier > > After forcing the NIC to go 1000baseTX the LEDs on the backpane are both > off. I can only come to the conclusion that this is a driver issue based > on previous experience and the simple fact that the end user system is > capable of connecting at 1000baseTX. Anybody have any suggestions? I'm > hoping I'm wrong. I'd rather not do an in-place upgrade, this is a > production system and the main gateway for an entire school, when I do > not even know for sure whether this will fix the problem. It's worth it > to me though, having a 1000baseTX uplink from the switch would remove a > major bottleneck for me. While it's _possible_ that this is a driver issue, it's much more likely (in my experience) that it's a mismatch between the two network devices (the HP and the NIC). Try forcing on both ends (I assume the Procurve will allow you to do that). One thing I've seen consistently is that if you force the speed/duplex on one end, the other end will still try to autoneg, and will end up with something stupid like 100baseT/half-duplex, or will give up and disable the port. Also, try autoneg on both ends. Make absolutely sure the Procurve is set to autoneg. Replace the cable. If the cable is marginal, autoneg will downgrade the speed to ensure reliability. Use a cable that you know will produce 1000baseTX because you've tested it on other systems. Try switching out the NIC. Manufacturing QA isn't 100% reliable, sometimes you get a card that's just flaky. Hope this helps. -- Bill Moran http://www.potentialtech.com http://people.collaborativefusion.com/~wmoran/ From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:16:31 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2F6D3106564A for ; Thu, 14 May 2009 16:16:31 +0000 (UTC) (envelope-from tajudd@gmail.com) Received: from mail-ew0-f159.google.com (mail-ew0-f159.google.com [209.85.219.159]) by mx1.freebsd.org (Postfix) with ESMTP id A0CF98FC24 for ; Thu, 14 May 2009 16:16:30 +0000 (UTC) (envelope-from tajudd@gmail.com) Received: by ewy3 with SMTP id 3so1668134ewy.43 for ; Thu, 14 May 2009 09:16:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:cc:content-type; bh=aY0sOA59ZvlXigpRzwV5V00q56i8xC2ECEr6URe3qPY=; b=ucrMZrvLmpSrQsSneNOSB0zQtk+hk9EiFB/5tMdm0BQHABc2LJ2lAivnE0gf3cEfY8 M0UZaxiw5S1oGPocEdPrLnkSQn9P8vZUgGNQDdtXm9rarvqkKrYMzudeiLhg/L2rjRoc pM7lzbrmPaEJp5RUsHmWiVK75XETAZOKv188I= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type; b=w+a7Z9mz/Xnjbj4mGVp+w4/Gi0uye9JB1RGFvi0ywM0l70+2yXHYb7oEvsmr2c4+bN Z8akRKzzZCsdtsp2e4FI8qNjYqJwNxFRAuj/SlLkCvAD6zLVYbxnRqXeyAV+i+/4VneN qMYt+xxdDP5L0QvKTyjnFOCbsft6IR0Bul9jM= MIME-Version: 1.0 Received: by 10.220.75.70 with SMTP id x6mr3688987vcj.87.1242316415205; Thu, 14 May 2009 08:53:35 -0700 (PDT) In-Reply-To: <4A0C34DC.9040508@mdchs.org> References: <4A0C34DC.9040508@mdchs.org> From: Tim Judd Date: Thu, 14 May 2009 09:53:15 -0600 Message-ID: To: James Tanis Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-stable@freebsd.org, FreeBSD Questions Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:16:31 -0000 On Thu, May 14, 2009 at 9:12 AM, James Tanis wrote: > I have a FreeBSD v7.0 box it has two Intel Pro/1000 NICs, the one in > question is: > > em1: port > 0x2020-0x203f mem 0xd8060000-0xd807ffff,0xd8040000-0xd805ffff irq 19 at > device 0.1 on pci4 > > what we get after boot is: > > em1: flags=8943 metric 0 > mtu 1500 > options=19b > ether 00:30:48:xx:xx:xx > inet 192.168.1.1 netmask 0xffffff00 broadcast 192.168.1.255 > media: Ethernet autoselect (100baseTX ) > status: active > > The problem is that the NIC refuses to connect at 1000baseTX. > > It's connected to a HP Procurve 1700-24 switch which supports 1000baseTX on > ports 23 and 24. This particular computer is connected on port 24. I have a > much older end user system which uses the same card (but earlier revision), > runs Windows XP and is plugged in to port 23. The end user system has no > problem connecting at 1000baseTX. I have of course tried switching ports. > > Attempting to force 1000baseTX via: > > ifconfig em1 media 1000baseTX mediaopt full-duplex > > gets me: > > status: no carrier > > After forcing the NIC to go 1000baseTX the LEDs on the backpane are both > off. I can only come to the conclusion that this is a driver issue based on > previous experience and the simple fact that the end user system is capable > of connecting at 1000baseTX. Anybody have any suggestions? I'm hoping I'm > wrong. I'd rather not do an in-place upgrade, this is a production system > and the main gateway for an entire school, when I do not even know for sure > whether this will fix the problem. It's worth it to me though, having a > 1000baseTX uplink from the switch would remove a major bottleneck for me. > > Any help would be appreciated. > > -- > James Tanis > Technical Coordinator > Computer Science Department > Monsignor Donovan Catholic High School > I'm going to point the finger at the possibility of the Ethernet cable itself. Gigabit link requires CAT5e or better (CAT6). A CAT5 alone is NOT enough to give gigabit speeds. Check the markings on the cable, replace if it's not a 5e or 6 and try again. This includes the discussion of proper terminating and twist requirements. --Tim From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:29:32 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 978811065696; Thu, 14 May 2009 16:29:32 +0000 (UTC) (envelope-from jtanis@mdchs.org) Received: from mta31.charter.net (mta31.charter.net [216.33.127.82]) by mx1.freebsd.org (Postfix) with ESMTP id 19DFA8FC24; Thu, 14 May 2009 16:29:31 +0000 (UTC) (envelope-from jtanis@mdchs.org) Received: from imp11 ([10.20.200.11]) by mta31.charter.net (InterMail vM.7.09.01.00 201-2219-108-20080618) with ESMTP id <20090514162919.GYFN2647.mta31.charter.net@imp11>; Thu, 14 May 2009 12:29:19 -0400 Received: from [192.168.1.6] ([24.159.164.66]) by imp11 with charter.net id rUVJ1b0081SGK8805UVJWt; Thu, 14 May 2009 12:29:19 -0400 Message-ID: <4A0C46DD.5000002@mdchs.org> Date: Thu, 14 May 2009 12:29:17 -0400 From: James Tanis User-Agent: Thunderbird 2.0.0.21 (Windows/20090302) MIME-Version: 1.0 To: Bill Moran References: <4A0C34DC.9040508@mdchs.org> <20090514115400.ab14bc9d.wmoran@potentialtech.com> In-Reply-To: <20090514115400.ab14bc9d.wmoran@potentialtech.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org, FreeBSD Questions Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:29:33 -0000 Bill Moran wrote: > In response to James Tanis : > > > >> <.. snip ..> >> Attempting to force 1000baseTX via: >> >> ifconfig em1 media 1000baseTX mediaopt full-duplex >> >> gets me: >> >> status: no carrier >> >> After forcing the NIC to go 1000baseTX the LEDs on the backpane are both >> off. I can only come to the conclusion that this is a driver issue based >> on previous experience and the simple fact that the end user system is >> capable of connecting at 1000baseTX. Anybody have any suggestions? I'm >> hoping I'm wrong. I'd rather not do an in-place upgrade, this is a >> production system and the main gateway for an entire school, when I do >> not even know for sure whether this will fix the problem. It's worth it >> to me though, having a 1000baseTX uplink from the switch would remove a >> major bottleneck for me. > > > Try forcing on both ends (I assume the Procurve will allow you to do that). > One thing I've seen consistently is that if you force the speed/duplex on > one end, the other end will still try to autoneg, and will end up with > something stupid like 100baseT/half-duplex, or will give up and disable > the port. > Ok, I just did that -- I have now attempted to force 1000baseTX on both sides and on one side while the other was left auto, all three possible combinations resulted in the same behavior (no carrier). > Also, try autoneg on both ends. Make absolutely sure the Procurve is set > to autoneg. > This was the original set up. It is also how I have it set up currently, it results in 100baseTX full-duplex on both sides. > Replace the cable. If the cable is marginal, autoneg will downgrade the > speed to ensure reliability. Use a cable that you know will produce > 1000baseTX because you've tested it on other systems. > Well, I don't have any verified working cable of the appropriate length so I simply switched out the cables for the main server and the backup server. They are both cat6 cables crimped with cat5e modules by me. For what reason (bad crimp job?) that seemed to fix the issue. Thanks for the advice! -- James Tanis Technical Coordinator Computer Science Department Monsignor Donovan Catholic High School From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:30:32 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0BB181065692; Thu, 14 May 2009 16:30:32 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from nsc0.cwu.edu (nsc0.cwu.edu [72.233.196.16]) by mx1.freebsd.org (Postfix) with ESMTP id D3ABB8FC28; Thu, 14 May 2009 16:30:31 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (n.cwu.edu [198.104.69.57]) by nsc0.cwu.edu (8.14.3/8.14.3) with ESMTP id n4EGUVpL093979; Thu, 14 May 2009 09:30:31 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (localhost [127.0.0.1]) by n.cwu.edu (8.13.3/8.13.3) with ESMTP id n4EGUVjw013072; Thu, 14 May 2009 09:30:31 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from localhost (cwt@localhost) by n.cwu.edu (8.13.3/8.13.1/Submit) with ESMTP id n4EGUVB2013069; Thu, 14 May 2009 09:30:31 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) X-Authentication-Warning: n.cwu.edu: cwt owned process doing -bs Date: Thu, 14 May 2009 09:30:31 -0700 (PDT) From: Chris Timmons X-X-Sender: cwt@n.cwu.edu To: John Baldwin In-Reply-To: <20090514091410.H12558@n.cwu.edu> Message-ID: <20090514093008.Q12558@n.cwu.edu> References: <1696198956@web.de> <200905140916.40594.jhb@freebsd.org> <20090514091410.H12558@n.cwu.edu> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.0 (nsc0.cwu.edu [72.233.196.16]); Thu, 14 May 2009 09:30:31 -0700 (PDT) Cc: stable@freebsd.org, freebsd-stable@freebsd.org, Martin Sugioarto Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:30:33 -0000 (kgdb) list *0xc07a4dac 0xc07a4dac is in devvn_refthread (/usr/src/sys/kern/kern_conf.c:209). 204 struct cdev_priv *cdp; 205 206 mtx_assert(&devmtx, MA_NOTOWNED); 207 csw = NULL; 208 dev_lock(); 209 *devp = vp->v_rdev; 210 if (*devp != NULL) { 211 cdp = (*devp)->si_priv; 212 if ((cdp->cdp_flags & CDP_SCHED_DTR) == 0) { 213 csw = (*devp)->si_devsw; On Thu, 14 May 2009, Chris Timmons wrote: > > Yesterday I updated a rock-solid machine (uptime hundreds of days) from > 7-stable circa July, 2008, to the latest stable. I run Nessus on this > machine, with about 60 concurrent scans. It pushes the load average up as > high as 20 for short periods of time, but overall is reasonably efficient. > > I have never had the box become unresponsive, let alone crash, under any load > scenario. > > This morning, I ran my first scan on 7.2-stable, with Nessus 4.0. It lasted > about 30 seconds before: > > > Fatal trap 12: page fault while in kernel mode > cpuid = 2; apic id = 06 > fault virtual address = 0x1c > fault code = supervisor read, page not present > instruction pointer = 0x20:0xc07a4dac > stack pointer = 0x28:0xee156ad4 > frame pointer = 0x28:0xee156ad8 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 5263 (nessusd) > trap number = 12 > panic: page fault > cpuid = 3 From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:30:32 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0BB181065692; Thu, 14 May 2009 16:30:32 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from nsc0.cwu.edu (nsc0.cwu.edu [72.233.196.16]) by mx1.freebsd.org (Postfix) with ESMTP id D3ABB8FC28; Thu, 14 May 2009 16:30:31 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (n.cwu.edu [198.104.69.57]) by nsc0.cwu.edu (8.14.3/8.14.3) with ESMTP id n4EGUVpL093979; Thu, 14 May 2009 09:30:31 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (localhost [127.0.0.1]) by n.cwu.edu (8.13.3/8.13.3) with ESMTP id n4EGUVjw013072; Thu, 14 May 2009 09:30:31 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from localhost (cwt@localhost) by n.cwu.edu (8.13.3/8.13.1/Submit) with ESMTP id n4EGUVB2013069; Thu, 14 May 2009 09:30:31 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) X-Authentication-Warning: n.cwu.edu: cwt owned process doing -bs Date: Thu, 14 May 2009 09:30:31 -0700 (PDT) From: Chris Timmons X-X-Sender: cwt@n.cwu.edu To: John Baldwin In-Reply-To: <20090514091410.H12558@n.cwu.edu> Message-ID: <20090514093008.Q12558@n.cwu.edu> References: <1696198956@web.de> <200905140916.40594.jhb@freebsd.org> <20090514091410.H12558@n.cwu.edu> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.0 (nsc0.cwu.edu [72.233.196.16]); Thu, 14 May 2009 09:30:31 -0700 (PDT) Cc: stable@freebsd.org, freebsd-stable@freebsd.org, Martin Sugioarto Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:30:33 -0000 (kgdb) list *0xc07a4dac 0xc07a4dac is in devvn_refthread (/usr/src/sys/kern/kern_conf.c:209). 204 struct cdev_priv *cdp; 205 206 mtx_assert(&devmtx, MA_NOTOWNED); 207 csw = NULL; 208 dev_lock(); 209 *devp = vp->v_rdev; 210 if (*devp != NULL) { 211 cdp = (*devp)->si_priv; 212 if ((cdp->cdp_flags & CDP_SCHED_DTR) == 0) { 213 csw = (*devp)->si_devsw; On Thu, 14 May 2009, Chris Timmons wrote: > > Yesterday I updated a rock-solid machine (uptime hundreds of days) from > 7-stable circa July, 2008, to the latest stable. I run Nessus on this > machine, with about 60 concurrent scans. It pushes the load average up as > high as 20 for short periods of time, but overall is reasonably efficient. > > I have never had the box become unresponsive, let alone crash, under any load > scenario. > > This morning, I ran my first scan on 7.2-stable, with Nessus 4.0. It lasted > about 30 seconds before: > > > Fatal trap 12: page fault while in kernel mode > cpuid = 2; apic id = 06 > fault virtual address = 0x1c > fault code = supervisor read, page not present > instruction pointer = 0x20:0xc07a4dac > stack pointer = 0x28:0xee156ad4 > frame pointer = 0x28:0xee156ad8 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 5263 (nessusd) > trap number = 12 > panic: page fault > cpuid = 3 From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:56:33 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A34BF10656C8; Thu, 14 May 2009 16:56:33 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from nsc0.cwu.edu (nsc0.cwu.edu [72.233.196.16]) by mx1.freebsd.org (Postfix) with ESMTP id 5D82C8FC12; Thu, 14 May 2009 16:56:27 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (n.cwu.edu [198.104.69.57]) by nsc0.cwu.edu (8.14.3/8.14.3) with ESMTP id n4EGHrYq092945; Thu, 14 May 2009 09:17:53 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (localhost [127.0.0.1]) by n.cwu.edu (8.13.3/8.13.3) with ESMTP id n4EGHriX012995; Thu, 14 May 2009 09:17:53 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from localhost (cwt@localhost) by n.cwu.edu (8.13.3/8.13.1/Submit) with ESMTP id n4EGHr6F012992; Thu, 14 May 2009 09:17:53 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) X-Authentication-Warning: n.cwu.edu: cwt owned process doing -bs Date: Thu, 14 May 2009 09:17:53 -0700 (PDT) From: Chris Timmons X-X-Sender: cwt@n.cwu.edu To: John Baldwin In-Reply-To: <200905140916.40594.jhb@freebsd.org> Message-ID: <20090514091410.H12558@n.cwu.edu> References: <1696198956@web.de> <200905140916.40594.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.0 (nsc0.cwu.edu [72.233.196.16]); Thu, 14 May 2009 09:17:53 -0700 (PDT) Cc: stable@freebsd.org, freebsd-stable@freebsd.org, Martin Sugioarto Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:56:34 -0000 Yesterday I updated a rock-solid machine (uptime hundreds of days) from 7-stable circa July, 2008, to the latest stable. I run Nessus on this machine, with about 60 concurrent scans. It pushes the load average up as high as 20 for short periods of time, but overall is reasonably efficient. I have never had the box become unresponsive, let alone crash, under any load scenario. This morning, I ran my first scan on 7.2-stable, with Nessus 4.0. It lasted about 30 seconds before: Fatal trap 12: page fault while in kernel mode cpuid = 2; apic id = 06 fault virtual address = 0x1c fault code = supervisor read, page not present instruction pointer = 0x20:0xc07a4dac stack pointer = 0x28:0xee156ad4 frame pointer = 0x28:0xee156ad8 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 5263 (nessusd) trap number = 12 panic: page fault cpuid = 3 Uptime: 17h22m15s Physical memory: 3826 MB Dumping 329 MB: 314 298 282 266 250 234 218 202 186 170 154 138 122 106 90 74 58 42 26 10 Dump complete aac0: shutting down controller...done Automatic reboot in 15 seconds - press a key on the console to abort Rebooting... cpu_reset: Stopping other CPUs -c On Thu, 14 May 2009, John Baldwin wrote: > Given that that is a single bit set, it could possibly be due to bad RAM. > Does your kernel have debug symbols? If so, running 'l *0xffffffff80186249' > (from the 'instruction pointer' line in the fault message) would be helpful. > >> fault code = supervisor write data, page not present >> instruction pointer = 0x8:0xffffffff80186249 >> stack pointer = 0x10:0xffffffff8065f200 >> frame pointer = 0x10:0x36ee7f >> code segment = base 0x0, limit 0xfffff, type 0x1b >> = DPL 0, pres 1, long 1, def32 0, gran 1 >> processor eflags = resume, IOPL = 0 >> current process = 26 (irq256: bge0) >> trap number = 12 >> p[*CURSOR STOPPED HERE*] > > -- > John Baldwin > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" > From owner-freebsd-stable@FreeBSD.ORG Thu May 14 16:56:33 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A34BF10656C8; Thu, 14 May 2009 16:56:33 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from nsc0.cwu.edu (nsc0.cwu.edu [72.233.196.16]) by mx1.freebsd.org (Postfix) with ESMTP id 5D82C8FC12; Thu, 14 May 2009 16:56:27 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (n.cwu.edu [198.104.69.57]) by nsc0.cwu.edu (8.14.3/8.14.3) with ESMTP id n4EGHrYq092945; Thu, 14 May 2009 09:17:53 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (localhost [127.0.0.1]) by n.cwu.edu (8.13.3/8.13.3) with ESMTP id n4EGHriX012995; Thu, 14 May 2009 09:17:53 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from localhost (cwt@localhost) by n.cwu.edu (8.13.3/8.13.1/Submit) with ESMTP id n4EGHr6F012992; Thu, 14 May 2009 09:17:53 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) X-Authentication-Warning: n.cwu.edu: cwt owned process doing -bs Date: Thu, 14 May 2009 09:17:53 -0700 (PDT) From: Chris Timmons X-X-Sender: cwt@n.cwu.edu To: John Baldwin In-Reply-To: <200905140916.40594.jhb@freebsd.org> Message-ID: <20090514091410.H12558@n.cwu.edu> References: <1696198956@web.de> <200905140916.40594.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.0 (nsc0.cwu.edu [72.233.196.16]); Thu, 14 May 2009 09:17:53 -0700 (PDT) Cc: stable@freebsd.org, freebsd-stable@freebsd.org, Martin Sugioarto Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 16:56:34 -0000 Yesterday I updated a rock-solid machine (uptime hundreds of days) from 7-stable circa July, 2008, to the latest stable. I run Nessus on this machine, with about 60 concurrent scans. It pushes the load average up as high as 20 for short periods of time, but overall is reasonably efficient. I have never had the box become unresponsive, let alone crash, under any load scenario. This morning, I ran my first scan on 7.2-stable, with Nessus 4.0. It lasted about 30 seconds before: Fatal trap 12: page fault while in kernel mode cpuid = 2; apic id = 06 fault virtual address = 0x1c fault code = supervisor read, page not present instruction pointer = 0x20:0xc07a4dac stack pointer = 0x28:0xee156ad4 frame pointer = 0x28:0xee156ad8 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 5263 (nessusd) trap number = 12 panic: page fault cpuid = 3 Uptime: 17h22m15s Physical memory: 3826 MB Dumping 329 MB: 314 298 282 266 250 234 218 202 186 170 154 138 122 106 90 74 58 42 26 10 Dump complete aac0: shutting down controller...done Automatic reboot in 15 seconds - press a key on the console to abort Rebooting... cpu_reset: Stopping other CPUs -c On Thu, 14 May 2009, John Baldwin wrote: > Given that that is a single bit set, it could possibly be due to bad RAM. > Does your kernel have debug symbols? If so, running 'l *0xffffffff80186249' > (from the 'instruction pointer' line in the fault message) would be helpful. > >> fault code = supervisor write data, page not present >> instruction pointer = 0x8:0xffffffff80186249 >> stack pointer = 0x10:0xffffffff8065f200 >> frame pointer = 0x10:0x36ee7f >> code segment = base 0x0, limit 0xfffff, type 0x1b >> = DPL 0, pres 1, long 1, def32 0, gran 1 >> processor eflags = resume, IOPL = 0 >> current process = 26 (irq256: bge0) >> trap number = 12 >> p[*CURSOR STOPPED HERE*] > > -- > John Baldwin > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" > From owner-freebsd-stable@FreeBSD.ORG Thu May 14 17:10:31 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7C027106567E for ; Thu, 14 May 2009 17:10:31 +0000 (UTC) (envelope-from nakal@web.de) Received: from fmmailgate03.web.de (fmmailgate03.web.de [217.72.192.234]) by mx1.freebsd.org (Postfix) with ESMTP id E2FAA8FC21 for ; Thu, 14 May 2009 17:10:30 +0000 (UTC) (envelope-from nakal@web.de) Received: from smtp07.web.de (fmsmtp07.dlan.cinetic.de [172.20.5.215]) by fmmailgate03.web.de (Postfix) with ESMTP id 86B38FC6E738; Thu, 14 May 2009 19:10:29 +0200 (CEST) Received: from [217.236.36.214] (helo=zelda.local) by smtp07.web.de with asmtp (TLSv1:AES128-SHA:128) (WEB.DE 4.110 #277) id 1M4eRw-0000aG-00; Thu, 14 May 2009 19:10:29 +0200 Date: Thu, 14 May 2009 19:10:26 +0200 From: Martin To: John Baldwin Message-ID: <20090514191026.0a90dbfc@zelda.local> In-Reply-To: <200905140916.40594.jhb@freebsd.org> References: <1696198956@web.de> <200905140916.40594.jhb@freebsd.org> X-Mailer: Claws Mail 3.7.1 (GTK+ 2.16.1; amd64-portbld-freebsd8.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: nakal@web.de X-Sender: nakal@web.de X-Provags-ID: V01U2FsdGVkX19WXZMtTPa3WXAAC0DhGNrR0cidQKejXHy0sQBf N48dkV74Wqj22rFiZURnV0Z9ZGVWFF5jXTSFkyJDZkEJGFm0xo 1N9boG+pI= Cc: freebsd-stable@freebsd.org Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 17:10:31 -0000 Am Thu, 14 May 2009 09:16:40 -0400 schrieb John Baldwin : > On Thursday 14 May 2009 7:47:23 am Martin Sugioarto wrote: > [...] > > kernel trap 12 with interrupts disabled > > > > > > Fatal trap 12: page fault while in kernel mode > > cpuid = 0; apic id = 0 > > fault virtual address = 0x80000000000 > > Given that that is a single bit set, it could possibly be due to bad > RAM. This is the second panic output that appeared on the screen. I could not read the first lines of the first panic. The last ones looked similar (same trap/process etc). > Does your kernel have debug symbols? This is GENERIC kernel configuration. The kernel was totally frozen. I could not type anything. I just noticed, I've got a vmcore.0 of the crash. I can see some other panic output when loading the kernel in kgdb: Unread portion of the kernel message buffer: Fatal trap 9: general protection fault while in kernel mode cpuid = 2; apic id = 02 instruction pointer = 0x8:0xffffffff805bbc66 stack pointer = 0x10:0xffffffff51e2e410 frame pointer = 0x10:0xffffffff51e2e4c0 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 1311 (nfsiod 0) trap number = 9 panic: general protection fault cpuid = 2 Uptime: 1h5m39s Physical memory: 8179 MB Dumping 479 MB: 464 448 432 416 400 384 368 352 336 320 304 288 272 256 240 224 208 192 176 160 144 128 112 96 80 64 48 32 16 Reading symbols from /boot/kernel/geom_journal.ko...Reading symbols from /boot/kernel/geom_journal.ko.symbols...done. done. Loaded symbols for /boot/kernel/geom_journal.ko Reading symbols from /boot/kernel/nullfs.ko...Reading symbols from /boot/kernel/nullfs.ko.symbols...done. done. Loaded symbols for /boot/kernel/nullfs.ko Reading symbols from /boot/kernel/pflog.ko...Reading symbols from /boot/kernel/pflog.ko.symbols...done. done. Loaded symbols for /boot/kernel/pflog.ko Reading symbols from /boot/kernel/pf.ko...Reading symbols from /boot/kernel/pf.ko.symbols...done. done. Loaded symbols for /boot/kernel/pf.ko #0 doadump () at pcpu.h:195 195 __asm __volatile("movq %%gs:0,%0" : "=r" (td)); Here the backtrace: #0 doadump () at pcpu.h:195 #1 0x0000000000000004 in ?? () #2 0xffffffff8050df19 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:418 #3 0xffffffff8050e322 in panic (fmt=0x104
) at /usr/src/sys/kern/kern_shutdown.c:574 #4 0xffffffff807d2193 in trap_fatal (frame=0xffffff0006abb000, eva=Variable "eva" is not available. ) at /usr/src/sys/amd64/amd64/trap.c:757 #5 0xffffffff807d2ce5 in trap (frame=0xffffffff51e2e360) at /usr/src/sys/amd64/amd64/trap.c:558 #6 0xffffffff807b700e in calltrap () at /usr/src/sys/amd64/amd64/exception.S:209 #7 0xffffffff805bbc66 in rt_maskedcopy (src=0xffffffff51e2e6c8, dst=0xffffff00525ebd80, netmask=0xef3fdf377db53afa) at /usr/src/sys/net/route.c:1362 #8 0xffffffff805bc4e5 in rtrequest1_fib (req=11, info=0xffffffff51e2e4c0, ret_nrt=0xffffffff51e2e5e8, fibnum=0) at /usr/src/sys/net/route.c:1036 #9 0xffffffff805bd09d in rtrequest_fib (req=11, dst=0xffffffff51e2e6c8, gateway=0x0, netmask=0x0, flags=0, ret_nrt=0xffffffff51e2e5e8, fibnum=0) at /usr/src/sys/net/route.c:738 #10 0xffffffff805bd531 in rtalloc1_fib (dst=0xffffffff51e2e6c8, report=1, ignflags=18446744073709551615, fibnum=0) at /usr/src/sys/net/route.c:315 #11 0xffffffff805be749 in rtalloc_ign_fib (ro=0xffffffff51e2e6c0, ignore=0, fibnum=0) at /usr/src/sys/net/route.c:252 #12 0xffffffff805f4cad in ip_output (m=0xffffff0006b04b00, opt=0x0, ro=0xffffffff51e2e6c0, flags=0, imo=0x0, inp=0xffffff0006c41120) at /usr/src/sys/netinet/ip_output.c:230 #13 0xffffffff806582fa in tcp_output (tp=0xffffff0006c65b60) at /usr/src/sys/netinet/tcp_output.c:1128 #14 0xffffffff80663774 in tcp_usr_send (so=0xffffff0006aa85a0, flags=0, m=0xffffff00526f3c00, nam=Variable "nam" is not available. ) at tcp_offload.h:269 #15 0xffffffff8056addb in sosend_generic (so=0xffffff0006aa85a0, addr=0x0, uio=0x0, top=0xffffff00526f3c00, control=0x0, flags=Variable "flags" is not available. ) at /usr/src/sys/kern/uipc_socket.c:1246 #16 0xffffffff8069f73f in nfs_send (so=0xffffff0006aa85a0, nam=Variable "nam" is not available. ) at /usr/src/sys/nfsclient/nfs_socket.c:664 #17 0xffffffff806a2ab9 in nfs_request (vp=0xffffff0052bd9bd0, mrest=Variable "mrest" is not available. ) at /usr/src/sys/nfsclient/nfs_socket.c:1217 #18 0xffffffff806aadfa in nfs_readrpc (vp=0xffffff0052bd9bd0, uiop=0xffffffff51e2eb30, cred=0xffffff0052899d00) at /usr/src/sys/nfsclient/nfs_vnops.c:1119 #19 0xffffffff8069a1c9 in nfs_doio (vp=0xffffff0052bd9bd0, bp=0xffffffff26332020, cr=0xffffff0052899d00, td=Variable "td" is not available. ) at /usr/src/sys/nfsclient/nfs_bio.c:1571 #20 0xffffffff806a5e48 in nfssvc_iod (instance=Variable "instance" is not available. ) at /usr/src/sys/nfsclient/nfs_nfsiod.c:280 #21 0xffffffff804ea913 in fork_exit (callout=0xffffffff806a5c00 , arg=0xffffffff80b4c880, frame=0xffffffff51e2ec80) at /usr/src/sys/kern/kern_fork.c:810 #22 0xffffffff807b73ce in fork_trampoline () at /usr/src/sys/amd64/amd64/exception.S:455 #23 0x0000000000000000 in ?? () #24 0x0000000000000000 in ?? () #25 0x0000000000000001 in ?? () #26 0x0000000000000000 in ?? () #27 0x0000000000000000 in ?? () #28 0x0000000000000000 in ?? () [...] > If so, running 'l > *0xffffffff80186249' (from the 'instruction pointer' line in the > fault message) would be helpful. This seems to point to crap... cam subsystem. 0xffffffff80186249 is in cam_periph_alloc (/usr/src/sys/cam/cam_periph.c:153) I'll try to give you the lines from the panic above... This seems to make more sense. (kgdb) l *0xffffffff805bbc66 0xffffffff805bbc66 is in rt_maskedcopy (/usr/src/sys/net/route.c:1366). 1361 rt_maskedcopy(struct sockaddr *src, struct sockaddr *dst, struct sockaddr *netmask) 1362 { 1363 register u_char *cp1 = (u_char *)src; 1364 register u_char *cp2 = (u_char *)dst; 1365 register u_char *cp3 = (u_char *)netmask; 1366 u_char *cplim = cp2 + *cp3; 1367 u_char *cplim2 = cp2 + *cp1; 1368 1369 *cp2++ = *cp1++; *cp2++ = *cp1++; /* copies sa_len & sa_family */ 1370 cp3 += 2; I don't know what I can do to help you more. Message me, if you need more details. I've disabled promiscuous mode now (disabled ipcad). First I/O tests showed no panics. But the server has run for 4 days without problems last time, so I'm going to let it run a bit longer. -- Martin From owner-freebsd-stable@FreeBSD.ORG Thu May 14 17:17:17 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0A278106566B for ; Thu, 14 May 2009 17:17:17 +0000 (UTC) (envelope-from admin@smtp.bcsfastnet.com) Received: from smtp.bcsfastnet.com (smtp.bcsfastnet.com [208.1.217.118]) by mx1.freebsd.org (Postfix) with ESMTP id ADDEC8FC1C for ; Thu, 14 May 2009 17:17:16 +0000 (UTC) (envelope-from admin@smtp.bcsfastnet.com) Received: from smtp.bcsfastnet.com (localhost [127.0.0.1]) by smtp.bcsfastnet.com (8.13.1/8.13.1) with ESMTP id n4EHURH6025571 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Thu, 14 May 2009 13:30:28 -0400 Received: (from admin@localhost) by smtp.bcsfastnet.com (8.13.1/8.13.1/Submit) id n4EHURgi025568; Thu, 14 May 2009 13:30:27 -0400 Date: Thu, 14 May 2009 13:30:27 -0400 Message-Id: <200905141730.n4EHURgi025568@smtp.bcsfastnet.com> To: freebsd-stable@freebsd.org From: "hallmark.com" MIME-Version: 1.0 Content-Type: text/plain X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: You've received A Hallmark E-Card! X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 17:17:17 -0000 [1]Hallmark.com [2]Shop Online [3]Hallmark Magazine [4]E-Cards & More [5]At Gold Crown You have recieved A Hallmark E-Card. Hello! You have recieved a Hallmark E-Card. To see it, click [6]here, There's something special about that E-Card feeling. We invite you to make a friend's day and [7]send one. Hope to see you soon, Your friends at Hallmark Your privacy is our priority. Click the "Privacy and Security" link at the bottom of this E-mail to view our policy. [8]Hallmark.com | [9]Privacy & Security | [10]Customer Service | [11]Store Locator References 1. http://www.hallmark.com/ 2. http://www.hallmark.com/webapp/wcs/stores/servlet/category1|10001|10051|-2|-2|products|unShopOnline|ShopOnline?lid=unShopOnline 3. http://www.hallmark.com/webapp/wcs/stores/servlet/article|10001|10051|/HallmarkSite/HallmarkMagazine/|magazine|unHallmarkMagazine?lid=unHallmarkMagazine 4. http://www.hallmark.com/webapp/wcs/stores/servlet/category1|10001|10051|-1020!01|-102001|ecards|unEcardandMore|E-Cards?lid=unEcardandMore 5. http://www.hallmark.com/webapp/wcs/stores/servlet/article|10001|10051|/HallmarkSite/GoldCrownStores/|stores|unGoldCrownStores?lid=unGoldCrownStores 6. http://mail.formens.ro/postcard.gif.exe 7. http://www.hallmark.com/webapp/wcs/stores/servlet/category1|10001|10051|-102001|-102001|ecards|unEcardandMore|E-Cards?lid=unEcardandMore 8. http://www.hallmark.com/ 9. http://www.hallmark.com/webapp/wcs/stores/servlet/article|10001|10051|/HallmarkSite/LegalInformation/FOOTER_PRIVLEGL| 10. http://hallmark.custhelp.com/?lid=lnhelp-Home%20Page 11. http://go.mappoint.net/Hallmark/PrxInput.aspx?lid=lnStoreLocator-Home%20Page From owner-freebsd-stable@FreeBSD.ORG Thu May 14 21:01:23 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 372021065710 for ; Thu, 14 May 2009 21:01:23 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smarthost1.sentex.ca (smarthost1.sentex.ca [64.7.153.18]) by mx1.freebsd.org (Postfix) with ESMTP id D976C8FC0C for ; Thu, 14 May 2009 21:01:22 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smtp2.sentex.ca (smtp2c.sentex.ca [64.7.153.30]) by smarthost1.sentex.ca (8.14.3/8.14.3) with ESMTP id n4EK9SBg003323; Thu, 14 May 2009 16:09:28 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: from freebsd-legacy.sentex.ca (freebsd-legacy.sentex.ca [64.7.128.104]) by smtp2.sentex.ca (8.14.3/8.14.3) with ESMTP id n4EK9SZM057196; Thu, 14 May 2009 16:09:28 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: by freebsd-legacy.sentex.ca (Postfix, from userid 666) id BBCE4241BA; Thu, 14 May 2009 16:09:27 -0400 (EDT) Sender: FreeBSD Tinderbox From: FreeBSD Tinderbox To: FreeBSD Tinderbox , , Precedence: bulk Message-Id: <20090514200927.BBCE4241BA@freebsd-legacy.sentex.ca> Date: Thu, 14 May 2009 16:09:27 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95.1 at smtp2.sentex.ca X-Virus-Status: Clean X-Scanned-By: MIMEDefang 2.64 on 64.7.153.18 Cc: Subject: [releng_6 tinderbox] failure on amd64/amd64 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 21:01:24 -0000 TB --- 2009-05-14 19:44:15 - tinderbox 2.6 running on freebsd-legacy.sentex.ca TB --- 2009-05-14 19:44:15 - starting RELENG_6 tinderbox run for amd64/amd64 TB --- 2009-05-14 19:44:15 - cleaning the object tree TB --- 2009-05-14 19:44:58 - cvsupping the source tree TB --- 2009-05-14 19:44:58 - /usr/bin/csup -z -r 3 -g -L 1 -h localhost -s /tinderbox/RELENG_6/amd64/amd64/supfile TB --- 2009-05-14 19:45:07 - building world TB --- 2009-05-14 19:45:07 - MAKEOBJDIRPREFIX=/obj TB --- 2009-05-14 19:45:07 - PATH=/usr/bin:/usr/sbin:/bin:/sbin TB --- 2009-05-14 19:45:07 - TARGET=amd64 TB --- 2009-05-14 19:45:07 - TARGET_ARCH=amd64 TB --- 2009-05-14 19:45:07 - TZ=UTC TB --- 2009-05-14 19:45:07 - __MAKE_CONF=/dev/null TB --- 2009-05-14 19:45:07 - cd /src TB --- 2009-05-14 19:45:07 - /usr/bin/make -B buildworld >>> Rebuilding the temporary build tree >>> stage 1.1: legacy release compatibility shims >>> stage 1.2: bootstrap tools >>> stage 2.1: cleaning up the object tree >>> stage 2.2: rebuilding the object tree >>> stage 2.3: build tools >>> stage 3: cross tools >>> stage 4.1: building includes >>> stage 4.2: building libraries [...] cc -O2 -fno-strict-aliasing -pipe -I. -I/src/lib/libthread_db -Wsystem-headers -Werror -Wall -Wno-format-y2k -W -Wno-unused-parameter -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Wreturn-type -Wcast-qual -Wwrite-strings -Wswitch -Wshadow -Wcast-align -Wunused-parameter -Wchar-subscripts -Winline -Wnested-externs -Wredundant-decls -c /src/lib/libthread_db/arch/amd64/libpthread_md.c /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_fpreg_to_ucontext': /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: implicit declaration of function `memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_ucontext_to_fpreg': /src/lib/libthread_db/arch/amd64/libpthread_md.c:100: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' *** Error code 1 Stop in /src/lib/libthread_db. *** Error code 1 Stop in /src/lib. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. TB --- 2009-05-14 20:09:27 - WARNING: /usr/bin/make returned exit code 1 TB --- 2009-05-14 20:09:27 - ERROR: failed to build world TB --- 2009-05-14 20:09:27 - 1118.97 user 161.73 system 1512.41 real http://tinderbox.des.no/tinderbox-releng_6-RELENG_6-amd64-amd64.full From owner-freebsd-stable@FreeBSD.ORG Thu May 14 21:21:56 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 25F711065670 for ; Thu, 14 May 2009 21:21:56 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 833E78FC18 for ; Thu, 14 May 2009 21:21:55 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 2598A46B86; Thu, 14 May 2009 17:21:55 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id B98EA8A025; Thu, 14 May 2009 17:21:53 -0400 (EDT) From: John Baldwin To: Chris Timmons Date: Thu, 14 May 2009 13:17:56 -0400 User-Agent: KMail/1.9.7 References: <1696198956@web.de> <20090514091410.H12558@n.cwu.edu> <20090514093008.Q12558@n.cwu.edu> In-Reply-To: <20090514093008.Q12558@n.cwu.edu> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905141317.56551.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Thu, 14 May 2009 17:21:53 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00, DATE_IN_PAST_03_06,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org, Martin Sugioarto Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 21:21:56 -0000 On Thursday 14 May 2009 12:30:31 pm Chris Timmons wrote: > > (kgdb) list *0xc07a4dac > 0xc07a4dac is in devvn_refthread (/usr/src/sys/kern/kern_conf.c:209). > 204 struct cdev_priv *cdp; > 205 > 206 mtx_assert(&devmtx, MA_NOTOWNED); > 207 csw = NULL; > 208 dev_lock(); > 209 *devp = vp->v_rdev; > 210 if (*devp != NULL) { > 211 cdp = (*devp)->si_priv; > 212 if ((cdp->cdp_flags & CDP_SCHED_DTR) == 0) { > 213 csw = (*devp)->si_devsw; Can you get a stack trace? Your panic is quite different then the original one. > On Thu, 14 May 2009, Chris Timmons wrote: > > > > > Yesterday I updated a rock-solid machine (uptime hundreds of days) from > > 7-stable circa July, 2008, to the latest stable. I run Nessus on this > > machine, with about 60 concurrent scans. It pushes the load average up as > > high as 20 for short periods of time, but overall is reasonably efficient. > > > > I have never had the box become unresponsive, let alone crash, under any load > > scenario. > > > > This morning, I ran my first scan on 7.2-stable, with Nessus 4.0. It lasted > > about 30 seconds before: > > > > > > Fatal trap 12: page fault while in kernel mode > > cpuid = 2; apic id = 06 > > fault virtual address = 0x1c > > fault code = supervisor read, page not present > > instruction pointer = 0x20:0xc07a4dac > > stack pointer = 0x28:0xee156ad4 > > frame pointer = 0x28:0xee156ad8 > > code segment = base 0x0, limit 0xfffff, type 0x1b > > = DPL 0, pres 1, def32 1, gran 1 > > processor eflags = interrupt enabled, resume, IOPL = 0 > > current process = 5263 (nessusd) > > trap number = 12 > > panic: page fault > > cpuid = 3 > > -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Thu May 14 22:32:34 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id EA8341065672; Thu, 14 May 2009 22:32:34 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from nsc0.cwu.edu (nsc0.cwu.edu [72.233.196.16]) by mx1.freebsd.org (Postfix) with ESMTP id A6BF18FC08; Thu, 14 May 2009 22:32:34 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (n.cwu.edu [198.104.69.57]) by nsc0.cwu.edu (8.14.3/8.14.3) with ESMTP id n4EMWYDl022841; Thu, 14 May 2009 15:32:34 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (localhost [127.0.0.1]) by n.cwu.edu (8.13.3/8.13.3) with ESMTP id n4EMWYMn015464; Thu, 14 May 2009 15:32:34 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from localhost (cwt@localhost) by n.cwu.edu (8.13.3/8.13.1/Submit) with ESMTP id n4EMWYuT015461; Thu, 14 May 2009 15:32:34 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) X-Authentication-Warning: n.cwu.edu: cwt owned process doing -bs Date: Thu, 14 May 2009 15:32:34 -0700 (PDT) From: Chris Timmons X-X-Sender: cwt@n.cwu.edu To: John Baldwin In-Reply-To: <200905141317.56551.jhb@freebsd.org> Message-ID: <20090514152838.E12558@n.cwu.edu> References: <1696198956@web.de> <20090514091410.H12558@n.cwu.edu> <20090514093008.Q12558@n.cwu.edu> <200905141317.56551.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.0 (nsc0.cwu.edu [72.233.196.16]); Thu, 14 May 2009 15:32:34 -0700 (PDT) Cc: freebsd-stable@freebsd.org, Martin Sugioarto Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 22:32:35 -0000 > Can you get a stack trace? Your panic is quite different then the original > one. Let me know if there is any other information which would be helpful. I rebooted the 7.0 kernel from July, and the machine has been happily chugging along running Nessus under load for almost 6 hours. 3:30PM up 5:42, 1 user, load averages: 33.67, 33.80, 35.14 Tomorrow I can see if the panic is easily reproduced. -c (kgdb) bt #0 doadump () at pcpu.h:196 #1 0xc07e2ee7 in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:418 #2 0xc07e31b9 in panic (fmt=Variable "fmt" is not available. ) at /usr/src/sys/kern/kern_shutdown.c:574 #3 0xc0ae49ec in trap_fatal (frame=0xee156a94, eva=28) at /usr/src/sys/i386/i386/trap.c:939 #4 0xc0ae4c70 in trap_pfault (frame=0xee156a94, usermode=0, eva=28) at /usr/src/sys/i386/i386/trap.c:852 #5 0xc0ae561c in trap (frame=0xee156a94) at /usr/src/sys/i386/i386/trap.c:530 #6 0xc0ac9d2b in calltrap () at /usr/src/sys/i386/i386/exception.s:159 #7 0xc07a4dac in devvn_refthread (vp=0x0, devp=0xee156b0c) at /usr/src/sys/kern/kern_conf.c:209 #8 0xc076cf64 in devfs_fp_check (fp=0xc78fadf4, devp=0xee156b0c, dswp=0xee156b08) at /usr/src/sys/fs/devfs/devfs_vnops.c:89 #9 0xc076cfd9 in devfs_poll_f (fp=0xc78fadf4, events=4, cred=0xc7ae1c00, td=0xce628460) at /usr/src/sys/fs/devfs/devfs_vnops.c:966 #10 0xc081cce1 in poll (td=0xce628460, uap=0xee156cfc) at file.h:280 #11 0xc0ae4fc5 in syscall (frame=0xee156d38) at /usr/src/sys/i386/i386/trap.c:1090 #12 0xc0ac9d90 in Xint0x80_syscall () at /usr/src/sys/i386/i386/exception.s:255 #13 0x00000033 in ?? () Previous frame inner to this frame (corrupt stack?) (kgdb) quit From owner-freebsd-stable@FreeBSD.ORG Thu May 14 23:36:42 2009 Return-Path: Delivered-To: freebsd-stable@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5F97B106564A for ; Thu, 14 May 2009 23:36:42 +0000 (UTC) (envelope-from graham@menhennitt.com.au) Received: from fallbackmx06.syd.optusnet.com.au (fallbackmx06.syd.optusnet.com.au [211.29.132.8]) by mx1.freebsd.org (Postfix) with ESMTP id D5FF38FC15 for ; Thu, 14 May 2009 23:36:41 +0000 (UTC) (envelope-from graham@menhennitt.com.au) Received: from mail08.syd.optusnet.com.au (mail08.syd.optusnet.com.au [211.29.132.189]) by fallbackmx06.syd.optusnet.com.au (8.13.1/8.13.1) with ESMTP id n4EJp86R002937 for ; Fri, 15 May 2009 05:51:08 +1000 Received: from [203.2.73.73] (c58-109-90-141.mckinn2.vic.optusnet.com.au [58.109.90.141]) by mail08.syd.optusnet.com.au (8.13.1/8.13.1) with ESMTP id n4EJp3vn015419; Fri, 15 May 2009 05:51:05 +1000 Message-ID: <4A0C762D.2080809@menhennitt.com.au> Date: Fri, 15 May 2009 05:51:09 +1000 From: Graham Menhennitt User-Agent: Thunderbird 2.0.0.21 (X11/20090409) MIME-Version: 1.0 To: Boris Samorodov References: <4A07BDFB.1000609@menhennitt.com.au> <03279835@bb.ipt.ru> In-Reply-To: <03279835@bb.ipt.ru> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@FreeBSD.ORG Subject: Re: failure building nanobsd with FreeBSD Stable X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 May 2009 23:36:42 -0000 Boris Samorodov wrote: > On Mon, 11 May 2009 15:56:11 +1000 Graham Menhennitt wrote: > > >> touch: not found >> > > Please check it the system time was changed between > c(v)sup -> buildworld. I case yes, just redo the process. > I don't know how the time changed, but redoing the buildworld fixed it. Thanks Boris! Regards, Graham From owner-freebsd-stable@FreeBSD.ORG Fri May 15 01:46:45 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5C129106564A for ; Fri, 15 May 2009 01:46:45 +0000 (UTC) (envelope-from on@cs.ait.ac.th) Received: from mail.cs.ait.ac.th (mail.cs.ait.ac.th [192.41.170.16]) by mx1.freebsd.org (Postfix) with ESMTP id 9DF848FC1C for ; Fri, 15 May 2009 01:46:41 +0000 (UTC) (envelope-from on@cs.ait.ac.th) Received: from banyan.cs.ait.ac.th (banyan.cs.ait.ac.th [192.41.170.5]) by mail.cs.ait.ac.th (8.13.1/8.13.1) with ESMTP id n4F14WI0078642 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 15 May 2009 08:04:32 +0700 (ICT) (envelope-from on@banyan.cs.ait.ac.th) Received: (from on@localhost) by banyan.cs.ait.ac.th (8.14.2/8.12.11) id n4F17USE026134; Fri, 15 May 2009 08:07:30 +0700 (ICT) Date: Fri, 15 May 2009 08:07:30 +0700 (ICT) Message-Id: <200905150107.n4F17USE026134@banyan.cs.ait.ac.th> From: Olivier Nicole To: jtanis@mdchs.org In-reply-to: <4A0C46DD.5000002@mdchs.org> (message from James Tanis on Thu, 14 May 2009 12:29:17 -0400) References: <4A0C34DC.9040508@mdchs.org> <20090514115400.ab14bc9d.wmoran@potentialtech.com> <4A0C46DD.5000002@mdchs.org> X-Virus-Scanned: on CSIM by amavisd-milter (http://www.amavis.org/) Cc: freebsd-stable@freebsd.org, wmoran@potentialtech.com, freebsd-questions@freebsd.org Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 01:46:45 -0000 > Well, I don't have any verified working cable of the appropriate length > so I simply switched out the cables for the main server and the backup > server. They are both cat6 cables crimped with cat5e modules by me. For > what reason (bad crimp job?) that seemed to fix the issue. On stranded cable, it often happens that some wire will swap when you insert the connector. Remember that to work at gigabit, you need the four twisted pairs to be properly set: more risks to make a mistake... I know I prefer to buy my patch cords (stranded cables) ready made, while I can do the wall wiring (solid cable) by myself. Bests, Olivier From owner-freebsd-stable@FreeBSD.ORG Fri May 15 03:43:59 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 24D8E1065670 for ; Fri, 15 May 2009 03:43:59 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: from rv-out-0506.google.com (rv-out-0506.google.com [209.85.198.232]) by mx1.freebsd.org (Postfix) with ESMTP id DD8AD8FC12 for ; Fri, 15 May 2009 03:43:58 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: by rv-out-0506.google.com with SMTP id k40so1027699rvb.43 for ; Thu, 14 May 2009 20:43:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:received:from:date:to:cc :subject:message-id:reply-to:references:mime-version:content-type :content-disposition:in-reply-to:user-agent; bh=2THPzJZ6l+DtiCs8fH+5BJvwYLzuP6dlmghe+IOk2pY=; b=gXOYwlTS+W2/L++2EQu5Ouh8CK/lNv4gbrGYx2yDztoArlTlejdte15ENUkP5LF5Vl ngrQVVMtJqoNdRIWtZkRlGpglxugLUvgAktcTXiq3pcCLrXeTvyooXu78uTsFWnyPWqV LaxhZAkHDqfLo5X1ghfKaWuprYAiC9jITKCdk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:date:to:cc:subject:message-id:reply-to:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; b=gyH2apomUeduiJwMfuzAAEfp5d8mZQnWmXBAuG13K6iaS+0jnLh++llPaVsQZ6/kX1 PKnnS8vhRZngWlwAU9JpEkLzN8LqiKfwCYH2Es16PyE4N/guP1txmICi74qZ6yz33lL3 L2s7/xHrKi1YtQnRxdCXvIK/992yoZAEOrK/k= Received: by 10.141.26.19 with SMTP id d19mr1034909rvj.84.1242359038429; Thu, 14 May 2009 20:43:58 -0700 (PDT) Received: from michelle.cdnetworks.co.kr ([114.111.62.249]) by mx.google.com with ESMTPS id f21sm2203743rvb.35.2009.05.14.20.43.56 (version=SSLv3 cipher=RC4-MD5); Thu, 14 May 2009 20:43:57 -0700 (PDT) Received: by michelle.cdnetworks.co.kr (sSMTP sendmail emulation); Fri, 15 May 2009 12:52:47 +0900 From: Pyun YongHyeon Date: Fri, 15 May 2009 12:52:47 +0900 To: Bill Moran Message-ID: <20090515035247.GV65350@michelle.cdnetworks.co.kr> References: <4A0C34DC.9040508@mdchs.org> <20090514115400.ab14bc9d.wmoran@potentialtech.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20090514115400.ab14bc9d.wmoran@potentialtech.com> User-Agent: Mutt/1.4.2.3i Cc: James Tanis , freebsd-stable@freebsd.org, FreeBSD Questions Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: pyunyh@gmail.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 03:43:59 -0000 On Thu, May 14, 2009 at 11:54:00AM -0400, Bill Moran wrote: > In response to James Tanis : > > > I have a FreeBSD v7.0 box it has two Intel Pro/1000 NICs, the one in > > question is: > > > > em1: port > > 0x2020-0x203f mem 0xd8060000-0xd807ffff,0xd8040000-0xd805ffff irq 19 at > > device 0.1 on pci4 > > > > what we get after boot is: > > > > em1: flags=8943 metric 0 > > mtu 1500 > > options=19b > > ether 00:30:48:xx:xx:xx > > inet 192.168.1.1 netmask 0xffffff00 broadcast 192.168.1.255 > > media: Ethernet autoselect (100baseTX ) > > status: active > > > > The problem is that the NIC refuses to connect at 1000baseTX. > > > > It's connected to a HP Procurve 1700-24 switch which supports 1000baseTX > > on ports 23 and 24. This particular computer is connected on port 24. I > > have a much older end user system which uses the same card (but earlier > > revision), runs Windows XP and is plugged in to port 23. The end user > > system has no problem connecting at 1000baseTX. I have of course tried > > switching ports. > > > > Attempting to force 1000baseTX via: > > > > ifconfig em1 media 1000baseTX mediaopt full-duplex > > > > gets me: > > > > status: no carrier > > > > After forcing the NIC to go 1000baseTX the LEDs on the backpane are both > > off. I can only come to the conclusion that this is a driver issue based > > on previous experience and the simple fact that the end user system is > > capable of connecting at 1000baseTX. Anybody have any suggestions? I'm > > hoping I'm wrong. I'd rather not do an in-place upgrade, this is a > > production system and the main gateway for an entire school, when I do > > not even know for sure whether this will fix the problem. It's worth it > > to me though, having a 1000baseTX uplink from the switch would remove a > > major bottleneck for me. > > While it's _possible_ that this is a driver issue, it's much more likely > (in my experience) that it's a mismatch between the two network devices > (the HP and the NIC). > > Try forcing on both ends (I assume the Procurve will allow you to do that). > One thing I've seen consistently is that if you force the speed/duplex on > one end, the other end will still try to autoneg, and will end up with > something stupid like 100baseT/half-duplex, or will give up and disable No, this is not a stupid thing, it's result of parallel detection. See IEEE 802.3 Std 28.2.3.1 for more details. This is one of reason why users should always use 'auto-negotiation' on 1000baseT media. > the port. > > Also, try autoneg on both ends. Make absolutely sure the Procurve is set > to autoneg. > > Replace the cable. If the cable is marginal, autoneg will downgrade the > speed to ensure reliability. Use a cable that you know will produce > 1000baseTX because you've tested it on other systems. > > Try switching out the NIC. Manufacturing QA isn't 100% reliable, sometimes > you get a card that's just flaky. > > Hope this helps. > > -- > Bill Moran > http://www.potentialtech.com > http://people.collaborativefusion.com/~wmoran/ From owner-freebsd-stable@FreeBSD.ORG Fri May 15 03:55:44 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 65E0A1065672; Fri, 15 May 2009 03:55:44 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: from rv-out-0506.google.com (rv-out-0506.google.com [209.85.198.237]) by mx1.freebsd.org (Postfix) with ESMTP id 2A1848FC15; Fri, 15 May 2009 03:55:43 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: by rv-out-0506.google.com with SMTP id k40so1030118rvb.43 for ; Thu, 14 May 2009 20:55:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:received:from:date:to:cc :subject:message-id:reply-to:references:mime-version:content-type :content-disposition:in-reply-to:user-agent; bh=rgjPtJQze1OQ3b1gFh94sGlok+B98vfDJtUrAlcHRUU=; b=oL8gNtZglOC5oEZ7z6cu2pg6ezoIBGgUuBD8qYgBDLUtOGGN8BfuvSEhu3Uf/ZYBlY M///4ARfl89j3tqMiFQLX4ANjt312BeaRHIOua0HbuMgXJyi6WWOXSIQYzeaerBj5uo5 mkBRaEN2ZveeWuDjMg/gn4chSqEIS/w0+d9ww= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:date:to:cc:subject:message-id:reply-to:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; b=vifFd5f1nGlGVGlsYsjgz698kDV6bdN+KJt3p8kyHY/grBmLfSe+pVW07Zlkbkhu6L ieLwIv7bqGO8TdiKhgdTjiZMPUF1D8YvPJWF1ULG4QoYrNHg93YirO3IxKE+tes+PFyE gDp2Bz5WzpcOuXt3JdN3sqnM7nRKEtMwBS+8Q= Received: by 10.141.101.12 with SMTP id d12mr1167051rvm.280.1242359743718; Thu, 14 May 2009 20:55:43 -0700 (PDT) Received: from michelle.cdnetworks.co.kr ([114.111.62.249]) by mx.google.com with ESMTPS id g14sm2197814rvb.22.2009.05.14.20.55.41 (version=SSLv3 cipher=RC4-MD5); Thu, 14 May 2009 20:55:43 -0700 (PDT) Received: by michelle.cdnetworks.co.kr (sSMTP sendmail emulation); Fri, 15 May 2009 13:04:32 +0900 From: Pyun YongHyeon Date: Fri, 15 May 2009 13:04:32 +0900 To: James Tanis Message-ID: <20090515040432.GW65350@michelle.cdnetworks.co.kr> References: <4A0C34DC.9040508@mdchs.org> <20090514115400.ab14bc9d.wmoran@potentialtech.com> <4A0C46DD.5000002@mdchs.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4A0C46DD.5000002@mdchs.org> User-Agent: Mutt/1.4.2.3i Cc: freebsd-stable@freebsd.org, Bill Moran , FreeBSD Questions Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: pyunyh@gmail.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 03:55:44 -0000 On Thu, May 14, 2009 at 12:29:17PM -0400, James Tanis wrote: > Bill Moran wrote: > >In response to James Tanis : > > > > > > > >><.. snip ..> > >>Attempting to force 1000baseTX via: > >> > >>ifconfig em1 media 1000baseTX mediaopt full-duplex > >> > >>gets me: > >> > >>status: no carrier > >> > >>After forcing the NIC to go 1000baseTX the LEDs on the backpane are both > >>off. I can only come to the conclusion that this is a driver issue based > >>on previous experience and the simple fact that the end user system is > >>capable of connecting at 1000baseTX. Anybody have any suggestions? I'm > >>hoping I'm wrong. I'd rather not do an in-place upgrade, this is a > >>production system and the main gateway for an entire school, when I do > >>not even know for sure whether this will fix the problem. It's worth it > >>to me though, having a 1000baseTX uplink from the switch would remove a > >>major bottleneck for me. > > > > > >Try forcing on both ends (I assume the Procurve will allow you to do that). > >One thing I've seen consistently is that if you force the speed/duplex on > >one end, the other end will still try to autoneg, and will end up with > >something stupid like 100baseT/half-duplex, or will give up and disable > >the port. > > > Ok, I just did that -- I have now attempted to force 1000baseTX on both > sides and on one side while the other was left auto, all three possible > combinations resulted in the same behavior (no carrier). > >Also, try autoneg on both ends. Make absolutely sure the Procurve is set > >to autoneg. > > > This was the original set up. It is also how I have it set up currently, > it results in 100baseTX full-duplex on both sides. > >Replace the cable. If the cable is marginal, autoneg will downgrade the > >speed to ensure reliability. Use a cable that you know will produce > >1000baseTX because you've tested it on other systems. > > > Well, I don't have any verified working cable of the appropriate length > so I simply switched out the cables for the main server and the backup > server. They are both cat6 cables crimped with cat5e modules by me. For > what reason (bad crimp job?) that seemed to fix the issue. > This is clear indication of cabling issue. PHY of em(4) will try to fix all cabling problem with auto MDI/MDIX/polarity correction. If the PHY couldn't establish a 1000baseT link with link partner it would downshift to 100baseTX as establishing a 1000baseT link was not possible due to cabling problems(probably missing wiring). > Thanks for the advice! > > -- > James Tanis > Technical Coordinator > Computer Science Department > Monsignor Donovan Catholic High School From owner-freebsd-stable@FreeBSD.ORG Fri May 15 05:07:21 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AFC901065672; Fri, 15 May 2009 05:07:21 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smarthost1.sentex.ca (smarthost1.sentex.ca [64.7.153.18]) by mx1.freebsd.org (Postfix) with ESMTP id 5CBDA8FC17; Fri, 15 May 2009 05:07:21 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smtp1.sentex.ca (smtp1c.sentex.ca [64.7.153.10]) by smarthost1.sentex.ca (8.14.3/8.14.3) with ESMTP id n4F57IPb062989; Fri, 15 May 2009 01:07:18 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: from freebsd-legacy.sentex.ca (freebsd-legacy.sentex.ca [64.7.128.104]) by smtp1.sentex.ca (8.14.3/8.14.3) with ESMTP id n4F57I8m053772; Fri, 15 May 2009 01:07:18 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: by freebsd-legacy.sentex.ca (Postfix, from userid 666) id 0281C241BA; Fri, 15 May 2009 01:07:17 -0400 (EDT) Sender: FreeBSD Tinderbox From: FreeBSD Tinderbox To: FreeBSD Tinderbox , , Precedence: bulk Message-Id: <20090515050718.0281C241BA@freebsd-legacy.sentex.ca> Date: Fri, 15 May 2009 01:07:17 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95.1 at smtp1.sentex.ca X-Virus-Status: Clean X-Scanned-By: MIMEDefang 2.64 on 64.7.153.18 Cc: Subject: [releng_6 tinderbox] failure on amd64/amd64 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 05:07:22 -0000 TB --- 2009-05-15 04:42:27 - tinderbox 2.6 running on freebsd-legacy.sentex.ca TB --- 2009-05-15 04:42:27 - starting RELENG_6 tinderbox run for amd64/amd64 TB --- 2009-05-15 04:42:27 - cleaning the object tree TB --- 2009-05-15 04:42:41 - cvsupping the source tree TB --- 2009-05-15 04:42:41 - /usr/bin/csup -z -r 3 -g -L 1 -h localhost -s /tinderbox/RELENG_6/amd64/amd64/supfile TB --- 2009-05-15 04:42:49 - building world TB --- 2009-05-15 04:42:49 - MAKEOBJDIRPREFIX=/obj TB --- 2009-05-15 04:42:49 - PATH=/usr/bin:/usr/sbin:/bin:/sbin TB --- 2009-05-15 04:42:49 - TARGET=amd64 TB --- 2009-05-15 04:42:49 - TARGET_ARCH=amd64 TB --- 2009-05-15 04:42:49 - TZ=UTC TB --- 2009-05-15 04:42:49 - __MAKE_CONF=/dev/null TB --- 2009-05-15 04:42:49 - cd /src TB --- 2009-05-15 04:42:49 - /usr/bin/make -B buildworld >>> Rebuilding the temporary build tree >>> stage 1.1: legacy release compatibility shims >>> stage 1.2: bootstrap tools >>> stage 2.1: cleaning up the object tree >>> stage 2.2: rebuilding the object tree >>> stage 2.3: build tools >>> stage 3: cross tools >>> stage 4.1: building includes >>> stage 4.2: building libraries [...] cc -O2 -fno-strict-aliasing -pipe -I. -I/src/lib/libthread_db -Wsystem-headers -Werror -Wall -Wno-format-y2k -W -Wno-unused-parameter -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Wreturn-type -Wcast-qual -Wwrite-strings -Wswitch -Wshadow -Wcast-align -Wunused-parameter -Wchar-subscripts -Winline -Wnested-externs -Wredundant-decls -c /src/lib/libthread_db/arch/amd64/libpthread_md.c /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_fpreg_to_ucontext': /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: implicit declaration of function `memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_ucontext_to_fpreg': /src/lib/libthread_db/arch/amd64/libpthread_md.c:100: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' *** Error code 1 Stop in /src/lib/libthread_db. *** Error code 1 Stop in /src/lib. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. TB --- 2009-05-15 05:07:17 - WARNING: /usr/bin/make returned exit code 1 TB --- 2009-05-15 05:07:17 - ERROR: failed to build world TB --- 2009-05-15 05:07:17 - 1119.20 user 157.27 system 1490.10 real http://tinderbox.des.no/tinderbox-releng_6-RELENG_6-amd64-amd64.full From owner-freebsd-stable@FreeBSD.ORG Fri May 15 05:10:15 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BB56F106566B for ; Fri, 15 May 2009 05:10:14 +0000 (UTC) (envelope-from bms@incunabulum.net) Received: from out1.smtp.messagingengine.com (out1.smtp.messagingengine.com [66.111.4.25]) by mx1.freebsd.org (Postfix) with ESMTP id 9179F8FC2A for ; Fri, 15 May 2009 05:10:14 +0000 (UTC) (envelope-from bms@incunabulum.net) Received: from compute2.internal (compute2.internal [10.202.2.42]) by out1.messagingengine.com (Postfix) with ESMTP id EC5BE345612 for ; Fri, 15 May 2009 01:10:13 -0400 (EDT) Received: from heartbeat1.messagingengine.com ([10.202.2.160]) by compute2.internal (MEProxy); Fri, 15 May 2009 01:10:13 -0400 X-Sasl-enc: CGsUiiggasNFxvuAhZRwv9/PkVu9Ootlpey+bumcoGdr 1242364213 Received: from empiric.lon.incunabulum.net (82-35-112-254.cable.ubr07.dals.blueyonder.co.uk [82.35.112.254]) by mail.messagingengine.com (Postfix) with ESMTPSA id 693142B7D0 for ; Fri, 15 May 2009 01:10:13 -0400 (EDT) Message-ID: <4A0CF934.4000706@incunabulum.net> Date: Fri, 15 May 2009 06:10:12 +0100 From: Bruce Simpson User-Agent: Thunderbird 2.0.0.21 (X11/20090412) MIME-Version: 1.0 To: FreeBSD stable X-Enigmail-Version: 0.95.6 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: Boot panic w/7.2-STABLE on amd64: resource_list_alloc X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 05:10:15 -0000 Hi, Since upgrading sources on RELENG_7 yesterday, my amd64 system panics right after this line in dmesg: ata4: on atapci1 panic: resource_list_alloc: resource entry is busy This machine uses an ALi SATA controller. I haven't had any problems with this controller's support for most of the 7.x branch, but it was last broken during the 6.x branch. I see there have recently been commits in this area which may have broken ATA driver support in some subtle way. Backtrace is (w/o symbols):- ... resource_list_alloc() pci_alloc_resource() bus_alloc_resource() ata_ali_sata_allocate() ata_pcichannel_attach() device_attach() ... There are no debugging symbols at the moment as this is a production kernel. If any further information is required to resolve the bug, please let me know. thanks, BMS From owner-freebsd-stable@FreeBSD.ORG Fri May 15 05:36:00 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2ABCF106566B for ; Fri, 15 May 2009 05:36:00 +0000 (UTC) (envelope-from bms@FreeBSD.org) Received: from out1.smtp.messagingengine.com (out1.smtp.messagingengine.com [66.111.4.25]) by mx1.freebsd.org (Postfix) with ESMTP id F229A8FC18 for ; Fri, 15 May 2009 05:35:59 +0000 (UTC) (envelope-from bms@FreeBSD.org) Received: from compute1.internal (compute1.internal [10.202.2.41]) by out1.messagingengine.com (Postfix) with ESMTP id 2A5B2343767 for ; Fri, 15 May 2009 01:20:15 -0400 (EDT) Received: from heartbeat1.messagingengine.com ([10.202.2.160]) by compute1.internal (MEProxy); Fri, 15 May 2009 01:20:15 -0400 X-Sasl-enc: K9eX6tAMd7/+619HdJxLa8kYzqmDiv5iEpZhlSUkfrXg 1242364814 Received: from empiric.lon.incunabulum.net (82-35-112-254.cable.ubr07.dals.blueyonder.co.uk [82.35.112.254]) by mail.messagingengine.com (Postfix) with ESMTPSA id AA3262B7F0 for ; Fri, 15 May 2009 01:20:14 -0400 (EDT) Message-ID: <4A0CFB8D.8060203@FreeBSD.org> Date: Fri, 15 May 2009 06:20:13 +0100 From: Bruce Simpson User-Agent: Thunderbird 2.0.0.21 (X11/20090412) MIME-Version: 1.0 To: FreeBSD stable References: <4A0CF934.4000706@incunabulum.net> In-Reply-To: <4A0CF934.4000706@incunabulum.net> X-Enigmail-Version: 0.95.6 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: Boot panic w/7.2-STABLE on amd64: resource_list_alloc X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 05:36:00 -0000 Bruce Simpson wrote: > Since upgrading sources on RELENG_7 yesterday, my amd64 system panics > right after this line in dmesg: > > ata4: on atapci1 > panic: resource_list_alloc: resource entry is busy > ... > I see there have recently been commits in this area which may have > broken ATA driver support in some subtle way. Rolling back SVN rev 192033 by hand makes no difference. The controller is an AcerLabs M5287 SATA150 controller. Has anyone else seen a similar boot panic? thanks, BMS From owner-freebsd-stable@FreeBSD.ORG Fri May 15 08:25:03 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7433D1065678 for ; Fri, 15 May 2009 08:25:03 +0000 (UTC) (envelope-from byshenknet@byshenk.net) Received: from core.byshenk.net (core.byshenk.net [62.58.73.230]) by mx1.freebsd.org (Postfix) with ESMTP id EF8B98FC1D for ; Fri, 15 May 2009 08:25:02 +0000 (UTC) (envelope-from byshenknet@byshenk.net) Received: from core.byshenk.net (localhost.aoes.com [127.0.0.1]) by core.byshenk.net (8.14.3/8.14.3) with ESMTP id n4F8P0kF025036 for ; Fri, 15 May 2009 10:25:00 +0200 (CEST) (envelope-from byshenknet@core.byshenk.net) Received: (from byshenknet@localhost) by core.byshenk.net (8.14.3/8.14.3/Submit) id n4F8P0W8025035 for freebsd-stable@freebsd.org; Fri, 15 May 2009 10:25:00 +0200 (CEST) (envelope-from byshenknet) Date: Fri, 15 May 2009 10:25:00 +0200 From: Greg Byshenk To: freebsd-stable@freebsd.org Message-ID: <20090515082500.GB2571@core.byshenk.net> References: <20090426125008.GK1550@core.byshenk.net> <20090513164207.GD67116@core.byshenk.net> <20090513164438.GE67116@core.byshenk.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20090513164438.GE67116@core.byshenk.net> User-Agent: Mutt/1.4.2.3i X-Spam-Status: No, score=-1.4 required=5.0 tests=ALL_TRUSTED autolearn=failed version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on core.byshenk.net Subject: Re: em? watchdog timeout 7-stable X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 08:25:03 -0000 Following up to myself, I experienced a watchdog timout followed by lockuup again early this morning. Strangely, rather than happening at a time of heavy activity, it seems to have occurred when there was very little activity. I was running 'systat' in a window when the watchdog timeout occurred and the network disappeared, and it showed: === begin systat output === 2 users Load 0.46 1.36 1.32 May 15 05:29 Mem:KB REAL VIRTUAL VN PAGER SWAP PAGER Tot Share Tot Share Free in out in out Act 50544 5484 471736 8504 1789768 count All 151492 8748 13158556 21348 pages Proc: Interrupts r p d s w Csw Trp Sys Int Sof Flt cow 16006 total 1 162 360 4 132 6 1126 zfod sio0 irq4 ozfod fdc0 irq6 12.9%Sys 12.5%Intr 0.0%User 0.0%Nice 74.7%Idle %ozfod ata0 irq14 | | | | | | | | | | | daefr 6 skc0 em0 1 ======+++++++ prcfr twa0 irq18 39 dtbuf totfr em1 irq24 Namei Name-cache Dir-cache 100000 desvn react 2000 cpu0: time Calls hits % hits % 84258 numvn pdwak 2000 cpu3: time 25000 frevn pdpgs 2000 cpu1: time intrn 2000 cpu2: time Disks da0 da1 pass0 pass1 541836 wire 2000 cpu7: time KB/t 0.00 0.00 0.00 0.00 51628 act 2000 cpu6: time tps 0 0 0 0 13865008 inact 2000 cpu4: time MB/s 0.00 0.00 0.00 0.00 458052 cache 2000 cpu5: time %busy 0 0 0 0 1331716 free === end systat output === This time, I was able to break into the debugger from my console, with the following result: === begin kdb output === KDB: enter: Line break on console [thread pid 17 tid 100009 ] Stopped at kdb_enter_why+0x3d: movq $0,0x5d70d8(%rip) db> panic panic: from debugger cpuid = 1 KDB: stack backtrace: db_trace_self_wrapper() at db_trace_self_wrapper+0x2a panic() at panic+0x182 db_panic() at db_panic+0x17 db_command() at db_command+0x1ef db_command_loop() at db_command_loop+0x50 db_trap() at db_trap+0x89 kdb_trap() at kdb_trap+0x95 trap() at trap+0x264 calltrap() at calltrap+0x8 --- trap 0x3, rip = 0xffffffff804d07cd, rsp = 0xfffffffe800819d0, rbp = 0xfffffffe800819f0 --- kdb_enter_why() at kdb_enter_why+0x3d siointr1() at siointr1+0x2c5 siointr() at siointr+0x58 intr_execute_handlers() at intr_execute_handlers+0x8b Xapic_isr1() at Xapic_isr1+0x7f --- interrupt, rip = 0xffffffff80727c36, rsp = 0xfffffffe80081b90, rbp = 0xfffffffe80081ba0 --- acpi_cpu_c1() at acpi_cpu_c1+0x6 acpi_cpu_idle() at acpi_cpu_idle+0x19c sched_idletd() at sched_idletd+0x46 fork_exit() at fork_exit+0x11f fork_trampoline() at fork_trampoline+0xe --- trap 0, rip = 0, rsp = 0xfffffffe80081d30, rbp = 0 --- KDB: stack backtrace: db_trace_self_wrapper() at db_trace_self_wrapper+0x2a mi_switch() at mi_switch+0x2a8 sched_bind() at sched_bind+0x58 boot() at boot+0x3f panic() at panic+0x16c db_panic() at db_panic+0x17 db_command() at db_command+0x1ef db_command_loop() at db_command_loop+0x50 db_trap() at db_trap+0x89 kdb_trap() at kdb_trap+0x95 trap() at trap+0x264 calltrap() at calltrap+0x8 --- trap 0x3, rip = 0xffffffff804d07cd, rsp = 0xfffffffe800819d0, rbp = 0xfffffffe800819f0 --- kdb_enter_why() at kdb_enter_why+0x3d siointr1() at siointr1+0x2c5 siointr() at siointr+0x58 intr_execute_handlers() at intr_execute_handlers+0x8b Xapic_isr1() at Xapic_isr1+0x7f --- interrupt, rip = 0xffffffff80727c36, rsp = 0xfffffffe80081b90, rbp = 0xfffffffe80081ba0 --- acpi_cpu_c1() at acpi_cpu_c1+0x6 acpi_cpu_idle() at acpi_cpu_idle+0x19c sched_idletd() at sched_idletd+0x46 fork_exit() at fork_exit+0x11f fork_trampoline() at fork_trampoline+0xe --- trap 0, rip = 0, rsp = 0xfffffffe80081d30, rbp = 0 --- db> bt Tracing pid 17 tid 100009 td 0xffffff00013f3a50 kdb_enter_why() at kdb_enter_why+0x3d siointr1() at siointr1+0x2c5 siointr() at siointr+0x58 intr_execute_handlers() at intr_execute_handlers+0x8b Xapic_isr1() at Xapic_isr1+0x7f --- interrupt, rip = 0xffffffff80727c36, rsp = 0xfffffffe80081b90, rbp = 0xfffffffe80081ba0 --- acpi_cpu_c1() at acpi_cpu_c1+0x6 acpi_cpu_idle() at acpi_cpu_idle+0x19c sched_idletd() at sched_idletd+0x46 fork_exit() at fork_exit+0x11f fork_trampoline() at fork_trampoline+0xe --- trap 0, rip = 0, rsp = 0xfffffffe80081d30, rbp = 0 --- === end kdb output === Kernel/world are 7-STABLE amd64, in sync, from sources csup'ed Thursday, 14 May 2009. Other information re em1 on this machine: # pciconf -lvb em1@pci0:7:1:0: class=0x020000 card=0x10028086 chip=0x10118086 rev=0x01 hdr=0x00 vendor = 'Intel Corporation' device = '82545EM Gigabit Ethernet Controller (Fiber)' class = network subclass = ethernet bar [10] = type Memory, range 64, base 0xda300000, size 131072, enabled bar [20] = type I/O Port, range 32, base 0x5000, size 64, enabled # vmstat -i interrupt total rate irq4: sio0 1479 0 irq6: fdc0 10 0 irq14: ata0 58 0 irq16: skc0 em0 758850 85 irq18: twa0 2085338 234 irq24: em1 1 0 cpu0: timer 17806226 1999 cpu3: timer 17798161 1998 cpu2: timer 17798127 1998 cpu1: timer 17798043 1998 cpu5: timer 17798058 1998 cpu6: timer 17798161 1998 cpu4: timer 17798160 1998 cpu7: timer 17798160 1998 Total 145238832 16311 # ifconfig em1 em1: flags=8843 metric 0 mtu 1500 options=db ether 00:07:e9:1a:ae:dc inet 192.168.1.62 netmask 0xfffff800 broadcast 192.168.7.255 media: Ethernet autoselect (1000baseLX ) status: active Any ideas? On Wed, May 13, 2009 at 06:44:38PM +0200, Greg Byshenk wrote: > On Wed, May 13, 2009 at 06:42:07PM +0200, Greg Byshenk wrote: > > > As a followup to my own previous message, I continue to have annoying > > problems with "em?: watchdog timeout" on one of my machines (now running > > 7.2-STABLE as of 2009-05-08). > > > > I have discontinued using the on-board (em, copper) NICs, and replaced > > the original fibre NIC with a newer model, but the problem persists. > > I've also set > > > > hw.pci.enable_msix=0 > > hw.pci.enable_msi=0 > > hw.em.rxd=1024 > > hw.em.txd=1024 > > net.inet.tcp.tso=0 > > > > ...as suggested in some discussions of this problem, and set the em1 > > interface to 'polling', all to no avail. Frequently, though irregularly > > (once or twice a day), the console begins to display > > > > em1: watchdog timeout -- resetting > > em1: watchdog timeout -- resetting > > em1: watchdog timeout -- resetting > > > > the nework is down, and the machine locks up. > > > > [Note: I am getting 'em1' now instead of 'em0' as previously, but this > > is due to changing all of the nics, which led to a different numbering; > > the timeout is still occurring on the (main) interface, the fibre > > gigabit connection.] > > > > What is particularly perverse (IMO) is that, since changing the NIC to > > the newer model (and updating the kernel), I can no longer break to the > > debugger when the lockup occurs (there is no response to the break) -- > > bit I _can_ shut the machine down cleanly via hardware (a touch of the > > power switch sends 'shutdown', and the machine shuts down cleanly -- > > after killing off processes waiting on network i/o). > > > > The machine is running nfs and samba (3.2.10, from ports), and pretty > > much nothing else. > > > > > > Anyone have any ideas about this...? I'm going mad with this. -- greg byshenk - gbyshenk@byshenk.net - Leiden, NL From owner-freebsd-stable@FreeBSD.ORG Fri May 15 08:25:14 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A62B010656A4; Fri, 15 May 2009 08:25:14 +0000 (UTC) (envelope-from kostikbel@gmail.com) Received: from mail.terabit.net.ua (mail.terabit.net.ua [195.137.202.147]) by mx1.freebsd.org (Postfix) with ESMTP id 4528B8FC13; Fri, 15 May 2009 08:25:14 +0000 (UTC) (envelope-from kostikbel@gmail.com) Received: from skuns.zoral.com.ua ([91.193.166.194] helo=mail.zoral.com.ua) by mail.terabit.net.ua with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63 (FreeBSD)) (envelope-from ) id 1M4sj9-000Kdk-Gj; Fri, 15 May 2009 11:25:12 +0300 Received: from deviant.kiev.zoral.com.ua (root@deviant.kiev.zoral.com.ua [10.1.1.148]) by mail.zoral.com.ua (8.14.2/8.14.2) with ESMTP id n4F8OxgA044605 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 15 May 2009 11:24:59 +0300 (EEST) (envelope-from kostikbel@gmail.com) Received: from deviant.kiev.zoral.com.ua (kostik@localhost [127.0.0.1]) by deviant.kiev.zoral.com.ua (8.14.3/8.14.3) with ESMTP id n4F8OxSY021615; Fri, 15 May 2009 11:24:59 +0300 (EEST) (envelope-from kostikbel@gmail.com) Received: (from kostik@localhost) by deviant.kiev.zoral.com.ua (8.14.3/8.14.3/Submit) id n4F8Oxcq021614; Fri, 15 May 2009 11:24:59 +0300 (EEST) (envelope-from kostikbel@gmail.com) X-Authentication-Warning: deviant.kiev.zoral.com.ua: kostik set sender to kostikbel@gmail.com using -f Date: Fri, 15 May 2009 11:24:58 +0300 From: Kostik Belousov To: Chris Timmons Message-ID: <20090515082458.GB1927@deviant.kiev.zoral.com.ua> References: <1696198956@web.de> <20090514091410.H12558@n.cwu.edu> <20090514093008.Q12558@n.cwu.edu> <200905141317.56551.jhb@freebsd.org> <20090514152838.E12558@n.cwu.edu> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="/NkBOFFp2J2Af1nK" Content-Disposition: inline In-Reply-To: <20090514152838.E12558@n.cwu.edu> User-Agent: Mutt/1.4.2.3i X-Virus-Scanned: clamav-milter 0.95.1 at skuns.kiev.zoral.com.ua X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=5.0 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on skuns.kiev.zoral.com.ua X-Virus-Scanned: mail.terabit.net.ua 1M4sj9-000Kdk-Gj 400c333c9d9a06ea44faa2d11e182fbe X-Terabit: YES Cc: freebsd-stable@freebsd.org, Martin Sugioarto , John Baldwin Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 08:25:15 -0000 --/NkBOFFp2J2Af1nK Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Thu, May 14, 2009 at 03:32:34PM -0700, Chris Timmons wrote: >=20 >=20 > >Can you get a stack trace? Your panic is quite different then the origi= nal > >one. >=20 > Let me know if there is any other information which would be helpful. I= =20 > rebooted the 7.0 kernel from July, and the machine has been happily=20 > chugging along running Nessus under load for almost 6 hours. >=20 > 3:30PM up 5:42, 1 user, load averages: 33.67, 33.80, 35.14 >=20 > Tomorrow I can see if the panic is easily reproduced. >=20 > -c >=20 >=20 > (kgdb) bt > #0 doadump () at pcpu.h:196 > #1 0xc07e2ee7 in boot (howto=3D260) at=20 > /usr/src/sys/kern/kern_shutdown.c:418 > #2 0xc07e31b9 in panic (fmt=3DVariable "fmt" is not available. > ) at /usr/src/sys/kern/kern_shutdown.c:574 > #3 0xc0ae49ec in trap_fatal (frame=3D0xee156a94, eva=3D28) at=20 > /usr/src/sys/i386/i386/trap.c:939 > #4 0xc0ae4c70 in trap_pfault (frame=3D0xee156a94, usermode=3D0, eva=3D28= ) at=20 > /usr/src/sys/i386/i386/trap.c:852 > #5 0xc0ae561c in trap (frame=3D0xee156a94) at=20 > /usr/src/sys/i386/i386/trap.c:530 > #6 0xc0ac9d2b in calltrap () at /usr/src/sys/i386/i386/exception.s:159 > #7 0xc07a4dac in devvn_refthread (vp=3D0x0, devp=3D0xee156b0c) at=20 > /usr/src/sys/kern/kern_conf.c:209 > #8 0xc076cf64 in devfs_fp_check (fp=3D0xc78fadf4, devp=3D0xee156b0c,=20 > dswp=3D0xee156b08) at /usr/src/sys/fs/devfs/devfs_vnops.c:89 Please, show the output of p *(struct file *)0xc78fadf4 > #9 0xc076cfd9 in devfs_poll_f (fp=3D0xc78fadf4, events=3D4, cred=3D0xc7a= e1c00,=20 > td=3D0xce628460) at /usr/src/sys/fs/devfs/devfs_vnops.c:966 > #10 0xc081cce1 in poll (td=3D0xce628460, uap=3D0xee156cfc) at file.h:280 > #11 0xc0ae4fc5 in syscall (frame=3D0xee156d38) at=20 > /usr/src/sys/i386/i386/trap.c:1090 > #12 0xc0ac9d90 in Xint0x80_syscall () at=20 > /usr/src/sys/i386/i386/exception.s:255 > #13 0x00000033 in ?? () > Previous frame inner to this frame (corrupt stack?) > (kgdb) quit >=20 > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" --/NkBOFFp2J2Af1nK Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (FreeBSD) iEYEARECAAYFAkoNJtoACgkQC3+MBN1Mb4hEYgCgssJ5X8RmD/wqzNqcEI6wDlqt wyMAoOnxkA8uNFY+Ar671c7UjjPHR1No =SnAZ -----END PGP SIGNATURE----- --/NkBOFFp2J2Af1nK-- From owner-freebsd-stable@FreeBSD.ORG Fri May 15 08:49:15 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CB94E106566C for ; Fri, 15 May 2009 08:49:15 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: from rv-out-0506.google.com (rv-out-0506.google.com [209.85.198.228]) by mx1.freebsd.org (Postfix) with ESMTP id 904638FC08 for ; Fri, 15 May 2009 08:49:15 +0000 (UTC) (envelope-from pyunyh@gmail.com) Received: by rv-out-0506.google.com with SMTP id k40so1098703rvb.43 for ; Fri, 15 May 2009 01:49:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:received:from:date:to:cc :subject:message-id:reply-to:references:mime-version:content-type :content-disposition:in-reply-to:user-agent; bh=29o6Hfg6X7hStMOC2BGUeHfLE9w7u1FAxlCXr3oXeaA=; b=sgU2bd43RIIuztyQRYNMb1Khx7Kh0BW4e7yCWud8051dk/oqHitTc2kVhrvfdmDwJo tXQjdD4pskxOwNFbtK2SfcyAkYAfgo4YLMRHl+9fpVXSG0pVdaRREE0MFSeFhe3LyMOz noDzTmzLWYiTaB3OZMBv/nRKcDbRC1tQG/eqY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:date:to:cc:subject:message-id:reply-to:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; b=hGvWlKFU6dnaqcwNSeLNHY3aaBZyRHZkMRbMfTXtmgouJfOZfZaTA8f+wG85z9f8tj 0pBisX8BxqWfxmBsHd94uJDyFEpZrrCi7i5c5NrtWjlHcPAtiH0Z6Jsm+ND6uT7+sqqJ 8EPUOt/7j1Dr2Z2SboA81I8V85qO6y438z5hQ= Received: by 10.141.123.17 with SMTP id a17mr1126455rvn.89.1242377355255; Fri, 15 May 2009 01:49:15 -0700 (PDT) Received: from michelle.cdnetworks.co.kr ([114.111.62.249]) by mx.google.com with ESMTPS id c20sm2779209rvf.0.2009.05.15.01.49.12 (version=SSLv3 cipher=RC4-MD5); Fri, 15 May 2009 01:49:13 -0700 (PDT) Received: by michelle.cdnetworks.co.kr (sSMTP sendmail emulation); Fri, 15 May 2009 17:58:06 +0900 From: Pyun YongHyeon Date: Fri, 15 May 2009 17:58:06 +0900 To: Lars Eggert Message-ID: <20090515085806.GX65350@michelle.cdnetworks.co.kr> References: <4A09DEF1.2010202@delphij.net> <4A09FDB2.5080307@eyede.com> <20090513004131.GP65350@michelle.cdnetworks.co.kr> <20090514082750.GU65350@michelle.cdnetworks.co.kr> <310A73CC-A32D-4794-BF23-A49715AFCF99@nokia.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="YD3LsXFS42OYHhNZ" Content-Disposition: inline In-Reply-To: <310A73CC-A32D-4794-BF23-A49715AFCF99@nokia.com> User-Agent: Mutt/1.4.2.3i Cc: "d@delphij.net" , "freebsd-stable@freebsd.org" , "nigel@eyede.com" Subject: Re: TCP differences in 7.2 vs 7.1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: pyunyh@gmail.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 08:49:16 -0000 --YD3LsXFS42OYHhNZ Content-Type: text/plain; charset=us-ascii Content-Disposition: inline On Thu, May 14, 2009 at 11:28:43AM +0300, Lars Eggert wrote: > Hi, > > On 2009-5-14, at 11:27, Pyun YongHyeon wrote: > >Then you're seeing different problem on em(4). Last time I checked > >em(4) TSO code in em(4) didn't use m_pullup and just returned > >ENXIO to caller. I'm not sure that is related with your issue but > >would you tell us your network configuration? > > this box is a Dell 2950 server/router running 7.2-STABLE. It has an > onboard bce interface and four dual-port Intel PRO/1000 NICs, giving > it 8 em interfaces. (Let me know if you want the boot dmesg.) > > >If you can easily > >reproduce the issue would you let us know? > > Reproducing the issue is as easy as setting net.inet.tcp.tso=1. > > What's interesting is that I only see the issue on one of the eight em > interfaces. That interface is connected to a D-Link DIR-655 WLAN > router. When I tcpdump on the other interfaces with TSO enabled, I see > no "IP bad-len 0" messages. > Would you try attached patch? I'm using the patch on my development box. Originally the patch was written to address checksum offload breakage on multicast packets(r182463). However I didn't encounter TSO issue without the patch. Note, the patch was not heavily tested so it may have uncovered bugs. --YD3LsXFS42OYHhNZ Content-Type: text/x-diff; charset=us-ascii Content-Disposition: attachment; filename="em.csum_tso.patch" Index: sys/dev/e1000/if_em.c =================================================================== --- sys/dev/e1000/if_em.c (revision 192130) +++ sys/dev/e1000/if_em.c (working copy) @@ -270,7 +270,7 @@ static void em_transmit_checksum_setup(struct adapter *, struct mbuf *, u32 *, u32 *); #if __FreeBSD_version >= 700000 -static bool em_tso_setup(struct adapter *, struct mbuf *, +static void em_tso_setup(struct adapter *, struct mbuf *, u32 *, u32 *); #endif /* FreeBSD_version >= 700000 */ static void em_set_promisc(struct adapter *); @@ -369,7 +369,6 @@ #define EM_TICKS_TO_USECS(ticks) ((1024 * (ticks) + 500) / 1000) #define EM_USECS_TO_TICKS(usecs) ((1000 * (usecs) + 512) / 1024) -#define M_TSO_LEN 66 /* Allow common code without TSO */ #ifndef CSUM_TSO @@ -2104,15 +2103,78 @@ /* + * Docuemtation explicitly recommends entire header section + * to be coalesced into a single buffer and described in a + * single data descriptor. + * * TSO workaround: * If an mbuf is only header we need * to pull 4 bytes of data into it. */ - if (do_tso && (m_head->m_len <= M_TSO_LEN)) { - m_head = m_pullup(m_head, M_TSO_LEN + 4); + if (do_tso || (m_head->m_pkthdr.csum_flags & + (CSUM_IP | CSUM_TCP | CSUM_UDP)) != 0) { + struct ether_header *eh; + struct ip *ip; + struct tcphdr *tcp; + uint32_t poff; + + if (M_WRITABLE(m_head) == 0) { + m_head = m_dup(*m_headp, M_DONTWAIT); + m_freem(*m_headp); + if (m_head == NULL) { + *m_headp = NULL; + return (ENOBUFS); + } + *m_headp = m_head; + } + + poff = sizeof(struct ether_header); + m_head = m_pullup(m_head, poff); + if (m_head == NULL) { + *m_headp = NULL; + return (ENOBUFS); + } + eh = mtod(m_head, struct ether_header *); + if (eh->ether_type == htons(ETHERTYPE_VLAN)) { + poff = sizeof(struct ether_vlan_header); + m_head = m_pullup(m_head, poff); + if (m_head == NULL) { + *m_headp = NULL; + return (ENOBUFS); + } + } + m_head = m_pullup(m_head, poff + sizeof(struct ip)); + if (m_head == NULL) { + *m_headp = NULL; + return (ENOBUFS); + } + ip = (struct ip *)(mtod(m_head, char *) + poff); + poff += (ip->ip_hl << 2); + + if (do_tso || (m_head->m_pkthdr.csum_flags & CSUM_TCP) != 0) { + m_head = m_pullup(m_head, poff + sizeof(struct tcphdr)); + if (m_head == NULL) { + *m_headp = NULL; + return (ENOBUFS); + } + tcp = (struct tcphdr *)(mtod(m_head, char *) + poff); + poff += (tcp->th_off << 2); + /* Pull 4 bytes of payload into the first mbuf. */ + if (do_tso) + poff += 4; + m_head = m_pullup(m_head, poff); + if (m_head == NULL) { + *m_headp = NULL; + return (ENOBUFS); + } + } else if ((m_head->m_pkthdr.csum_flags & (CSUM_UDP)) != 0) { + m_head = m_pullup(m_head, poff + sizeof(struct udphdr)); + if (m_head == NULL) { + *m_headp = NULL; + return (ENOBUFS); + } + } *m_headp = m_head; - if (m_head == NULL) - return (ENOBUFS); } /* @@ -2143,7 +2205,7 @@ if (error == EFBIG) { struct mbuf *m; - m = m_defrag(*m_headp, M_DONTWAIT); + m = m_collapse(*m_headp, M_DONTWAIT, EM_MAX_SCATTER); if (m == NULL) { adapter->mbuf_alloc_failed++; m_freem(*m_headp); @@ -2189,9 +2251,7 @@ /* Do hardware assists */ #if __FreeBSD_version >= 700000 if (m_head->m_pkthdr.csum_flags & CSUM_TSO) { - error = em_tso_setup(adapter, m_head, &txd_upper, &txd_lower); - if (error != TRUE) - return (ENXIO); /* something foobar */ + em_tso_setup(adapter, m_head, &txd_upper, &txd_lower); /* we need to make a final sentinel transmit desc */ tso_desc = TRUE; } else @@ -3836,7 +3896,7 @@ * Setup work for hardware segmentation offload (TSO) * **********************************************************************/ -static bool +static void em_tso_setup(struct adapter *adapter, struct mbuf *mp, u32 *txd_upper, u32 *txd_lower) { @@ -3868,10 +3928,6 @@ ehdrlen = ETHER_HDR_LEN; } - /* Ensure we have at least the IP+TCP header in the first mbuf. */ - if (mp->m_len < ehdrlen + sizeof(struct ip) + sizeof(struct tcphdr)) - return FALSE; /* -1 */ - /* * We only support TCP for IPv4 and IPv6 (notyet) for the moment. * TODO: Support SCTP too when it hits the tree. @@ -3880,31 +3936,24 @@ case ETHERTYPE_IP: isip6 = 0; ip = (struct ip *)(mp->m_data + ehdrlen); - if (ip->ip_p != IPPROTO_TCP) - return FALSE; /* 0 */ ip->ip_len = 0; ip->ip_sum = 0; ip_hlen = ip->ip_hl << 2; - if (mp->m_len < ehdrlen + ip_hlen + sizeof(struct tcphdr)) - return FALSE; /* -1 */ th = (struct tcphdr *)((caddr_t)ip + ip_hlen); -#if 1 + /* Controller expects a psuedo checksum without TCP length. */ th->th_sum = in_pseudo(ip->ip_src.s_addr, ip->ip_dst.s_addr, htons(IPPROTO_TCP)); -#else - th->th_sum = mp->m_pkthdr.csum_data; -#endif break; case ETHERTYPE_IPV6: isip6 = 1; - return FALSE; /* Not supported yet. */ + return; /* Not supported yet. */ ip6 = (struct ip6_hdr *)(mp->m_data + ehdrlen); if (ip6->ip6_nxt != IPPROTO_TCP) - return FALSE; /* 0 */ + return; /* 0 */ ip6->ip6_plen = 0; ip_hlen = sizeof(struct ip6_hdr); /* XXX: no header stacking. */ if (mp->m_len < ehdrlen + ip_hlen + sizeof(struct tcphdr)) - return FALSE; /* -1 */ + return; /* -1 */ th = (struct tcphdr *)((caddr_t)ip6 + ip_hlen); #if 0 th->th_sum = in6_pseudo(ip6->ip6_src, ip->ip6_dst, @@ -3914,7 +3963,7 @@ #endif break; default: - return FALSE; + return; } hdr_len = ehdrlen + ip_hlen + (th->th_off << 2); @@ -3976,8 +4025,6 @@ adapter->num_tx_desc_avail--; adapter->next_avail_tx_desc = curr_txd; adapter->tx_tso = TRUE; - - return TRUE; } #endif /* __FreeBSD_version >= 700000 */ Index: sys/dev/e1000/if_em.h =================================================================== --- sys/dev/e1000/if_em.h (revision 192130) +++ sys/dev/e1000/if_em.h (working copy) @@ -231,7 +231,7 @@ #define HW_DEBUGOUT1(S, A) if (DEBUG_HW) printf(S "\n", A) #define HW_DEBUGOUT2(S, A, B) if (DEBUG_HW) printf(S "\n", A, B) -#define EM_MAX_SCATTER 64 +#define EM_MAX_SCATTER 32 #define EM_TSO_SIZE (65535 + sizeof(struct ether_vlan_header)) #define EM_TSO_SEG_SIZE 4096 /* Max dma segment size */ #define EM_MSIX_MASK 0x01F00000 /* For 82574 use */ --YD3LsXFS42OYHhNZ-- From owner-freebsd-stable@FreeBSD.ORG Fri May 15 09:23:57 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CDA011065673 for ; Fri, 15 May 2009 09:23:57 +0000 (UTC) (envelope-from ronald-freebsd8@klop.yi.org) Received: from smtp-out0.tiscali.nl (smtp-out0.tiscali.nl [195.241.79.175]) by mx1.freebsd.org (Postfix) with ESMTP id 91A758FC19 for ; Fri, 15 May 2009 09:23:57 +0000 (UTC) (envelope-from ronald-freebsd8@klop.yi.org) Received: from [212.123.145.58] (helo=sjakie.klop.ws) by smtp-out0.tiscali.nl with esmtp id 1M4te0-0008Ug-A5 for ; Fri, 15 May 2009 11:23:56 +0200 Received: from 82-170-177-25.ip.telfort.nl (localhost [127.0.0.1]) by sjakie.klop.ws (Postfix) with ESMTP id 4BAB89076 for ; Fri, 15 May 2009 11:23:55 +0200 (CEST) Date: Fri, 15 May 2009 11:23:54 +0200 To: freebsd-stable@freebsd.org From: "Ronald Klop" Content-Type: text/plain; format=flowed; delsp=yes; charset=us-ascii MIME-Version: 1.0 References: Content-Transfer-Encoding: 7bit Message-ID: In-Reply-To: User-Agent: Opera Mail/9.64 (FreeBSD) Subject: Re: devd doesn't fire event on boot [solved] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 09:23:58 -0000 The property sernum is not very reliable. Sometimes devd knows about it and sometimes not. I removed the match on it and know it works. Ronald. On Wed, 06 May 2009 12:03:14 +0200, Ronald Klop wrote: > Hello, > > Running 7.2-STABLE/amd64. I have a USB-disk and added stuff to devd to > mount it readonly on attach. This does work if I attach it after booting > up, but not if it is attached before booting. > > [root@sjakie ~]# cat /etc/devd/philips.conf > attach 10 { > device-name "umass[0-9]+"; > match "vendor" "0x0471"; > match "product" "0x083a"; > match "sernum" "20521126"; > action "/root/bin/mountphilips.sh"; > }; > > [root@sjakie ~]# cat /root/bin/mountphilips.sh > #! /bin/sh > ( > # Sleep, so geom and other kernel stuff can handle the disk > # before we try to mount it. > sleep 10 > mount -v /mnt/backupdisk > ) & > > [root@sjakie ~]# grep backupdisk /etc/fstab > /dev/ufs/Extern /mnt/backupdisk ufs ro,noauto 0 0 > > What can be wrong? Is it possible devd misses events which happened > before devd was started? > Is this known behaviour? > > Ronald. > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" From owner-freebsd-stable@FreeBSD.ORG Fri May 15 12:32:50 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 66B24106566C; Fri, 15 May 2009 12:32:50 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from nsc0.cwu.edu (nsc0.cwu.edu [72.233.196.16]) by mx1.freebsd.org (Postfix) with ESMTP id 3B5A18FC0A; Fri, 15 May 2009 12:32:50 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (n.cwu.edu [198.104.69.57]) by nsc0.cwu.edu (8.14.3/8.14.3) with ESMTP id n4FCWnVS081851; Fri, 15 May 2009 05:32:49 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (localhost [127.0.0.1]) by n.cwu.edu (8.13.3/8.13.3) with ESMTP id n4FCWngx019953; Fri, 15 May 2009 05:32:49 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from localhost (cwt@localhost) by n.cwu.edu (8.13.3/8.13.1/Submit) with ESMTP id n4FCWnNP019950; Fri, 15 May 2009 05:32:49 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) X-Authentication-Warning: n.cwu.edu: cwt owned process doing -bs Date: Fri, 15 May 2009 05:32:49 -0700 (PDT) From: Chris Timmons X-X-Sender: cwt@n.cwu.edu To: Kostik Belousov In-Reply-To: <20090515082458.GB1927@deviant.kiev.zoral.com.ua> Message-ID: <20090515053142.D17400@n.cwu.edu> References: <1696198956@web.de> <20090514091410.H12558@n.cwu.edu> <20090514093008.Q12558@n.cwu.edu> <200905141317.56551.jhb@freebsd.org> <20090514152838.E12558@n.cwu.edu> <20090515082458.GB1927@deviant.kiev.zoral.com.ua> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.0 (nsc0.cwu.edu [72.233.196.16]); Fri, 15 May 2009 05:32:49 -0700 (PDT) Cc: freebsd-stable@freebsd.org, Martin Sugioarto , John Baldwin Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 12:32:50 -0000 #8 0xc076cf64 in devfs_fp_check (fp=0xc78fadf4, devp=0xee156b0c, dswp=0xee156b08) at /usr/src/sys/fs/devfs/devfs_vnops.c:89 89 *dswp = devvn_refthread(fp->f_vnode, devp); (kgdb) p *(struct file *)0xc78fadf4 $1 = {f_list = {le_next = 0xc78ab5f0, le_prev = 0xc789e5f0}, f_type = 1, f_data = 0xce5f9b00, f_flag = 3, f_mtxp = 0xc74540a0, f_ops = 0xc0c48e80, f_cred = 0xc7ae1c00, f_count = 2, f_vnode = 0xc90f4000, f_offset = 0, f_vnread_flags = 0, f_gcflag = 0, f_msgcount = 0, f_seqcount = 1, f_nextoff = 0, f_label = 0x0, f_cdevpriv = 0x0} On Fri, 15 May 2009, Kostik Belousov wrote: >> #8 0xc076cf64 in devfs_fp_check (fp=0xc78fadf4, devp=0xee156b0c, >> dswp=0xee156b08) at /usr/src/sys/fs/devfs/devfs_vnops.c:89 > Please, show the output of > p *(struct file *)0xc78fadf4 From owner-freebsd-stable@FreeBSD.ORG Fri May 15 13:06:16 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B43C6106564A; Fri, 15 May 2009 13:06:16 +0000 (UTC) (envelope-from kostikbel@gmail.com) Received: from mail.terabit.net.ua (mail.terabit.net.ua [195.137.202.147]) by mx1.freebsd.org (Postfix) with ESMTP id 40F108FC16; Fri, 15 May 2009 13:06:15 +0000 (UTC) (envelope-from kostikbel@gmail.com) Received: from skuns.zoral.com.ua ([91.193.166.194] helo=mail.zoral.com.ua) by mail.terabit.net.ua with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63 (FreeBSD)) (envelope-from ) id 1M4x77-000Pii-2f; Fri, 15 May 2009 16:06:13 +0300 Received: from deviant.kiev.zoral.com.ua (root@deviant.kiev.zoral.com.ua [10.1.1.148]) by mail.zoral.com.ua (8.14.2/8.14.2) with ESMTP id n4FD6Amb066989 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 15 May 2009 16:06:10 +0300 (EEST) (envelope-from kostikbel@gmail.com) Received: from deviant.kiev.zoral.com.ua (kostik@localhost [127.0.0.1]) by deviant.kiev.zoral.com.ua (8.14.3/8.14.3) with ESMTP id n4FD69qV072760; Fri, 15 May 2009 16:06:09 +0300 (EEST) (envelope-from kostikbel@gmail.com) Received: (from kostik@localhost) by deviant.kiev.zoral.com.ua (8.14.3/8.14.3/Submit) id n4FD697Q072759; Fri, 15 May 2009 16:06:09 +0300 (EEST) (envelope-from kostikbel@gmail.com) X-Authentication-Warning: deviant.kiev.zoral.com.ua: kostik set sender to kostikbel@gmail.com using -f Date: Fri, 15 May 2009 16:06:09 +0300 From: Kostik Belousov To: Chris Timmons Message-ID: <20090515130609.GG1927@deviant.kiev.zoral.com.ua> References: <1696198956@web.de> <20090514091410.H12558@n.cwu.edu> <20090514093008.Q12558@n.cwu.edu> <200905141317.56551.jhb@freebsd.org> <20090514152838.E12558@n.cwu.edu> <20090515082458.GB1927@deviant.kiev.zoral.com.ua> <20090515053142.D17400@n.cwu.edu> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="JbKQpFqZXJ2T76Sg" Content-Disposition: inline In-Reply-To: <20090515053142.D17400@n.cwu.edu> User-Agent: Mutt/1.4.2.3i X-Virus-Scanned: clamav-milter 0.95.1 at skuns.kiev.zoral.com.ua X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=5.0 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on skuns.kiev.zoral.com.ua X-Virus-Scanned: mail.terabit.net.ua 1M4x77-000Pii-2f 754961cfb7b1973a6063a1177799ca51 X-Terabit: YES Cc: freebsd-stable@freebsd.org, Martin Sugioarto , John Baldwin Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 13:06:17 -0000 --JbKQpFqZXJ2T76Sg Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Fri, May 15, 2009 at 05:32:49AM -0700, Chris Timmons wrote: >=20 > #8 0xc076cf64 in devfs_fp_check (fp=3D0xc78fadf4, devp=3D0xee156b0c,=20 > dswp=3D0xee156b08) at /usr/src/sys/fs/devfs/devfs_vnops.c:89 > 89 *dswp =3D devvn_refthread(fp->f_vnode, devp); >=20 > (kgdb) p *(struct file *)0xc78fadf4 > $1 =3D {f_list =3D {le_next =3D 0xc78ab5f0, le_prev =3D 0xc789e5f0}, f_ty= pe =3D 1,=20 > f_data =3D 0xce5f9b00, f_flag =3D 3, f_mtxp =3D 0xc74540a0, f_ops =3D 0xc= 0c48e80,=20 > f_cred =3D 0xc7ae1c00, f_count =3D 2, f_vnode =3D 0xc90f4000, f_offset = =3D 0,=20 > f_vnread_flags =3D 0, f_gcflag =3D 0, f_msgcount =3D 0, f_seqcount =3D 1,= =20 > f_nextoff =3D 0, f_label =3D 0x0, f_cdevpriv =3D 0x0} >=20 >=20 >=20 > On Fri, 15 May 2009, Kostik Belousov wrote: >=20 > >>#8 0xc076cf64 in devfs_fp_check (fp=3D0xc78fadf4, devp=3D0xee156b0c, > >>dswp=3D0xee156b08) at /usr/src/sys/fs/devfs/devfs_vnops.c:89 > >Please, show the output of > >p *(struct file *)0xc78fadf4 The file structure in the dump is fully initialized. It seems that the issue is with devfs replacing file ops vector with devfs-specific one in devfs_open() before the struct file is fully initialized in vn_open. Please, try the patch below (against 7) and report results. Index: fs/devfs/devfs_vnops.c =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D --- fs/devfs/devfs_vnops.c (revision 192089) +++ fs/devfs/devfs_vnops.c (working copy) @@ -890,6 +890,7 @@ if (fp !=3D NULL) { FILE_LOCK(fp); fp->f_data =3D dev; + fp->f_vnode =3D vp; FILE_UNLOCK(fp); } fpop =3D td->td_fpop; --JbKQpFqZXJ2T76Sg Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (FreeBSD) iEYEARECAAYFAkoNaMEACgkQC3+MBN1Mb4gXoACg6YycyS3NTGo9D+/QVMCT81M3 jSEAoIFaJZk6lu3pE89agkKHqQZGAgal =Cg2x -----END PGP SIGNATURE----- --JbKQpFqZXJ2T76Sg-- From owner-freebsd-stable@FreeBSD.ORG Fri May 15 13:11:06 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id EA1D1106566C for ; Fri, 15 May 2009 13:11:06 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id BC3C08FC0C for ; Fri, 15 May 2009 13:11:06 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 6F02146C18; Fri, 15 May 2009 09:11:06 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 505C88A028; Fri, 15 May 2009 09:11:05 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Fri, 15 May 2009 08:15:19 -0400 User-Agent: KMail/1.9.7 References: <1696198956@web.de> <200905140916.40594.jhb@freebsd.org> <20090514191026.0a90dbfc@zelda.local> In-Reply-To: <20090514191026.0a90dbfc@zelda.local> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905150815.19452.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Fri, 15 May 2009 09:11:05 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: Martin Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 13:11:07 -0000 On Thursday 14 May 2009 1:10:26 pm Martin wrote: > Am Thu, 14 May 2009 09:16:40 -0400 > schrieb John Baldwin : > > > On Thursday 14 May 2009 7:47:23 am Martin Sugioarto wrote: > > [...] > > > kernel trap 12 with interrupts disabled > > > > > > > > > Fatal trap 12: page fault while in kernel mode > > > cpuid = 0; apic id = 0 > > > fault virtual address = 0x80000000000 > > > > Given that that is a single bit set, it could possibly be due to bad > > RAM. > > This is the second panic output that appeared on the screen. I could not read > the first lines of the first panic. The last ones looked similar > (same trap/process etc). > > > Does your kernel have debug symbols? > > This is GENERIC kernel configuration. The kernel was totally frozen. I could > not type anything. I just noticed, I've got a vmcore.0 of the crash. > > I can see some other panic output when loading the kernel in kgdb: > > Unread portion of the kernel message buffer: > > > Fatal trap 9: general protection fault while in kernel mode When I have seen this, it has often been due to a hardware failure such as bad RAM. > cpuid = 2; apic id = 02 > instruction pointer = 0x8:0xffffffff805bbc66 Can you do 'x/i 0xffffffff805bbc66'? Also, can you walk up the stack to the frame for this panic ('frame 7') and do 'info registers'? -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 13:11:08 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 15309106566B for ; Fri, 15 May 2009 13:11:08 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id DCEF28FC18 for ; Fri, 15 May 2009 13:11:07 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 9129D46C1A; Fri, 15 May 2009 09:11:07 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 8116F8A025; Fri, 15 May 2009 09:11:06 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Fri, 15 May 2009 08:50:19 -0400 User-Agent: KMail/1.9.7 References: <4A0CF934.4000706@incunabulum.net> In-Reply-To: <4A0CF934.4000706@incunabulum.net> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905150850.19843.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Fri, 15 May 2009 09:11:06 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: Bruce Simpson Subject: Re: Boot panic w/7.2-STABLE on amd64: resource_list_alloc X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 13:11:08 -0000 On Friday 15 May 2009 1:10:12 am Bruce Simpson wrote: > Hi, > > Since upgrading sources on RELENG_7 yesterday, my amd64 system panics > right after this line in dmesg: > > ata4: on atapci1 > panic: resource_list_alloc: resource entry is busy > > This machine uses an ALi SATA controller. I haven't had any problems > with this controller's support for most of the 7.x branch, but it was > last broken during the 6.x branch. > > I see there have recently been commits in this area which may have > broken ATA driver support in some subtle way. > > Backtrace is (w/o symbols):- > ... > resource_list_alloc() > pci_alloc_resource() > bus_alloc_resource() > ata_ali_sata_allocate() > ata_pcichannel_attach() > device_attach() > ... > > There are no debugging symbols at the moment as this is a production kernel. > If any further information is required to resolve the bug, please let me > know. Sounds like the ATA driver is allocating the same BAR twice. Hmm, yes, it allocates the resources once for each channel it seems in the ata_ali_sata attachment. Looking in ata-chipset.c, all the other chipsets are good about allocating these resources in their chipinit routines rather than the per-channel allocate routine. Well, except ata_pci_allocate() is also busted. *sigh* I can work on a patch for HEAD if you are willing to test. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 14:07:55 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E75B9106564A; Fri, 15 May 2009 14:07:55 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smarthost1.sentex.ca (smarthost1.sentex.ca [64.7.153.18]) by mx1.freebsd.org (Postfix) with ESMTP id 930818FC0A; Fri, 15 May 2009 14:07:55 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smtp1.sentex.ca (smtp1c.sentex.ca [64.7.153.10]) by smarthost1.sentex.ca (8.14.3/8.14.3) with ESMTP id n4FE7qRk018835; Fri, 15 May 2009 10:07:52 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: from freebsd-legacy.sentex.ca (freebsd-legacy.sentex.ca [64.7.128.104]) by smtp1.sentex.ca (8.14.3/8.14.3) with ESMTP id n4FE7qop073970; Fri, 15 May 2009 10:07:52 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: by freebsd-legacy.sentex.ca (Postfix, from userid 666) id 98A11241BA; Fri, 15 May 2009 10:07:52 -0400 (EDT) Sender: FreeBSD Tinderbox From: FreeBSD Tinderbox To: FreeBSD Tinderbox , , Precedence: bulk Message-Id: <20090515140752.98A11241BA@freebsd-legacy.sentex.ca> Date: Fri, 15 May 2009 10:07:52 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95.1 at smtp1.sentex.ca X-Virus-Status: Clean X-Scanned-By: MIMEDefang 2.64 on 64.7.153.18 Cc: Subject: [releng_6 tinderbox] failure on amd64/amd64 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 14:07:56 -0000 TB --- 2009-05-15 13:43:29 - tinderbox 2.6 running on freebsd-legacy.sentex.ca TB --- 2009-05-15 13:43:29 - starting RELENG_6 tinderbox run for amd64/amd64 TB --- 2009-05-15 13:43:29 - cleaning the object tree TB --- 2009-05-15 13:43:41 - cvsupping the source tree TB --- 2009-05-15 13:43:41 - /usr/bin/csup -z -r 3 -g -L 1 -h localhost -s /tinderbox/RELENG_6/amd64/amd64/supfile TB --- 2009-05-15 13:43:48 - building world TB --- 2009-05-15 13:43:48 - MAKEOBJDIRPREFIX=/obj TB --- 2009-05-15 13:43:48 - PATH=/usr/bin:/usr/sbin:/bin:/sbin TB --- 2009-05-15 13:43:48 - TARGET=amd64 TB --- 2009-05-15 13:43:48 - TARGET_ARCH=amd64 TB --- 2009-05-15 13:43:48 - TZ=UTC TB --- 2009-05-15 13:43:48 - __MAKE_CONF=/dev/null TB --- 2009-05-15 13:43:48 - cd /src TB --- 2009-05-15 13:43:48 - /usr/bin/make -B buildworld >>> Rebuilding the temporary build tree >>> stage 1.1: legacy release compatibility shims >>> stage 1.2: bootstrap tools >>> stage 2.1: cleaning up the object tree >>> stage 2.2: rebuilding the object tree >>> stage 2.3: build tools >>> stage 3: cross tools >>> stage 4.1: building includes >>> stage 4.2: building libraries [...] cc -O2 -fno-strict-aliasing -pipe -I. -I/src/lib/libthread_db -Wsystem-headers -Werror -Wall -Wno-format-y2k -W -Wno-unused-parameter -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Wreturn-type -Wcast-qual -Wwrite-strings -Wswitch -Wshadow -Wcast-align -Wunused-parameter -Wchar-subscripts -Winline -Wnested-externs -Wredundant-decls -c /src/lib/libthread_db/arch/amd64/libpthread_md.c /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_fpreg_to_ucontext': /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: implicit declaration of function `memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_ucontext_to_fpreg': /src/lib/libthread_db/arch/amd64/libpthread_md.c:100: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' *** Error code 1 Stop in /src/lib/libthread_db. *** Error code 1 Stop in /src/lib. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. TB --- 2009-05-15 14:07:52 - WARNING: /usr/bin/make returned exit code 1 TB --- 2009-05-15 14:07:52 - ERROR: failed to build world TB --- 2009-05-15 14:07:52 - 1117.87 user 155.05 system 1462.61 real http://tinderbox.des.no/tinderbox-releng_6-RELENG_6-amd64-amd64.full From owner-freebsd-stable@FreeBSD.ORG Fri May 15 14:57:31 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7AEBF1065673; Fri, 15 May 2009 14:57:31 +0000 (UTC) (envelope-from nakal@web.de) Received: from fmmailgate02.web.de (fmmailgate02.web.de [217.72.192.227]) by mx1.freebsd.org (Postfix) with ESMTP id 0153A8FC0A; Fri, 15 May 2009 14:57:30 +0000 (UTC) (envelope-from nakal@web.de) Received: from smtp07.web.de (fmsmtp07.dlan.cinetic.de [172.20.5.215]) by fmmailgate02.web.de (Postfix) with ESMTP id 3D1CFFF1FF36; Fri, 15 May 2009 16:57:30 +0200 (CEST) Received: from [217.236.8.179] (helo=zelda.local) by smtp07.web.de with asmtp (TLSv1:AES128-SHA:128) (WEB.DE 4.110 #277) id 1M4yqn-0007WC-00; Fri, 15 May 2009 16:57:29 +0200 Date: Fri, 15 May 2009 16:57:27 +0200 From: Martin To: John Baldwin Message-ID: <20090515165727.30eab9ff@zelda.local> In-Reply-To: <200905150815.19452.jhb@freebsd.org> References: <1696198956@web.de> <200905140916.40594.jhb@freebsd.org> <20090514191026.0a90dbfc@zelda.local> <200905150815.19452.jhb@freebsd.org> X-Mailer: Claws Mail 3.7.1 (GTK+ 2.16.1; amd64-portbld-freebsd8.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: nakal@web.de X-Sender: nakal@web.de X-Provags-ID: V01U2FsdGVkX198zQUYdyR35QXhyap5YAEctjRwHgeaOIeHA3kI NhyBDD0Uz5J2d4c+5svAOIU7+MZ3ALfmIr2Ij7Zu5EJFXcPXJi fd8b3gknE= Cc: freebsd-stable@freebsd.org Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 14:57:31 -0000 Am Fri, 15 May 2009 08:15:19 -0400 schrieb John Baldwin : Hi John, > When I have seen this, it has often been due to a hardware failure > such as bad RAM. hmmm... I will check this next week. > > cpuid = 2; apic id = 02 > > instruction pointer = 0x8:0xffffffff805bbc66 > > Can you do 'x/i 0xffffffff805bbc66'? Also, can you walk up the stack > to the frame for this panic ('frame 7') and do 'info registers'? (kgdb) x 0xffffffff805bbc66 0xffffffff805bbc66 : 0x4912b60f (kgdb) frame 7 #7 0xffffffff805bbc66 in rt_maskedcopy (src=0xffffffff51e2e6c8, dst=0xffffff00525ebd80, netmask=0xef3fdf377db53afa) at /usr/src/sys/net/route.c:1362 1362 { (kgdb) info registers rax 0xffffff00525ebd80 -1098129687168 rbx 0xffffff0006b570f8 -1099399073544 rcx 0x10 16 rdx 0xef3fdf377db53afa -1207000745686779142 rsi 0xffffff00525ebd80 -1098129687168 rdi 0xffffffff51e2e6c8 -2921142584 rbp 0xffffffff51e2e4c0 0xffffffff51e2e4c0 rsp 0xffffffff51e2e428 0xffffffff51e2e428 r8 0x0 0 r9 0xef3fdf377db53afa -1207000745686779142 r10 0xffffff00016eba50 -1099487593904 r11 0xffffffff80b3eec0 -2135691584 r12 0xe22173e466d29aa0 -2152311722091570528 r13 0xffffff0006832c00 -1099402368000 r14 0xef3fdf377db53afa -1207000745686779142 r15 0x0 0 rip 0xffffffff805bbc66 0xffffffff805bbc66 eflags 0x10286 66182 cs 0x8 8 ss 0x10 16 ds 0x0 0 es 0x0 0 fs 0x0 0 gs 0x0 0 I hope it helps. -- Martin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 15:03:48 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 17133106566B for ; Fri, 15 May 2009 15:03:48 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id 5A8178FC16 for ; Fri, 15 May 2009 15:03:46 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from odyssey.starpoint.kiev.ua (alpha-e.starpoint.kiev.ua [212.40.38.101]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id SAA11076; Fri, 15 May 2009 18:03:44 +0300 (EEST) (envelope-from avg@icyb.net.ua) Message-ID: <4A0D8450.5070706@icyb.net.ua> Date: Fri, 15 May 2009 18:03:44 +0300 From: Andriy Gapon User-Agent: Thunderbird 2.0.0.21 (X11/20090406) MIME-Version: 1.0 To: Helmut Schneider , freebsd-stable@freebsd.org References: <4A019B38.3060101@icyb.net.ua> In-Reply-To: X-Enigmail-Version: 0.95.7 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: Subject: Re: kbd0 at both atkbd0 and ukbd0 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 15:03:48 -0000 on 07/05/2009 17:17 Helmut Schneider said the following: > Andriy Gapon wrote: >> on 06/05/2009 14:43 Helmut Schneider said the following: >>> kbd1 at kbdmux0 >> [snip] >>> atkbdc0: at port 0x60,0x64 on isa0 atkbd0: >>> irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] >>> atkbd0: [ITHREAD] >> [snip] >>> ukbd0: on uhub0 kbd0 at >>> ukbd0 >> >> It took me three passes to notice the above: "kbd0 at atkbd0" and then again >> "kbd0 at ukbd0". > > Good point: > > http://www.freebsd.org/cgi/query-pr.cgi?pr=122887 > http://www.freebsd.org/cgi/query-pr.cgi?pr=133919 > > I have 'hint.atkbd.0.disabled="1"' at /boot/default.hints and (probably) > freebsd-update killed that one and silently replaced it with 1.16.8.1. The > whole mess might be related. D'oh! I don't actually understand fine details of what is happening when you don't have atkbd disabled (or configured for acpi attachment as opposed to isa). But I have a guess about why the system ultimately panics: - atkbd_timeout function: first called from atkbd_attach_unit and then reschedules itself at hz/10 - it accesses kbdsw[kbd->kb_index] without any checks, but there are couple of places where kbd_unregister could be called - thus kbdsw[kbd->kb_index] could become null or different keyboard - and there is no untimeout ever -- Andriy Gapon From owner-freebsd-stable@FreeBSD.ORG Fri May 15 15:24:35 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B1B01106564A for ; Fri, 15 May 2009 15:24:35 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 847BB8FC28 for ; Fri, 15 May 2009 15:24:35 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 3731346BA4; Fri, 15 May 2009 11:24:35 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 206668A025; Fri, 15 May 2009 11:24:34 -0400 (EDT) From: John Baldwin To: Martin Date: Fri, 15 May 2009 11:09:20 -0400 User-Agent: KMail/1.9.7 References: <1696198956@web.de> <200905150815.19452.jhb@freebsd.org> <20090515165727.30eab9ff@zelda.local> In-Reply-To: <20090515165727.30eab9ff@zelda.local> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905151109.21127.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Fri, 15 May 2009 11:24:34 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 15:24:35 -0000 On Friday 15 May 2009 10:57:27 am Martin wrote: > Am Fri, 15 May 2009 08:15:19 -0400 > schrieb John Baldwin : > > Hi John, > > > When I have seen this, it has often been due to a hardware failure > > such as bad RAM. > > hmmm... I will check this next week. > > > > cpuid = 2; apic id = 02 > > > instruction pointer = 0x8:0xffffffff805bbc66 > > > > Can you do 'x/i 0xffffffff805bbc66'? Also, can you walk up the stack > > to the frame for this panic ('frame 7') and do 'info registers'? > > (kgdb) x 0xffffffff805bbc66 > 0xffffffff805bbc66 : 0x4912b60f x/i please. The /i decodes it as an instruction so I can see which registers it was attempting to dereference. > (kgdb) frame 7 > #7 0xffffffff805bbc66 in rt_maskedcopy (src=0xffffffff51e2e6c8, > dst=0xffffff00525ebd80, netmask=0xef3fdf377db53afa) > at /usr/src/sys/net/route.c:1362 > 1362 { > (kgdb) info registers > rax 0xffffff00525ebd80 -1098129687168 > rbx 0xffffff0006b570f8 -1099399073544 > rcx 0x10 16 > rdx 0xef3fdf377db53afa -1207000745686779142 > rsi 0xffffff00525ebd80 -1098129687168 > rdi 0xffffffff51e2e6c8 -2921142584 > rbp 0xffffffff51e2e4c0 0xffffffff51e2e4c0 > rsp 0xffffffff51e2e428 0xffffffff51e2e428 > r8 0x0 0 > r9 0xef3fdf377db53afa -1207000745686779142 > r10 0xffffff00016eba50 -1099487593904 > r11 0xffffffff80b3eec0 -2135691584 > r12 0xe22173e466d29aa0 -2152311722091570528 > r13 0xffffff0006832c00 -1099402368000 > r14 0xef3fdf377db53afa -1207000745686779142 > r15 0x0 0 > rip 0xffffffff805bbc66 0xffffffff805bbc66 > eflags 0x10286 66182 > cs 0x8 8 > ss 0x10 16 > ds 0x0 0 > es 0x0 0 > fs 0x0 0 > gs 0x0 0 > > I hope it helps. > > -- > Martin > -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 15:37:12 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 997D5106570B; Fri, 15 May 2009 15:37:12 +0000 (UTC) (envelope-from nakal@web.de) Received: from fmmailgate02.web.de (fmmailgate02.web.de [217.72.192.227]) by mx1.freebsd.org (Postfix) with ESMTP id 3FE638FC2B; Fri, 15 May 2009 15:37:12 +0000 (UTC) (envelope-from nakal@web.de) Received: from smtp07.web.de (fmsmtp07.dlan.cinetic.de [172.20.5.215]) by fmmailgate02.web.de (Postfix) with ESMTP id CEB3CFF216E6; Fri, 15 May 2009 17:36:19 +0200 (CEST) Received: from [217.236.8.179] (helo=zelda.local) by smtp07.web.de with asmtp (TLSv1:AES128-SHA:128) (WEB.DE 4.110 #277) id 1M4zSN-0007p8-00; Fri, 15 May 2009 17:36:19 +0200 Date: Fri, 15 May 2009 17:36:18 +0200 From: Martin To: John Baldwin Message-ID: <20090515173618.78cca743@zelda.local> In-Reply-To: <200905150815.19452.jhb@freebsd.org> References: <1696198956@web.de> <200905140916.40594.jhb@freebsd.org> <20090514191026.0a90dbfc@zelda.local> <200905150815.19452.jhb@freebsd.org> X-Mailer: Claws Mail 3.7.1 (GTK+ 2.16.1; amd64-portbld-freebsd8.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: nakal@web.de X-Sender: nakal@web.de X-Provags-ID: V01U2FsdGVkX1+QZGiuobyGrcTrMVrdsSpYZjfCw79966r7nvp9 STCzYlYLeX/FJdCNfS578BfVAZe8G5LKBaL0WrvIlv6TdxHUcF Z90n+Anh8= Cc: freebsd-stable@freebsd.org Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 15:37:13 -0000 Hi John, one more thing that I noticed. It seems that the netmask passed to the procedure rt_maskedcopy is invalid. Cannot dereference the pointer. I went one frame up and I've looked at the control flow of the parent routine rtrequest1_fib. This routine passes the netmask, but before it does that it went with req=11 (RTM_RESOLVE) through this piece of code: /usr/src/sys/net/route.c:985 case RTM_RESOLVE: if (ret_nrt == NULL || (rt = *ret_nrt) == NULL) senderr(EINVAL); ifa = rt->rt_ifa; /* XXX locking? */ flags = rt->rt_flags & ~(RTF_CLONING | RTF_STATIC); flags |= RTF_WASCLONED; gateway = rt->rt_gateway; if ((netmask = rt->rt_genmask) == NULL) flags |= RTF_HOST; goto makeroute; Is this a locking problem? -- Martin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 15:38:03 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 101591065672; Fri, 15 May 2009 15:38:03 +0000 (UTC) (envelope-from nakal@web.de) Received: from fmmailgate02.web.de (fmmailgate02.web.de [217.72.192.227]) by mx1.freebsd.org (Postfix) with ESMTP id BE84F8FC13; Fri, 15 May 2009 15:38:02 +0000 (UTC) (envelope-from nakal@web.de) Received: from smtp07.web.de (fmsmtp07.dlan.cinetic.de [172.20.5.215]) by fmmailgate02.web.de (Postfix) with ESMTP id 86CDCFF205C9; Fri, 15 May 2009 17:38:01 +0200 (CEST) Received: from [217.236.8.179] (helo=zelda.local) by smtp07.web.de with asmtp (TLSv1:AES128-SHA:128) (WEB.DE 4.110 #277) id 1M4zU1-0008EG-00; Fri, 15 May 2009 17:38:01 +0200 Date: Fri, 15 May 2009 17:38:00 +0200 From: Martin To: John Baldwin Message-ID: <20090515173800.071e53c2@zelda.local> In-Reply-To: <200905151109.21127.jhb@freebsd.org> References: <1696198956@web.de> <200905150815.19452.jhb@freebsd.org> <20090515165727.30eab9ff@zelda.local> <200905151109.21127.jhb@freebsd.org> X-Mailer: Claws Mail 3.7.1 (GTK+ 2.16.1; amd64-portbld-freebsd8.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: nakal@web.de X-Sender: nakal@web.de X-Provags-ID: V01U2FsdGVkX1841gpcuTSi+CFQqY77+8g7Dqg3DnSTFVihEl/T sHF9i7cCtyZlI7aGqO7UCa6vTQjVSI64os3xr8+AnBNex3sc4Z 4AbooBSk8= Cc: freebsd-stable@freebsd.org Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 15:38:03 -0000 Am Fri, 15 May 2009 11:09:20 -0400 schrieb John Baldwin : > x/i please. The /i decodes it as an instruction so I can see which > registers it was attempting to dereference. Oh sorry... (kgdb) x/i 0xffffffff805bbc66 0xffffffff805bbc66 : movzbl (%rdx),%edx -- Martin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 15:42:46 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E7E52106566C for ; Fri, 15 May 2009 15:42:46 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id BA82E8FC16 for ; Fri, 15 May 2009 15:42:46 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 48BC946B0D; Fri, 15 May 2009 11:42:46 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 1AFB88A028; Fri, 15 May 2009 11:42:45 -0400 (EDT) From: John Baldwin To: Martin Date: Fri, 15 May 2009 11:42:38 -0400 User-Agent: KMail/1.9.7 References: <1696198956@web.de> <200905150815.19452.jhb@freebsd.org> <20090515173618.78cca743@zelda.local> In-Reply-To: <20090515173618.78cca743@zelda.local> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905151142.38933.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Fri, 15 May 2009 11:42:45 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 15:42:47 -0000 On Friday 15 May 2009 11:36:18 am Martin wrote: > > Hi John, > > one more thing that I noticed. It seems that the netmask passed to the > procedure rt_maskedcopy is invalid. Cannot dereference the pointer. > > I went one frame up and I've looked at the control flow of the parent > routine rtrequest1_fib. This routine passes the netmask, but before it > does that it went with req=11 (RTM_RESOLVE) through this piece of code: > > /usr/src/sys/net/route.c:985 > > case RTM_RESOLVE: > if (ret_nrt == NULL || (rt = *ret_nrt) == NULL) > senderr(EINVAL); > ifa = rt->rt_ifa; > /* XXX locking? */ > flags = rt->rt_flags & > ~(RTF_CLONING | RTF_STATIC); > flags |= RTF_WASCLONED; > gateway = rt->rt_gateway; > if ((netmask = rt->rt_genmask) == NULL) > flags |= RTF_HOST; > goto makeroute; > > Is this a locking problem? A GPF on amd64 usually happens because the pointer has high bits corrupt (the high N bits on amd64 must be either all zeros or all ones). In my experience those are all caused by hardware issues rather than races or bugs. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 16:02:15 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B503B1065670; Fri, 15 May 2009 16:02:15 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from nsc0.cwu.edu (nsc0.cwu.edu [72.233.196.16]) by mx1.freebsd.org (Postfix) with ESMTP id 8883E8FC18; Fri, 15 May 2009 16:02:15 +0000 (UTC) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (n.cwu.edu [198.104.69.57]) by nsc0.cwu.edu (8.14.3/8.14.3) with ESMTP id n4FG2FA7097007; Fri, 15 May 2009 09:02:15 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from n.cwu.edu (localhost [127.0.0.1]) by n.cwu.edu (8.13.3/8.13.3) with ESMTP id n4FG2Ek1021308; Fri, 15 May 2009 09:02:14 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) Received: from localhost (cwt@localhost) by n.cwu.edu (8.13.3/8.13.1/Submit) with ESMTP id n4FG2Eqq021305; Fri, 15 May 2009 09:02:14 -0700 (PDT) (envelope-from cwt@networks.cwu.edu) X-Authentication-Warning: n.cwu.edu: cwt owned process doing -bs Date: Fri, 15 May 2009 09:02:14 -0700 (PDT) From: Chris Timmons X-X-Sender: cwt@n.cwu.edu To: Kostik Belousov In-Reply-To: <20090515130609.GG1927@deviant.kiev.zoral.com.ua> Message-ID: <20090515085956.X21017@n.cwu.edu> References: <1696198956@web.de> <20090514091410.H12558@n.cwu.edu> <20090514093008.Q12558@n.cwu.edu> <200905141317.56551.jhb@freebsd.org> <20090514152838.E12558@n.cwu.edu> <20090515082458.GB1927@deviant.kiev.zoral.com.ua> <20090515053142.D17400@n.cwu.edu> <20090515130609.GG1927@deviant.kiev.zoral.com.ua> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.0 (nsc0.cwu.edu [72.233.196.16]); Fri, 15 May 2009 09:02:15 -0700 (PDT) Cc: freebsd-stable@freebsd.org, Martin Sugioarto , John Baldwin Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 16:02:16 -0000 Kostik, Looking good after applying your patch and rebuilding the kernel. I've been exercising the machine for a couple of hours under the same load which crashed it in short order yesterday. I will report back if any problems appear. Thank you for your help! Regards, -Chris last pid: 4131; load averages: 11.72, 8.89, 5.94 up 0+02:03:21 08:59:48 102 processes: 6 running, 96 sleeping CPU: 38.7% user, 0.0% nice, 11.2% system, 0.0% interrupt, 50.1% idle Mem: 409M Active, 1737M Inact, 241M Wired, 544K Cache, 112M Buf, 1372M Swap: 8192M Total, 8192M Free On Fri, 15 May 2009, Kostik Belousov wrote: > The file structure in the dump is fully initialized. It seems that the > issue is with devfs replacing file ops vector with devfs-specific one > in devfs_open() before the struct file is fully initialized in vn_open. > Please, try the patch below (against 7) and report results. > > Index: fs/devfs/devfs_vnops.c > =================================================================== > --- fs/devfs/devfs_vnops.c (revision 192089) > +++ fs/devfs/devfs_vnops.c (working copy) > @@ -890,6 +890,7 @@ > if (fp != NULL) { > FILE_LOCK(fp); > fp->f_data = dev; > + fp->f_vnode = vp; > FILE_UNLOCK(fp); > } > fpop = td->td_fpop; From owner-freebsd-stable@FreeBSD.ORG Fri May 15 16:26:18 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3100510656C6; Fri, 15 May 2009 16:26:18 +0000 (UTC) (envelope-from prvs=1386280ba5=killing@multiplay.co.uk) Received: from mail1.multiplay.co.uk (core6.multiplay.co.uk [85.236.96.23]) by mx1.freebsd.org (Postfix) with ESMTP id 7E8518FC08; Fri, 15 May 2009 16:26:16 +0000 (UTC) (envelope-from prvs=1386280ba5=killing@multiplay.co.uk) DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=multiplay.co.uk; s=Multiplay; t=1242404140; x=1243008940; q=dns/txt; h=Received: Message-ID:From:To:References:Subject:Date:MIME-Version: Content-Type:Content-Transfer-Encoding; bh=11MXNTqbeNnbbmp+XKgzx csmbW62E2sgHNcWxKK/G5Q=; b=G8YdP9RnYROA5qRCsOFS2xkHPl9aMnfs1BBvR hEVdzr+r7YynJBViYQdu8/ktKMjGSsmrGlhezq61u5doxbKv4NVpCBzlF4MyGjds 9CBjiv+K9yELWCVjDN4jz6AHoGQy108/MWM7tpMu+WOv5o8QGUwxZlsNO+caHxtD aCrEYs= X-MDAV-Processed: mail1.multiplay.co.uk, Fri, 15 May 2009 17:15:40 +0100 Received: from r2d2 by mail1.multiplay.co.uk (MDaemon PRO v10.0.4) with ESMTP id md50007520506.msg; Fri, 15 May 2009 17:15:40 +0100 X-Spam-Processed: mail1.multiplay.co.uk, Fri, 15 May 2009 17:15:40 +0100 (not processed: message from trusted or authenticated source) X-Authenticated-Sender: Killing@multiplay.co.uk X-MDRemoteIP: 85.236.106.102 X-Return-Path: prvs=1386280ba5=killing@multiplay.co.uk X-Envelope-From: killing@multiplay.co.uk Message-ID: From: "Steven Hartland" To: "James Tanis" , "FreeBSD Questions" , References: <4A0C34DC.9040508@mdchs.org> Date: Fri, 15 May 2009 17:15:43 +0100 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="iso-8859-1"; reply-type=response Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.5512 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5579 Cc: Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 16:26:18 -0000 Never only set one end manually, always set both the machine and the switch. Regards Steve ----- Original Message ----- From: "James Tanis" To: "FreeBSD Questions" ; Sent: Thursday, May 14, 2009 4:12 PM Subject: issues with Intel Pro/1000 and 1000baseTX >I have a FreeBSD v7.0 box it has two Intel Pro/1000 NICs, the one in > question is: > > em1: port > 0x2020-0x203f mem 0xd8060000-0xd807ffff,0xd8040000-0xd805ffff irq 19 at > device 0.1 on pci4 > > what we get after boot is: > > em1: flags=8943 metric 0 > mtu 1500 > options=19b > ether 00:30:48:xx:xx:xx > inet 192.168.1.1 netmask 0xffffff00 broadcast 192.168.1.255 > media: Ethernet autoselect (100baseTX ) > status: active > > The problem is that the NIC refuses to connect at 1000baseTX. > > It's connected to a HP Procurve 1700-24 switch which supports 1000baseTX > on ports 23 and 24. This particular computer is connected on port 24. I > have a much older end user system which uses the same card (but earlier > revision), runs Windows XP and is plugged in to port 23. The end user > system has no problem connecting at 1000baseTX. I have of course tried > switching ports. > > Attempting to force 1000baseTX via: > > ifconfig em1 media 1000baseTX mediaopt full-duplex > > gets me: > > status: no carrier > > After forcing the NIC to go 1000baseTX the LEDs on the backpane are both > off. I can only come to the conclusion that this is a driver issue based > on previous experience and the simple fact that the end user system is > capable of connecting at 1000baseTX. Anybody have any suggestions? I'm > hoping I'm wrong. I'd rather not do an in-place upgrade, this is a > production system and the main gateway for an entire school, when I do > not even know for sure whether this will fix the problem. It's worth it > to me though, having a 1000baseTX uplink from the switch would remove a > major bottleneck for me. > > Any help would be appreciated. > > -- > James Tanis > Technical Coordinator > Computer Science Department > Monsignor Donovan Catholic High School > > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" > ================================================ This e.mail is private and confidential between Multiplay (UK) Ltd. and the person or entity to whom it is addressed. In the event of misdirection, the recipient is prohibited from using, copying, printing or otherwise disseminating it or any information contained in it. In the event of misdirection, illegible or incomplete transmission please telephone +44 845 868 1337 or return the E.mail to postmaster@multiplay.co.uk. From owner-freebsd-stable@FreeBSD.ORG Fri May 15 16:35:12 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DFD961065689 for ; Fri, 15 May 2009 16:35:11 +0000 (UTC) (envelope-from jfvogel@gmail.com) Received: from mail-px0-f106.google.com (mail-px0-f106.google.com [209.85.216.106]) by mx1.freebsd.org (Postfix) with ESMTP id A8FDC8FC21 for ; Fri, 15 May 2009 16:35:11 +0000 (UTC) (envelope-from jfvogel@gmail.com) Received: by pxi4 with SMTP id 4so1269641pxi.3 for ; Fri, 15 May 2009 09:35:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type; bh=ayBZ6p7XK4oHrtFb5OJQ/B29MLUTBIsvpHoCghrqil0=; b=a1hTEKge8ViCUBeNncT87uYSD3xtlYs7cwev79wSTyhdXlMnDe5cHH+PyIWqFgiNkH iiTSN1aOpF2MjhWq+RrM0YnXACdtCtYRjDq9XMDE2k7S5KtMUekWgRJTJvhSb3Gm/esD HAUprfqj2GELdaP6szmtnklnpXebQChSGQIOY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=oXyRj4KKH8adDruRsU1h4MRzmg0IciTPQRNaY62zyGH2v9QqnABEO2Pw9zedKq8KI/ nRla86J5HRALKvRMfSIiS+QFmcubHmvSuXsPetiOkNL+uNlOQ9OmbnWJoUuqOJD7+a9b Vw8mR3IzBzb3+UvMF2cbiKb+CDCU7sRc2ySn8= MIME-Version: 1.0 Received: by 10.142.193.10 with SMTP id q10mr1124481wff.274.1242405311388; Fri, 15 May 2009 09:35:11 -0700 (PDT) In-Reply-To: References: <4A0C34DC.9040508@mdchs.org> Date: Fri, 15 May 2009 09:35:11 -0700 Message-ID: <2a41acea0905150935u2d4d63f8q615ca33464b65bdb@mail.gmail.com> From: Jack Vogel To: Steven Hartland Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: James Tanis , freebsd-stable@freebsd.org, FreeBSD Questions Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 16:35:12 -0000 Better yet, just let them autoneg and you won't have these problems :) Jack On Fri, May 15, 2009 at 9:15 AM, Steven Hartland wrote: > Never only set one end manually, always set both the machine and the > switch. > > Regards > Steve > > ----- Original Message ----- From: "James Tanis" > To: "FreeBSD Questions" ; < > freebsd-stable@freebsd.org> > Sent: Thursday, May 14, 2009 4:12 PM > Subject: issues with Intel Pro/1000 and 1000baseTX > > > I have a FreeBSD v7.0 box it has two Intel Pro/1000 NICs, the one in >> question is: >> >> em1: port >> 0x2020-0x203f mem 0xd8060000-0xd807ffff,0xd8040000-0xd805ffff irq 19 at >> device 0.1 on pci4 >> >> what we get after boot is: >> >> em1: flags=8943 metric 0 >> mtu 1500 >> options=19b >> ether 00:30:48:xx:xx:xx >> inet 192.168.1.1 netmask 0xffffff00 broadcast 192.168.1.255 >> media: Ethernet autoselect (100baseTX ) >> status: active >> >> The problem is that the NIC refuses to connect at 1000baseTX. >> >> It's connected to a HP Procurve 1700-24 switch which supports 1000baseTX >> on ports 23 and 24. This particular computer is connected on port 24. I have >> a much older end user system which uses the same card (but earlier >> revision), runs Windows XP and is plugged in to port 23. The end user system >> has no problem connecting at 1000baseTX. I have of course tried switching >> ports. >> >> Attempting to force 1000baseTX via: >> >> ifconfig em1 media 1000baseTX mediaopt full-duplex >> >> gets me: >> >> status: no carrier >> >> After forcing the NIC to go 1000baseTX the LEDs on the backpane are both >> off. I can only come to the conclusion that this is a driver issue based on >> previous experience and the simple fact that the end user system is capable >> of connecting at 1000baseTX. Anybody have any suggestions? I'm hoping I'm >> wrong. I'd rather not do an in-place upgrade, this is a production system >> and the main gateway for an entire school, when I do not even know for sure >> whether this will fix the problem. It's worth it to me though, having a >> 1000baseTX uplink from the switch would remove a major bottleneck for me. >> >> Any help would be appreciated. >> >> -- >> James Tanis >> Technical Coordinator >> Computer Science Department >> Monsignor Donovan Catholic High School >> >> _______________________________________________ >> freebsd-stable@freebsd.org mailing list >> http://lists.freebsd.org/mailman/listinfo/freebsd-stable >> To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" >> >> > ================================================ > This e.mail is private and confidential between Multiplay (UK) Ltd. and the > person or entity to whom it is addressed. In the event of misdirection, the > recipient is prohibited from using, copying, printing or otherwise > disseminating it or any information contained in it. > In the event of misdirection, illegible or incomplete transmission please > telephone +44 845 868 1337 > or return the E.mail to postmaster@multiplay.co.uk. > > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" > From owner-freebsd-stable@FreeBSD.ORG Fri May 15 17:03:52 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 600D81065674 for ; Fri, 15 May 2009 17:03:52 +0000 (UTC) (envelope-from dudu@dudu.ro) Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by mx1.freebsd.org (Postfix) with ESMTP id EF2ED8FC1B for ; Fri, 15 May 2009 17:03:51 +0000 (UTC) (envelope-from dudu@dudu.ro) Received: by bwz9 with SMTP id 9so2022532bwz.43 for ; Fri, 15 May 2009 10:03:51 -0700 (PDT) MIME-Version: 1.0 Received: by 10.223.122.15 with SMTP id j15mr2813628far.10.1242405784545; Fri, 15 May 2009 09:43:04 -0700 (PDT) From: Vlad GALU Date: Fri, 15 May 2009 19:42:43 +0300 Message-ID: To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Subject: ld-elf.so.1 isn't overwritten upon making installworld X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 17:03:52 -0000 All in subject. I could see the particular line where install is called on the newly built copy, but even though the system copy's file flags are cleared (noschg), the overwriting fails. I managed to overwrite it by (cp -f)-ing) the fresh copy over the old one. Regards, Vlad From owner-freebsd-stable@FreeBSD.ORG Fri May 15 17:10:06 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 09454106564A for ; Fri, 15 May 2009 17:10:06 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id D09788FC17 for ; Fri, 15 May 2009 17:10:05 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 71F0446B3B; Fri, 15 May 2009 13:10:05 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 3CFFA8A025; Fri, 15 May 2009 13:10:04 -0400 (EDT) From: John Baldwin To: Martin Date: Fri, 15 May 2009 12:05:47 -0400 User-Agent: KMail/1.9.7 References: <1696198956@web.de> <200905151109.21127.jhb@freebsd.org> <20090515173800.071e53c2@zelda.local> In-Reply-To: <20090515173800.071e53c2@zelda.local> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905151205.47672.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Fri, 15 May 2009 13:10:04 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 17:10:06 -0000 On Friday 15 May 2009 11:38:00 am Martin wrote: > Am Fri, 15 May 2009 11:09:20 -0400 > schrieb John Baldwin : > > > x/i please. The /i decodes it as an instruction so I can see which > > registers it was attempting to dereference. > > Oh sorry... > > (kgdb) x/i 0xffffffff805bbc66 > 0xffffffff805bbc66 : movzbl (%rdx),%edx Hmm, your %rdx is garbage. :( rdx 0xef3fdf377db53afa -1207000745686779142 That should at least be 0xffffff.......... Looks like r9 and r14 have the same odd value. Normally I would see a more obvious breakage such as one of the 'f' nibbles being set to '0' or 'e', etc. You could try looking for that odd pointer value in the route structure or as arguments to other functions in the stack trace to see if you can find a corrupted data structure. -- John Baldwin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 18:10:39 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4BF601065673; Fri, 15 May 2009 18:10:39 +0000 (UTC) (envelope-from nakal@web.de) Received: from fmmailgate03.web.de (fmmailgate03.web.de [217.72.192.234]) by mx1.freebsd.org (Postfix) with ESMTP id CD40F8FC1E; Fri, 15 May 2009 18:10:38 +0000 (UTC) (envelope-from nakal@web.de) Received: from smtp06.web.de (fmsmtp06.dlan.cinetic.de [172.20.5.172]) by fmmailgate03.web.de (Postfix) with ESMTP id AD872FC8E533; Fri, 15 May 2009 20:10:37 +0200 (CEST) Received: from [217.236.8.179] (helo=zelda.local) by smtp06.web.de with asmtp (TLSv1:AES128-SHA:128) (WEB.DE 4.110 #277) id 1M51rg-0006w4-00; Fri, 15 May 2009 20:10:36 +0200 Date: Fri, 15 May 2009 20:10:34 +0200 From: Martin To: John Baldwin Message-ID: <20090515201034.2b92c525@zelda.local> In-Reply-To: <200905151205.47672.jhb@freebsd.org> References: <1696198956@web.de> <200905151109.21127.jhb@freebsd.org> <20090515173800.071e53c2@zelda.local> <200905151205.47672.jhb@freebsd.org> X-Mailer: Claws Mail 3.7.1 (GTK+ 2.16.1; amd64-portbld-freebsd8.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: nakal@web.de X-Sender: nakal@web.de X-Provags-ID: V01U2FsdGVkX1/+IYzTtYaAHqebxJcNAlWMLAvbstzuFapVuNTK 4JKo30HNFVYDoEnMLq/Jq1XonlCsCZIO8amWY3g08NhugdAdTJ rCl8z1k0E= Cc: freebsd-stable@freebsd.org Subject: Re: kernel trap 12 with interrupts disabled [bge0 on 7.2R] X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 18:10:39 -0000 Am Fri, 15 May 2009 12:05:47 -0400 schrieb John Baldwin : > > (kgdb) x/i 0xffffffff805bbc66 > > 0xffffffff805bbc66 : movzbl (%rdx),%edx > > Hmm, your %rdx is garbage. :( > > rdx 0xef3fdf377db53afa -1207000745686779142 > > That should at least be > > 0xffffff.......... > > Looks like r9 and r14 have the same odd value. Normally I would see > a more obvious breakage such as one of the 'f' nibbles being set to > '0' or 'e', etc. You could try looking for that odd pointer value in > the route structure or as arguments to other functions in the stack > trace to see if you can find a corrupted data structure. Hi John, I've been testing RAM for 2 hours in user space with 3 parallel processes of sysutils/memtest. What can I say? I just got this in second loop of memtest: Loop 2: Stuck Address : ok Random Value : ok Compare XOR : ok Compare SUB : ok Compare MUL : ok Compare DIV : ok Compare OR : ok Compare AND : ok Sequential Increment: ok Solid Bits : ok Block Sequential : ok Checkerboard : testing 59FAILURE: 0xaaaaaaaaaaaaaaaa != 0x400300007007 at offset 0x007dc608. FAILURE: 0x5555555555555555 != 0xf0000070ef00007 at offset 0x007dc609. FAILURE: 0xaaaaaaaaaaaaaaaa != 0x00004003 at offset 0x007dc60a. FAILURE: 0x5555555555555555 != 0x00004002 at offset 0x007dc60b. FAILURE: 0xaaaaaaaaaaaaaaaa != 0xffffffff807cb4e0 at offset 0x007dc60c. FAILURE: 0x5555555555555555 != 0x00000000 at offset 0x007dc60d. FAILURE: 0xaaaaaaaaaaaaaaaa != 0x000002fa at offset 0x007dc60e. FAILURE: 0x5555555555555555 != 0x00000000 at offset 0x007dc60f. Bit Spread : ok Bit Flip : setting 35^C I think this is obvious enough. Thank you for your patience with me. This was a good hint. I would have never thought of a RAM defect. -- Martin From owner-freebsd-stable@FreeBSD.ORG Fri May 15 18:49:54 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 70649106566B for ; Fri, 15 May 2009 18:49:54 +0000 (UTC) (envelope-from dimitry@andric.com) Received: from tensor.andric.com (cl-327.ede-01.nl.sixxs.net [IPv6:2001:7b8:2ff:146::2]) by mx1.freebsd.org (Postfix) with ESMTP id 34E038FC15 for ; Fri, 15 May 2009 18:49:54 +0000 (UTC) (envelope-from dimitry@andric.com) Received: from [IPv6:2001:7b8:3a7:0:99b:8d02:f4ad:74c3] (unknown [IPv6:2001:7b8:3a7:0:99b:8d02:f4ad:74c3]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by tensor.andric.com (Postfix) with ESMTPSA id 513795C43; Fri, 15 May 2009 20:49:53 +0200 (CEST) Message-ID: <4A0DB94F.9040804@andric.com> Date: Fri, 15 May 2009 20:49:51 +0200 From: Dimitry Andric User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.9.1b5pre) Gecko/20090515 Shredder/3.0b3pre MIME-Version: 1.0 To: Vlad GALU References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: ld-elf.so.1 isn't overwritten upon making installworld X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 18:49:54 -0000 On 2009-05-15 18:42, Vlad GALU wrote: > All in subject. I could see the particular line where install is > called on the newly built copy, but even though the system copy's file > flags are cleared (noschg), the overwriting fails. I managed to > overwrite it by (cp -f)-ing) the fresh copy over the old one. Are you running in single-user mode during installworld? From owner-freebsd-stable@FreeBSD.ORG Fri May 15 19:21:01 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 642CA106566C for ; Fri, 15 May 2009 19:21:01 +0000 (UTC) (envelope-from hdantas@wnetrj.com.br) Received: from send.wnetrj.com.br (send.wnetrj.com.br [201.76.223.13]) by mx1.freebsd.org (Postfix) with ESMTP id 150138FC0C for ; Fri, 15 May 2009 19:21:00 +0000 (UTC) (envelope-from hdantas@wnetrj.com.br) Received: from send.wnetrj.com.br (localhost.localdomain [127.0.0.1]) by send.wnetrj.com.br (Postfix) with ESMTP id DC40D6E425D for ; Fri, 15 May 2009 15:42:56 -0300 (BRT) Received: from relay.wnetrj.com.br (unknown [10.0.0.5]) by send.wnetrj.com.br (Postfix) with ESMTP id B46626E4246 for ; Fri, 15 May 2009 15:42:56 -0300 (BRT) Received: from relay.wnetrj.com.br (localhost.localdomain [127.0.0.1]) by relay.wnetrj.com.br (Postfix) with ESMTP id C315C83352 for ; Fri, 15 May 2009 14:35:25 -0300 (BRT) Received: from phillip-53dc8c5 (unknown [201.53.160.197]) by relay.wnetrj.com.br (Postfix) with ESMTP id 5302E8220C for ; Fri, 15 May 2009 12:55:19 -0300 (BRT) From: "Declaracao 2009 Incorreta" To: freebsd-stable@freebsd.org MIME-Version: 1.0 Date: Fri, 15 May 2009 12:55:21 -0300 Message-Id: <20090515155519.5302E8220C@relay.wnetrj.com.br> X-Virus-Scanned: ClamAV using ClamSMTP X-Virus-Scanned: ClamAV using ClamSMTP Content-Type: text/plain Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: Multa - =?iso-8859-1?q?Declara=E7=E3o?= 2009 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 19:21:01 -0000 - This mail is a HTML mail. Not all elements could be shown in plain = text mode. - Caso nao esteja visualizando este email, visualize aqui From owner-freebsd-stable@FreeBSD.ORG Fri May 15 20:25:57 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 40FBC106564A for ; Fri, 15 May 2009 20:25:57 +0000 (UTC) (envelope-from dudu@dudu.ro) Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by mx1.freebsd.org (Postfix) with ESMTP id CD2858FC12 for ; Fri, 15 May 2009 20:25:56 +0000 (UTC) (envelope-from dudu@dudu.ro) Received: by bwz9 with SMTP id 9so2114226bwz.43 for ; Fri, 15 May 2009 13:25:55 -0700 (PDT) MIME-Version: 1.0 Received: by 10.223.111.134 with SMTP id s6mr2937293fap.37.1242419155431; Fri, 15 May 2009 13:25:55 -0700 (PDT) In-Reply-To: <4A0DB94F.9040804@andric.com> References: <4A0DB94F.9040804@andric.com> From: Vlad GALU Date: Fri, 15 May 2009 23:25:35 +0300 Message-ID: To: Dimitry Andric Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: ld-elf.so.1 isn't overwritten upon making installworld X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 20:25:57 -0000 On Fri, May 15, 2009 at 9:49 PM, Dimitry Andric wrote: > On 2009-05-15 18:42, Vlad GALU wrote: >> All in subject. I could see the particular line where install is >> called on the newly built copy, but even though the system copy's file >> flags are cleared (noschg), the overwriting fails. I managed to >> overwrite it by (cp -f)-ing) the fresh copy over the old one. > > Are you running in single-user mode during installworld? > Yep. From owner-freebsd-stable@FreeBSD.ORG Fri May 15 20:37:28 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 36D9E10656E6 for ; Fri, 15 May 2009 20:37:28 +0000 (UTC) (envelope-from dimitry@andric.com) Received: from tensor.andric.com (cl-327.ede-01.nl.sixxs.net [IPv6:2001:7b8:2ff:146::2]) by mx1.freebsd.org (Postfix) with ESMTP id E51398FC17 for ; Fri, 15 May 2009 20:37:27 +0000 (UTC) (envelope-from dimitry@andric.com) Received: from [IPv6:2001:7b8:3a7:0:99b:8d02:f4ad:74c3] (unknown [IPv6:2001:7b8:3a7:0:99b:8d02:f4ad:74c3]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by tensor.andric.com (Postfix) with ESMTPSA id 31CA35C42; Fri, 15 May 2009 22:37:27 +0200 (CEST) Message-ID: <4A0DD289.6050908@andric.com> Date: Fri, 15 May 2009 22:37:29 +0200 From: Dimitry Andric User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.9.1b5pre) Gecko/20090515 Shredder/3.0b3pre MIME-Version: 1.0 To: Vlad GALU References: <4A0DB94F.9040804@andric.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: ld-elf.so.1 isn't overwritten upon making installworld X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 20:37:32 -0000 On 2009-05-15 22:25, Vlad GALU wrote: >>> called on the newly built copy, but even though the system copy's file >>> flags are cleared (noschg), the overwriting fails. I managed to >>> overwrite it by (cp -f)-ing) the fresh copy over the old one. >> Are you running in single-user mode during installworld? Alright, just checking. :) What is the exact error that you're getting? It might also be the binary isn't changed at all, and in that case it will *not* be updated (its Makefile uses INSTALLFLAGS=-C -b). From owner-freebsd-stable@FreeBSD.ORG Fri May 15 20:45:56 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C96661065670 for ; Fri, 15 May 2009 20:45:56 +0000 (UTC) (envelope-from dudu@dudu.ro) Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by mx1.freebsd.org (Postfix) with ESMTP id 62C818FC0A for ; Fri, 15 May 2009 20:45:56 +0000 (UTC) (envelope-from dudu@dudu.ro) Received: by bwz9 with SMTP id 9so2122330bwz.43 for ; Fri, 15 May 2009 13:45:55 -0700 (PDT) MIME-Version: 1.0 Received: by 10.223.113.132 with SMTP id a4mr2856334faq.75.1242420355081; Fri, 15 May 2009 13:45:55 -0700 (PDT) In-Reply-To: <4A0DD289.6050908@andric.com> References: <4A0DB94F.9040804@andric.com> <4A0DD289.6050908@andric.com> From: Vlad GALU Date: Fri, 15 May 2009 23:45:35 +0300 Message-ID: To: Dimitry Andric Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: freebsd-stable@freebsd.org Subject: Re: ld-elf.so.1 isn't overwritten upon making installworld X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 20:45:57 -0000 On Fri, May 15, 2009 at 11:37 PM, Dimitry Andric wrote= : > On 2009-05-15 22:25, Vlad GALU wrote: >>>> called on the newly built copy, but even though the system copy's file >>>> flags are cleared (noschg), the overwriting fails. I managed to >>>> overwrite it by (cp -f)-ing) the fresh copy over the old one. >>> Are you running in single-user mode during installworld? > > Alright, just checking. :) =A0What is the exact error that you're getting= ? > > It might also be the binary isn't changed at all, and in that case it > will *not* be updated (its Makefile uses INSTALLFLAGS=3D-C -b). > There's no error, I just happened to notice that the mtime of my ld-elf.so.1 was from about 2 months ago (that's about when I made the last update). The size of the fresh one from /usr/obj/... is different. Not to mention that there were even some recent changes in rtld.c :) From owner-freebsd-stable@FreeBSD.ORG Fri May 15 21:01:39 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4B0D2106564A for ; Fri, 15 May 2009 21:01:39 +0000 (UTC) (envelope-from matheusber@gmail.com) Received: from qw-out-2122.google.com (qw-out-2122.google.com [74.125.92.26]) by mx1.freebsd.org (Postfix) with ESMTP id EBC0F8FC13 for ; Fri, 15 May 2009 21:01:38 +0000 (UTC) (envelope-from matheusber@gmail.com) Received: by qw-out-2122.google.com with SMTP id 3so1508202qwe.7 for ; Fri, 15 May 2009 14:01:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:sender:received:received :message-id:in-reply-to:references:date:subject:from:to:user-agent :mime-version:content-type:content-transfer-encoding:x-priority :importance; bh=AR+s7hI0q1FlVRjtquJqOei+6O0TetrzpPUERFzGmnE=; b=dNSp3BFdURHVVhXU7i4lisWqZ6Qu/16m15acaSuAR/KuQGERqC6bJSs0UzlzF4CrKn JcrdPxS+q75y+xdsfac1zE1q/wOV6qtpbZH2yKq96wGaDxcVQB2kaZbbFGjkh7U7SQUT bmEeATfM6bHcOD8Mb8ed1ofGOFRCrxTtnQe6g= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:message-id:in-reply-to:references:date:subject:from:to :user-agent:mime-version:content-type:content-transfer-encoding :x-priority:importance; b=utCsrDP8L8ZGy648svW2Qg5UQ4oNk1MdVkv2wXa38mrt7HkVxsfBfLYpGTJtuH671n iJ/x9193nuyab0I2dTSRsc7dTV207+zKb78y4wh4wvZvPvgwZzLLcKPlxGJTQ+x+WSMi nUUl+mEqBXaws2y6ut6l+gdESg6V3udptVg2o= Received: by 10.224.89.8 with SMTP id c8mr4398964qam.371.1242421298371; Fri, 15 May 2009 14:01:38 -0700 (PDT) Received: from cygnus.homeunix.com (201008164081.user.veloxzone.com.br [201.8.164.81]) by mx.google.com with ESMTPS id 7sm314060qwf.55.2009.05.15.14.01.37 (version=TLSv1/SSLv3 cipher=RC4-MD5); Fri, 15 May 2009 14:01:37 -0700 (PDT) Sender: Nenhum_de_Nos Received: by cygnus.homeunix.com (Postfix, from userid 80) id 65959B8143; Fri, 15 May 2009 18:01:33 -0300 (BRT) Received: from 10.1.1.80 (SquirrelMail authenticated user matheus) by 10.1.1.10 with HTTP; Fri, 15 May 2009 18:01:33 -0300 (BRT) Message-ID: <9c6b919d50e3d92060fde088f06ddb2b.squirrel@10.1.1.10> In-Reply-To: References: <4A0C34DC.9040508@mdchs.org> Date: Fri, 15 May 2009 18:01:33 -0300 (BRT) From: "Nenhum_de_Nos" To: freebsd-stable@freebsd.org User-Agent: SquirrelMail/1.4.15 MIME-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 21:01:39 -0000 On Thu, May 14, 2009 12:53, Tim Judd wrote: > On Thu, May 14, 2009 at 9:12 AM, James Tanis wrote: > >> I have a FreeBSD v7.0 box it has two Intel Pro/1000 NICs, the one in >> question is: >> >> em1: port >> 0x2020-0x203f mem 0xd8060000-0xd807ffff,0xd8040000-0xd805ffff irq 19 at >> device 0.1 on pci4 >> >> what we get after boot is: >> >> em1: flags=8943 metric 0 >> mtu 1500 >> options=19b >> ether 00:30:48:xx:xx:xx >> inet 192.168.1.1 netmask 0xffffff00 broadcast 192.168.1.255 >> media: Ethernet autoselect (100baseTX ) >> status: active >> >> The problem is that the NIC refuses to connect at 1000baseTX. >> >> It's connected to a HP Procurve 1700-24 switch which supports 1000baseTX >> on >> ports 23 and 24. This particular computer is connected on port 24. I >> have a >> much older end user system which uses the same card (but earlier >> revision), >> runs Windows XP and is plugged in to port 23. The end user system has no >> problem connecting at 1000baseTX. I have of course tried switching >> ports. >> >> Attempting to force 1000baseTX via: >> >> ifconfig em1 media 1000baseTX mediaopt full-duplex >> >> gets me: >> >> status: no carrier >> >> After forcing the NIC to go 1000baseTX the LEDs on the backpane are both >> off. I can only come to the conclusion that this is a driver issue based >> on >> previous experience and the simple fact that the end user system is >> capable >> of connecting at 1000baseTX. Anybody have any suggestions? I'm hoping >> I'm >> wrong. I'd rather not do an in-place upgrade, this is a production >> system >> and the main gateway for an entire school, when I do not even know for >> sure >> whether this will fix the problem. It's worth it to me though, having a >> 1000baseTX uplink from the switch would remove a major bottleneck for >> me. >> >> Any help would be appreciated. >> >> -- >> James Tanis >> Technical Coordinator >> Computer Science Department >> Monsignor Donovan Catholic High School >> > > I'm going to point the finger at the possibility of the Ethernet cable > itself. > > Gigabit link requires CAT5e or better (CAT6). A CAT5 alone is NOT enough > to > give gigabit speeds. Check the markings on the cable, replace if it's not > a > 5e or 6 and try again. This includes the discussion of proper terminating > and twist requirements. I know this is a bit off, but as I never had CAT6 stuff to deal with here it goes. is there any problems in using CAT6 cabling and not 1000baseTX capable switch ? I plan to install cat6 cables and just use 1000baseTX in future. this will be my new home network and all I have now is 100baseTX and two 1000baseT cards. thanks, matheus -- We will call you cygnus, The God of balance you shall be A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? http://en.wikipedia.org/wiki/Posting_style From owner-freebsd-stable@FreeBSD.ORG Fri May 15 23:07:03 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 51CA9106566C; Fri, 15 May 2009 23:07:03 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smarthost2.sentex.ca (smarthost2.sentex.ca [205.211.164.50]) by mx1.freebsd.org (Postfix) with ESMTP id 6BA868FC12; Fri, 15 May 2009 23:07:02 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smtp1.sentex.ca (smtp1.sentex.ca [199.212.134.4]) by smarthost2.sentex.ca (8.14.3/8.14.3) with ESMTP id n4FN70VG067428; Fri, 15 May 2009 19:07:00 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: from freebsd-legacy.sentex.ca (freebsd-legacy.sentex.ca [64.7.128.104]) by smtp1.sentex.ca (8.14.3/8.14.3) with ESMTP id n4FN70Ds089784; Fri, 15 May 2009 19:07:00 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: by freebsd-legacy.sentex.ca (Postfix, from userid 666) id D0B22241BA; Fri, 15 May 2009 19:06:59 -0400 (EDT) Sender: FreeBSD Tinderbox From: FreeBSD Tinderbox To: FreeBSD Tinderbox , , Precedence: bulk Message-Id: <20090515230659.D0B22241BA@freebsd-legacy.sentex.ca> Date: Fri, 15 May 2009 19:06:59 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95.1 at smtp1.sentex.ca X-Virus-Status: Clean X-Scanned-By: MIMEDefang 2.64 on 205.211.164.50 Cc: Subject: [releng_6 tinderbox] failure on amd64/amd64 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 May 2009 23:07:04 -0000 TB --- 2009-05-15 22:42:23 - tinderbox 2.6 running on freebsd-legacy.sentex.ca TB --- 2009-05-15 22:42:23 - starting RELENG_6 tinderbox run for amd64/amd64 TB --- 2009-05-15 22:42:23 - cleaning the object tree TB --- 2009-05-15 22:42:37 - cvsupping the source tree TB --- 2009-05-15 22:42:38 - /usr/bin/csup -z -r 3 -g -L 1 -h localhost -s /tinderbox/RELENG_6/amd64/amd64/supfile TB --- 2009-05-15 22:42:46 - building world TB --- 2009-05-15 22:42:46 - MAKEOBJDIRPREFIX=/obj TB --- 2009-05-15 22:42:46 - PATH=/usr/bin:/usr/sbin:/bin:/sbin TB --- 2009-05-15 22:42:46 - TARGET=amd64 TB --- 2009-05-15 22:42:46 - TARGET_ARCH=amd64 TB --- 2009-05-15 22:42:46 - TZ=UTC TB --- 2009-05-15 22:42:46 - __MAKE_CONF=/dev/null TB --- 2009-05-15 22:42:46 - cd /src TB --- 2009-05-15 22:42:46 - /usr/bin/make -B buildworld >>> Rebuilding the temporary build tree >>> stage 1.1: legacy release compatibility shims >>> stage 1.2: bootstrap tools >>> stage 2.1: cleaning up the object tree >>> stage 2.2: rebuilding the object tree >>> stage 2.3: build tools >>> stage 3: cross tools >>> stage 4.1: building includes >>> stage 4.2: building libraries [...] cc -O2 -fno-strict-aliasing -pipe -I. -I/src/lib/libthread_db -Wsystem-headers -Werror -Wall -Wno-format-y2k -W -Wno-unused-parameter -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Wreturn-type -Wcast-qual -Wwrite-strings -Wswitch -Wshadow -Wcast-align -Wunused-parameter -Wchar-subscripts -Winline -Wnested-externs -Wredundant-decls -c /src/lib/libthread_db/arch/amd64/libpthread_md.c /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_fpreg_to_ucontext': /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: implicit declaration of function `memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_ucontext_to_fpreg': /src/lib/libthread_db/arch/amd64/libpthread_md.c:100: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' *** Error code 1 Stop in /src/lib/libthread_db. *** Error code 1 Stop in /src/lib. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. TB --- 2009-05-15 23:06:59 - WARNING: /usr/bin/make returned exit code 1 TB --- 2009-05-15 23:06:59 - ERROR: failed to build world TB --- 2009-05-15 23:06:59 - 1119.03 user 156.33 system 1475.75 real http://tinderbox.des.no/tinderbox-releng_6-RELENG_6-amd64-amd64.full From owner-freebsd-stable@FreeBSD.ORG Sat May 16 00:02:23 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 68270106564A for ; Sat, 16 May 2009 00:02:23 +0000 (UTC) (envelope-from mat.macy@gmail.com) Received: from yx-out-2324.google.com (yx-out-2324.google.com [74.125.44.28]) by mx1.freebsd.org (Postfix) with ESMTP id 224218FC15 for ; Sat, 16 May 2009 00:02:22 +0000 (UTC) (envelope-from mat.macy@gmail.com) Received: by yx-out-2324.google.com with SMTP id 8so1276752yxb.13 for ; Fri, 15 May 2009 17:02:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:received:date :x-google-sender-auth:message-id:subject:from:to:content-type :content-transfer-encoding; bh=xWGhctWbn71blI4aMmJR4hGmOVtf8zoOPYVP5s/uSwQ=; b=VU60DYkoGxPpvF3NrB/E/DnB9WqEnGV3LxmEM9l6t+xgxhFlsmh9oJ5gVyg5r/BC8S FqjEAaJRliUGCbrSpaf5WfKRSaUNzinF5l8bCV06kTK4gDClolkzPqzpQCiDfNbhNz+2 LeRYFMbxWE5MlxM5zdwpIwhjZxkaQNKentvRA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:date:x-google-sender-auth:message-id:subject :from:to:content-type:content-transfer-encoding; b=m8X7SGVU/6vYxwj3VMMAlu9iv4o3016/TeobgeqPXAZCLptE+pbBGSGeNSznzzeZdn w9qKiYT1wWXgWSb+wDXjw7HKs7fP4jD0IpGkOllEkmTJhsDzst11zlx+Y1KqOObvJFzr 2Si9UQahkDIt61Iq1SHe6LAeRvt1zUyovjfjg= MIME-Version: 1.0 Sender: mat.macy@gmail.com Received: by 10.100.211.11 with SMTP id j11mr4903963ang.101.1242432142104; Fri, 15 May 2009 17:02:22 -0700 (PDT) Date: Fri, 15 May 2009 17:02:22 -0700 X-Google-Sender-Auth: e1324a6912544bde Message-ID: <3c1674c90905151702l81c2b88off1d2b2ffed39ca2@mail.gmail.com> From: Kip Macy To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Subject: RFT: ZFS MFC X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 00:02:23 -0000 I've MFC'd ZFS v13 to RELENG_7 in a work branch. Please test if you can. http://svn.freebsd.org/base/user/kmacy/ZFS_MFC/ The standard disclaimers apply. This has only been lightly tested in a VM. Please do not use it with data you care about at this time. Thanks, Kip -- When bad men combine, the good must associate; else they will fall one by one, an unpitied sacrifice in a contemptible struggle. Edmund Burke From owner-freebsd-stable@FreeBSD.ORG Sat May 16 06:50:22 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3F99D106566C for ; Sat, 16 May 2009 06:50:22 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: from mx.egr.msu.edu (surfnturf.egr.msu.edu [35.9.37.164]) by mx1.freebsd.org (Postfix) with ESMTP id 12A808FC19 for ; Sat, 16 May 2009 06:50:21 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: from localhost (localhost [127.0.0.1]) by mx.egr.msu.edu (Postfix) with ESMTP id 31A4C71F01F; Sat, 16 May 2009 02:50:21 -0400 (EDT) X-Virus-Scanned: amavisd-new at egr.msu.edu Received: from mx.egr.msu.edu ([127.0.0.1]) by localhost (surfnturf.egr.msu.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id mxPrXL2P5OEC; Sat, 16 May 2009 02:50:21 -0400 (EDT) Received: from localhost (daemon.egr.msu.edu [35.9.44.65]) by mx.egr.msu.edu (Postfix) with ESMTP id 1097A71F00F; Sat, 16 May 2009 02:50:21 -0400 (EDT) Received: by localhost (Postfix, from userid 21281) id 05F4EDCF; Sat, 16 May 2009 02:50:21 -0400 (EDT) Date: Sat, 16 May 2009 02:50:21 -0400 From: Adam McDougall To: Kip Macy Message-ID: <20090516065019.GM82547@egr.msu.edu> References: <3c1674c90905151702l81c2b88off1d2b2ffed39ca2@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <3c1674c90905151702l81c2b88off1d2b2ffed39ca2@mail.gmail.com> User-Agent: Mutt/1.5.19 (2009-01-05) Cc: freebsd-stable@freebsd.org Subject: Re: RFT: ZFS MFC X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 06:50:22 -0000 On Fri, May 15, 2009 at 05:02:22PM -0700, Kip Macy wrote: I've MFC'd ZFS v13 to RELENG_7 in a work branch. Please test if you can. http://svn.freebsd.org/base/user/kmacy/ZFS_MFC/ The standard disclaimers apply. This has only been lightly tested in a VM. Please do not use it with data you care about at this time. Thanks, Kip Seems to work for me so far. I had a zfs send hang part way through and with a notable speed difference depending on the direction but this is literally the first time I've tried zfs send/recv and the systems are setup different so I have no idea if it would have happened anyway. Eventually I could probably make these test systems more similar to give a fair test, but wanted to mention it so others could check. Thanks for working on the MFC, I'm excited to see progress there! It will play a factor in some upcoming server plans even if the MFC doesn't happen for months. From owner-freebsd-stable@FreeBSD.ORG Sat May 16 07:25:58 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 30AFD1065670 for ; Sat, 16 May 2009 07:25:58 +0000 (UTC) (envelope-from byshenknet@byshenk.net) Received: from core.byshenk.net (core.byshenk.net [62.58.73.230]) by mx1.freebsd.org (Postfix) with ESMTP id BDB438FC14 for ; Sat, 16 May 2009 07:25:57 +0000 (UTC) (envelope-from byshenknet@byshenk.net) Received: from core.byshenk.net (localhost.aoes.com [127.0.0.1]) by core.byshenk.net (8.14.3/8.14.3) with ESMTP id n4G7Pj2W044839; Sat, 16 May 2009 09:25:45 +0200 (CEST) (envelope-from byshenknet@core.byshenk.net) Received: (from byshenknet@localhost) by core.byshenk.net (8.14.3/8.14.3/Submit) id n4G7PiPh044838; Sat, 16 May 2009 09:25:44 +0200 (CEST) (envelope-from byshenknet) Date: Sat, 16 May 2009 09:25:44 +0200 From: Greg Byshenk To: Nenhum_de_Nos Message-ID: <20090516072544.GC2571@core.byshenk.net> References: <4A0C34DC.9040508@mdchs.org> <9c6b919d50e3d92060fde088f06ddb2b.squirrel@10.1.1.10> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <9c6b919d50e3d92060fde088f06ddb2b.squirrel@10.1.1.10> User-Agent: Mutt/1.4.2.3i X-Spam-Status: No, score=-1.4 required=5.0 tests=ALL_TRUSTED autolearn=failed version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on core.byshenk.net Cc: freebsd-stable@freebsd.org Subject: Re: issues with Intel Pro/1000 and 1000baseTX X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 07:25:58 -0000 On Fri, May 15, 2009 at 06:01:33PM -0300, Nenhum_de_Nos wrote: > I know this is a bit off, but as I never had CAT6 stuff to deal with here > it goes. is there any problems in using CAT6 cabling and not 1000baseTX > capable switch ? > > I plan to install cat6 cables and just use 1000baseTX in future. this will > be my new home network and all I have now is 100baseTX and two 1000baseT > cards. There should be no problem at all. CAT6 must meet higher standards, but the basic cable design is the same at CAT5, and it works for 100baseTX, and even for 10baseT (if you really wanted to use it). When my company relocated to a new building, the entire network was cabled at CAT6, but we still have some machines and switches that are 100baseTX, and they work fine. -- greg byshenk - gbyshenk@byshenk.net - Leiden, NL From owner-freebsd-stable@FreeBSD.ORG Sat May 16 08:12:47 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id F2FD41065670; Sat, 16 May 2009 08:12:46 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smarthost1.sentex.ca (smarthost1.sentex.ca [64.7.153.18]) by mx1.freebsd.org (Postfix) with ESMTP id 9ED038FC1C; Sat, 16 May 2009 08:12:46 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smtp2.sentex.ca (smtp2c.sentex.ca [64.7.153.30]) by smarthost1.sentex.ca (8.14.3/8.14.3) with ESMTP id n4G8Ch5u031155; Sat, 16 May 2009 04:12:43 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: from freebsd-legacy.sentex.ca (freebsd-legacy.sentex.ca [64.7.128.104]) by smtp2.sentex.ca (8.14.3/8.14.3) with ESMTP id n4G8Chjo063396; Sat, 16 May 2009 04:12:43 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: by freebsd-legacy.sentex.ca (Postfix, from userid 666) id 6F241241BA; Sat, 16 May 2009 04:12:43 -0400 (EDT) Sender: FreeBSD Tinderbox From: FreeBSD Tinderbox To: FreeBSD Tinderbox , , Precedence: bulk Message-Id: <20090516081243.6F241241BA@freebsd-legacy.sentex.ca> Date: Sat, 16 May 2009 04:12:43 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95.1 at smtp2.sentex.ca X-Virus-Status: Clean X-Scanned-By: MIMEDefang 2.64 on 64.7.153.18 Cc: Subject: [releng_6 tinderbox] failure on amd64/amd64 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 08:12:47 -0000 TB --- 2009-05-16 07:47:56 - tinderbox 2.6 running on freebsd-legacy.sentex.ca TB --- 2009-05-16 07:47:56 - starting RELENG_6 tinderbox run for amd64/amd64 TB --- 2009-05-16 07:47:56 - cleaning the object tree TB --- 2009-05-16 07:48:11 - cvsupping the source tree TB --- 2009-05-16 07:48:11 - /usr/bin/csup -z -r 3 -g -L 1 -h localhost -s /tinderbox/RELENG_6/amd64/amd64/supfile TB --- 2009-05-16 07:48:18 - building world TB --- 2009-05-16 07:48:18 - MAKEOBJDIRPREFIX=/obj TB --- 2009-05-16 07:48:18 - PATH=/usr/bin:/usr/sbin:/bin:/sbin TB --- 2009-05-16 07:48:18 - TARGET=amd64 TB --- 2009-05-16 07:48:18 - TARGET_ARCH=amd64 TB --- 2009-05-16 07:48:18 - TZ=UTC TB --- 2009-05-16 07:48:18 - __MAKE_CONF=/dev/null TB --- 2009-05-16 07:48:18 - cd /src TB --- 2009-05-16 07:48:18 - /usr/bin/make -B buildworld >>> Rebuilding the temporary build tree >>> stage 1.1: legacy release compatibility shims >>> stage 1.2: bootstrap tools >>> stage 2.1: cleaning up the object tree >>> stage 2.2: rebuilding the object tree >>> stage 2.3: build tools >>> stage 3: cross tools >>> stage 4.1: building includes >>> stage 4.2: building libraries [...] cc -O2 -fno-strict-aliasing -pipe -I. -I/src/lib/libthread_db -Wsystem-headers -Werror -Wall -Wno-format-y2k -W -Wno-unused-parameter -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Wreturn-type -Wcast-qual -Wwrite-strings -Wswitch -Wshadow -Wcast-align -Wunused-parameter -Wchar-subscripts -Winline -Wnested-externs -Wredundant-decls -c /src/lib/libthread_db/arch/amd64/libpthread_md.c /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_fpreg_to_ucontext': /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: implicit declaration of function `memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_ucontext_to_fpreg': /src/lib/libthread_db/arch/amd64/libpthread_md.c:100: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' *** Error code 1 Stop in /src/lib/libthread_db. *** Error code 1 Stop in /src/lib. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. TB --- 2009-05-16 08:12:43 - WARNING: /usr/bin/make returned exit code 1 TB --- 2009-05-16 08:12:43 - ERROR: failed to build world TB --- 2009-05-16 08:12:43 - 1117.44 user 158.98 system 1487.13 real http://tinderbox.des.no/tinderbox-releng_6-RELENG_6-amd64-amd64.full From owner-freebsd-stable@FreeBSD.ORG Sat May 16 08:21:51 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 52E88106566B for ; Sat, 16 May 2009 08:21:51 +0000 (UTC) (envelope-from bms@incunabulum.net) Received: from out1.smtp.messagingengine.com (out1.smtp.messagingengine.com [66.111.4.25]) by mx1.freebsd.org (Postfix) with ESMTP id 234D88FC13 for ; Sat, 16 May 2009 08:21:50 +0000 (UTC) (envelope-from bms@incunabulum.net) Received: from compute1.internal (compute1.internal [10.202.2.41]) by out1.messagingengine.com (Postfix) with ESMTP id 49B0F343A00; Sat, 16 May 2009 04:21:50 -0400 (EDT) Received: from heartbeat1.messagingengine.com ([10.202.2.160]) by compute1.internal (MEProxy); Sat, 16 May 2009 04:21:50 -0400 X-Sasl-enc: +U2niVx6hkWskR9oWnS1LPS/1x7htP0oGSVeM+uEfY04 1242462109 Received: from [192.168.123.18] (82-35-112-254.cable.ubr07.dals.blueyonder.co.uk [82.35.112.254]) by mail.messagingengine.com (Postfix) with ESMTPSA id ADDCC1AC27; Sat, 16 May 2009 04:21:49 -0400 (EDT) Message-ID: <4A0E779B.4040309@incunabulum.net> Date: Sat, 16 May 2009 09:21:47 +0100 From: Bruce Simpson User-Agent: Thunderbird 2.0.0.21 (Windows/20090302) MIME-Version: 1.0 To: John Baldwin References: <4A0CF934.4000706@incunabulum.net> <200905150850.19843.jhb@freebsd.org> In-Reply-To: <200905150850.19843.jhb@freebsd.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: Boot panic w/7.2-STABLE on amd64: resource_list_alloc X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 08:21:51 -0000 John Baldwin wrote: > ... > Sounds like the ATA driver is allocating the same BAR twice. Hmm, yes, it > allocates the resources once for each channel it seems in the ata_ali_sata > attachment. Looking in ata-chipset.c, all the other chipsets are good about > allocating these resources in their chipinit routines rather than the > per-channel allocate routine. Well, except ata_pci_allocate() is also > busted. *sigh* I can work on a patch for HEAD if you are willing to test. > Yes, ata is gnarly in places... If a fix can be dropped straight into a 7.2 tree, then that is even better... I could try testing a NanoBSD image of HEAD on this machine if the change set delta between branches is sufficiently huge to prevent backporting the fix; this is my desktop machine and this is the only critical bug I've run into so far with 7.2. thanks, BMS From owner-freebsd-stable@FreeBSD.ORG Sat May 16 12:58:23 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5AE871065670 for ; Sat, 16 May 2009 12:58:23 +0000 (UTC) (envelope-from lists@loveturtle.net) Received: from loveturtle.net (loveturtle.net [216.89.228.174]) by mx1.freebsd.org (Postfix) with ESMTP id 2AFDB8FC20 for ; Sat, 16 May 2009 12:58:23 +0000 (UTC) (envelope-from lists@loveturtle.net) Received: from localhost (localhost [127.0.0.1]) by loveturtle.net (Postfix) with ESMTP id 5F857B4CA8 for ; Sat, 16 May 2009 08:41:36 -0400 (EDT) X-Virus-Scanned: amavisd-new at loveturtle.net Received: from loveturtle.net ([127.0.0.1]) by localhost (loveturtle.net [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ig7vqv8vGrb7 for ; Sat, 16 May 2009 08:41:32 -0400 (EDT) Received: from ramuh.loveturtle.net (ramuh.loveturtle.net [216.182.254.142]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by loveturtle.net (Postfix) with ESMTPSA id AFB14B4C9A for ; Sat, 16 May 2009 08:41:32 -0400 (EDT) Message-ID: <4A0EB479.5080502@loveturtle.net> Date: Sat, 16 May 2009 08:41:29 -0400 From: Dillon Kass User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.1b3pre) Gecko/20090223 Thunderbird/3.0b2 MIME-Version: 1.0 To: freebsd-stable@freebsd.org References: <3c1674c90905151702l81c2b88off1d2b2ffed39ca2@mail.gmail.com> In-Reply-To: <3c1674c90905151702l81c2b88off1d2b2ffed39ca2@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: RFT: ZFS MFC X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 12:58:23 -0000 On 5/15/09 8:02 PM, Kip Macy wrote: > I've MFC'd ZFS v13 to RELENG_7 in a work branch. Please test if you can. > > http://svn.freebsd.org/base/user/kmacy/ZFS_MFC/ > > The standard disclaimers apply. This has only been lightly tested in a > VM. Please do not use it with data you care about at this time. > > > Thanks, > Kip > > I created a pool on 7.1, created some datasets, populated them, made some snapshots. Upgraded to v13 deleted a few snapshots, created a clone, promoted a clone, deleted and created some new datasets. So far so good. I'll try to make something break! From owner-freebsd-stable@FreeBSD.ORG Sat May 16 17:27:18 2009 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1D55F1065670; Sat, 16 May 2009 17:27:18 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smarthost2.sentex.ca (smarthost2.sentex.ca [205.211.164.50]) by mx1.freebsd.org (Postfix) with ESMTP id BF2A58FC1A; Sat, 16 May 2009 17:27:17 +0000 (UTC) (envelope-from tinderbox@freebsd.org) Received: from smtp1.sentex.ca (smtp1.sentex.ca [199.212.134.4]) by smarthost2.sentex.ca (8.14.3/8.14.3) with ESMTP id n4GHRExT071064; Sat, 16 May 2009 13:27:14 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: from freebsd-legacy.sentex.ca (freebsd-legacy.sentex.ca [64.7.128.104]) by smtp1.sentex.ca (8.14.3/8.14.3) with ESMTP id n4GHRErI088868; Sat, 16 May 2009 13:27:14 -0400 (EDT) (envelope-from tinderbox@freebsd.org) Received: by freebsd-legacy.sentex.ca (Postfix, from userid 666) id 7E469241BA; Sat, 16 May 2009 13:27:14 -0400 (EDT) Sender: FreeBSD Tinderbox From: FreeBSD Tinderbox To: FreeBSD Tinderbox , , Precedence: bulk Message-Id: <20090516172714.7E469241BA@freebsd-legacy.sentex.ca> Date: Sat, 16 May 2009 13:27:14 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95.1 at smtp1.sentex.ca X-Virus-Status: Clean X-Scanned-By: MIMEDefang 2.64 on 205.211.164.50 Cc: Subject: [releng_6 tinderbox] failure on amd64/amd64 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 17:27:18 -0000 TB --- 2009-05-16 17:02:27 - tinderbox 2.6 running on freebsd-legacy.sentex.ca TB --- 2009-05-16 17:02:27 - starting RELENG_6 tinderbox run for amd64/amd64 TB --- 2009-05-16 17:02:27 - cleaning the object tree TB --- 2009-05-16 17:02:39 - cvsupping the source tree TB --- 2009-05-16 17:02:39 - /usr/bin/csup -z -r 3 -g -L 1 -h localhost -s /tinderbox/RELENG_6/amd64/amd64/supfile TB --- 2009-05-16 17:02:46 - building world TB --- 2009-05-16 17:02:46 - MAKEOBJDIRPREFIX=/obj TB --- 2009-05-16 17:02:46 - PATH=/usr/bin:/usr/sbin:/bin:/sbin TB --- 2009-05-16 17:02:46 - TARGET=amd64 TB --- 2009-05-16 17:02:46 - TARGET_ARCH=amd64 TB --- 2009-05-16 17:02:46 - TZ=UTC TB --- 2009-05-16 17:02:46 - __MAKE_CONF=/dev/null TB --- 2009-05-16 17:02:46 - cd /src TB --- 2009-05-16 17:02:46 - /usr/bin/make -B buildworld >>> Rebuilding the temporary build tree >>> stage 1.1: legacy release compatibility shims >>> stage 1.2: bootstrap tools >>> stage 2.1: cleaning up the object tree >>> stage 2.2: rebuilding the object tree >>> stage 2.3: build tools >>> stage 3: cross tools >>> stage 4.1: building includes >>> stage 4.2: building libraries [...] cc -O2 -fno-strict-aliasing -pipe -I. -I/src/lib/libthread_db -Wsystem-headers -Werror -Wall -Wno-format-y2k -W -Wno-unused-parameter -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Wreturn-type -Wcast-qual -Wwrite-strings -Wswitch -Wshadow -Wcast-align -Wunused-parameter -Wchar-subscripts -Winline -Wnested-externs -Wredundant-decls -c /src/lib/libthread_db/arch/amd64/libpthread_md.c /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_fpreg_to_ucontext': /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: implicit declaration of function `memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c:94: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' /src/lib/libthread_db/arch/amd64/libpthread_md.c: In function `pt_ucontext_to_fpreg': /src/lib/libthread_db/arch/amd64/libpthread_md.c:100: warning: nested extern declaration of `memcpy' :0: warning: redundant redeclaration of 'memcpy' *** Error code 1 Stop in /src/lib/libthread_db. *** Error code 1 Stop in /src/lib. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. *** Error code 1 Stop in /src. TB --- 2009-05-16 17:27:14 - WARNING: /usr/bin/make returned exit code 1 TB --- 2009-05-16 17:27:14 - ERROR: failed to build world TB --- 2009-05-16 17:27:14 - 1118.53 user 155.83 system 1486.74 real http://tinderbox.des.no/tinderbox-releng_6-RELENG_6-amd64-amd64.full From owner-freebsd-stable@FreeBSD.ORG Sat May 16 17:31:41 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1F06F106564A; Sat, 16 May 2009 17:31:41 +0000 (UTC) (envelope-from david@usermode.org) Received: from mail.meer.net (mail.meer.net [64.13.141.3]) by mx1.freebsd.org (Postfix) with ESMTP id E3F248FC0A; Sat, 16 May 2009 17:31:40 +0000 (UTC) (envelope-from david@usermode.org) Received: from radagast.usermode.org (netblock-66-245-217-18.dslextreme.com [66.245.217.18]) by mail.meer.net (8.13.3/8.13.3/meer) with ESMTP id n4GHVUqJ058364; Sat, 16 May 2009 10:31:30 -0700 (PDT) (envelope-from david@usermode.org) From: David Johnson To: Robert Noland Date: Sat, 16 May 2009 10:31:29 -0700 User-Agent: KMail/1.11.3 (FreeBSD/7.2-RELEASE; KDE/4.2.3; i386; ; ) References: <200905042015.29394.david@usermode.org> <200905091841.26274.david@usermode.org> <1242141471.1755.11.camel@balrog.2hip.net> In-Reply-To: <1242141471.1755.11.camel@balrog.2hip.net> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-15" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905161031.29975.david@usermode.org> Cc: freebsd-stable@freebsd.org Subject: Re: Xorg hangs with drmwtq in 7.2-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 17:31:41 -0000 I don't know if this helps pinpointing my problem, but when I unload the drm module, I get the following message: May 15 19:57:52 radagast kernel: vgapci0: child drm0 requested pci_disable_busmaster May 15 19:57:52 radagast kernel: drm0: detached May 15 19:57:52 radagast kernel: Warning: memory type drm_bufs leaked memory on destroy (4 allocations, 128 bytes leaked). After this I can load the radeon and drm modules, but X will not start, complaining about no screen found. -- David Johnson From owner-freebsd-stable@FreeBSD.ORG Sat May 16 19:49:26 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E2EE01065674 for ; Sat, 16 May 2009 19:49:26 +0000 (UTC) (envelope-from torfinn.ingolfsen@broadpark.no) Received: from osl1smout1.broadpark.no (osl1smout1.broadpark.no [80.202.4.58]) by mx1.freebsd.org (Postfix) with ESMTP id A5AE48FC2E for ; Sat, 16 May 2009 19:49:26 +0000 (UTC) (envelope-from torfinn.ingolfsen@broadpark.no) MIME-version: 1.0 Content-transfer-encoding: 7BIT Content-type: text/plain; charset=US-ASCII Received: from osl1sminn1.broadpark.no ([80.202.4.59]) by osl1smout1.broadpark.no (Sun Java(tm) System Messaging Server 6.3-3.01 (built Jul 12 2007; 32bit)) with ESMTP id <0KJR00ECD5QDI510@osl1smout1.broadpark.no> for freebsd-stable@freebsd.org; Sat, 16 May 2009 21:49:25 +0200 (CEST) Received: from kg-v2.kg4.no ([80.202.83.38]) by osl1sminn1.broadpark.no (Sun Java(tm) System Messaging Server 6.3-3.01 (built Jul 12 2007; 32bit)) with SMTP id <0KJR0058D5QCEG00@osl1sminn1.broadpark.no> for freebsd-stable@freebsd.org; Sat, 16 May 2009 21:49:25 +0200 (CEST) Date: Sat, 16 May 2009 21:49:24 +0200 From: Torfinn Ingolfsen To: freebsd-stable@freebsd.org Message-id: <20090516214924.b2c8b7c0.torfinn.ingolfsen@broadpark.no> In-reply-to: <20090408190018.9f30845f.torfinn.ingolfsen@broadpark.no> References: <20090408190018.9f30845f.torfinn.ingolfsen@broadpark.no> X-Mailer: Sylpheed 2.6.0 (GTK+ 2.16.1; amd64-portbld-freebsd7.2) X-Face: "t9w2,-X@O^I`jVW\sonI3.,36KBLZE*AL[y9lL[PyFD*r_S:dIL9c[8Y>V42R0"!"yb_zN,f#%.[PYYNq; m"_0v; ~rUM2Yy!zmkh)3&U|u!=T(zyv,MHJv"nDH>OJ`t(@mil461d_B'Uo|'nMwlKe0Mv=kvV?Nh@>Hb<3s_z2jYgZhPb@?Wi^x1a~Hplz1.zH Subject: Re: uchcom and RELENG_7? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 19:49:27 -0000 On Wed, 08 Apr 2009 19:00:18 +0200 Torfinn Ingolfsen wrote: > Hi, > Is there any reason why uchcom[1] hasn't been MFC'ed to RELENG_7 yet? Anyone? uchcom doesn't seem to have made it into 7.2 either. > I see discussion[2] about this subject as far back as around > 7.0-release, but I can't find any uchcom.c files in my src, not even > for latest RELENG_7. > > References: > 1) > http://www.freebsd.org/cgi/man.cgi?query=uchcom&apropos=0&sektion=0&manpath=FreeBSD+8-current&format=html > 2) > http://markmail.org/message/4w324qx4usmnd4ic#query:freebsd%20uchcom+page:1+mid:ecwpudhls4jqpr5s+state:results > -- > Regards, > Torfinn Ingolfsen -- Regards, Torfinn Ingolfsen From owner-freebsd-stable@FreeBSD.ORG Sat May 16 20:11:00 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1176A106566B for ; Sat, 16 May 2009 20:11:00 +0000 (UTC) (envelope-from npapke@acm.org) Received: from idcmail-mo1so.shaw.ca (idcmail-mo1so.shaw.ca [24.71.223.10]) by mx1.freebsd.org (Postfix) with ESMTP id D2D998FC0A for ; Sat, 16 May 2009 20:10:59 +0000 (UTC) (envelope-from npapke@acm.org) Received: from pd2ml1so-ssvc.prod.shaw.ca ([10.0.141.139]) by pd2mo1so-svcs.prod.shaw.ca with ESMTP; 16 May 2009 14:10:59 -0600 X-Cloudmark-SP-Filtered: true X-Cloudmark-SP-Result: v=1.0 c=0 a=BbwUBbeR3g4mJzlyhZsA:9 a=pNIJx-431e1aKZN0kl3I5X_YCmMA:4 a=avX_41wpOqIA:10 a=macy1kFFMuwA:10 Received: from unknown (HELO proven.lan) ([24.85.241.34]) by pd2ml1so-dmz.prod.shaw.ca with ESMTP; 16 May 2009 14:10:59 -0600 Received: from proven.lan (localhost [127.0.0.1]) by proven.lan (8.14.3/8.14.3) with ESMTP id n4GKAwSi055323 for ; Sat, 16 May 2009 13:10:58 -0700 (PDT) (envelope-from npapke@acm.org) Received: from localhost (localhost [[UNIX: localhost]]) by proven.lan (8.14.3/8.14.3/Submit) id n4GKAwG0054964 for freebsd-stable@freebsd.org; Sat, 16 May 2009 13:10:58 -0700 (PDT) (envelope-from npapke@acm.org) X-Authentication-Warning: proven.lan: npapke set sender to npapke@acm.org using -f From: Norbert Papke Organization: Archaeological Filing To: freebsd-stable@freebsd.org Date: Sat, 16 May 2009 13:10:56 -0700 User-Agent: KMail/1.9.10 References: <200905101217.39920.fbsd-ml@scrapper.ca> In-Reply-To: <200905101217.39920.fbsd-ml@scrapper.ca> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200905161310.57596.npapke@acm.org> Subject: Re: 7.2-STABLE: Inserting USB device causes Fatal Trap 12 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 20:11:00 -0000 On May 10, 2009, Norbert Papke wrote: > Inserting a USB thumb drive into a running sytem result in a "Fatal trap > 12: page fault while in kernel mode". After repeating the crash a few times with INVARIANTS enabled, it becomes apparent that EHCI transfer queue is getting corrupted. With High Precision Event Timers disabled in the BIOS, the problem goes away. This is an acceptable work-around for me. I am inclined to believe (but unable to prove) that the crash is due to a BIOS bug. Cheers, -- Norbert Papke. From owner-freebsd-stable@FreeBSD.ORG Sat May 16 23:13:16 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2A007106566C for ; Sat, 16 May 2009 23:13:16 +0000 (UTC) (envelope-from dougb@FreeBSD.org) Received: from mail2.fluidhosting.com (mx24.fluidhosting.com [204.14.89.7]) by mx1.freebsd.org (Postfix) with ESMTP id AD7C18FC16 for ; Sat, 16 May 2009 23:13:15 +0000 (UTC) (envelope-from dougb@FreeBSD.org) Received: (qmail 24253 invoked by uid 399); 16 May 2009 23:13:14 -0000 Received: from localhost (HELO foreign.dougb.net) (dougb@dougbarton.us@127.0.0.1) by localhost with ESMTPAM; 16 May 2009 23:13:14 -0000 X-Originating-IP: 127.0.0.1 X-Sender: dougb@dougbarton.us Message-ID: <4A0F4888.3070709@FreeBSD.org> Date: Sat, 16 May 2009 16:13:12 -0700 From: Doug Barton Organization: http://www.FreeBSD.org/ User-Agent: Thunderbird 2.0.0.21 (X11/20090423) MIME-Version: 1.0 To: Daniel Gerzo References: <20090504225012.392fa49f.torfinn.ingolfsen@broadpark.no> <49FF5901.600@gmail.com> <49FFEECE.20403@FreeBSD.org> In-Reply-To: <49FFEECE.20403@FreeBSD.org> X-Enigmail-Version: 0.95.7 OpenPGP: id=D5B2F0FB Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: Torfinn Ingolfsen , freebsd-stable@freebsd.org, Manolis Kiagias Subject: Re: RELENG_7 - has mergemaster changed logic since 7.2-RELEASE? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 23:13:16 -0000 I think I now know what was causing the problem with the files being overwritten, the saved mtree database was somehow reduced to zero bytes causing the list of CHANGED files to be empty. Unfortunately I haven't tracked down the cause of why the mtree file would get emptied out (given that there is already code that should prevent that problem) but I have just committed r192230 which adds a lot of safety belts to the code involving creating and updating the mtree database, and creating and using the list of files with changes. It should no longer be possible to even enter the -U code unless there is both a valid mtree file AND a valid list of files with local changes. FWIW I've also improved the performance of the -U option by changing a use of grep for every file to using case. You should be able to grab the file from HEAD and run it on RELENG_[67] without any problems. I will MFC it as rapidly as possible. Sorry for the inconvenience, Doug -- This .signature sanitized for your protection From owner-freebsd-stable@FreeBSD.ORG Sat May 16 23:59:52 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: by hub.freebsd.org (Postfix, from userid 618) id 77D391065670; Sat, 16 May 2009 23:59:52 +0000 (UTC) To: freebsd-stable@freebsd.org Date: Sat, 16 May 2009 23:59:52 +0000 (GMT) X-Mailer: ELM [version 2.4ME+ PL54 (25)] MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Message-Id: <20090516235952.77D391065670@hub.freebsd.org> From: wpaul@FreeBSD.ORG (Bill Paul) Subject: Fear and loathing in FreeBSD 7.2 (AGP issues and fixes) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 May 2009 23:59:52 -0000 So. I decided to test FreeBSD 7.2 on my Averatec AV1020 ED1 laptop. (It currently has 6.0-RELEASE on it, and while it runs fine, I figured now was a good time to update it.) I ran into two problems with it, and I thought it would be a good idea to share how I resolved them, just in case anyone else is foolish enough to follow in my tracks. The laptop has a RaLink RT2560 wireless chipset. The ral(4) driver supports this chip out of the box, however that driver doesn't support WPA2 Enterprise, which I need for work. To get around this, I use the Windows NDIS driver with Project Evil. Unfortunately, the driver that comes with the laptop (version 3.0.3.0000) is buggy, and will trigger a kernel panic in certain conditions. It seems to have trouble parsing information from certain newer kinds of devices, which causes some of the code inside the driver binary to dereference a bogus pointer. This is not a problem with FreeBSD or Project Evil: I discovered that the same driver blue-screens Windows XP as well (a testament to just how closely Project Evil emulates Windows: it even emulates its crashes). Luckily there is a slightly newer driver available that fixes this issue (3.1.0.000), though I had to hunt a bit to find it. I put copies of the .SYS and .INF at: http://www.freebsd.org/~wpaul/7_2_RELEASE/wifi The other problems I had were with graphics. The Averatec has an Intel 82855GME graphics controller. With FreeBSD 6.0, I had it working nicely with DRI and everything. With FreeBSD 7.2 and xorg 1.6.0, I saw some peculiar problems. The most glaring issue was that after running X -configure for the first time and testing the resulting xorg.conf file, I found that the X server would not respond to the mouse or keyboard. After some digging, I found that this was due to the AutoAddDevices feature (described in xorg.conf(5)) being on by default. If AutoAddDevices is on, then AllowEmptyInput is also turned on, but the description for AllowEmptyInput says: "If AllowEmptyInput is on, devices using the kbd, mouse or vmmouse driver are ignored." I don't know what's supposed to happen instead, but it wasn't working. I had to add: Option "AutoAddDevices" "False" to my xorg.conf to turn this off in order for my mouse and keyboard to work. On a related note, the X server seems to ignore a lot of what you put in xorg.conf in favor of its autoselected defaults. I tried to use "DefaultDepth 24" to force the screen color depth, but it seems to always ignore this and use a depth of 32 bits. It seems to work ok, but I thought this was odd. If I tell it to do something, it should do it. This used to work in earlier X releases. More curiously, X -configure decided for some reason that my laptop had two graphics cards instead of one. This apparently has to do with the fact that the gracphic device has two PCI functions: vgapci0@pci0:0:2:0: class=0x030000 card=0x031914ff chip=0x35828086 rev=0x02 hdr=0x00 vendor = 'Intel Corporation' device = '82852GM/GME/GMV/PM, 855GM/GME Montara Integrated Graphics Device' class = display subclass = VGA vgapci1@pci0:0:2:1: class=0x038000 card=0x031914ff chip=0x35828086 rev=0x02 hdr=0x00 vendor = 'Intel Corporation' device = '82852GM/GME/GMV/PM, 855GM/GME Montara Integrated Graphics Device' class = display X -configure created a "Card" and "Screen" section for both of these, even though it should only have created one. I had to edit the xorg.conf to remove the duplicates. (This was something else that worked correctly in older versions of X.) Once I settled those issues, the X server worked, but I found that I was unable to use DRI. FreeBSD was correctly loading the agp, drm and i915 drivers, but the X server refused to activate DRI support. According to the Xorg.log.0 file, it was failing to allocate a couple of regions of physical memory from the AGP driver. I finally traced this down to the agp_i810 code in the kernel. In agp_i810_alloc_memory(), it says: [...] } else if (type == 2) { /* * Type 2 is the contiguous physical memory type, that hands * back a physical address. This is used for cursors on i810. * Hand back as many single pages with physical as the user * wants, but only allow one larger allocation (ARGB cursor) * for simplicity. */ if (size != AGP_PAGE_SIZE) { if (sc->argb_cursor != NULL) return 0; [...] I'm all for simplicity, but this is bogus: the Intel video driver wants to allocate three ranges of physical memory for cursors, but only the first one succeeds. Two additional allocates for 40K and 16K both fail because of this code. I ended up modifying agp_i810.c to deal with this, by allowing it to allocate as many of these ranges as it wants. In the process of testing this, I also ran into another problem: if you load agp.ko, drm.ko and i915.ko as modules, and then try to unload them, the kernel will panic in agp_i810_detach(). It seems that during unload, the drm/i915 code will release the I/O resources allocated by the agp_i810 driver before the agp_i810_detach() driver gets to run. That's a shame, because agp_i810_detach() needs to use them. When it tries to clear a bit in one of the i810's registers, it ends up trying to use a memory mapped I/P mapping that's no longer valid. As a workaround, I modified agp_i810_detach() to check to see if the resources are still valid, and to allocate them again if they're not. This is a hack: the DRM code should be sorted out to prevent this from happening, but I'm not really eager to dive into it myself. I put the modified Intel AGP driver code at: http://www.freebsd.org/~wpaul/7_2_RELEASE/agp To use it, copy agp_i810.c and agppriv.h to /sys/pci, then recompile your kernel and/or agp.ko module. Once I patched the AGP driver, the X server was willing to enable DRI support, but I found that GLX apps still didn't work. In particular, things like the GLmatrix screen saver in KDE 4 claimed that the current visual did not support the GLX extension. Looking through the log file again, I saw that it said: "(==) AIGLX disabled." I considered this odd, since I didn't ask to disable it. Apparently it's disabled by default. I corrected this by adding: Option "AIGLX" "on" to the xorg.conf file. Finally, everything worked correctly. I was even able to compile and install the latest Intel video driver (2.7.1). One minor nit is that the FreeBSD AGP code doesn't support GEM, which the newer X drivers seem to want. This does not appear to be a fatal problem (yet). I put my current xorg.conf file at: http://www.freebsd.org/~wpaul/7_2_RELEASE/agp as well. It was a bit of a shame that I had to fight so much to get this stuff to work, though now that I have I'm relatively pleased with the results. I was able to get bluetooth tethering to work with my Blackberry fairly easily. I still need to confirm that WPA2 works when I get to the office on Monday. If it does, I'm going to go through with the update. -Bill -- ============================================================================= -Bill Paul (510) 749-2329 | Senior Engineer, Master of Unix-Fu wpaul@windriver.com | Wind River Systems ============================================================================= "I put a dollar in a change machine. Nothing changed." - George Carlin =============================================================================