From owner-freebsd-sparc64@FreeBSD.ORG Mon May 30 11:07:08 2011 Return-Path: Delivered-To: freebsd-sparc64@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 021D31065673 for ; Mon, 30 May 2011 11:07:08 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id E5B2C8FC12 for ; Mon, 30 May 2011 11:07:07 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.4/8.14.4) with ESMTP id p4UB77tA050505 for ; Mon, 30 May 2011 11:07:07 GMT (envelope-from owner-bugmaster@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.4/8.14.4/Submit) id p4UB77mZ050503 for freebsd-sparc64@FreeBSD.org; Mon, 30 May 2011 11:07:07 GMT (envelope-from owner-bugmaster@FreeBSD.org) Date: Mon, 30 May 2011 11:07:07 GMT Message-Id: <201105301107.p4UB77mZ050503@freefall.freebsd.org> X-Authentication-Warning: freefall.freebsd.org: gnats set sender to owner-bugmaster@FreeBSD.org using -f From: FreeBSD bugmaster To: freebsd-sparc64@FreeBSD.org Cc: Subject: Current problem reports assigned to freebsd-sparc64@FreeBSD.org X-BeenThere: freebsd-sparc64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the Sparc List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 30 May 2011 11:07:08 -0000 Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- f sparc/145211 sparc64 [panic] Memory modified after free o sparc/142102 sparc64 [nfs] [panic] FreeBSD 8.0 kernel panics on sparc64 whe o sparc/141918 sparc64 [ehci] ehci_interrupt: unrecoverable error, controller s sparc/139134 sparc64 kernel output corruption f sparc/108732 sparc64 ping(8) reports 14 digit time on sparc64 s sparc/107087 sparc64 [hang] system is hung during boot from CD o sparc/105048 sparc64 [trm] trm(4) panics on sparc64 o sparc/104428 sparc64 [nullfs] nullfs panics on E4500 (but not E420) o sparc/80890 sparc64 [panic] kmem_malloc(73728): kmem_map too small running o sparc/71729 sparc64 printf in kernel thread causes panic on SPARC 10 problems total. From owner-freebsd-sparc64@FreeBSD.ORG Wed Jun 1 10:14:59 2011 Return-Path: Delivered-To: freebsd-sparc@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8B71A106567B for ; Wed, 1 Jun 2011 10:14:59 +0000 (UTC) (envelope-from plaxo@mx.plaxo.com) Received: from plaxomx1.plaxo.com (plaxomx1.plaxo.com [204.15.240.53]) by mx1.freebsd.org (Postfix) with ESMTP id 6866F8FC17 for ; Wed, 1 Jun 2011 10:14:59 +0000 (UTC) Received: from 127.0.0.1 ([10.1.2.12]) by plaxomx1.plaxo.com (StrongMail Enterprise 4.1.2(4.1.2-51177)); Wed, 01 Jun 2011 02:59:57 -0700 X-VirtualServer: Pulse-Member, plaxomx1.plaxo.com, 10.1.6.54 X-MailingID: 00000::00000::00000::00000::79-0-0-244816035251-0-20110601::910946 X-SMHeaderMap: mid="X-MailingID" X-Destination-ID: freebsd-sparc@freebsd.org X-SMFBL: ZnJlZWJzZC1zcGFyY0BmcmVlYnNkLm9yZw== DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=mx.plaxo.com; s=plaxo; l=6979; t=1306922397; i=@mx.plaxo.com; h=From:To:Subject:Message-Id:X-VirtualServerGroup: X-PlaxoMailType:X-PlaxoMailID:List-Unsubscribe:Reply-To: Errors-To:Date:X-Mailer:Content-Type:MIME-Version; bh=9uXs99EmQ6 NxlA7M+ZimJZ8CsoI=; b=QRb0EpLDjWrufdSXnr7OuOzep2p3Vs9xP/qwp6TJsL d9H+ldjGKmQc1tmGjafDIrZItLiD6fKlQOq1oIQ9jjbRi//cWxMunyoUJZxnhPOG fecMvQxCybJCkf7S59n9by From: "John Styler via Plaxo" To: freebsd-sparc@freebsd.org Message-Id: <88afa0768621dfd487c74aa998cec86c@xpertmailer.com> X-VirtualServerGroup: Pulse-Member X-PlaxoMailType: Pulse-NonMember X-PlaxoMailID: 79-0-0-244816035251-0-20110601 Errors-To: plaxo@mx.plaxo.com Date: Wed, 01 Jun 2011 02:59:57 -0700 X-Mailer: XPM4 v.0.3 < www.xpertmailer.com > MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Subject: John Styler has invited you to join Plaxo X-BeenThere: freebsd-sparc64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: noreply@plaxo.com List-Id: Porting FreeBSD to the Sparc List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 01 Jun 2011 10:14:59 -0000 John invited you to join Plaxo!=20 John Styler Web & Graphic Specialist, Get more people to see your business @ www.BizOppers.biz John uses Plaxo, a smart address book that keeps your address book up-to-date and notifies your important contacts when you change your information. Learn more. http://www.plaxo.com/products?src=3Demail&et=3D79&el=3Den_us Join now for free http://www.plaxo.com/join/taf?src=3Demail&et=3D79&el=3Den_us&rem=3D2rlB1zL1y4m= EmPQuzHKKxgCa3LZZDUa2oDB7ak2U0To%3D&suid=3DWwCFRqYnbpMkAONcISCRzHWRMlgGub8lK2E= ZdQ09YYk%3D Plaxo, Inc. - 203 Ravendale Drive - Mountain View - CA - 94043 - USA=20 Never Send Me Emails http://www.plaxo.com/stop?src=3Demail&et=3D79&el=3Den_us&email=3Dfreebsd-sparc= %40freebsd.org From owner-freebsd-sparc64@FreeBSD.ORG Wed Jun 1 23:12:47 2011 Return-Path: Delivered-To: freebsd-sparc64@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A11B8106566B for ; Wed, 1 Jun 2011 23:12:47 +0000 (UTC) (envelope-from peterjeremy@acm.org) Received: from mail17.syd.optusnet.com.au (mail17.syd.optusnet.com.au [211.29.132.198]) by mx1.freebsd.org (Postfix) with ESMTP id 35B1E8FC17 for ; Wed, 1 Jun 2011 23:12:45 +0000 (UTC) Received: from server.vk2pj.dyndns.org (c220-239-116-103.belrs4.nsw.optusnet.com.au [220.239.116.103]) by mail17.syd.optusnet.com.au (8.13.1/8.13.1) with ESMTP id p51NCc6N031089 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Thu, 2 Jun 2011 09:12:39 +1000 X-Bogosity: Ham, spamicity=0.000000 Received: from server.vk2pj.dyndns.org (localhost.vk2pj.dyndns.org [127.0.0.1]) by server.vk2pj.dyndns.org (8.14.4/8.14.4) with ESMTP id p51NCcYL005306; Thu, 2 Jun 2011 09:12:38 +1000 (EST) (envelope-from peter@server.vk2pj.dyndns.org) Received: (from peter@localhost) by server.vk2pj.dyndns.org (8.14.4/8.14.4/Submit) id p51NCbGp005305; Thu, 2 Jun 2011 09:12:37 +1000 (EST) (envelope-from peter) Date: Thu, 2 Jun 2011 09:12:37 +1000 From: Peter Jeremy To: Marius Strobl Message-ID: <20110601231237.GA5267@server.vk2pj.dyndns.org> References: <20110526234728.GA69750@server.vk2pj.dyndns.org> <20110527120659.GA78000@alchemy.franken.de> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="UugvWAfsgieZRqgk" Content-Disposition: inline In-Reply-To: <20110527120659.GA78000@alchemy.franken.de> X-PGP-Key: http://members.optusnet.com.au/peterjeremy/pubkey.asc User-Agent: Mutt/1.5.21 (2010-09-15) Cc: freebsd-sparc64@freebsd.org Subject: Re: 'make -j16 universe' gives SIReset X-BeenThere: freebsd-sparc64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the Sparc List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 01 Jun 2011 23:12:47 -0000 --UugvWAfsgieZRqgk Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 2011-May-27 14:06:59 +0200, Marius Strobl wr= ote: >On Fri, May 27, 2011 at 09:47:28AM +1000, Peter Jeremy wrote: >> I tried a "make -j16 universe" using a recent 8-stable on a 16-CPU >> V890 and after about 11 minutes, I got the following. This box >It probably doesn't hurt to check the hardware with SunVTS though. I ran a comprehensive SunVTS test for just under 24 hours without any errors and then checked that an unmodified kernel still SIR'd (which it did). >ignores a certain amount of these). You could try whether the following >patch makes any difference to the SIR you're seeing: >http://people.freebsd.org/~marius/sparc64_intr_vector_stray.diff Running -current, I didn't see any stray interrupts but I did get some "spin lock held too long" panics. Unfortunately, I don't seem to get crashdumps - with DDB/KDB in the kernel, I get a hard hang - RSC break doesn't work and the only way out is a RSC reset. spin lock 0xc07914d8 (sched lock) held by 0xfffff8a29749d980 (tid 100443) t= oo long panic: spin lock held too long cpuid =3D 14 KDB: stack backtrace: panic() at panic+0x1c8 _mtx_lock_spin_failed() at _mtx_lock_spin_failed+0x50 _mtx_lock_spin() at _mtx_lock_spin+0xac sched_add() at sched_add+0x9c sched_wakeup() at sched_wakeup+0x44 setrunnable() at setrunnable+0xb8 sleepq_resume_thread() at sleepq_resume_thread+0xfc sleepq_broadcast() at sleepq_broadcast+0xbc wakeup() at wakeup+0x18 exit1() at exit1+0x10b0 sys_exit() at sys_exit+0x10 syscallenter() at syscallenter+0x2f4 syscall() at syscall+0x30 -- syscall (1, FreeBSD ELF64, sys_exit) %o7=3D0x128ad0 -- userland() at 0x14b748 user trace: trap %o7=3D0x128ad0 pc 0x14b748, sp 0x7fdffffdb81 spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long panic: spin lock held too long spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long cpuid =3D 3 spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long KDB: stack backtrace: spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) t= oo long pspin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) = too long aspin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) = too long nspin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) = too long ispin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) = too long cspin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) = too long (spin lock 0xc07914d8 (sched lock) held by 0xfffff8a0168f3980 (tid 100312) = too long ) at panic+0x1c8 _mtx_lock_spin_failed() at _mtx_lock_spin_failed+0x50 _thread_lock_flags() at _thread_lock_flags+0xf8 statclock() at statclock+0x144 handleevents() at handleevents+0x120 timercb() at timercb+0x4c0 tick_intr() at tick_intr+0x1fc -- interrupt level=3D0xe pil=3D0 %o7=3D0x5faa28 -- userland() at 0x5faa40 user trace: trap %o7=3D0x5faa28 pc 0x5faa40, sp 0x7fdffffb5f1 pc 0x5feab0, sp 0x7fdffffbf01 pc 0x600310, sp 0x7fdffffc011 pc 0x600628, sp 0x7fdffffc141 pc 0x6035f4, sp 0x7fdffffc251 pc 0x603b1c, sp 0x7fdffffc361 pc 0x605b14, sp 0x7fdffffc471 pc 0x5e6aa8, sp 0x7fdffffc551 pc 0x19c594, sp 0x7fdffffc611 pc 0x5a4aa0, sp 0x7fdffffc6d1 pc 0x454e08, sp 0x7fdffffc791 pc 0x4a65c0, sp 0x7fdffffc8c1 pc 0x4a0558, sp 0x7fdffffc981 pc 0x4a06c8, sp 0x7fdffffca41 pc 0x1924e0, sp 0x7fdffffcb01 pc 0x192684, sp 0x7fdffffcbc1 pc 0x1926a4, sp 0x7fdffffcc81 pc 0x1926a4, sp 0x7fdffffcd41 pc 0x54c710, sp 0x7fdffffce01 pc 0x144080, sp 0x7fdffffcec1 pc 0x17d698, sp 0x7fdffffcf81 pc 0x17e280, sp 0x7fdffffd041 pc 0x148710, sp 0x7fdffffd101 pc 0x1b9264, sp 0x7fdffffd1d1 pc 0x1001d0, sp 0x7fdffffd291 pc 0, sp 0x7fdffffd351 done KDB: enter: panic On -stable, I didn't see any stray interrupts but had the system hang several times, needing a RSC reset to recover. >and once with: > val &=3D ~DCR_SI; I tried adding this and the system survived a "make -j30 universe" on -stable (BTW "make universe" seems to have issues cross-building x86 derivatives). I'm now trying that on -current. I'm not sure what the implications of the above change are. --=20 Peter Jeremy --UugvWAfsgieZRqgk Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.17 (FreeBSD) iEYEARECAAYFAk3mx2UACgkQ/opHv/APuIcazgCfeS5I0HM5ODCLabzxtfkz42zC uvQAn3rjYotdp/KYnUNY3cYdk1gEyc8g =1v5M -----END PGP SIGNATURE----- --UugvWAfsgieZRqgk-- From owner-freebsd-sparc64@FreeBSD.ORG Fri Jun 3 07:26:53 2011 Return-Path: Delivered-To: freebsd-sparc64@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 84935106564A; Fri, 3 Jun 2011 07:26:53 +0000 (UTC) (envelope-from nwf@cs.jhu.edu) Received: from blaze.cs.jhu.edu (blaze.cs.jhu.edu [128.220.13.50]) by mx1.freebsd.org (Postfix) with ESMTP id 5976A8FC08; Fri, 3 Jun 2011 07:26:53 +0000 (UTC) Received: from gradx.cs.jhu.edu (gradx.cs.jhu.edu [128.220.13.52]) by blaze.cs.jhu.edu (8.14.3/8.14.3) with ESMTP id p5373ucI020158 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT); Fri, 3 Jun 2011 03:03:56 -0400 (EDT) Received: from gradx.cs.jhu.edu (localhost [127.0.0.1]) by gradx.cs.jhu.edu (8.14.3/8.13.1) with ESMTP id p5373u38026976; Fri, 3 Jun 2011 03:03:56 -0400 Received: (from nwf@localhost) by gradx.cs.jhu.edu (8.14.3/8.13.8/Submit) id p5373uU0026975; Fri, 3 Jun 2011 03:03:56 -0400 Date: Fri, 3 Jun 2011 03:03:56 -0400 From: Nathaniel W Filardo To: freebsd-current@freebsd.org, freebsd-sparc64@freebsd.org Message-ID: <20110603070356.GJ7129@gradx.cs.jhu.edu> References: <20110406080043.GQ609@gradx.cs.jhu.edu> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="Ev+J9+115SO9Z5Eo" Content-Disposition: inline In-Reply-To: <20110406080043.GQ609@gradx.cs.jhu.edu> User-Agent: Mutt/1.5.20 (2009-08-17) Cc: Subject: Re: ZFS panic with concurrent recv and read-heavy workload X-BeenThere: freebsd-sparc64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the Sparc List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 03 Jun 2011 07:26:53 -0000 --Ev+J9+115SO9Z5Eo Content-Type: text/plain; charset=us-ascii Content-Disposition: inline I just got this on another machine, no heavy workload needed, just booting and starting some jails. Of interest, perhaps, both this and the machine triggering the below panic are SMP V240s with 1.5GHz CPUs (though I will confess that the machine in the original report may have had bad RAM). I have run a UP 1.2GHz V240 for months and never seen this panic. This time the kernel is > FreeBSD 9.0-CURRENT #9: Fri Jun 3 02:32:13 EDT 2011 csup'd immediately before building. The full panic this time is > panic: Lock buf_hash_table.ht_locks[i].ht_lock not exclusively locked @ > /usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/arc.c:4659 > > cpuid = 1 > KDB: stack backtrace: > panic() at panic+0x1c8 > _sx_assert() at _sx_assert+0xc4 > _sx_xunlock() at _sx_xunlock+0x98 > l2arc_feed_thread() at l2arc_feed_thread+0xeac > fork_exit() at fork_exit+0x9c > fork_trampoline() at fork_trampoline+0x8 > > SC Alert: SC Request to send Break to host. > KDB: enter: Line break on console > [ thread pid 27 tid 100121 ] > Stopped at kdb_enter+0x80: ta %xcc, 1 > db> reset > ttiimmeeoouutt sshhuuttttiinngg ddoowwnn CCPPUUss.. Half of the memory in this machine is new (well, came with the machine) and half is from the aforementioned UP V240 which seemed to work fine (I was attempting an upgrade when this happened); none of it (or indeed any of the hardware save the disk controller and disks) are common between this and the machine reporting below. Thoughts? Any help would be greatly appreciated. Thanks. --nwf; On Wed, Apr 06, 2011 at 04:00:43AM -0400, Nathaniel W Filardo wrote: >[...] > panic: Lock buf_hash_table.ht_locks[i].ht_lock not exclusively locked @ /usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/arc.c:1869 > > cpuid = 1 > KDB: stack backtrace: > panic() at panic+0x1c8 > _sx_assert() at _sx_assert+0xc4 > _sx_xunlock() at _sx_xunlock+0x98 > arc_evict() at arc_evict+0x614 > arc_get_data_buf() at arc_get_data_buf+0x360 > arc_buf_alloc() at arc_buf_alloc+0x94 > dmu_buf_will_fill() at dmu_buf_will_fill+0xfc > dmu_write() at dmu_write+0xec > dmu_recv_stream() at dmu_recv_stream+0x8a8 > zfs_ioc_recv() at zfs_ioc_recv+0x354 > zfsdev_ioctl() at zfsdev_ioctl+0xe0 > devfs_ioctl_f() at devfs_ioctl_f+0xe8 > kern_ioctl() at kern_ioctl+0x294 > ioctl() at ioctl+0x198 > syscallenter() at syscallenter+0x270 > syscall() at syscall+0x74 > -- syscall (54, FreeBSD ELF64, ioctl) %o7=0x40c13e24 -- > userland() at 0x40e72cc8 > user trace: trap %o7=0x40c13e24 > pc 0x40e72cc8, sp 0x7fdffff4641 > pc 0x40c158f4, sp 0x7fdffff4721 > pc 0x40c1e878, sp 0x7fdffff47f1 > pc 0x40c1ce54, sp 0x7fdffff8b01 > pc 0x40c1dbe0, sp 0x7fdffff9431 > pc 0x40c1f718, sp 0x7fdffffd741 > pc 0x10731c, sp 0x7fdffffd831 > pc 0x10c90c, sp 0x7fdffffd8f1 > pc 0x103ef0, sp 0x7fdffffe1d1 > pc 0x4021aff4, sp 0x7fdffffe291 > done >[...] --Ev+J9+115SO9Z5Eo Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) iEYEARECAAYFAk3oh1wACgkQTeQabvr9Tc+FLACcCBa0wdug6xIFknSxPi9f4IaN SbMAn1jJ9GqKWVs0lSJDMQXvCyxjc9nQ =JDAt -----END PGP SIGNATURE----- --Ev+J9+115SO9Z5Eo--