From owner-freebsd-scsi@FreeBSD.ORG Sun Mar 6 14:13:20 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C12081065670 for ; Sun, 6 Mar 2011 14:13:20 +0000 (UTC) (envelope-from mavbsd@gmail.com) Received: from mail-bw0-f54.google.com (mail-bw0-f54.google.com [209.85.214.54]) by mx1.freebsd.org (Postfix) with ESMTP id 2362E8FC0A for ; Sun, 6 Mar 2011 14:13:19 +0000 (UTC) Received: by bwz12 with SMTP id 12so3455698bwz.13 for ; Sun, 06 Mar 2011 06:13:18 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:sender:message-id:date:from:user-agent :mime-version:to:cc:subject:x-enigmail-version:content-type :content-transfer-encoding; bh=br+3XBbsyreUqbV9PbHn233XBZiNok0Ys86pg/mmp6I=; b=IXLTmnqCUH45aLq8EcN6AES1Cx/jWLwzQt6u2rqqbjQynbVTH6OGIYRDAkG3OAxYv1 6jxwkSPmwTVbKIUftn+p1bs4sKmxphDAYNdOPxP6YtJHfYaUQdkqrI66F2SzALzu1/Tb he/8hTG+OLytGIrR+c3XFspDELqSc7qB7DR8k= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:message-id:date:from:user-agent:mime-version:to:cc:subject :x-enigmail-version:content-type:content-transfer-encoding; b=C2r+aNyOqkrj354aYpYazXfBIdUMTW2dIrvxrUrnrhO1Xw5VR13WABTKmsdxPRZZX7 us7fN9cNtoQHS1cHHf0Xcfjt9TECIoofIRVt9saoiqBjTvPrGKds95GDlL+YGtLZyMfr +uqc41BEvYMoPaLVnmlomWz6W+NtNCk06TSqM= Received: by 10.204.153.20 with SMTP id i20mr2386345bkw.208.1299420798458; Sun, 06 Mar 2011 06:13:18 -0800 (PST) Received: from mavbook2.mavhome.dp.ua (pc.mavhome.dp.ua [212.86.226.226]) by mx.google.com with ESMTPS id l1sm1023679bkl.13.2011.03.06.06.13.16 (version=SSLv3 cipher=OTHER); Sun, 06 Mar 2011 06:13:17 -0800 (PST) Sender: Alexander Motin Message-ID: <4D73962D.3080908@FreeBSD.org> Date: Sun, 06 Mar 2011 16:11:57 +0200 From: Alexander Motin User-Agent: Thunderbird 2.0.0.23 (X11/20091212) MIME-Version: 1.0 To: FreeBSD-Current , freebsd-geom@freebsd.org, freebsd-scsi@freebsd.org X-Enigmail-Version: 0.96.0 Content-Type: text/plain; charset=KOI8-R Content-Transfer-Encoding: 7bit Cc: Warner Losh Subject: RFC: graid, first step done. X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 06 Mar 2011 14:13:20 -0000 Hi. I would like to present for review and testing results of my and M. Warner Losh work on new GEOM-based ataraid(4) replacement - graid. At this moment we have implemented all required core functionality and the first set of transformation and metadata modules. Now we have implemented transformation modules to support such RAID levels: RAID0, RAID1, RAID1E, RAID10 and CONCAT. For all these levels all such features as reading, writing, rebuilding, I/O errors recovery and kernel dumping are supported. Metadata modules now written for Intel and JMicron RAID BIOS formats. JMicron format is minimalistic by itself, while Intel format is much more sophisticated, and now we support all of it features, except RAID level migration. Work on Promise (used by AMD) and SiliconImage formats is now in progress. Support for other metadata formats and RAID levels planned later. If somebody wants to help -- feel free to contact me to coordinate work. Patches are existing for HEAD, 8-STABLE and 7-STABLE branches: http://people.freebsd.org/~mav/graid/graid-head-20110305.diff http://people.freebsd.org/~mav/graid/graid-8-20110305.diff http://people.freebsd.org/~mav/graid/graid-7-20110305.diff Patches for HEAD and 8-STABLE are not very invasive and, if everything goes well, going to be committed to HEAD in few weeks. Patch for 7-STABLE, except graid, also includes massive update of the ATA and CAM code from 9-CURRENT. It will not be committed due to numerous ABI breakages, but can be interesting to some people. Just be aware that main focus there was on ATA subsystem -- SCSI drivers were not tested. Once completed, this project should remove last major show stopper for enabling CAM-based ATA subsystem by default in 9.x branch. I would really like it to be done before 9.0 release. Special thanks to Cisco Systems, Inc. and iXsystems, Inc. for sponsoring this project. -- Alexander Motin From owner-freebsd-scsi@FreeBSD.ORG Mon Mar 7 11:07:07 2011 Return-Path: Delivered-To: freebsd-scsi@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 484681065674 for ; Mon, 7 Mar 2011 11:07:07 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id 34FA88FC14 for ; Mon, 7 Mar 2011 11:07:07 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.4/8.14.4) with ESMTP id p27B773s097052 for ; Mon, 7 Mar 2011 11:07:07 GMT (envelope-from owner-bugmaster@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.4/8.14.4/Submit) id p27B76KF097050 for freebsd-scsi@FreeBSD.org; Mon, 7 Mar 2011 11:07:06 GMT (envelope-from owner-bugmaster@FreeBSD.org) Date: Mon, 7 Mar 2011 11:07:06 GMT Message-Id: <201103071107.p27B76KF097050@freefall.freebsd.org> X-Authentication-Warning: freefall.freebsd.org: gnats set sender to owner-bugmaster@FreeBSD.org using -f From: FreeBSD bugmaster To: freebsd-scsi@FreeBSD.org Cc: Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 07 Mar 2011 11:07:07 -0000 Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/154432 scsi [xpt] run_interrupt_driven_hooks: still waiting after o kern/153361 scsi [ciss] Smart Array 5300 boot/detect drive problem o kern/152250 scsi [ciss] [patch] Kernel panic when hw.ciss.expose_hidden o kern/151564 scsi [ciss] ciss(4) should increase CISS_MAX_LOGICAL to 10 o docs/151336 scsi Missing documentation of scsi_ and ata_ functions in c s kern/149927 scsi [cam] hard drive not stopped before removing power dur o kern/148083 scsi [aac] Strange device reporting o kern/147704 scsi [mpt] sys/dev/mpt: new chip revision, partially unsupp o kern/146287 scsi [ciss] ciss(4) cannot see more than one SmartArray con o kern/145768 scsi [mpt] can't perform I/O on SAS based SAN disk in freeb o kern/144648 scsi [aac] Strange values of speed and bus width in dmesg o kern/144301 scsi [ciss] [hang] HP proliant server locks when using ciss o kern/142351 scsi [mpt] LSILogic driver performance problems o kern/141934 scsi [cam] [patch] add support for SEAGATE DAT Scopion 130 o kern/134488 scsi [mpt] MPT SCSI driver probes max. 8 LUNs per device o kern/132250 scsi [ciss] ciss driver does not support more then 15 drive o kern/132206 scsi [mpt] system panics on boot when mirroring and 2nd dri o kern/130621 scsi [mpt] tranfer rate is inscrutable slow when use lsi213 o kern/129602 scsi [ahd] ahd(4) gets confused and wedges SCSI bus o kern/128452 scsi [sa] [panic] Accessing SCSI tape drive randomly crashe o kern/128245 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/127927 scsi [isp] isp(4) target driver crashes kernel when set up o kern/127717 scsi [ata] [patch] [request] - support write cache toggling o kern/124667 scsi [amd] [panic] FreeBSD-7 kernel page faults at amd-scsi o kern/123674 scsi [ahc] ahc driver dumping o kern/123520 scsi [ahd] unable to boot from net while using ahd o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc o kern/120487 scsi [sg] scsi_sg incompatible with scanners o kern/120247 scsi [mpt] FreeBSD 6.3 and LSI Logic 1030 = only 3.300MB/s o kern/114597 scsi [sym] System hangs at SCSI bus reset with dual HBAs o kern/110847 scsi [ahd] Tyan U320 onboard problem with more than 3 disks o kern/99954 scsi [ahc] reading from DVD failes on 6.x [regression] f kern/94838 scsi Kernel panic while mounting SD card with lock switch o o kern/92798 scsi [ahc] SCSI problem with timeouts o kern/90282 scsi [sym] SCSI bus resets cause loss of ch device o kern/76178 scsi [ahd] Problem with ahd and large SCSI Raid system o kern/74627 scsi [ahc] [hang] Adaptec 2940U2W Can't boot 5.3 s kern/61165 scsi [panic] kernel page fault after calling cam_send_ccb o kern/60641 scsi [sym] Sporadic SCSI bus resets with 53C810 under load o kern/60598 scsi wire down of scsi devices conflicts with config s kern/57398 scsi [mly] Current fails to install on mly(4) based RAID di o bin/57088 scsi [cam] [patch] for a possible fd leak in libcam.c o kern/52638 scsi [panic] SCSI U320 on SMP server won't run faster than o kern/44587 scsi dev/dpt/dpt.h is missing defines required for DPT_HAND f kern/40895 scsi wierd kernel / device driver bug o kern/39388 scsi ncr/sym drivers fail with 53c810 and more than 256MB m o kern/35234 scsi World access to /dev/pass? (for scanner) requires acce 47 problems total. From owner-freebsd-scsi@FreeBSD.ORG Mon Mar 7 11:29:37 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DE34F106564A for ; Mon, 7 Mar 2011 11:29:37 +0000 (UTC) (envelope-from freebsd-scsi@m.gmane.org) Received: from lo.gmane.org (lo.gmane.org [80.91.229.12]) by mx1.freebsd.org (Postfix) with ESMTP id 986D08FC08 for ; Mon, 7 Mar 2011 11:29:37 +0000 (UTC) Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1PwYOX-0001EN-B9 for freebsd-scsi@freebsd.org; Mon, 07 Mar 2011 12:14:33 +0100 Received: from lara.cc.fer.hr ([161.53.72.113]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 07 Mar 2011 12:14:33 +0100 Received: from ivoras by lara.cc.fer.hr with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 07 Mar 2011 12:14:33 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-scsi@freebsd.org From: Ivan Voras Date: Mon, 07 Mar 2011 12:14:20 +0100 Lines: 25 Message-ID: References: <4D73962D.3080908@FreeBSD.org> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: lara.cc.fer.hr User-Agent: Mozilla/5.0 (X11; U; FreeBSD amd64; en-US; rv:1.9.2.12) Gecko/20101102 Thunderbird/3.1.6 In-Reply-To: <4D73962D.3080908@FreeBSD.org> X-Enigmail-Version: 1.1.2 Cc: freebsd-current@freebsd.org, freebsd-geom@freebsd.org Subject: Re: RFC: graid, first step done. X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 07 Mar 2011 11:29:37 -0000 On 06/03/2011 15:11, Alexander Motin wrote: > Hi. > > I would like to present for review and testing results of my and M. > Warner Losh work on new GEOM-based ataraid(4) replacement - graid. > implemented transformation modules to support such RAID levels: RAID0, > RAID1, RAID1E, RAID10 and CONCAT. For all these levels all such features > Once completed, this project should remove last major show stopper for > enabling CAM-based ATA subsystem by default in 9.x branch. I would > really like it to be done before 9.0 release. Excellent! Two questions: will there be more RAID levels supported, and would graid be usable for people without these controllers (I assume that it would, but it's better to check :) ). Some nits about the patches: * graid.8: Should "strip" be "stripe"? (search/replace) * graid.8: teh -> the (search replace) I'look at it more later! From owner-freebsd-scsi@FreeBSD.ORG Tue Mar 8 23:26:45 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3E7071065675 for ; Tue, 8 Mar 2011 23:26:45 +0000 (UTC) (envelope-from josh@tcbug.org) Received: from out5.smtp.messagingengine.com (out5.smtp.messagingengine.com [66.111.4.29]) by mx1.freebsd.org (Postfix) with ESMTP id 09E668FC18 for ; Tue, 8 Mar 2011 23:26:44 +0000 (UTC) Received: from compute1.internal (compute1.nyi.mail.srv.osa [10.202.2.41]) by gateway1.messagingengine.com (Postfix) with ESMTP id C0FBA208E2; Tue, 8 Mar 2011 18:10:19 -0500 (EST) Received: from frontend1.messagingengine.com ([10.202.2.160]) by compute1.internal (MEProxy); Tue, 08 Mar 2011 18:10:19 -0500 DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=messagingengine.com; h=from:to:subject:date:cc:references:in-reply-to:mime-version:content-type:content-transfer-encoding:message-id; s=smtpout; bh=xkI173sGGt03cAd8DBHtOJkVZMI=; b=S+oPGnK4n1yZ3iUbcNja9PWRi3duk0dJUNdmxx2AbucQr7bWel5EY24fuIf7RzxBEkLMbtJ2hCRPAgfFqorAOp6hYzvZS+Cc4WmiTjyb6sN6wRivpi2R7cPUIuYIk3reqMN4HtctGx+LmMtx/MpLh7y+UpekMkyj9oJftzuzwOM= X-Sasl-enc: 5wmdvrTkoc+JxuauMsAzkxoLuNglq+ls3TmZahmm47+E 1299625819 Received: from tcbug.ixsystems.com (74-34-19-98.dr01.rsmt.mn.frontiernet.net [74.34.19.98]) by mail.messagingengine.com (Postfix) with ESMTPSA id 6EB8140933A; Tue, 8 Mar 2011 18:10:19 -0500 (EST) From: Josh Paetzel To: freebsd-scsi@freebsd.org Date: Tue, 8 Mar 2011 17:10:11 -0600 User-Agent: KMail/1.13.5 (FreeBSD/8.2-RELEASE; KDE/4.5.5; amd64; ; ) References: <20110218164209.GA77903@nargothrond.kdm.org> <20110219044237.GA89045@nargothrond.kdm.org> <4D5FC5E0.5030203@mullet.se> In-Reply-To: <4D5FC5E0.5030203@mullet.se> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="nextPart1703595.eV2gyhUhhj"; protocol="application/pgp-signature"; micalg=pgp-sha1 Content-Transfer-Encoding: 7bit Message-Id: <201103081710.17409.josh@tcbug.org> Cc: Martin Nilsson Subject: Re: mps(4) driver (LSI 6Gb SAS) commited to stable/8 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 08 Mar 2011 23:26:45 -0000 --nextPart1703595.eV2gyhUhhj Content-Type: Text/Plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable On Saturday, February 19, 2011 07:30:08 am Martin Nilsson wrote: > >> Thank you Ken for getting this done. Any plan to support the LSI 9240 > >> (skinny) cards? if its a matter of hardware to develop with, contact > >> me off list and I can provide you with a development system. > >=20 > > That'll be up to LSI most likely. > >=20 > > If it has the mpt2 interface, then I'm sure they'll support it through > > the mps driver. > >=20 > > If it has the MegaRAID interface, then the mfi driver would need to be > > modified. >=20 > It is the mfi interface and the driver at LSI:s web supports them, the > new driver should compile on both i386 and amd64 the previous one were > i386 only. >=20 > /Martin The mfi driver on the LSI web page supports the 9240, whereas the stock=20 =46reeBSD mfi driver does not. The LSI driver breaks mfiutil however, so m= ake=20 sure you have megacli or the like working. Xin Li is looking at merging support for the 9240 into FreeBSD mfi, but the= =20 two have diverged so it's not a slam-dunk. =2D-=20 Thanks, Josh Paetzel --nextPart1703595.eV2gyhUhhj Content-Type: application/pgp-signature; name=signature.asc Content-Description: This is a digitally signed message part. -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.16 (FreeBSD) iQEcBAABAgAGBQJNdrdZAAoJEKFq1/n1feG2iJcH/RqHESm6WwbAuAc4qlxOEj9X 9Jcyku6uaPg4a1BwSeCdhFFHQD0B2gJ6sOu4USCsJwcOq67VKAHxHIemPT+oTboM TZD9PfYSKo/MpuWwWCLR2wz6FVAa6xWdITycWUkvWbK0zzWV/QufhHwdBV8q5hCo cT10gOSZG0OovT/hpSm4yChbASdGoxMQ06xKdfDmW1kX+XkELyw809GDVBXLKNca /fJlK1J7KpsoWpz8xbJXN0c1F072R2nYxGd/SAR2X+OU8bUQ1Qh8JgWnHvz3q0Oc 7ye9cYD7WZIXsCCKp7PWDnSzdPO3UIH6T5ZZI4wLOCcN3JAscxa98CWxzja7fEo= =UayM -----END PGP SIGNATURE----- --nextPart1703595.eV2gyhUhhj-- From owner-freebsd-scsi@FreeBSD.ORG Wed Mar 9 00:49:35 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3548E1065673 for ; Wed, 9 Mar 2011 00:49:35 +0000 (UTC) (envelope-from nschelly@dyn.com) Received: from dynmail-01-mht.dyndns.com (dynmail-01-mht.dyndns.com [216.146.45.13]) by mx1.freebsd.org (Postfix) with ESMTP id 9762E8FC17 for ; Wed, 9 Mar 2011 00:49:34 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by dynmail-01-mht.dyndns.com (Postfix) with ESMTP id 008231752014 for ; Tue, 8 Mar 2011 19:33:17 -0500 (EST) X-Virus-Scanned: amavisd-new at dynmail-01-mht.dyndns.com Received: from dynmail-01-mht.dyndns.com ([127.0.0.1]) by localhost (dynmail-01-mht.dyndns.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ybup2-eL0wlS for ; Tue, 8 Mar 2011 19:33:16 -0500 (EST) Received: from mail.corp.dyndns.com (mail.corp.dyndns.com [216.146.45.14]) by dynmail-01-mht.dyndns.com (Postfix) with ESMTP id 5CE981752012 for ; Tue, 8 Mar 2011 19:33:16 -0500 (EST) Date: Tue, 8 Mar 2011 19:33:16 -0500 (EST) From: Neil Schelly To: freebsd-scsi@freebsd.org Message-ID: <4139036.97089.1299630796345.JavaMail.root@mail.corp> In-Reply-To: <28269840.97080.1299630735538.JavaMail.root@mail.corp> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_97088_3026873.1299630796344" X-Originating-IP: [172.16.252.166] X-Mailer: Zimbra 6.0.7_GA_2473.UBUNTU8 (ZimbraWebClient - SAF3 (Linux)/6.0.7_GA_2473.UBUNTU8) X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: Re: Serious Dell Sadness - H200, H700, and H800 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 09 Mar 2011 00:49:35 -0000 ------=_Part_97088_3026873.1299630796344 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit I'm sorry, I left out the attachment in my earlier post. Please see it here. -- Neil Schelly Director of Uptime Dynamic Network Services, Inc. W: 603-296-1581 M: 508-410-4776 http://www.dyndns.com ----- Original Message ----- > We've got some more information about the mpt testing we've been doing > here. The setup we're testing is Dell PowerEdge r610 servers with PERC > H800 SAS/RAID cards connected to MD1200 shelves full of 12 SAS drives. > We've recreated the same problem on other configurations, including > combinations of r510s, MD1220 shelves, PERC H700 cards, etc. We've > also eliminated any particular piece of hardware as faulty by running > these on identical hardware configurations in mirrored setups on > different physical pieces of hardware. We've experienced these issues > in FreeBSD 7.3, 8.1, and 8.2. We've experienced this issue with either > RAID10 logical drive configurations formatted with UFS or 6-disk JBOD > configurations setup in a ZFS raidz volume. We've triggered the > problem with both bonnie++ and iozone. All machines are runnning the > latest firmware on the H700 and H800 cards. > > The easiest method to reproduce this problem is with a ZFS > configuration and using `iozone -a`. We have a 6-disk raidz partition > with a ZFS filesystem on it. We just run `iozone -a` from within that > filesystem, and I'd say 3 out of 4 times, it will eventually pause. > After 45-50 seconds of pausing, you'll start seeing the console and > /var/log/messages output that looks something like: > mfi0: COMMAND 0xffffff8000db5fe0 TIMEOUT AFTER 105 SECONDS > > If we let it go for a few days, it may actually "finish" and recover, > but it's essentially just stuck and not recovering. The system is > responsive and fully operational except the dead controller at this > point. We cannot kill the iozone process that is hung on these IO > operations, even with `kill -9`. Like others have reported, we can run > any of the mfiutil commands and the controller immediately begins to > respond normally again. Usually, the iozone test will complete, but > sometimes it will even get stuck again on the same run. > > We compiled mfiutil with debugging symbols so we could run it with gdb > and see exactly what was causing the controller to become responsive > again. It's the ioctl() call that does it. For example: > > `mfiutil show volumes` eventually gets to something like: > mfi_dcmd_command (fd=7, opcode=50397184, buf=0x7fffffffe4a0, > bufsize=1032, mbox=0x0, mboxlen=0, statusp=0x0) > at /usr/src/usr.sbin/mfiutil/mfi_cmd.c:257 > * fd=7 is /dev/mfi0, where the command will be sent with an ioctl > command > * opcode=50397184 is the MFI_DCMD_LD_GET_LIST command > > `mfiutil show battery` eventually gets to something like: > mfi_dcmd_command (fd=7, opcode=84017152, buf=0x7fffffffea20, > bufsize=48, mbox=0x0, mboxlen=0, statusp=0x7fffffffe9cf "") > at /usr/src/usr.sbin/mfiutil/mfi_cmd.c:257 > * fd=7 is /dev/mfi0, where the command will be sent with an ioctl > command > * opcode=84017152 is the MFI_DCMD_BBU_GET_CAPACITY_INFO command > > I wrote a small self-contained C program that can easily be modified > to run any ioctl command you'd like and send it to /dev/mfi0 > (attached). Use it if you'd like at your own risk, but it's > essentially just running an arbitrary command with ioctl, putting > nothing into the memory range normally passed by the *buf pointer. I > did try sending random opcodes, and it didn't work, so it does have to > be an opcode that the firmware will recognize at least, but it doesn't > seem to matter which one. > > I'm not sure where else we should be looking for a fix. We can > reliably reproduce the problem, analyze the system during the issue, > and recover the system to a normal state. If there's anyone who can > help us troubleshoot this with any information we can gather or even a > local login remotely accessible, we're open to ideas. > > -- > Neil Schelly > Director of Uptime > Dynamic Network Services, Inc. > W: 603-296-1581 > M: 508-410-4776 > http://www.dyndns.com ------=_Part_97088_3026873.1299630796344-- From owner-freebsd-scsi@FreeBSD.ORG Wed Mar 9 00:49:35 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4E1911065675 for ; Wed, 9 Mar 2011 00:49:35 +0000 (UTC) (envelope-from nschelly@dyn.com) Received: from dynmail-01-mht.dyndns.com (dynmail-01-mht.dyndns.com [216.146.45.13]) by mx1.freebsd.org (Postfix) with ESMTP id 976908FC18 for ; Wed, 9 Mar 2011 00:49:34 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by dynmail-01-mht.dyndns.com (Postfix) with ESMTP id 053191752013 for ; Tue, 8 Mar 2011 19:32:16 -0500 (EST) X-Virus-Scanned: amavisd-new at dynmail-01-mht.dyndns.com Received: from dynmail-01-mht.dyndns.com ([127.0.0.1]) by localhost (dynmail-01-mht.dyndns.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 8yh+ogdKoBqP for ; Tue, 8 Mar 2011 19:32:15 -0500 (EST) Received: from mail.corp.dyndns.com (mail.corp.dyndns.com [216.146.45.14]) by dynmail-01-mht.dyndns.com (Postfix) with ESMTP id 954C31752012 for ; Tue, 8 Mar 2011 19:32:15 -0500 (EST) Date: Tue, 8 Mar 2011 19:32:15 -0500 (EST) From: Neil Schelly To: freebsd-scsi@freebsd.org Message-ID: <28269840.97080.1299630735538.JavaMail.root@mail.corp> In-Reply-To: <4187606.97023.1299629795456.JavaMail.root@mail.corp> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Originating-IP: [172.16.252.166] X-Mailer: Zimbra 6.0.7_GA_2473.UBUNTU8 (ZimbraWebClient - SAF3 (Linux)/6.0.7_GA_2473.UBUNTU8) Subject: Re: Serious Dell Sadness - H200, H700, and H800 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 09 Mar 2011 00:49:35 -0000 We've got some more information about the mpt testing we've been doing here. The setup we're testing is Dell PowerEdge r610 servers with PERC H800 SAS/RAID cards connected to MD1200 shelves full of 12 SAS drives. We've recreated the same problem on other configurations, including combinations of r510s, MD1220 shelves, PERC H700 cards, etc. We've also eliminated any particular piece of hardware as faulty by running these on identical hardware configurations in mirrored setups on different physical pieces of hardware. We've experienced these issues in FreeBSD 7.3, 8.1, and 8.2. We've experienced this issue with either RAID10 logical drive configurations formatted with UFS or 6-disk JBOD configurations setup in a ZFS raidz volume. We've triggered the problem with both bonnie++ and iozone. All machines are runnning the latest firmware on the H700 and H800 cards. The easiest method to reproduce this problem is with a ZFS configuration and using `iozone -a`. We have a 6-disk raidz partition with a ZFS filesystem on it. We just run `iozone -a` from within that filesystem, and I'd say 3 out of 4 times, it will eventually pause. After 45-50 seconds of pausing, you'll start seeing the console and /var/log/messages output that looks something like: mfi0: COMMAND 0xffffff8000db5fe0 TIMEOUT AFTER 105 SECONDS If we let it go for a few days, it may actually "finish" and recover, but it's essentially just stuck and not recovering. The system is responsive and fully operational except the dead controller at this point. We cannot kill the iozone process that is hung on these IO operations, even with `kill -9`. Like others have reported, we can run any of the mfiutil commands and the controller immediately begins to respond normally again. Usually, the iozone test will complete, but sometimes it will even get stuck again on the same run. We compiled mfiutil with debugging symbols so we could run it with gdb and see exactly what was causing the controller to become responsive again. It's the ioctl() call that does it. For example: `mfiutil show volumes` eventually gets to something like: mfi_dcmd_command (fd=7, opcode=50397184, buf=0x7fffffffe4a0, bufsize=1032, mbox=0x0, mboxlen=0, statusp=0x0) at /usr/src/usr.sbin/mfiutil/mfi_cmd.c:257 * fd=7 is /dev/mfi0, where the command will be sent with an ioctl command * opcode=50397184 is the MFI_DCMD_LD_GET_LIST command `mfiutil show battery` eventually gets to something like: mfi_dcmd_command (fd=7, opcode=84017152, buf=0x7fffffffea20, bufsize=48, mbox=0x0, mboxlen=0, statusp=0x7fffffffe9cf "") at /usr/src/usr.sbin/mfiutil/mfi_cmd.c:257 * fd=7 is /dev/mfi0, where the command will be sent with an ioctl command * opcode=84017152 is the MFI_DCMD_BBU_GET_CAPACITY_INFO command I wrote a small self-contained C program that can easily be modified to run any ioctl command you'd like and send it to /dev/mfi0 (attached). Use it if you'd like at your own risk, but it's essentially just running an arbitrary command with ioctl, putting nothing into the memory range normally passed by the *buf pointer. I did try sending random opcodes, and it didn't work, so it does have to be an opcode that the firmware will recognize at least, but it doesn't seem to matter which one. I'm not sure where else we should be looking for a fix. We can reliably reproduce the problem, analyze the system during the issue, and recover the system to a normal state. If there's anyone who can help us troubleshoot this with any information we can gather or even a local login remotely accessible, we're open to ideas. -- Neil Schelly Director of Uptime Dynamic Network Services, Inc. W: 603-296-1581 M: 508-410-4776 http://www.dyndns.com From owner-freebsd-scsi@FreeBSD.ORG Wed Mar 9 22:49:49 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B98B9106566B for ; Wed, 9 Mar 2011 22:49:49 +0000 (UTC) (envelope-from ambrisko@ambrisko.com) Received: from mail.ambrisko.com (mail.ambrisko.com [64.174.51.43]) by mx1.freebsd.org (Postfix) with ESMTP id 7632F8FC18 for ; Wed, 9 Mar 2011 22:49:49 +0000 (UTC) X-Ambrisko-Me: Yes Received: from server2.ambrisko.com (HELO internal.ambrisko.com) ([192.168.1.2]) by ironport.ambrisko.com with ESMTP; 09 Mar 2011 14:20:57 -0800 Received: from ambrisko.com (localhost [127.0.0.1]) by internal.ambrisko.com (8.14.4/8.14.4) with ESMTP id p29MKvDq010175; Wed, 9 Mar 2011 14:20:57 -0800 (PST) (envelope-from ambrisko@ambrisko.com) Received: (from ambrisko@localhost) by ambrisko.com (8.14.4/8.14.4/Submit) id p29MKuip010174; Wed, 9 Mar 2011 14:20:56 -0800 (PST) (envelope-from ambrisko) From: Doug Ambrisko Message-Id: <201103092220.p29MKuip010174@ambrisko.com> In-Reply-To: <201103081710.17409.josh@tcbug.org> To: Josh Paetzel Date: Wed, 9 Mar 2011 14:20:56 -0800 (PST) X-Mailer: ELM [version 2.4ME+ PL124d (25)] MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset="US-ASCII" Cc: Martin Nilsson , freebsd-scsi@freebsd.org Subject: Re: mps(4) driver (LSI 6Gb SAS) commited to stable/8 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 09 Mar 2011 22:49:49 -0000 Josh Paetzel writes: | On Saturday, February 19, 2011 07:30:08 am Martin Nilsson wrote: | > >> Thank you Ken for getting this done. Any plan to support the LSI 9240 | > >> (skinny) cards? if its a matter of hardware to develop with, contact | > >> me off list and I can provide you with a development system. | > > | > > That'll be up to LSI most likely. | > > | > > If it has the mpt2 interface, then I'm sure they'll support it through | > > the mps driver. | > > | > > If it has the MegaRAID interface, then the mfi driver would need to be | > > modified. | > | > It is the mfi interface and the driver at LSI:s web supports them, the | > new driver should compile on both i386 and amd64 the previous one were | > i386 only. | > | > /Martin | | The mfi driver on the LSI web page supports the 9240, whereas the stock | FreeBSD mfi driver does not. The LSI driver breaks mfiutil however, so make | sure you have megacli or the like working. | | Xin Li is looking at merging support for the 9240 into FreeBSD mfi, but the | two have diverged so it's not a slam-dunk. You should have Xin have Scott and myself review changes. However, LSI sent me changes (a relative long time ago) to integrate. I need to go through them, test them and have them reviewed. The version on their web site had some concerns and I don't know if this card is one of them. I should be getting back to doing some real FreeBSD work at work but been busy with other work. There are a few things I need to get back into FreeBSD. Hopefully LSI didn't give up on me. Atleast iXsystems was able to off-load some work for me via graid etc. I've been helping to get them to converge versus diverge. However, what is put on their web site can be an early version of work. It's great that LSI is supporting FreeBSD. So far any LSI MegaRAID SAS card that I've tried has worked. I don't have all of them but might be able to get some now from LSI. It might be good in the future to convert mfiutil to use the LSI ioctl. It should be fairly trivial to do our in-house tool uses the LSI ioctl. Users need to understand that their SAS cards are different then their RAID cards. That can usually be figured out looking at Linux references. Thanks, Doug A. From owner-freebsd-scsi@FreeBSD.ORG Thu Mar 10 05:47:11 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C6F6E106564A for ; Thu, 10 Mar 2011 05:47:11 +0000 (UTC) (envelope-from martin@mullet.se) Received: from smtp.mullet.se (smtp.mullet.se [94.247.168.122]) by mx1.freebsd.org (Postfix) with ESMTP id 7FD468FC14 for ; Thu, 10 Mar 2011 05:47:11 +0000 (UTC) Received: from mbp.gneto.com (ua-83-227-181-30.cust.bredbandsbolaget.se [83.227.181.30]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by smtp.mullet.se (Postfix) with ESMTPSA id 48A368442F; Thu, 10 Mar 2011 06:47:08 +0100 (CET) Message-ID: <4D7865D4.80304@mullet.se> Date: Thu, 10 Mar 2011 06:47:00 +0100 From: Martin Nilsson User-Agent: Thunderbird 2.0.0.12 (Macintosh/20080213) MIME-Version: 1.0 To: Josh Paetzel References: <20110218164209.GA77903@nargothrond.kdm.org> <20110219044237.GA89045@nargothrond.kdm.org> <4D5FC5E0.5030203@mullet.se> <201103081710.17409.josh@tcbug.org> In-Reply-To: <201103081710.17409.josh@tcbug.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit Cc: freebsd-scsi@freebsd.org Subject: Re: mps(4) driver (LSI 6Gb SAS) commited to stable/8 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 10 Mar 2011 05:47:11 -0000 Josh Paetzel skrev: > The mfi driver on the LSI web page supports the 9240, whereas the stock > FreeBSD mfi driver does not. The LSI driver breaks mfiutil however, so make > sure you have megacli or the like working. > > Xin Li is looking at merging support for the 9240 into FreeBSD mfi, but the > two have diverged so it's not a slam-dunk. > > Just to let you know there is now a new mfi driver at the LSI site supporting the new dual core 9265 adapters that were released on CeBIT. However the new boards does not seem to be avaialble yet. If you need help with testing, please let me know, I have access to most of the lsi adapters and lots of Supermicro hardware. -- Martin Nilsson, CEO, Mullet Scandinavia AB, Malmö, SWEDEN E-mail: martin@mullet.se, Phone: +46-(0)708-59 99 91, Web: www.mullet.se Our business is well engineered servers optimised for FreeBSD & Linux From owner-freebsd-scsi@FreeBSD.ORG Thu Mar 10 20:20:13 2011 Return-Path: Delivered-To: freebsd-scsi@hub.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 74C261065675 for ; Thu, 10 Mar 2011 20:20:13 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id 4AE088FC1F for ; Thu, 10 Mar 2011 20:20:13 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.4/8.14.4) with ESMTP id p2AKKDqp045680 for ; Thu, 10 Mar 2011 20:20:13 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.4/8.14.4/Submit) id p2AKKC4e045679; Thu, 10 Mar 2011 20:20:13 GMT (envelope-from gnats) Date: Thu, 10 Mar 2011 20:20:13 GMT Message-Id: <201103102020.p2AKKC4e045679@freefall.freebsd.org> To: freebsd-scsi@FreeBSD.org From: =?ISO-8859-1?Q?Lo=EFc_Pefferkorn?= Cc: Subject: Re: kern/152250: [ciss] [patch] Kernel panic when hw.ciss.expose_hidden_physical is set X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: =?ISO-8859-1?Q?Lo=EFc_Pefferkorn?= List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 10 Mar 2011 20:20:13 -0000 The following reply was made to PR kern/152250; it has been noted by GNATS. From: =?ISO-8859-1?Q?Lo=EFc_Pefferkorn?= To: bug-followup@FreeBSD.org Cc: Subject: Re: kern/152250: [ciss] [patch] Kernel panic when hw.ciss.expose_hidden_physical is set Date: Thu, 10 Mar 2011 21:04:24 +0100 Hello, Are some informations missing or not clear enough ? If my report is not usable please tell me how to improve it :) Regards, Loïc