From owner-freebsd-scsi@FreeBSD.ORG Sun Jul 14 17:26:22 2013 Return-Path: Delivered-To: FreeBSD-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id 2EA46B9B for ; Sun, 14 Jul 2013 17:26:22 +0000 (UTC) (envelope-from sean_bruno@yahoo.com) Received: from nm4-vm1.bullet.mail.ne1.yahoo.com (nm4-vm1.bullet.mail.ne1.yahoo.com [98.138.91.44]) by mx1.freebsd.org (Postfix) with ESMTP id D5CADA6D for ; Sun, 14 Jul 2013 17:26:18 +0000 (UTC) Received: from [98.138.226.179] by nm4.bullet.mail.ne1.yahoo.com with NNFMP; 14 Jul 2013 17:23:42 -0000 Received: from [98.138.84.43] by tm14.bullet.mail.ne1.yahoo.com with NNFMP; 14 Jul 2013 17:23:42 -0000 Received: from [127.0.0.1] by smtp111.mail.ne1.yahoo.com with NNFMP; 14 Jul 2013 17:23:42 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1373822622; bh=oa8r+GQavFFd4vnmR8pQxn9rlD7h4QLli2J8S8SJy68=; h=X-Yahoo-Newman-Id:X-Yahoo-Newman-Property:X-YMail-OSG:X-Yahoo-SMTP:X-Rocket-Received:Subject:From:Reply-To:To:Content-Type:Date:Message-ID:Mime-Version:X-Mailer; b=ddi43Ly2nJ+PhGCr2+qe8fO0TKXMbcpYlCWRUbFcO+qQJom1ypEVpgswE6aFLv32yJHU3LAl7u/EpbnCP2RAraGzcA8pT623LX6EadNhdvvxHDHMdO3tDv2VZDuNiz1xIyuU5paI0u/V1+SiS6cN35VofAe8YPnHFcMSwVDe+O8= X-Yahoo-Newman-Id: 794415.41917.bm@smtp111.mail.ne1.yahoo.com X-Yahoo-Newman-Property: ymail-3 X-YMail-OSG: Z5lcXJQVM1lP3o3JnaSOwYSdoDza5Av0jwwZwHiOfP_U3gi nHET9BOS_DrgOrAV3YTZjCcSnhh1vXblBJt4EJmMBIOZZulhu.ja8YkyPBA8 6r6SlYSWFFv.B2kZuSR.hWxmo25AGRko11YKcz9ib595nErK.mUj5WQG7Z7W jGXacd7mIU3IUdt04OkQsecaaAsWLSxakxYKusHCpZ7UVPfonw4CZOsYJBE9 uqTnffRyHuQCwEYEgRscxYuq8i7hgYY98jA7JcYPLKLq0MSADX0lpCvOhvYl nxE8V2O4b9e0XIay5IUOkFAUsVYjepnTC3AU_v98jeNHzr_jCBmn4hq.Q9Pg Lwa7qq5CWCvbYuQ9ps5QPzZjBz5pSaf2i18.SSz7cpWkpCc8XQADiSddFlT6 bZ04Mbf8bB25E9dqDBVgy4uhhqjUaNx1tLiPy.TdDPwohmt.Bi0G07ADCDvF TwzmEW5gW_SmjNup2UcwhsTQPKb7rIZo2r9CP7iIa1aGQSTwIL5xwss.Io66 HrtaMClNb8_POGNgvv14EXg9Eyirx X-Yahoo-SMTP: u5BKR6OswBC_iZJVfGRoMkTIpc8pEA4- X-Rocket-Received: from [192.168.1.210] (sean_bruno@71.202.40.63 with ) by smtp111.mail.ne1.yahoo.com with SMTP; 14 Jul 2013 10:23:42 -0700 PDT Subject: Dell H310, JBOD mode "hard error" From: Sean Bruno To: "FreeBSD-scsi@freebsd.org" Content-Type: multipart/signed; micalg="pgp-sha1"; protocol="application/pgp-signature"; boundary="=-oikxDMNvAY2qtu2TBW6C" Date: Sun, 14 Jul 2013 10:23:41 -0700 Message-ID: <1373822621.1431.5.camel@localhost> Mime-Version: 1.0 X-Mailer: Evolution 2.32.1 FreeBSD GNOME Team Port X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list Reply-To: sbruno@freebsd.org List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 14 Jul 2013 17:26:22 -0000 --=-oikxDMNvAY2qtu2TBW6C Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Not sure what to make of this. I've tested a lot of svn revisions of the thunderbolt code, but nothing looks obvious. =20 When I use a single drive in "SYSPD" mode on a Dell H310 (falcon or skinny drake) I get a /dev/mfisyspd0 device. The JBOD mode *seems* to work just fine as long as I don't do multiple things at once to it, e.g. single user fsck works, but multiuser things die. I get a failure case that emits errors such as: g_vfs_done():error 27 in callback mfisyspd0p2[READ(offset=3D7176192, length=3D425984)]mfisyspd0: hard error error =3D 5 cmd=3Dread 15360-16383 error 27 in callback g_vfs_done():mfisyspd0: hard error mfisyspd0p2[READ(offset=3D7602176, length=3D524288)]cmd=3Dread error =3D 5 16384-17407 error 27 in callback g_vfs_done():mfisyspd0: hard error mfisyspd0p2[READ(offset=3D8126464, length=3D524288)]cmd=3Dread error =3D 5 14560-15359 error 27 in callback g_vfs_done():mfisyspd0: hard error mfisyspd0p2[READ(offset=3D7192576, length=3D409600)]cmd=3Dread error =3D 5 15360-16383 Sean --=-oikxDMNvAY2qtu2TBW6C Content-Type: application/pgp-signature; name="signature.asc" Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.13 (FreeBSD) iQEcBAABAgAGBQJR4t6dAAoJEBkJRdwI6BaHmzMH/005dnK0dRDuiNJbb5GIxHuS qc1FDaJHkakGpC2GniVRMpKI6ixcpou0cFRLMVBIKShDr0u5RSaHFqd9x4edhhqx GvmsExb9t2+cE6OJWbmXWORVqpVDRmKC/6y9sfVx5hhj24pQQJjV9NgXZbShxlxk a0NWSLxnb0+/bqpV8X55nV/CgRlm3tOJzI8B5K4ih+/1XrUOQIdmP/kH0gTNsuTo PmWFOEKOySY6Xyp6IxAKu629Ksd6T6fy2FNDBlwEqFXk4PmW4Yx+Cj1vuoza/tad dboIU71vCdjtWX+DIngEFwVIoaGnPlLpvEGcWzablz8Gkda4Jof60HXYNA3X22s= =IXO1 -----END PGP SIGNATURE----- --=-oikxDMNvAY2qtu2TBW6C-- From owner-freebsd-scsi@FreeBSD.ORG Sun Jul 14 17:30:01 2013 Return-Path: Delivered-To: freebsd-scsi@smarthost.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id A79B7C17 for ; Sun, 14 Jul 2013 17:30:01 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:1900:2254:206c::16:87]) by mx1.freebsd.org (Postfix) with ESMTP id 7FFCCA99 for ; Sun, 14 Jul 2013 17:30:01 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.7/8.14.7) with ESMTP id r6EHU1I7011290 for ; Sun, 14 Jul 2013 17:30:01 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.7/8.14.7/Submit) id r6EHU1Ls011288; Sun, 14 Jul 2013 17:30:01 GMT (envelope-from gnats) Date: Sun, 14 Jul 2013 17:30:01 GMT Message-Id: <201307141730.r6EHU1Ls011288@freefall.freebsd.org> To: freebsd-scsi@FreeBSD.org Cc: From: Sean Bruno Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list Reply-To: Sean Bruno List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 14 Jul 2013 17:30:01 -0000 The following reply was made to PR kern/179932; it has been noted by GNATS. From: Sean Bruno To: bug-followup@FreeBSD.org, philipp.maechler@hostpoint.ch Cc: Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) Date: Sun, 14 Jul 2013 10:17:29 -0700 I updated stable/9 with most of the changes made to head recently. See if that makes any difference. Sean From owner-freebsd-scsi@FreeBSD.ORG Sun Jul 14 18:00:01 2013 Return-Path: Delivered-To: freebsd-scsi@smarthost.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id A39B0F60 for ; Sun, 14 Jul 2013 18:00:01 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:1900:2254:206c::16:87]) by mx1.freebsd.org (Postfix) with ESMTP id 9525CBAA for ; Sun, 14 Jul 2013 18:00:01 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.7/8.14.7) with ESMTP id r6EI01Mm016913 for ; Sun, 14 Jul 2013 18:00:01 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.7/8.14.7/Submit) id r6EI01hp016912; Sun, 14 Jul 2013 18:00:01 GMT (envelope-from gnats) Date: Sun, 14 Jul 2013 18:00:01 GMT Message-Id: <201307141800.r6EI01hp016912@freefall.freebsd.org> To: freebsd-scsi@FreeBSD.org Cc: From: Sean Bruno Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list Reply-To: Sean Bruno List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 14 Jul 2013 18:00:01 -0000 The following reply was made to PR kern/179932; it has been noted by GNATS. From: Sean Bruno To: bug-followup@FreeBSD.org, philipp.maechler@hostpoint.ch Cc: Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) Date: Sun, 14 Jul 2013 10:54:50 -0700 --=-8GMlK85srlOGW3NNuk3G Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable I've updated the DDB hook to display as many adapters as possible. Can you add this to your tests please? http://people.freebsd.org/~sbruno/ciss_ddb_update.txt Sean --=-8GMlK85srlOGW3NNuk3G Content-Type: application/pgp-signature; name="signature.asc" Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.13 (FreeBSD) iQEcBAABAgAGBQJR4uXqAAoJEBkJRdwI6BaHNcYH/2FAkpSVW1SbIDcyKgkD9nGd KMzk/LvF7ysJNuNdsgmcBc0EGcLe7fcZOSmnMyGNBCPe95rC4cRhMruJDU9qi6MV pWJav9CEooUBZ+i9foXMn4E/lQ8xYeG0jXs6M0kG+27As85VIjFlbACvGKAw//FU JUh5bFiJw6nFs+Moljdl3nEyQTqF+DS46k18A1vDQC/MitU7qkYhTdn8tMAdWJVE fwCKF7VdQFCP5Vaq2b5XOupjJjluhAXWCmhEl5ZiUaSRfBAet0Cb5iaTqBs3w3rk 7YfW3OVkq3cEAOIXp/TmDjc1sa3xN99wEyzenpogCmGb/xSPuGWm8fp0tTMAR4U= =TvrB -----END PGP SIGNATURE----- --=-8GMlK85srlOGW3NNuk3G-- From owner-freebsd-scsi@FreeBSD.ORG Mon Jul 15 11:06:50 2013 Return-Path: Delivered-To: freebsd-scsi@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id 544A5F7F for ; Mon, 15 Jul 2013 11:06:50 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:1900:2254:206c::16:87]) by mx1.freebsd.org (Postfix) with ESMTP id 47919FD0 for ; Mon, 15 Jul 2013 11:06:50 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.7/8.14.7) with ESMTP id r6FB6och084573 for ; Mon, 15 Jul 2013 11:06:50 GMT (envelope-from owner-bugmaster@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.7/8.14.7/Submit) id r6FB6nXG084571 for freebsd-scsi@FreeBSD.org; Mon, 15 Jul 2013 11:06:49 GMT (envelope-from owner-bugmaster@FreeBSD.org) Date: Mon, 15 Jul 2013 11:06:49 GMT Message-Id: <201307151106.r6FB6nXG084571@freefall.freebsd.org> X-Authentication-Warning: freefall.freebsd.org: gnats set sender to owner-bugmaster@FreeBSD.org using -f From: FreeBSD bugmaster To: freebsd-scsi@FreeBSD.org Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 15 Jul 2013 11:06:50 -0000 Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/179932 scsi [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP o kern/178795 scsi [mps] MSI for mps driver doesn't work under vmware o kern/165982 scsi [mpt] mpt instability, drive resets, and losses on Fre o kern/165740 scsi [cam] SCSI code must drain callbacks before free f kern/162256 scsi [mpt] QUEUE FULL EVENT and 'mpt_cam_event: 0x0' o docs/151336 scsi Missing documentation of scsi_ and ata_ functions in c o kern/148083 scsi [aac] Strange device reporting o kern/144648 scsi [aac] Strange values of speed and bus width in dmesg o kern/142351 scsi [mpt] LSILogic driver performance problems o kern/134488 scsi [mpt] MPT SCSI driver probes max. 8 LUNs per device o kern/130621 scsi [mpt] tranfer rate is inscrutable slow when use lsi213 f kern/129602 scsi [ahd] ahd(4) gets confused and wedges SCSI bus f kern/123674 scsi [ahc] ahc driver dumping o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc 14 problems total. From owner-freebsd-scsi@FreeBSD.ORG Mon Jul 15 14:00:02 2013 Return-Path: Delivered-To: freebsd-scsi@smarthost.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id 9CB271E9 for ; Mon, 15 Jul 2013 14:00:02 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:1900:2254:206c::16:87]) by mx1.freebsd.org (Postfix) with ESMTP id 8E752C44 for ; Mon, 15 Jul 2013 14:00:02 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.7/8.14.7) with ESMTP id r6FE01Kk024340 for ; Mon, 15 Jul 2013 14:00:01 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.7/8.14.7/Submit) id r6FE01qq024339; Mon, 15 Jul 2013 14:00:01 GMT (envelope-from gnats) Date: Mon, 15 Jul 2013 14:00:01 GMT Message-Id: <201307151400.r6FE01qq024339@freefall.freebsd.org> To: freebsd-scsi@FreeBSD.org Cc: From: Markus Gebert Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list Reply-To: Markus Gebert List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 15 Jul 2013 14:00:02 -0000 The following reply was made to PR kern/179932; it has been noted by GNATS. From: Markus Gebert To: bug-followup@FreeBSD.org, =?iso-8859-1?Q?Philipp_M=E4chler?= , "sean_bruno@yahoo.com" Cc: Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) Date: Mon, 15 Jul 2013 15:51:06 +0200 I checked your MFC and all the fixes are already included in my patch = for 9.1 that we're currently testing with. With that patch, all G8 = blades are still running stable and have not shown any more IO stalls. = The G7 ones still reliably crash with our test load. So I think we can = state that we have already tested wether the changes from head help or = not. Is there another reason you want us to test with a stable/9 kernel, = or should we stick with the patched 9.1 for now? In any case I'll apply your DDB hook patch to our patched 9.1 kernel, so = we'll get out more debug information when a G7 blade stalls next time. Markus From owner-freebsd-scsi@FreeBSD.ORG Mon Jul 15 14:50:10 2013 Return-Path: Delivered-To: freebsd-scsi@smarthost.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id 6983B100 for ; Mon, 15 Jul 2013 14:50:10 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:1900:2254:206c::16:87]) by mx1.freebsd.org (Postfix) with ESMTP id 413CE96 for ; Mon, 15 Jul 2013 14:50:10 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.7/8.14.7) with ESMTP id r6FEo90F036941 for ; Mon, 15 Jul 2013 14:50:09 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.7/8.14.7/Submit) id r6FEo9WW036940; Mon, 15 Jul 2013 14:50:09 GMT (envelope-from gnats) Date: Mon, 15 Jul 2013 14:50:09 GMT Message-Id: <201307151450.r6FEo9WW036940@freefall.freebsd.org> To: freebsd-scsi@FreeBSD.org Cc: From: Markus Gebert Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list Reply-To: Markus Gebert List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 15 Jul 2013 14:50:10 -0000 The following reply was made to PR kern/179932; it has been noted by GNATS. From: Markus Gebert To: bug-followup@FreeBSD.org, =?iso-8859-1?Q?Philipp_M=E4chler?= , "sean_bruno@yahoo.com" Cc: Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) Date: Mon, 15 Jul 2013 16:42:29 +0200 The patch applied but it broke the the kernel build. I've corrected it = as I think it was intended. The version I used is here: https://dl.dropboxusercontent.com/u/10669369/ciss_ddb_update_v2.txt Markus From owner-freebsd-scsi@FreeBSD.ORG Fri Jul 19 09:00:01 2013 Return-Path: Delivered-To: freebsd-scsi@smarthost.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id 9B612AA3 for ; Fri, 19 Jul 2013 09:00:01 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:1900:2254:206c::16:87]) by mx1.freebsd.org (Postfix) with ESMTP id 8E00DA9C for ; Fri, 19 Jul 2013 09:00:01 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.7/8.14.7) with ESMTP id r6J90167026460 for ; Fri, 19 Jul 2013 09:00:01 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.7/8.14.7/Submit) id r6J901NM026459; Fri, 19 Jul 2013 09:00:01 GMT (envelope-from gnats) Date: Fri, 19 Jul 2013 09:00:01 GMT Message-Id: <201307190900.r6J901NM026459@freefall.freebsd.org> To: freebsd-scsi@FreeBSD.org Cc: From: Markus Gebert Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list Reply-To: Markus Gebert List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 19 Jul 2013 09:00:01 -0000 The following reply was made to PR kern/179932; it has been noted by GNATS. From: Markus Gebert To: bug-followup@FreeBSD.org, =?windows-1252?Q?Philipp_M=E4chler?= , "sean_bruno@yahoo.com" Cc: Subject: Re: kern/179932: [ciss] ciss i/o stall problem with HP Bl Gen8 (and HP Bl Gen7 + Storage Blade) Date: Fri, 19 Jul 2013 10:52:27 +0200 We had another G7 IO stall and were able to get ciss debug outpout for = both controllers. ciss0 is the internal one, ciss1 is the one built into = the storage blade. = https://dl.dropboxusercontent.com/u/10669369/fbsd%20ciss%20iostall%20debug= /20130717%20-%20G7%20crash%20%2815s%29/20130717%20-%20alltrace.txt = https://dl.dropboxusercontent.com/u/10669369/fbsd%20ciss%20iostall%20debug= /20130717%20-%20G7%20crash%20%2815s%29/20130717%20-%20cissdebug.txt I hope this helps=85 G8 blades are still running stable with the ciss changes from head. Markus From owner-freebsd-scsi@FreeBSD.ORG Fri Jul 19 11:53:06 2013 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id 0F0C922E for ; Fri, 19 Jul 2013 11:53:06 +0000 (UTC) (envelope-from sowmyagowda90@gmail.com) Received: from mail-vb0-x234.google.com (mail-vb0-x234.google.com [IPv6:2607:f8b0:400c:c02::234]) by mx1.freebsd.org (Postfix) with ESMTP id C9683303 for ; Fri, 19 Jul 2013 11:53:05 +0000 (UTC) Received: by mail-vb0-f52.google.com with SMTP id f12so3132658vbg.11 for ; Fri, 19 Jul 2013 04:53:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=YUSz3928zKrOFI2+E4THbft4yCxmg5ftZTM/Id0Bx3g=; b=Er60GgrMwpPt+Z4xetvGhcMnStGcwOfkuyjCluvef5hV4Pa1eTIiHxBP8QNmhVeKLp R4r/8V2Q0GeaypvvyTuAUeZobudrfLJX+0R+WhzHWOPhoaBCEvnhHfvTx9W3hKqQUz74 v3VFUQnhQdNFCmFZhJbWiN0uFPu9Xz1lJoehwTtJ7cJljhZEktZGbdTZZhvlgg0gOI9n eLZk8HiJUt4ItiC/7pmQAbIOBFX9Vp1BqshQehEvoaJi//v8SMS79myphQwX1C0NwLnZ 1UIxWeQakWkQvZADE1p1VRcaoV33fb3ubuJlLty/ePKq70jeGOp7CkEQeVMcdlhLfAe5 9DDg== MIME-Version: 1.0 X-Received: by 10.220.97.134 with SMTP id l6mr5592117vcn.44.1374234785274; Fri, 19 Jul 2013 04:53:05 -0700 (PDT) Received: by 10.58.151.137 with HTTP; Fri, 19 Jul 2013 04:53:05 -0700 (PDT) Date: Fri, 19 Jul 2013 17:23:05 +0530 Message-ID: Subject: Data corruption seen on the pool when an active path is pulled from geom_multipath device while running I/O From: "$owmya gowda" To: freebsd-scsi@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.14 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 19 Jul 2013 11:53:06 -0000 Hi , Read/Write errors are recorded when an active path of the geom_multipath device is pulled while running the i/o on dataset created for the pool. Running I/o on dataset using dd. Freebsd version* :* 9.0 Patch imported from stable 9* : *r229303, r234916 zpool status: * * pool: poola state: ONLINE scan: none requested config: NAME STATE READ WRITE CKSUM poola ONLINE 0 0 0 mirror-0 ONLINE 0 0 0 multipath/newdisk4 ONLINE 0 0 0 multipath/newdisk2 ONLINE 0 0 0 errors: No known data errors * * * * gmultipath status: * * Name Status Components multipath/newdisk2 OPTIMAL da7 (ACTIVE) da2 (PASSIVE) multipath/newdisk1 OPTIMAL da6 (ACTIVE) da1 (PASSIVE) multipath/newdisk4 OPTIMAL da3 (ACTIVE) da4 (PASSIVE) multipath/newdisk OPTIMAL da0 (ACTIVE) da5 (PASSIVE) multipath/newdisk3 OPTIMAL da8 (ACTIVE) da9 (PASSIVE) * * scsi messages in the log: (da0:mpslsi0:0:136:0): READ(10). CDB: 28 0 0 4a bc d6 0 1 0 0 length 131072 SMID 516 terminated ioc 804b scsi 0 state 0 xfer 0 (da0:mpslsi0:0:136:0): READ(10). CDB: 28 0 0 4a bc d6 0 1 0 0 length 131072 SMID 516 terminated ioc 804b scsi 0 state 0 xfer 0 (da0:mpslsi0:0:136:0): READ(10). CDB: 28 0 0 4a bd d6 0 1 0 0 length 131072 SMID 538 terminated ioc 804b scsi 0 state 0 xfer 0 (da0:mpslsi0:0:136:0): READ(10). CDB: 28 0 0 4a bd d6 0 1 0 0 length 131072 SMID 538 terminated ioc 804b scsi 0 state 0 xfer 0 *zpool status after pulling the active path g_multipath device:* pool: mypool1 state: ONLINE status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-9P scan: resilvered 27.2M in 0h0m with 0 errors on Thu Jul 4 19:47:44 2013 config: NAME STATE READ WRITE CKSUM mypool1 ONLINE 0 0 0 mirror-0 ONLINE 0 12 0 multipath/newdisk4 ONLINE 0 27 0 multipath/newdisk2 ONLINE 0 12 0 spares multipath/newdisk AVAIL errors: No known data errors Are there any dependencies for the patch that is picked from stable 9 as mentioned above?? will be waiting for your reply. From owner-freebsd-scsi@FreeBSD.ORG Fri Jul 19 16:35:43 2013 Return-Path: Delivered-To: freebsd-scsi@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id E308C674 for ; Fri, 19 Jul 2013 16:35:43 +0000 (UTC) (envelope-from avg@FreeBSD.org) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id 2169C36A for ; Fri, 19 Jul 2013 16:35:42 +0000 (UTC) Received: from porto.starpoint.kiev.ua (porto-e.starpoint.kiev.ua [212.40.38.100]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id TAA11614 for ; Fri, 19 Jul 2013 19:35:41 +0300 (EEST) (envelope-from avg@FreeBSD.org) Received: from localhost ([127.0.0.1]) by porto.starpoint.kiev.ua with esmtp (Exim 4.34 (FreeBSD)) id 1V0Def-000A6O-6b for freebsd-scsi@freebsd.org; Fri, 19 Jul 2013 19:35:41 +0300 Message-ID: <51E96A8C.7050301@FreeBSD.org> Date: Fri, 19 Jul 2013 19:34:20 +0300 From: Andriy Gapon User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:17.0) Gecko/20130708 Thunderbird/17.0.7 MIME-Version: 1.0 To: freebsd-scsi@FreeBSD.org Subject: Re: xpt_schedule_dev: priority of new entry References: <516EC5F7.5030908@FreeBSD.org> In-Reply-To: <516EC5F7.5030908@FreeBSD.org> X-Enigmail-Version: 1.5.1 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 19 Jul 2013 16:35:44 -0000 [ping] on 17/04/2013 18:55 Andriy Gapon said the following: > > I wonder what is the reason for the following code: > /* New entry on the queue */ > if (new_priority < old_priority) > pinfo->priority = new_priority; > > Shouldn't we unconditionally honor new_priority given that this is the new entry case? > -- Andriy Gapon