From owner-freebsd-scsi@FreeBSD.ORG Mon Nov 17 01:13:23 2008 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8EDC11065670; Mon, 17 Nov 2008 01:13:23 +0000 (UTC) (envelope-from p.christias@noc.ntua.gr) Received: from achilles.noc.ntua.gr (achilles.noc.ntua.gr [IPv6:2001:648:2000:de::210]) by mx1.freebsd.org (Postfix) with ESMTP id F109D8FC17; Mon, 17 Nov 2008 01:13:22 +0000 (UTC) (envelope-from p.christias@noc.ntua.gr) Received: from ajax.noc.ntua.gr (ajax6.noc.ntua.gr [IPv6:2001:648:2000:dc::1]) by achilles.noc.ntua.gr (8.14.3/8.14.3) with ESMTP id mAH1DIRJ013941 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT); Mon, 17 Nov 2008 03:13:18 +0200 (EET) (envelope-from p.christias@noc.ntua.gr) Received: from ajax.noc.ntua.gr (localhost.noc.ntua.gr [127.0.0.1]) by ajax.noc.ntua.gr (8.13.8/8.13.8) with ESMTP id mAH1DHqN056431 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT); Mon, 17 Nov 2008 03:13:17 +0200 (EET) (envelope-from p.christias@noc.ntua.gr) Received: (from christia@localhost) by ajax.noc.ntua.gr (8.13.8/8.13.8/Submit) id mAH1DHVK056430; Mon, 17 Nov 2008 03:13:17 +0200 (EET) (envelope-from p.christias@noc.ntua.gr) X-Authentication-Warning: ajax.noc.ntua.gr: christia set sender to p.christias@noc.ntua.gr using -f Date: Mon, 17 Nov 2008 03:13:17 +0200 From: Panagiotis Christias To: Oleg Sharoiko Message-ID: <20081117011317.GB52109@noc.ntua.gr> References: <20081014222343.GA8706@noc.ntua.gr> <1224049455.1277.44.camel@brain.cc.rsu.ru> <20081015175453.GA3260@noc.ntua.gr> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20081015175453.GA3260@noc.ntua.gr> User-Agent: Mutt/1.5.16 (2007-06-09) X-Virus-Scanned: ClamAV version 0.94, clamav-milter version 0.94 on achilles.noc.ntua.gr X-Virus-Status: Clean Cc: freebsd-scsi@freebsd.org, freebsd-stable@freebsd.org Subject: Re: FreeBSD 7-STABLE, isp(4), QLE2462: panic & deadlocks X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 17 Nov 2008 01:13:23 -0000 On Wed, Oct 15, 2008 at 08:54:53PM +0300, Panagiotis Christias wrote: > On Wed, Oct 15, 2008 at 09:44:15AM +0400, Oleg Sharoiko wrote: > > Hi! > > > > On Wed, 2008-10-15 at 01:23 +0300, Panagiotis Christias wrote: > > > > > However, when we connect them to the CX3-40, create and mount a new > > > partition and then do something as simple as "tar -C /san -xf ports.tgz" > > > the system panics and deadlocks. We have tried several FreeBSD versions > > > (6.3 i386/adm64, 7.0 i386/adm64, 7.1 i386/adm64 and lastly 7-STABLE i386 > > > - we also tried the latest 8-CURRENT snapshot but it panicked too soon). > > > The result is always the same; panic and deadlock. > > > > Try reducing the number of "tagged openings" with 'camcontrol tags' down > > to 46. If it doesn't work try reducing it further to 2. Also be advised > > that I've seen panics with geom_multipath in FreeBSD-7, unfortunately I > > had no time to test it in -current. > > > Hm.. that would probably explain the fact that I was unable to panic the > system when I had set the hint.isp.0.debug="0x1F" in /boot/device.hints. > > Currently I am stress testing the server with the tagged openings set to > 44 (first value tested). Until now there is no panic or deadlock. I am > trying concurrent tar extractions and rsync copies. The filesystem looks > ok till now according to fsck. I will let it write/copy/delete overnight > and tomorrow I will try different tagged opening values. > > Thank you for the hint! I am wondering what is the performance penalty > with decreased tagged openings. Also, is there anything else I could try > in order to get more useful debug output? I have at least three servers > that I could use for any kind of tests and I am willing to spend as much > time I can get to help solving the problem. > > Finally, the only output in the logs is: > > Expensive timeout(9) function: 0xc06f4210(0xc67e1200) 0.059422635 s > Expensive timeout(9) function: 0xc08d4fd0(0) 0.060676147 s > > I suppose that is related to the CAMDEBUG kernel config options. For the record, I have done many tests using several stressing tools in parallel, different FreeBSD versions (up to 7.1beta2), various filesystem configurations (plain ufs2 with softupdates, ufs2 and gjournal, zfs) and various tag openings values (down to 2). Regardless of the configuration, the system deadlocks, panics or the filesystem gets awfully corrupted within seconds, minutes or a few hours. The only configuration that seems to work without problems(?) but with a unacceptable *severe* performance penalty is when tag openings are set to minimum value of 2 (that is more or less same as disabling tagged command queueing at all). All tests ran using a 500 GB RAID5 LUN on an EMC Clariion CX340: da0 at isp0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-4 device da0: Serial Number CK200083100148 da0: 400.000MB/s transfers da0: Command Queueing Enabled da0: 512000MB (1048576000 512 byte sectors: 255H 63S/T 65270C) Previously, a Sun StorEdge T3 was tested which worked flawlessly but it had a 1 Gbps fibre channel interface, instead of a 4 Gbps that Clariion has, was recognized as a SCSI-3 device and had 2 tags openings (no surprise) by default: da1 at isp1 bus 0 target 0 lun 0 da1: Fixed Direct Access SCSI-3 device da1: 100.000MB/s transfers da1: 241724MB (495050752 512 byte sectors: 255H 63S/T 30815C) As I mentioned before, I am willing to spend time or/and provide access to the system for testing and debugging. Regards, Panagiotis -- Panagiotis J. Christias Network Management Center P.Christias@noc.ntua.gr National Technical Univ. of Athens, GREECE From owner-freebsd-scsi@FreeBSD.ORG Mon Nov 17 09:06:30 2008 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 97EB51065670 for ; Mon, 17 Nov 2008 09:06:30 +0000 (UTC) (envelope-from jcigar@ulb.ac.be) Received: from mxin.vub.ac.be (mxin.vub.ac.be [134.184.129.110]) by mx1.freebsd.org (Postfix) with ESMTP id 1999B8FC12 for ; Mon, 17 Nov 2008 09:06:29 +0000 (UTC) (envelope-from jcigar@ulb.ac.be) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: Ai0BAHmyIEmkD30E/2dsb2JhbAAIlFmvH4lbgnk Received: from bebif01.ulb.ac.be (HELO [10.0.0.194]) ([164.15.125.4]) by smtp.vub.ac.be with ESMTP; 17 Nov 2008 09:36:53 +0100 From: Julien Cigar To: freebsd-scsi@freebsd.org Content-Type: text/plain; charset=UTF-8 Date: Mon, 17 Nov 2008 09:37:08 +0100 Message-Id: <1226911028.2746.14.camel@frodon.be-bif.ulb.ac.be> Mime-Version: 1.0 X-Mailer: Evolution 2.22.3.1 Content-Transfer-Encoding: 8bit Subject: ahc X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 17 Nov 2008 09:06:30 -0000 Dear FreeBSD users, I'm running FreeBSD 7.0 with Bacula as a backup box. This box has an Adaptec 2940 SCSI card and a Sony SDX700-C tape drive : ahc0: port 0xec00-0xecff mem 0xdffff000-0xdfffffff irq 18 at device 7.0 on pci0 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs sa0 at ahc0 bus 0 target 5 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 20.000MB/s transfers (10.000MHz, offset 8, 16bit) It works more or less in the sense that sometimes Bacula fails to write final EOF to tape. I'm sure that the tapes are OK because I reached the tape rotation cycle and sometimes it fails with a tape that didn't fail in the previous rotation cycle. When I look at the kernel outputs, I have the following : ahc0: Recovery Initiated >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< ahc0: Dumping Card State while idle, at SEQADDR 0x7 Card was paused ACCUM = 0x83, SINDEX = 0x57, DINDEX = 0x26, ARG_2 = 0x3e HCNT = 0xe8 SCBPTR = 0x0 SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0] LASTPHASE[0x1]:(P_BUSFREE) SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI) SBLKCTL[0x2]:(SELWIDE) SCSIRATE[0x0] SEQCTL[0x10]:(FASTMODE) SEQ_FLAGS[0xc0]:(NO_CDB_SENT| NOT_IDENTIFIED) SSTAT0[0x0] SSTAT1[0xa]:(PHASECHG|BUSFREE) SSTAT2[0x0] SSTAT3[0x0] SIMODE0[0x0] SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO) SXFRCTL0[0x80]:(DFON) DFCNTRL[0x0] DFSTATUS[0x2]:(FIFOFULL) STACK: 0x0 0x16a 0x19a 0x3 SCB count = 254 Kernel NEXTQSCB = 238 Card NEXTQSCB = 238 QINFIFO entries: Waiting Queue entries: Disconnected Queue entries: 0:248 QOUTFIFO entries: Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Sequencer SCB Info: 0 SCB_CONTROL[0x44]:(DISCONNECTED|DISCENB) SCB_SCSIID[0x57] SCB_LUN[0x0] SCB_TAG[0xf8] 1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 5 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 6 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 7 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 8 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 9 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 10 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 11 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 12 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 13 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 14 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 15 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Pending list: 248 SCB_CONTROL[0x40]:(DISCENB) SCB_SCSIID[0x57] SCB_LUN[0x0] Kernel Free SCB list: 239 240 241 242 243 244 245 246 247 249 250 251 252 253 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Untagged Q(5): 248 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> (sa0:ahc0:0:5:0): SCB 0xf8 - timed out sg[0] - Addr 0x375e028 : Length 8152 sg[1] - Addr 0x3700000 : Length 4096 sg[2] - Addr 0x2cb0000 : Length 20480 sg[3] - Addr 0x3701000 : Length 20480 sg[4] - Addr 0x375c000 : Length 4096 sg[5] - Addr 0x4026000 : Length 4096 sg[6] - Addr 0x4700000 : Length 3112 (sa0:ahc0:0:5:0): Queuing a BDR SCB ahc0: Timedout SCBs already complete. Interrupts may not be functioning. (sa0:ahc0:0:5:0): Bus Device Reset Message Sent (sa0:ahc0:0:5:0): no longer in timeout, status = 24b ahc0: Bus Device Reset on A:5. 1 SCBs aborted (sa0:ahc0:0:5:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc0:0:5:0): NO SENSE ILI (length mismatch): -56320 asc:0,0 (sa0:ahc0:0:5:0): No additional sense information (sa0:ahc0:0:5:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc0:0:5:0): NO SENSE ILI (length mismatch): -56320 asc:0,0 (sa0:ahc0:0:5:0): No additional sense information % Do you have an idea what could be wrong ? Should I fill a bug report ? Thanks, (and sorry for my english) Julien ps: as I'm not subscribed on this list, could you include my email address if you reply ? -- Julien Cigar Belgian Biodiversity Platform http://www.biodiversity.be Université Libre de Bruxelles (ULB) Campus de la Plaine CP 257 Bâtiment NO, Bureau 4 N4 115C (Niveau 4) Boulevard du Triomphe, entrée ULB 2 B-1050 Bruxelles Mail: jcigar@ulb.ac.be @biobel: http://biobel.biodiversity.be/person/show/471 Tel : 02 650 57 52 From owner-freebsd-scsi@FreeBSD.ORG Mon Nov 17 11:06:57 2008 Return-Path: Delivered-To: freebsd-scsi@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 231D11065680 for ; Mon, 17 Nov 2008 11:06:57 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id 1263B8FC1A for ; Mon, 17 Nov 2008 11:06:57 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.3/8.14.3) with ESMTP id mAHB6utH082656 for ; Mon, 17 Nov 2008 11:06:56 GMT (envelope-from owner-bugmaster@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.3/8.14.3/Submit) id mAHB6uWT082652 for freebsd-scsi@FreeBSD.org; Mon, 17 Nov 2008 11:06:56 GMT (envelope-from owner-bugmaster@FreeBSD.org) Date: Mon, 17 Nov 2008 11:06:56 GMT Message-Id: <200811171106.mAHB6uWT082652@freefall.freebsd.org> X-Authentication-Warning: freefall.freebsd.org: gnats set sender to owner-bugmaster@FreeBSD.org using -f From: FreeBSD bugmaster To: freebsd-scsi@FreeBSD.org Cc: Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 17 Nov 2008 11:06:57 -0000 Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/128452 scsi [sa] [panic] Accessing SCSI tape drive randomly crashe o kern/128245 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/127927 scsi [isp] isp(4) target driver crashes kernel when set up o kern/127901 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/126866 scsi [isp] [panic] kernel panic on card initialization o kern/124667 scsi [amd] [panic] FreeBSD-7 kernel page faults at amd-scsi o kern/123674 scsi [ahc] ahc driver dumping o kern/123666 scsi [aac] attach fails with Adaptec SAS RAID 3805 controll o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc o kern/120487 scsi [sg] scsi_sg incompatible with scanners o kern/120247 scsi [mpt] FreeBSD 6.3 and LSI Logic 1030 = only 3.300MB/s o kern/119668 scsi [cam] [patch] certain errors are too verbose comparing o kern/114597 scsi [sym] System hangs at SCSI bus reset with dual HBAs o kern/110847 scsi [ahd] Tyan U320 onboard problem with more than 3 disks o kern/99954 scsi [ahc] reading from DVD failes on 6.x [regression] o kern/94838 scsi Kernel panic while mounting SD card with lock switch o o kern/92798 scsi [ahc] SCSI problem with timeouts o kern/90282 scsi [sym] SCSI bus resets cause loss of ch device o kern/76178 scsi [ahd] Problem with ahd and large SCSI Raid system o kern/74627 scsi [ahc] [hang] Adaptec 2940U2W Can't boot 5.3 s kern/61165 scsi [panic] kernel page fault after calling cam_send_ccb o kern/60641 scsi [sym] Sporadic SCSI bus resets with 53C810 under load o kern/60598 scsi wire down of scsi devices conflicts with config s kern/57398 scsi [mly] Current fails to install on mly(4) based RAID di o kern/52638 scsi [panic] SCSI U320 on SMP server won't run faster than o kern/44587 scsi dev/dpt/dpt.h is missing defines required for DPT_HAND o kern/40895 scsi wierd kernel / device driver bug o kern/39388 scsi ncr/sym drivers fail with 53c810 and more than 256MB m o kern/38828 scsi [dpt] [request] DPT PM2012B/90 doesn't work o kern/35234 scsi World access to /dev/pass? (for scanner) requires acce 30 problems total. From owner-freebsd-scsi@FreeBSD.ORG Mon Nov 17 15:40:05 2008 Return-Path: Delivered-To: freebsd-scsi@hub.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CD5E71065674 for ; Mon, 17 Nov 2008 15:40:05 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id A30808FC23 for ; Mon, 17 Nov 2008 15:40:05 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.14.3/8.14.3) with ESMTP id mAHFe5tl088059 for ; Mon, 17 Nov 2008 15:40:05 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.3/8.14.3/Submit) id mAHFe5wq088058; Mon, 17 Nov 2008 15:40:05 GMT (envelope-from gnats) Date: Mon, 17 Nov 2008 15:40:05 GMT Message-Id: <200811171540.mAHFe5wq088058@freefall.freebsd.org> To: freebsd-scsi@FreeBSD.org From: Kirk Strauser Cc: Subject: Re: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Kirk Strauser List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 17 Nov 2008 15:40:05 -0000 The following reply was made to PR kern/128452; it has been noted by GNATS. From: Kirk Strauser To: bug-followup@freebsd.org, kirk@strauser.com Cc: Subject: Re: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Date: Mon, 17 Nov 2008 09:32:58 -0600 I don't wish to pester, but is anyone actually looking at these? If so, should I continue submitting dumps, or do you already have what you need? Is there anything else I can provide? From owner-freebsd-scsi@FreeBSD.ORG Tue Nov 18 11:37:14 2008 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1AF531065672 for ; Tue, 18 Nov 2008 11:37:14 +0000 (UTC) (envelope-from Andre.Albsmeier@siemens.com) Received: from thoth.sbs.de (thoth.sbs.de [192.35.17.2]) by mx1.freebsd.org (Postfix) with ESMTP id 8F26C8FC0A for ; Tue, 18 Nov 2008 11:37:13 +0000 (UTC) (envelope-from Andre.Albsmeier@siemens.com) Received: from mail3.siemens.de (localhost [127.0.0.1]) by thoth.sbs.de (8.12.11.20060308/8.12.11) with ESMTP id mAIBO38t018086 for ; Tue, 18 Nov 2008 12:24:03 +0100 Received: from curry.mchp.siemens.de (curry.mchp.siemens.de [139.25.40.130]) by mail3.siemens.de (8.12.11.20060308/8.12.11) with ESMTP id mAIBO2ik003188 for ; Tue, 18 Nov 2008 12:24:02 +0100 Received: (from localhost) by curry.mchp.siemens.de (8.14.3/8.14.3) id mAIBO2xl043490; Date: Tue, 18 Nov 2008 12:24:02 +0100 From: Andre Albsmeier To: freebsd-scsi@freebsd.org Message-ID: <20081118112402.GA78188@curry.mchp.siemens.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Echelon: X-Advice: Drop that crappy M$-Outlook, I'm tired of your viruses! User-Agent: Mutt/1.5.18 (2008-05-17) Cc: Andre.Albsmeier@siemens.com Subject: Quantum SLDT600 write problems X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 18 Nov 2008 11:37:14 -0000 Hello, for months, I am experiencing occasionally appearing problems using a Quantum SDLT600. What we do is simple: - open() /dev/sa0 - set the blocksize to 64k using ioctl() - write() data in 64k chunks - close() Sometimes, the write() comes back with EIO. This can happen whenever it likes to -- after a few GB, hundreds of GB or never. If it happens, the kernel spits out errors (see below). Otherwise, the machine runs rock solid as a server running quotas, samba, nfsd, dhcpd, NIS, ntpd, ... The complete hardware, apart from the SDLT drive itself, has been replaced a while ago. Earlier it was a 1,4GHz Tualatin on an Asus CUBX-L board using an Adaptec 29160 controller, now it is an 3GHz E8400 on an Asus P5W board using an Adaptec 39320LPE controller. Even the cable from the controller to the drive was changed. OS has always been a recent version of FreeBSD 6.x-STABLE (now 6.4). Any ideas what is happening here? Here are the kernel errors: Nov 18 12:04:13 server kernel: ahd3: Recovery Initiated - Card was not paused Nov 18 12:04:13 server kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< Nov 18 12:04:13 server kernel: ahd3: Dumping Card State at program address 0x32 Mode 0x0 Nov 18 12:04:13 server kernel: INTSTAT[0x0] SELOID[0x0] SELID[0x0] HS_MAILBOX[0x0] Nov 18 12:04:13 server kernel: INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x33] Nov 18 12:04:13 server kernel: SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0] LASTPHASE[0x1] Nov 18 12:04:13 server kernel: SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x0] SEQINTCTL[0x0] Nov 18 12:04:13 server kernel: SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] QFREEZE_COUNT[0xe1c] Nov 18 12:04:13 server kernel: KERNEL_QFREEZE_COUNT[0xe1c] MK_MESSAGE_SCB[0xff00] Nov 18 12:04:13 server kernel: MK_MESSAGE_SCSIID[0xff] SSTAT0[0x0] SSTAT1[0x8] SSTAT2[0x0] Nov 18 12:04:13 server kernel: SSTAT3[0x0] PERRDIAG[0x0] SIMODE1[0xa4] LQISTAT0[0x0] Nov 18 12:04:13 server kernel: LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] Nov 18 12:04:13 server kernel: LQOSTAT2[0x0] Nov 18 12:04:13 server kernel: Nov 18 12:04:13 server kernel: SCB Count = 16 CMDS_PENDING = 1 LASTSCB 0xffff CURRSCB 0x1 NEXTSCB 0x0 Nov 18 12:04:13 server kernel: qinstart = 9444 qinfifonext = 9444 Nov 18 12:04:13 server kernel: QINFIFO: Nov 18 12:04:13 server kernel: WAITING_TID_QUEUES: Nov 18 12:04:13 server kernel: Pending list: Nov 18 12:04:13 server kernel: 1 FIFO_USE[0x0] SCB_CONTROL[0x44] SCB_SCSIID[0x7] Nov 18 12:04:13 server kernel: Total 1 Nov 18 12:04:13 server kernel: Kernel Free SCB list: 2 15 13 12 11 10 9 8 7 6 5 4 3 14 0 Nov 18 12:04:13 server kernel: Sequencer Complete DMA-inprog list: Nov 18 12:04:13 server kernel: Sequencer Complete list: Nov 18 12:04:13 server kernel: Sequencer DMA-Up and Complete list: Nov 18 12:04:13 server kernel: Sequencer On QFreeze and Complete list: Nov 18 12:04:13 server kernel: Nov 18 12:04:13 server kernel: Nov 18 12:04:13 server kernel: ahd3: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 Nov 18 12:04:13 server kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] Nov 18 12:04:13 server kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] Nov 18 12:04:13 server kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 Nov 18 12:04:13 server kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] Nov 18 12:04:13 server kernel: Nov 18 12:04:13 server kernel: ahd3: FIFO1 Free, LONGJMP == 0x81fc, SCB 0x1 Nov 18 12:04:13 server kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x4] DFSTATUS[0x89] Nov 18 12:04:13 server kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] Nov 18 12:04:13 server kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 Nov 18 12:04:13 server kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] Nov 18 12:04:13 server kernel: LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 Nov 18 12:04:13 server kernel: ahd3: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 Nov 18 12:04:13 server kernel: ahd3: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 Nov 18 12:04:13 server kernel: ahd3: SAVED_SCSIID = 0x0 SAVED_LUN = 0x0 Nov 18 12:04:13 server kernel: SIMODE0[0xc] Nov 18 12:04:13 server kernel: CCSCBCTL[0x4] Nov 18 12:04:13 server kernel: ahd3: REG0 == 0x6f74, SINDEX = 0x1b8, DINDEX = 0x1ba Nov 18 12:04:13 server kernel: ahd3: SCBPTR == 0x0, SCB_NEXT == 0xff00, SCB_NEXT2 == 0x0 Nov 18 12:04:13 server kernel: CDB 0 0 0 0 0 0 Nov 18 12:04:13 server kernel: STACK: 0x23 0x0 0x0 0x0 0x0 0x0 0x0 0x0 Nov 18 12:04:13 server kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): SCB 1 - timed out Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): Queuing a BDR SCB Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): Bus Device Reset Message Sent Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): no longer in timeout, status = 24b Nov 18 12:04:13 server kernel: ahd3: Bus Device Reset on A:0. 1 SCBs aborted Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): WRITE FILEMARKS(6). CDB: 10 0 0 0 2 0 Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): CAM Status: SCSI Status Error Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): SCSI Status: Check Condition Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): UNIT ATTENTION csi:0,49,cc,1e asc:29,3 Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): Bus device reset function occurred Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): Retries Exhausted Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): failed to write terminating filemark(s) Thanks, -Andre From owner-freebsd-scsi@FreeBSD.ORG Wed Nov 19 15:40:04 2008 Return-Path: Delivered-To: freebsd-scsi@hub.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6F3521065686 for ; Wed, 19 Nov 2008 15:40:04 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id 42D368FC13 for ; Wed, 19 Nov 2008 15:40:04 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.14.3/8.14.3) with ESMTP id mAJFe4rB022904 for ; Wed, 19 Nov 2008 15:40:04 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.3/8.14.3/Submit) id mAJFe4e5022903; Wed, 19 Nov 2008 15:40:04 GMT (envelope-from gnats) Date: Wed, 19 Nov 2008 15:40:04 GMT Message-Id: <200811191540.mAJFe4e5022903@freefall.freebsd.org> To: freebsd-scsi@FreeBSD.org From: Kirk Strauser Cc: Subject: Re: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Kirk Strauser List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 19 Nov 2008 15:40:04 -0000 The following reply was made to PR kern/128452; it has been noted by GNATS. From: Kirk Strauser To: bug-followup@freebsd.org, kirk@strauser.com Cc: Subject: Re: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Date: Wed, 19 Nov 2008 09:33:33 -0600 You can close this bug. I replaced the card with an Adaptec 29160 and it's been working perfectly ever since. In all fairness, though, the old cards (Tekram DC390F) should be removed from the list of supported hardware since they are no longer functional under FreeBSD 7. From owner-freebsd-scsi@FreeBSD.ORG Fri Nov 21 17:51:45 2008 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1A2A01065670; Fri, 21 Nov 2008 17:51:45 +0000 (UTC) (envelope-from peter@simons-rock.edu) Received: from hedwig.simons-rock.edu (hedwig.simons-rock.edu [208.81.88.14]) by mx1.freebsd.org (Postfix) with ESMTP id D054B8FC0C; Fri, 21 Nov 2008 17:51:44 +0000 (UTC) (envelope-from peter@simons-rock.edu) Received: from cesium.hyperfine.info (c2.8d.5646.static.theplanet.com [70.86.141.194]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by hedwig.simons-rock.edu (Postfix) with ESMTP id 18DEFECFF8; Fri, 21 Nov 2008 12:23:20 -0500 (EST) Date: Fri, 21 Nov 2008 12:20:46 -0500 From: "Peter C. Lai" To: freebsd-stable@FreeBSD.org, freebsd-scsi@freebsd.org Message-ID: <20081121172028.GQ92091@cesium.hyperfine.info> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.17 (2007-11-01) Cc: Subject: ahc(4) on aic7899 wedges with a Tandberg LTO-2 sa drive X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 21 Nov 2008 17:51:45 -0000 I've got a Tandberg TS 400 LTO-2 drive in a Dell PE1800 which is wedging ahc(4) after writing about 5 GB of data. I've attached dmesg.boot. uname -a: FreeBSD phoenix.simons-rock.edu 7.1-PRERELEASE FreeBSD 7.1-PRERELEASE #0: Fri Oct 17 23:17:38 EDT 2008 root@phoenix.simons-rock.edu:/usr/obj/usr/src/sys/PHOENIXPCL i386 I put in a tape, and mt -f /dev/sa0 status gives the following kernel message: Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 12:06:28 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): error 6 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): Unretryable Error Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 12:06:28 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): error 6 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): Unretryable Error Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 12:06:28 phoenix kernel: Nov 21 12:06:28 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): error 6 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): Unretryable Error Nov 21 12:07:30 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 12:07:30 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 12:07:30 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 12:07:30 phoenix kernel: (sa0:ahc0:0:6:0): Retrying Command but it returns ok at the end: Mode Density Blocksize bpi Compression Current: 0x42 variable 0 0x1 ---------available modes--------- 0: 0x42 variable 0 0x1 1: 0x42 variable 0 0x1 2: 0x42 variable 0 0x1 3: 0x42 variable 0 0x1 --------------------------------- Current Driver State: at rest. --------------------------------- File Number: 0 Record Number: 0 Residual Count 0 I am using star bs=32k -no-fifo to write to the tape. ahc(4) crashes regardless of setting the blocksize on the hardware (to 32k) or not: Nov 21 08:58:15 phoenix kernel: ahc0: Recovery Initiated Nov 21 08:58:15 phoenix kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< Nov 21 08:58:15 phoenix kernel: ahc0: Dumping Card State while idle, at SEQADDR 0x8 Nov 21 08:58:15 phoenix kernel: Card was paused Nov 21 08:58:15 phoenix kernel: ACCUM = 0x4, SINDEX = 0x67, DINDEX = 0x27, ARG_2 = 0x3b Nov 21 08:58:15 phoenix kernel: HCNT = 0x0 SCBPTR = 0x0 Nov 21 08:58:15 phoenix kernel: SCSIPHASE[0x0] SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0] Nov 21 08:58:15 phoenix kernel: LASTPHASE[0x1]:(P_BUSFREE) SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI) Nov 21 08:58:15 phoenix kernel: SBLKCTL[0xa]:(SELWIDE|SELBUSB) SCSIRATE[0x0] SEQCTL[0x10]:(FASTMODE) Nov 21 08:58:15 phoenix kernel: SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED) SSTAT0[0x0] Nov 21 08:58:15 phoenix kernel: SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] SIMODE0[0x8]:(ENSWRAP) Nov 21 08:58:15 phoenix kernel: SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO) SXFRCTL0[0x80]:(DFON) Nov 21 08:58:15 phoenix kernel: DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) Nov 21 08:58:15 phoenix kernel: STACK: 0x0 0x164 0x179 0x3 Nov 21 08:58:15 phoenix kernel: SCB count = 254 Nov 21 08:58:15 phoenix kernel: Kernel NEXTQSCB = 247 Nov 21 08:58:15 phoenix kernel: Card NEXTQSCB = 247 Nov 21 08:58:15 phoenix kernel: QINFIFO entries: Nov 21 08:58:15 phoenix kernel: Waiting Queue entries: Nov 21 08:58:15 phoenix kernel: Disconnected Queue entries: 0:238 Nov 21 08:58:15 phoenix kernel: QOUTFIFO entries: Nov 21 08:58:15 phoenix kernel: Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Nov 21 08:58:15 phoenix kernel: Sequencer SCB Info: Nov 21 08:58:15 phoenix kernel: 0 SCB_CONTROL[0x44]:(DISCONNECTED|DISCENB) SCB_SCSIID[0x67] Nov 21 08:58:15 phoenix kernel: SCB_LUN[0x0] SCB_TAG[0xee] Nov 21 08:58:15 phoenix kernel: 1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) Nov 21 08:58:15 phoenix kernel: SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Nov 21 08:58:15 phoenix kernel: 2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) Nov 21 08:58:15 phoenix kernel: SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Nov 21 08:58:15 phoenix kernel: 3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) Nov 21 08:58:15 phoenix kernel: SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Nov 21 08:58:15 phoenix kernel: 4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) ... Nov 21 08:58:15 phoenix kernel: 31 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) Nov 21 08:58:15 phoenix kernel: SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Nov 21 08:58:15 phoenix kernel: Pending list: Nov 21 08:58:15 phoenix kernel: 238 SCB_CONTROL[0x40]:(DISCENB) SCB_SCSIID[0x67] SCB_LUN[0x0] Nov 21 08:58:15 phoenix kernel: Kernel Free SCB list: 239 240 241 242 243 244 245 246 248 249 250 251 252 253 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Nov 21 08:58:15 phoenix kernel: Untagged Q(6): 238 Nov 21 08:58:15 phoenix kernel: Nov 21 08:58:15 phoenix kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> Nov 21 08:58:15 phoenix kernel: (sa0:ahc0:0:6:0): SCB 0xee - timed out Nov 21 08:58:15 phoenix kernel: sg[0] - Addr 0x21706000 : Length 4096 Nov 21 08:58:15 phoenix kernel: sg[1] - Addr 0x217aa000 : Length 4096 Nov 21 08:58:15 phoenix kernel: sg[2] - Addr 0x2183b000 : Length 4096 ... Nov 21 08:58:15 phoenix kernel: sg[7] - Addr 0x221f6000 : Length 4096 Nov 21 08:58:15 phoenix kernel: (sa0:ahc0:0:6:0): Queuing a BDR SCB Nov 21 08:58:15 phoenix kernel: Infinite interrupt loop, INTSTAT = 0ahc0: Timedout SCBs already complete. Interrupts may not be functioning. Nov 21 08:58:17 phoenix kernel: Infinite interrupt loop, INTSTAT = 0ahc0: Recovery Initiated Nov 21 08:58:17 phoenix kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< Nov 21 08:58:17 phoenix kernel: ahc0: Dumping Card State while idle, at SEQADDR 0x18 Nov 21 08:58:17 phoenix kernel: Card was paused ... Nov 21 08:58:17 phoenix kernel: (sa0:ahc0:0:6:0): no longer in timeout, status = 24b Nov 21 08:58:17 phoenix kernel: ahc0: target 6 using 8bit transfers Nov 21 08:58:17 phoenix kernel: ahc0: target 6 using asynchronous transfers Nov 21 08:58:17 phoenix kernel: ahc0: Issued Channel A Bus Reset. 1 SCBs aborted Nov 21 08:58:17 phoenix kernel: (sa0:ahc0:0:6:0): Command timed out Nov 21 08:58:17 phoenix kernel: (sa0:ahc0:0:6:0): error 5 Nov 21 08:58:17 phoenix kernel: (sa0:ahc0:0:6:0): Retries Exausted Nov 21 08:58:17 phoenix kernel: ahc0: Timedout SCBs already complete. Interrupts may not be functioning. Nov 21 08:58:17 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 08:58:17 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 08:58:17 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 08:58:17 phoenix kernel: ahc0: target 6 using 16bit transfers Nov 21 08:58:17 phoenix kernel: ahc0: target 6 synchronous at 80.0MHz DT, offset = 0x7e Nov 21 09:02:17 phoenix kernel: ahc0: Recovery Initiated ... until shutdown. A shutdown -r will still leave the controller wedged until a fullblow powercycle is done without the tape in the drive. Reading from tapes work just fine though. Also this identical hardware work just fine under Linux. Please help. Thanks! -- =========================================================== Peter C. Lai | Bard College at Simon's Rock Systems Administrator | 84 Alford Rd. Information Technology Svcs. | Gt. Barrington, MA 01230 USA peter AT simons-rock.edu | (413) 528-7428 =========================================================== From owner-freebsd-scsi@FreeBSD.ORG Sat Nov 22 18:34:02 2008 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 701321065674 for ; Sat, 22 Nov 2008 18:34:02 +0000 (UTC) (envelope-from toasty@dragondata.com) Received: from tokyo01.jp.mail.your.org (tokyo01.jp.mail.your.org [204.9.54.5]) by mx1.freebsd.org (Postfix) with ESMTP id 0F3928FC12 for ; Sat, 22 Nov 2008 18:34:02 +0000 (UTC) (envelope-from toasty@dragondata.com) Received: from tokyo01.jp.mail.your.org (localhost.your.org [127.0.0.1]) by tokyo01.jp.mail.your.org (Postfix) with ESMTP id 193DD2AD58B5 for ; Sat, 22 Nov 2008 18:18:31 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=dragondata.com; h= message-id:from:to:content-type:content-transfer-encoding :mime-version:subject:date; s=selector1; bh=fhmbIGzZ2C0YTD3S+mv0 i1uvXuE=; b=K/G6qpWwInVOOd78D1HkjMCZ3IlaKh9Gbb8B+x4Cc0FRpj6lH5uN GweoxUMtSCIXnekDjNjihvywxvoVUpClEete2pXNUhpGpPRNhOfVZOk134WRHdqF shErTitq3uZUmqzfj6R0xBy08NCCYxpKGTmMJeyNJpaM9fLIeBH9TbM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=dragondata.com; h=message-id:from :to:content-type:content-transfer-encoding:mime-version:subject: date; q=dns; s=selector1; b=I/7Dma2TfP171javyXk+ULERrlXvJJ0IUxkn YUdVoqkBWNGzvFk3lA9OBJSXcM/HAuLCwNCC0iEGEbL7T0FXablrIr8qAD0RpPep WWeM4E+cIuJx1WGjk2dn0Q2spbkwCQdRx+zD+UYbqzHoZKRCSeGbRBoSzCGJ3l6n P8VSQsI= Received: from mail.your.org (server3-a.your.org [64.202.112.67]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by tokyo01.jp.mail.your.org (Postfix) with ESMTPS id AAAB52AD5468 for ; Sat, 22 Nov 2008 18:18:30 +0000 (UTC) Received: from [216.14.99.244] (unknown [216.14.99.244]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by mail.your.org (Postfix) with ESMTPSA id 0789CA0A406 for ; Sat, 22 Nov 2008 18:17:44 +0000 (UTC) Message-Id: From: Kevin Day To: freebsd-scsi@freebsd.org Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v929.2) Date: Sat, 22 Nov 2008 12:18:27 -0600 X-Mailer: Apple Mail (2.929.2) Subject: hpacucli on 7.0/amd64 not working X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 22 Nov 2008 18:34:02 -0000 Has anyone managed to get hpacucli working on amd64 in 7.0? I've got an HP DL185 G5 with HP's E200 RAID card in it. I had hpacucli working okay in 6.3/i386, and it works fine in a 64 bit Linux boot, but not in 7.0/amd64: # /usr/local/sbin/hpacucli .P Array Configuration Utility CLI .2. Detecting Controllers...In AddChild:0x838c180 In AddChild child doesnot exists:0x838c180 Iam inside findDevce The device enumerated now is 0x838c180 In Reenumerate Childern Eoption is 1 This device can discover children Thu Nov 13 15:29:40 2008 Operation Call: OperationSetAllowedControllerDiscovery Thu Nov 13 15:29:40 2008 Operation Call: OperationReadSystemInfo Thu Nov 13 15:29:40 2008 Operation Call: OperationCaptureConfigurationMutex Thu Nov 13 15:29:40 2008 Operation Call: OperationReleaseConfigurationMutex Thu Nov 13 15:29:40 2008 Operation Call: OperationDiscoverHostBusAdapters Thu Nov 13 15:29:40 2008 Operation Call: OperationDiscoverNonFibreHBA __TRACE_CODE*1310* pBmicRequest->wCommandStatus=0 /usr/home/user/ im453_new/im453/.s_/LINUX/src/lxioctlciss.cpp zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 freebsd, Looking for bus 5, device 8, function 0 ioctl: Inappropriate ioctl for device Thu Nov 13 15:29:40 2008 Operation Call: OperationDiscoverInternalArrayControllers zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 freebsd, Looking for bus 5, device 8, function 0 ioctl: Inappropriate ioctl for device Thu Nov 13 15:29:40 2008 Operation Call: OperationDiscoverChildren The device enumerated now is 0x838c180 In Reenumerate Childern Eoption is 2 In Enumerate Child nodes Thu Nov 13 15:29:40 2008 Operation Call: OperationSystemPostProcess Pchild in GenerateXML:0x8386d80 Pchild in GenerateXML:0x0 Thu Nov 13 15:29:40 2008 Operation Call: OperationCaptureConfigurationMutex Thu Nov 13 15:29:40 2008 Thu Nov 13 15:29:40 2008 OperationCaptureConfigurationMutex Thu Nov 13 15:29:40 2008 ModRoot137888128-System137937280 Thu Nov 13 15:29:40 2008 Done. Type "help" for a list of supported commands. Type "exit" to close the console. => controller all show The device enumerated now is 0x838c180 In Reenumerate Childern Eoption is 1 This device can discover children Thu Nov 13 15:30:06 2008 Operation Call: OperationSetAllowedControllerDiscovery Thu Nov 13 15:30:06 2008 Operation Call: OperationReadSystemInfo Thu Nov 13 15:30:06 2008 Operation Call: OperationCaptureConfigurationMutex Thu Nov 13 15:30:06 2008 Operation Call: OperationReleaseConfigurationMutex Thu Nov 13 15:30:06 2008 Operation Call: OperationDiscoverHostBusAdapters Thu Nov 13 15:30:06 2008 Operation Call: OperationDiscoverNonFibreHBA zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 freebsd, Looking for bus 5, device 8, function 0 ioctl: Inappropriate ioctl for device Thu Nov 13 15:30:06 2008 Operation Call: OperationDiscoverInternalArrayControllers zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 freebsd, Looking for bus 5, device 8, function 0 ioctl: Inappropriate ioctl for device Thu Nov 13 15:30:06 2008 Operation Call: OperationDiscoverChildren The device enumerated now is 0x838c180 In Reenumerate Childern Eoption is 2 In Enumerate Child nodes Thu Nov 13 15:30:06 2008 Operation Call: OperationSystemPostProcess Pchild in GenerateXML:0x8386d80 Pchild in GenerateXML:0x0 Error: No controllers detected. Anyone managed to get this to work? If not, any ideas as to what's going on? I'm guessing the "inappropriate ioctl for device" is significant here. I tried emailing the listed HP contact for hpacucli (v.sri.sai.ganesh at hp.com) but didn't get any reply. -- Kevin ciss0: port 0xe800-0xe8ff mem 0xdef80000-0xdeffffff,0xdef78000-0xdef7ffff irq 35 at device 8.0 on pci5 ciss0@pci0:5:8:0: class=0x010400 card=0x3212103c chip=0x3238103c rev=0x00 hdr=0x00 vendor = 'Hewlett-Packard Company' device = 'Smart Array E200/E200i Controller' class = mass storage subclass = RAID cap 01[c0] = powerspec 2 supports D0 D1 D3 current D0 cap 05[cc] = MSI supports 2 messages, 64 bit cap 07[dc] = PCI-X 64-bit supports 133MHz, 4096 burst read, 1 split transaction