From owner-freebsd-scsi@FreeBSD.ORG Mon May 2 11:07:07 2011 Return-Path: Delivered-To: freebsd-scsi@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1A0E5106567C for ; Mon, 2 May 2011 11:07:07 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id F232D8FC1E for ; Mon, 2 May 2011 11:07:06 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.4/8.14.4) with ESMTP id p42B76qO064204 for ; Mon, 2 May 2011 11:07:06 GMT (envelope-from owner-bugmaster@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.4/8.14.4/Submit) id p42B76UL064202 for freebsd-scsi@FreeBSD.org; Mon, 2 May 2011 11:07:06 GMT (envelope-from owner-bugmaster@FreeBSD.org) Date: Mon, 2 May 2011 11:07:06 GMT Message-Id: <201105021107.p42B76UL064202@freefall.freebsd.org> X-Authentication-Warning: freefall.freebsd.org: gnats set sender to owner-bugmaster@FreeBSD.org using -f From: FreeBSD bugmaster To: freebsd-scsi@FreeBSD.org Cc: Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 02 May 2011 11:07:07 -0000 Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/154432 scsi [xpt] run_interrupt_driven_hooks: still waiting after o kern/153361 scsi [ciss] Smart Array 5300 boot/detect drive problem o kern/152250 scsi [ciss] [patch] Kernel panic when hw.ciss.expose_hidden o kern/151564 scsi [ciss] ciss(4) should increase CISS_MAX_LOGICAL to 10 o docs/151336 scsi Missing documentation of scsi_ and ata_ functions in c s kern/149927 scsi [cam] hard drive not stopped before removing power dur o kern/148083 scsi [aac] Strange device reporting o kern/147704 scsi [mpt] sys/dev/mpt: new chip revision, partially unsupp o kern/146287 scsi [ciss] ciss(4) cannot see more than one SmartArray con o kern/145768 scsi [mpt] can't perform I/O on SAS based SAN disk in freeb o kern/144648 scsi [aac] Strange values of speed and bus width in dmesg o kern/144301 scsi [ciss] [hang] HP proliant server locks when using ciss o kern/142351 scsi [mpt] LSILogic driver performance problems o kern/141934 scsi [cam] [patch] add support for SEAGATE DAT Scopion 130 o kern/134488 scsi [mpt] MPT SCSI driver probes max. 8 LUNs per device o kern/132250 scsi [ciss] ciss driver does not support more then 15 drive o kern/132206 scsi [mpt] system panics on boot when mirroring and 2nd dri o kern/130621 scsi [mpt] tranfer rate is inscrutable slow when use lsi213 o kern/129602 scsi [ahd] ahd(4) gets confused and wedges SCSI bus o kern/128452 scsi [sa] [panic] Accessing SCSI tape drive randomly crashe o kern/128245 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/127927 scsi [isp] isp(4) target driver crashes kernel when set up o kern/127717 scsi [ata] [patch] [request] - support write cache toggling o kern/124667 scsi [amd] [panic] FreeBSD-7 kernel page faults at amd-scsi o kern/123674 scsi [ahc] ahc driver dumping o kern/123520 scsi [ahd] unable to boot from net while using ahd o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc o kern/120487 scsi [sg] scsi_sg incompatible with scanners o kern/120247 scsi [mpt] FreeBSD 6.3 and LSI Logic 1030 = only 3.300MB/s o kern/114597 scsi [sym] System hangs at SCSI bus reset with dual HBAs o kern/110847 scsi [ahd] Tyan U320 onboard problem with more than 3 disks o kern/99954 scsi [ahc] reading from DVD failes on 6.x [regression] o kern/92798 scsi [ahc] SCSI problem with timeouts o kern/90282 scsi [sym] SCSI bus resets cause loss of ch device o kern/76178 scsi [ahd] Problem with ahd and large SCSI Raid system o kern/74627 scsi [ahc] [hang] Adaptec 2940U2W Can't boot 5.3 s kern/61165 scsi [panic] kernel page fault after calling cam_send_ccb o kern/60641 scsi [sym] Sporadic SCSI bus resets with 53C810 under load o kern/60598 scsi wire down of scsi devices conflicts with config s kern/57398 scsi [mly] Current fails to install on mly(4) based RAID di o bin/57088 scsi [cam] [patch] for a possible fd leak in libcam.c o kern/52638 scsi [panic] SCSI U320 on SMP server won't run faster than o kern/44587 scsi dev/dpt/dpt.h is missing defines required for DPT_HAND o kern/39388 scsi ncr/sym drivers fail with 53c810 and more than 256MB m o kern/35234 scsi World access to /dev/pass? (for scanner) requires acce 45 problems total. From owner-freebsd-scsi@FreeBSD.ORG Thu May 5 16:22:54 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6E448106566C for ; Thu, 5 May 2011 16:22:54 +0000 (UTC) (envelope-from mike@summersault.com) Received: from tanagra.summersault.com (tanagra.summersault.com [12.161.105.149]) by mx1.freebsd.org (Postfix) with ESMTP id 0B1068FC18 for ; Thu, 5 May 2011 16:22:53 +0000 (UTC) Received: (qmail 88666 invoked from network); 5 May 2011 15:56:10 -0000 Received: from kosh.summersault.com (HELO ?192.168.97.193?) (mike@192.168.97.193) by tanagra.summersault.com with ESMTPA; 5 May 2011 15:56:10 -0000 Message-ID: <4DC2C899.8070502@summersault.com> Date: Thu, 05 May 2011 11:56:09 -0400 From: Mike Neimoyer User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.17) Gecko/20110424 Thunderbird/3.1.10 MIME-Version: 1.0 To: freebsd-scsi@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: Making sense of error codes X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 05 May 2011 16:22:54 -0000 Good morning, all. First post to the group, and I'm a bit of a newbie in general, so please be gentle. :) I'll provide as much information as I know how to, and if there's more you need, please let me know and I'll do my best! Thanks. Recently, I've had a tape library's autoloader stop being recognized. It was working just fine, and then the backup software (amanda) began reporting errors that it couldn't communicate. No changes were made to the system and it had run a backup earlier that morning successfully. Multiple attempts to re-establish communications failed, checked all cables, verified that the system saw the device by "camcontrol devlist" (it did, but the unit was listed simply as "pass3" instead of the usual "ch0,pass3"). Was eventually told to shutdown and powercycle both units, but without success. Basics: FreeBSD 6.1-RELEASE Dell Powervault 124t via SCSI Checking the /var/log/messages file, after the I see: May 2 16:44:24 jazz kernel: sa0 at ahc1 bus 0 target 12 lun 0 May 2 16:44:24 jazz kernel: sa0: Removable Sequential Access SCSI-3 device May 2 16:44:24 jazz kernel: sa0: 160.000MB/s transfers (80.000MHz, offset 127, 16bit) May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): MODE SENSE(06). CDB: 1a 20 1d 0 20 0 May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): HARDWARE FAILURE asc:40,84 May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): (null) May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): fatal error, failed to attach to device May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): lost device May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): removing device entry So, there is a hardware failure somewhere, but I am not sure how to determine if it's the internal card or the external tape library unit itself (in a perfect world, I'd have another tape library unit I could just swap out, but alas.... :) ). I've tried looking up the ASC (40, 84), but can't seem to find a agreed-upon meaning for them. I could, quite probably in fact, simply not be looking in the right place. Can any kind soul point me in the right direction for figuring this issue out? Thanks! Best, ~~Mike From owner-freebsd-scsi@FreeBSD.ORG Thu May 5 20:23:07 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0B8A8106567C for ; Thu, 5 May 2011 20:23:07 +0000 (UTC) (envelope-from mavbsd@gmail.com) Received: from mail-fx0-f54.google.com (mail-fx0-f54.google.com [209.85.161.54]) by mx1.freebsd.org (Postfix) with ESMTP id 614448FC19 for ; Thu, 5 May 2011 20:23:05 +0000 (UTC) Received: by fxm11 with SMTP id 11so2491280fxm.13 for ; Thu, 05 May 2011 13:23:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:sender:message-id:date:from:user-agent :mime-version:to:cc:subject:references:in-reply-to :x-enigmail-version:content-type:content-transfer-encoding; bh=edsBr8V3hC72DZtZ3wPVbCPG6ObLce1fzaB4xoX/KeA=; b=gYcrHjDsPSLdODnwV8Xati41aCH47PVoeJFhrJHP7JHzysijDNQJ+LQlqV4YHiSGGw bImtDKjqgRMI6Q+txiM6tWSWqVLBcfVOVBpo2p0pM5et/HVO1AP0eebSCmTQfzqL8F4M bQyzeDfWBjXioWK5wn4BDge3WzecAWN/mHcHA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:message-id:date:from:user-agent:mime-version:to:cc:subject :references:in-reply-to:x-enigmail-version:content-type :content-transfer-encoding; b=CeiGDpIpklaXWhcEp/eTK81KmySh8BVBGgkE0IblDghdflP/tJ6b6+SU6Smi0i8zqW X4/4CuoG/bRi6nVq2xqoBN7nf64trErGPr4LY6zi48aQmrnR/f0X5pPirSFnJZOfAQTS UaDatzqwt0i8rkkwD7sSvWw2wZKLC+6KBYTyU= Received: by 10.223.144.138 with SMTP id z10mr603331fau.65.1304626985120; Thu, 05 May 2011 13:23:05 -0700 (PDT) Received: from mavbook2.mavhome.dp.ua (pc.mavhome.dp.ua [212.86.226.226]) by mx.google.com with ESMTPS id 9sm863402fat.39.2011.05.05.13.23.03 (version=SSLv3 cipher=OTHER); Thu, 05 May 2011 13:23:04 -0700 (PDT) Sender: Alexander Motin Message-ID: <4DC30709.9050608@FreeBSD.org> Date: Thu, 05 May 2011 23:22:33 +0300 From: Alexander Motin User-Agent: Thunderbird 2.0.0.23 (X11/20091212) MIME-Version: 1.0 To: Mike Neimoyer References: In-Reply-To: X-Enigmail-Version: 0.96.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-scsi@freebsd.org Subject: Re: Making sense of error codes X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 05 May 2011 20:23:07 -0000 Mike Neimoyer wrote: > First post to the group, and I'm a bit of a newbie in general, so please > be gentle. :) I'll provide as much information as I know how to, and if > there's more you need, please let me know and I'll do my best! Thanks. > > Recently, I've had a tape library's autoloader stop being recognized. It > was working just fine, and then the backup software (amanda) began > reporting errors that it couldn't communicate. No changes were made to > the system and it had run a backup earlier that morning successfully. > Multiple attempts to re-establish communications failed, checked all > cables, verified that the system saw the device by "camcontrol devlist" > (it did, but the unit was listed simply as "pass3" instead of the usual > "ch0,pass3"). Was eventually told to shutdown and powercycle both > units, but without success. > > Basics: > FreeBSD 6.1-RELEASE > Dell Powervault 124t via SCSI > > Checking the /var/log/messages file, after the I see: > May 2 16:44:24 jazz kernel: sa0 at ahc1 bus 0 target 12 lun 0 > May 2 16:44:24 jazz kernel: sa0: Removable > Sequential Access SCSI-3 device > May 2 16:44:24 jazz kernel: sa0: 160.000MB/s transfers (80.000MHz, > offset 127, 16bit) > May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): MODE SENSE(06). CDB: 1a > 20 1d 0 20 0 > May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): HARDWARE FAILURE asc:40,84 > May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): (null) > May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): fatal error, failed to > attach to device > May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): lost device > May 2 16:44:24 jazz kernel: (ch0:ahc1:0:12:1): removing device entry > > So, there is a hardware failure somewhere, but I am not sure how to > determine if it's the internal card or the external tape library unit > itself (in a perfect world, I'd have another tape library unit I could > just swap out, but alas.... :) ). > > I've tried looking up the ASC (40, 84), but can't seem to find a > agreed-upon meaning for them. I could, quite probably in fact, simply > not be looking in the right place. > > Can any kind soul point me in the right direction for figuring this > issue out? Thanks! According to SPC-3 specification "asc:40,84" means: "DIAGNOSTIC FAILURE ON COMPONENT NN (80H-FFH)", where NN is 84H. -- Alexander Motin From owner-freebsd-scsi@FreeBSD.ORG Sat May 7 10:09:19 2011 Return-Path: Delivered-To: freebsd-scsi@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 49AD01065670 for ; Sat, 7 May 2011 10:09:19 +0000 (UTC) (envelope-from avg@FreeBSD.org) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id 933118FC13 for ; Sat, 7 May 2011 10:09:18 +0000 (UTC) Received: from porto.topspin.kiev.ua (porto-e.starpoint.kiev.ua [212.40.38.100]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id NAA06217 for ; Sat, 07 May 2011 13:09:17 +0300 (EEST) (envelope-from avg@FreeBSD.org) Received: from localhost.topspin.kiev.ua ([127.0.0.1]) by porto.topspin.kiev.ua with esmtp (Exim 4.34 (FreeBSD)) id 1QIeRo-0004CO-PZ for freebsd-scsi@freebsd.org; Sat, 07 May 2011 13:09:16 +0300 Message-ID: <4DC51A4C.5070603@FreeBSD.org> Date: Sat, 07 May 2011 13:09:16 +0300 From: Andriy Gapon User-Agent: Mozilla/5.0 (X11; U; FreeBSD amd64; en-US; rv:1.9.2.17) Gecko/20110503 Lightning/1.0b2 Thunderbird/3.1.10 MIME-Version: 1.0 To: freebsd-scsi@FreeBSD.org X-Enigmail-Version: 1.1.2 Content-Type: text/plain; charset=X-VIET-VPS Content-Transfer-Encoding: 7bit Cc: Subject: cam: pull logging of a number of conditions from under bootverbose X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 07 May 2011 10:09:19 -0000 I wonder why those error reports are hidden under bootverbose. They all seem to be sufficiently serious (and abnormal with non-faulty hardware) to be always reported. [test] cam: pull logging of a number of conditions from under bootverbose diff --git a/sys/cam/cam_periph.c b/sys/cam/cam_periph.c index dd51bca..4b4dce6 100644 --- a/sys/cam/cam_periph.c +++ b/sys/cam/cam_periph.c @@ -1717,40 +1717,40 @@ cam_periph_error(union ccb *ccb, cam_flags camflags, error = EIO; /* we have to kill the command */ break; case CAM_ATA_STATUS_ERROR: - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print(ccb->ccb_h.path, "ATA status error\n"); cam_error_print(ccb, CAM_ESF_ALL, CAM_EPF_ALL); printed++; } /* FALLTHROUGH */ case CAM_REQ_CMP_ERR: - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print(ccb->ccb_h.path, "Request completed with CAM_REQ_CMP_ERR\n"); printed++; } /* FALLTHROUGH */ case CAM_CMD_TIMEOUT: - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print(ccb->ccb_h.path, "Command timed out\n"); printed++; } /* FALLTHROUGH */ case CAM_UNEXP_BUSFREE: - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print(ccb->ccb_h.path, "Unexpected Bus Free\n"); printed++; } /* FALLTHROUGH */ case CAM_UNCOR_PARITY: - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print(ccb->ccb_h.path, "Uncorrected parity error\n"); printed++; } /* FALLTHROUGH */ case CAM_DATA_RUN_ERR: - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print(ccb->ccb_h.path, "Data overrun\n"); printed++; } @@ -1779,7 +1779,7 @@ cam_periph_error(union ccb *ccb, cam_flags camflags, ccb->ccb_h.retry_count--; error = ERESTART; - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print(ccb->ccb_h.path, "Selection timeout\n"); printed++; @@ -1830,7 +1830,7 @@ cam_periph_error(union ccb *ccb, cam_flags camflags, * these events and should be unconditionally * retried. */ - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print_path(ccb->ccb_h.path); if (status == CAM_BDR_SENT) printf("Bus Device Reset sent\n"); @@ -1863,7 +1863,7 @@ cam_periph_error(union ccb *ccb, cam_flags camflags, if (ccb->ccb_h.retry_count > 0) { ccb->ccb_h.retry_count--; error = ERESTART; - if (bootverbose && printed == 0) { + if (printed == 0) { xpt_print(ccb->ccb_h.path, "CAM status 0x%x\n", status); printed++; -- Andriy Gapon