Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 24 Sep 1999 18:24:15 -0700 (PDT)
From:      jdpf@tislabs.com
To:        freebsd-gnats-submit@freebsd.org
Subject:   kern/13941: ncr0: SCSI phase error on GENERIC kernel using Tekram DC-390F
Message-ID:  <19990925012415.CFBAB14BDC@hub.freebsd.org>

next in thread | raw e-mail | index | archive | help

>Number:         13941
>Category:       kern
>Synopsis:       ncr0: SCSI phase error on GENERIC kernel using Tekram DC-390F
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    freebsd-bugs
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Fri Sep 24 18:30:00 PDT 1999
>Closed-Date:
>Last-Modified:
>Originator:     jonathan ferguson
>Release:        3.3-RELEASE
>Organization:
NAI Labs
>Environment:
FreeBSD trinity.somewhere.com 3.3-RELEASE FreeBSD 3.3-RELEASE #0: Thu Sep 16 23:40:35 GMT 1999     jkh@highwing.cdrom.com:/usr/src/sys/compile/GENERIC  i386

HARDWARE:
motherboard: ASUS P3B-F
processor: Intel boxed PII-400MHz
SCSI Controller: Tekram DC-390F
disk: IBM UltraStar 18ES 9.1G UltraHD
ram: corsair cm654s128-bx2 128MB SDRAM DIMM PC100 CAS2
network: Intel EtherExpress Pro/100+ PCI
video: Jaton67 PCI (based on trident 9685)

>Description:
upon reboot [hard reboot, "reboot" or ctrl-alt-del], the machine hangs about 35% of the time with the following error output:

da0 at ncr0 bus 0 target 6 lun 0
da0: <IBM DNES-309170 SA30> Fixed Direct Access SCSI-3 device 
da0: 20.000MB/s transfers (20.000MHz, offset 16), Tagged Queueing Enabled
da0: 8748MB (17916240 512 byte sectors: 255H 63S/T 1115C)

<completely fscks filesystems (clean or dirty makes no difference)>

ncr0: SCSI phase error fixup: CCB already dequeued (0xc0a29400)
timeout nccb=(0xc0a3b400) (skip)
timeout nccb=(0xc0a29000) (skip)
timeout nccb=(0xc0a32400) (skip)
timeout nccb=(0xc0a29400) (skip)
timeout nccb=(0xc0a3b400) (skip)

...repeats above sequence of addresses ad nauseum...

machine will only hard-reset at this point.
very strange error here. i swapped cables with 
some others, and it still behaved in the same way, so 
i suspect the Tekram driver...


>How-To-Repeat:
reboot or power-cycle the machine.
again, it is only repeatable about 35% of the time. 
this happens on two identically configured machines...btw

i ran a trial of reboots so i could id any failure frequencies or patterns:

upon fails i would hard-reset the machine.
upon successes, i would ctrl-alt-del (reboot)

machine:   trinity   neo

1.         scsi err  ok
2.         ok        scsi err
3.         scsi err  ok
4.         ok        scsi err
5.         scsi err  ok
6.         ok        scsi err
7.         ok        ok
8.         ok        ok
9.         ok        scsi err
10.        ok        ok
11.        ok        ok
12.        ok        scsi err
13.        scsi err  ok
14.        ok        scsi err

so, 10 fails and 18 successes out of 28 total, it fails about 35% of the time, but the distribution isn't even.

>Fix:


>Release-Note:
>Audit-Trail:
>Unformatted:


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-bugs" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?19990925012415.CFBAB14BDC>