From owner-freebsd-stable@FreeBSD.ORG Sun Jul 27 13:04:05 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0287B37B401; Sun, 27 Jul 2003 13:04:05 -0700 (PDT) Received: from prioris.mini.pw.edu.pl (prioris.mini.pw.edu.pl [194.29.178.2]) by mx1.FreeBSD.org (Postfix) with ESMTP id BF63843F85; Sun, 27 Jul 2003 13:04:03 -0700 (PDT) (envelope-from G.Czaplinski@prioris.mini.pw.edu.pl) Received: from localhost (localhost.mini.pw.edu.pl [127.0.0.1]) by prioris.mini.pw.edu.pl (Postfix) with ESMTP id 6542B243C7; Sun, 27 Jul 2003 22:04:02 +0200 (CEST) Received: by prioris.mini.pw.edu.pl (Postfix, from userid 1368) id 9814B243C6; Sun, 27 Jul 2003 22:03:56 +0200 (CEST) Date: Sun, 27 Jul 2003 22:03:56 +0200 From: Grzegorz Czaplinski To: "Marc G. Fournier" Message-ID: <20030727200355.GM82199@prioris.mini.pw.edu.pl> Mail-Followup-To: "Marc G. Fournier" , freebsd-stable@freebsd.org, freebsd-scsi@freebsd.org References: <20030726115857.M37284@hub.org> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="Uwl7UQhJk99r8jnw" Content-Disposition: inline In-Reply-To: <20030726115857.M37284@hub.org> User-Agent: Mutt/1.4.1i X-PGP: http://prioris.mini.pw.edu.pl/~gregory/pgp.txt X-3w: http://prioris.mini.pw.edu.pl/~gregory/ X-voice: +48 692 412 424 X-FreeBSD: Running FreeBSD? - Share the server config! - http://prioris.mini.pw.edu.pl/~gregory/FreeBSD/ X-Virus-Scanned: by AMaViS (prioris) cc: freebsd-scsi@freebsd.org cc: freebsd-stable@freebsd.org Subject: Re: Dump Card State Begins ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 27 Jul 2003 20:04:05 -0000 --Uwl7UQhJk99r8jnw Content-Type: text/plain; charset=iso-8859-2 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Sat, Jul 26, 2003 at 12:03:48PM -0300, Marc G. Fournier wrote: >=20 > Hi ... >=20 > Can someone tell me whether or not this is indicative of a hardware, or > software, problem? It happened a few times today, on two different > drives, and it seem to "self-recover", since the server is still purring > along without any noticeable problems: >=20 > neptune# grep "timed out" /var/log/messages > Jul 25 03:52:51 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x40 - timed out > Jul 25 03:57:22 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x18 - timed out > Jul 25 03:58:53 neptune /kernel: (da1:ahd1:0:1:0): SCB 0x1e - timed out > Jul 26 10:55:46 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x39 - timed out >=20 > The drives are all U320 Seagate Cheetah 70G ... no RAID involved, its > just straight drives using the motherboard's onboard SCSI controller ... > the motherboard is the Intel SE7501, in the SR2300 chassis ... >=20 > It did it back on the 19th as well: >=20 > Jul 19 19:37:16 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x46 - timed out > Jul 19 19:38:46 neptune /kernel: (da1:ahd1:0:1:0): SCB 0x2d - timed out >=20 Time outs may be a case of bad cabling or termination. Check them... > But again, appears to have recovered with no ill effects ... >=20 >=20 > Jul 25 03:52:51 neptune /kernel: (da2:ahd1:0:2:0): SCB 0x40 - timed out > Jul 25 03:53:06 neptune /kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begin= s <<<<<<<<<<<<<<<<< > Jul 25 03:53:06 neptune /kernel: ahd1: Dumping Card State at program addr= ess 0x15 Mode 0x22 > Jul 25 03:53:06 neptune /kernel: Card was paused > Jul 25 03:53:06 neptune /kernel: HS_MAILBOX[0x0] INTCTL[0xc0] SEQINTSTAT[= 0x0] SAVED_MODE[0x11] > Jul 25 03:53:06 neptune /kernel: DFFSTAT[0x31] SCSISIGI[0x0] SCSIPHASE[0x= 0] SCSIBUS[0x0] > Jul 25 03:53:06 neptune /kernel: LASTPHASE[0x1] SCSISEQ0[0x0] SCSISEQ1[0x= 12] SEQCTL0[0x10] > Jul 25 03:53:06 neptune /kernel: SEQINTCTL[0x0] SEQ_FLAGS[0xc0] SEQ_FLAGS= 2[0x0] SSTAT0[0x0] > Jul 25 03:53:06 neptune /kernel: SSTAT1[0x8] SSTAT2[0x0] SSTAT3[0x0] PERR= DIAG[0x8] > Jul 25 03:53:06 neptune /kernel: SIMODE1[0xa4] LQISTAT0[0x0] LQISTAT1[0x0= ] LQISTAT2[0x0] > Jul 25 03:53:06 neptune /kernel: LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x1] > Jul 25 03:53:06 neptune /kernel: > Jul 25 03:53:06 neptune /kernel: SCB Count =3D 96 CMDS_PENDING =3D 29 LAS= TSCB 0x22 CURRSCB 0x22 NEXTSCB 0xff00 > Jul 25 03:53:06 neptune /kernel: qinstart =3D 65252 qinfifonext =3D 65252 > Jul 25 03:53:06 neptune /kernel: QINFIFO: > Jul 25 03:53:06 neptune /kernel: WAITING_TID_QUEUES: > Jul 25 03:53:06 neptune /kernel: Pending list: > Jul 25 03:53:06 neptune /kernel: 21 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:06 neptune /kernel: 29 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:06 neptune /kernel: 63 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:06 neptune /kernel: 65 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:06 neptune /kernel: 24 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:06 neptune /kernel: 10 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 15 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 47 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 59 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 26 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 77 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 54 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 42 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 57 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 55 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 92 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 78 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 12 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 27 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 28 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 32 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 95 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 53 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 25 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 52 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 1 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_SC= SIID[0x27] > Jul 25 03:53:07 neptune /kernel: 38 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 93 FIFO_USE[0x0] SCB_CONTROL[0x62] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: 64 FIFO_USE[0x0] SCB_CONTROL[0x60] SCB_S= CSIID[0x27] > Jul 25 03:53:07 neptune /kernel: Total 29 > Jul 25 03:53:07 neptune /kernel: Kernel Free SCB list: 7 56 34 19 4 37 20= 46 40 61 11 39 31 45 58 23 73 30 5 62 8 41 18 16 13 66 51 14 44 49 36 50 7= 0 35 9 76 74 2 48 43 3 33 79 71 75 60 67 69 91 94 6 72 0 68 17 22 90 89 88 = 87 86 85 84 83 82 81 80 > Jul 25 03:53:07 neptune /kernel: Sequencer Complete DMA-inprog list: > Jul 25 03:53:07 neptune /kernel: Sequencer Complete list: > Jul 25 03:53:07 neptune /kernel: Sequencer DMA-Up and Complete list: > Jul 25 03:53:07 neptune /kernel: > Jul 25 03:53:07 neptune /kernel: ahd1: FIFO0 Free, LONGJMP =3D=3D 0x80ff,= SCB 0x22 > Jul 25 03:53:07 neptune /kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x= 0] DFSTATUS[0x89] > Jul 25 03:53:07 neptune /kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSX= FRCTL[0x0] > Jul 25 03:53:07 neptune /kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR =3D 0x= 00, SHCNT =3D 0x0 > Jul 25 03:53:07 neptune /kernel: HADDR =3D 0x00, HCNT =3D 0x0 CCSGCTL[0x1= 0] > Jul 25 03:53:07 neptune /kernel: ahd1: FIFO1 Free, LONGJMP =3D=3D 0x8277,= SCB 0x7 > Jul 25 03:53:07 neptune /kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x= 4] DFSTATUS[0x89] > Jul 25 03:53:07 neptune /kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSX= FRCTL[0x0] > Jul 25 03:53:07 neptune /kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR =3D 0x= 00, SHCNT =3D 0x0 > Jul 25 03:53:07 neptune /kernel: HADDR =3D 0x00, HCNT =3D 0x0 CCSGCTL[0x1= 0] > Jul 25 03:53:07 neptune /kernel: LQIN: 0x55 0x0 0x0 0x7 0x0 0x0 0x0 0x0 0= x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 > Jul 25 03:53:07 neptune /kernel: ahd1: LQISTATE =3D 0x0, LQOSTATE =3D 0x0= , OPTIONMODE =3D 0x42 > Jul 25 03:53:07 neptune /kernel: ahd1: OS_SPACE_CNT =3D 0x20 MAXCMDCNT = =3D 0x1 > Jul 25 03:53:07 neptune /kernel: SIMODE0[0xc] > Jul 25 03:53:07 neptune /kernel: CCSCBCTL[0x0] > Jul 25 03:53:07 neptune /kernel: ahd1: REG0 =3D=3D 0x22, SINDEX =3D 0x122= , DINDEX =3D 0x102 > Jul 25 03:53:07 neptune /kernel: ahd1: SCBPTR =3D=3D 0x7, SCB_NEXT =3D=3D= 0x49, SCB_NEXT2 =3D=3D 0xfff1 > Jul 25 03:53:07 neptune /kernel: CDB 2a 0 7 80 a0 ca > Jul 25 03:53:07 neptune /kernel: STACK: 0x125 0x125 0x125 0x257 0x257 0x2= 57 0x29 0x15 > Jul 25 03:53:07 neptune /kernel: <<<<<<<<<<<<<<<< Dump Card State Ends >>= >>>>>>>>>>>>>>>> > Jul 25 03:53:07 neptune /kernel: Copied 18 bytes of sense data offset 12:= 0x70 0x0 0x6 0x0 0x0 0x0 0x0 0xa 0x0 0x0 0x0 0x0 0x29 0x2 0x2 0x0 0x0 0x0 > Jul 25 03:53:07 neptune /kernel: Copied 18 bytes of sense data offset 12:= 0x70 0x0 0x6 0x0 0x0 0x0 0x0 0xa 0x0 0x0 0x0 0x0 0x29 0x2 0x2 0x0 0x0 0x0 > Jul 25 03:53:07 neptune /kernel: Copied 18 bytes of sense data offset 12:= 0x70 0x0 0x6 0x0 0x0 0x0 0x0 0xa 0x0 0x0 0x0 0x0 0x29 0x2 0x2 0x0 0x0 0x0 > _______________________________________________ This looks like, your drive da2 is daying. I had the same sort of errors few weeks ago. You may not be able to unmount the drives properly now. Try to boot into single user mode and work on that drive from there. If you are lucky, you will have a chance to get the data back. Good luck, gregory -- Grzegorz Czaplinski "The Power to Serve, Right for the Power Users!" - http://www.FreeBSD.org/ Fingerprint: EB77 E19D CFA2 5736 810F 847C A70F A275 2489 469F --Uwl7UQhJk99r8jnw Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.2 (FreeBSD) iD8DBQE/JDArpw+idSSJRp8RAnkBAKDLkNHC+aOwwgJFyxVXCs4rMiLqbQCeLU4f XiJK8EQyKKvyEK+ZHqYCvAI= =aiXw -----END PGP SIGNATURE----- --Uwl7UQhJk99r8jnw--