Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 19 Aug 2014 17:19:11 +0100
From:      Kaya Saman <kayasaman@gmail.com>
To:        freebsd-questions <freebsd-questions@freebsd.org>
Subject:   CAM Status Error on SuperMicro system board and LSI HBA
Message-ID:  <53F378FF.3080302@gmail.com>

next in thread | raw e-mail | index | archive | help
Hi,

I'm seeing these errors in my logs:


  WARNING:  Kernel Errors Present
     (ada1:ahcich1:0:0:0): ATA status: 51 (DRDY SERV ERR), error: 04 (ABRT ) ...:  1 Time(s)
     (ada1:ahcich1:0:0:0): CAM status: ATA Status Error ...:  1 Time(s)
  
  1 Time(s): (ada1:ahcich1:0:0:0): DSM TRIM. ACB: 06 01 00 00 00 40 00 00 00 00 01 00
  1 Time(s): (ada1:ahcich1:0:0:0): RES: 51 04 01 00 00 e0 00 00 00 00 00
  1 Time(s): (ada1:ahcich1:0:0:0): Retrying command


  WARNING:  Kernel Errors Present
     (ada1:ahcich1:0:0:0): ATA status: 51 (DRDY SERV ERR), error: 04 (ABRT ) ...:  3 Time(s)
     (ada1:ahcich1:0:0:0): CAM status: ATA Status Error ...:  3 Time(s)
  
  3 Time(s): (ada1:ahcich1:0:0:0): DSM TRIM. ACB: 06 01 00 00 00 40 00 00 00 00 01 00
  3 Time(s): (ada1:ahcich1:0:0:0): RES: 51 04 01 00 00 e0 00 00 00 00 00
  3 Time(s): (ada1:ahcich1:0:0:0): Retrying command


  WARNING:  Kernel Errors Present
     (da0:mps0:0:8:0): CAM status: SCSI Status Error ...:  4 Time(s)
     (da0:mps0:0:8:0): Error 5, Unretryable e ...:  4 Time(s)
     (da0:mps0:0:8:0): SCSI sense: MEDIUM ERROR asc:11,0 (Unrecovered read error) ...:  4 Time(s)
     (da1:mps0:0:10:0): CAM status: SCSI Status Error ...:  8 Time(s)
     (da1:mps0:0:10:0): Error 5, Unretryable e ...:  8 Time(s)
     (da1:mps0:0:10:0): SCSI sense: MEDIUM ERROR asc:11,0 (Unrecovered read error) ...:  8 Time(s)
  
  4 Time(s): (da0:mps0:0:8:0): Info: 0x210
  1 Time(s): (da0:mps0:0:8:0): READ(10). CDB: 28 00 e8 e0 84 10 00 00 10 00 length 8192 SMID 929 terminated ioc 804b scsi 0 state 0 xfer 0
  1 Time(s): (da0:mps0:0:8:0): READ(10). CDB: 28 00 e8 e0 84 10 00 00 10 00 length 8192 SMID 930 terminated ioc 804b scsi 0 state 0 xfer 0
  4 Time(s): (da0:mps0:0:8:0): READ(6). CDB: 08 00 02 10 10 00
  4 Time(s): (da0:mps0:0:8:0): SCSI status: Check Condition
  4 Time(s): (da1:mps0:0:10:0): Info: 0x210
  4 Time(s): (da1:mps0:0:10:0): Info: 0xe8e08618
  1 Time(s): (da1:mps0:0:10:0): READ(10). CDB: 28 00 e8 e0 84 10 00 00 10 00 length 8192 SMID 781 terminated ioc 804b scsi 0 state 0 xfer 0
  1 Time(s): (da1:mps0:0:10:0): READ(10). CDB: 28 00 e8 e0 84 10 00 00 10 00 length 8192 SMID 782 terminated ioc 804b scsi 0 state 0 xfer 0
  1 Time(s): (da1:mps0:0:10:0): READ(10). CDB: 28 00 e8 e0 84 10 00 00 10 00 length 8192 SMID 784 terminated ioc 804b scsi 0 state 0 xfer 0
  1 Time(s): (da1:mps0:0:10:0): READ(10). CDB: 28 00 e8 e0 84 10 00 00 10 00 length 8192 SMID 785 terminated ioc 804b scsi 0 state 0 xfer 0
  4 Time(s): (da1:mps0:0:10:0): READ(10). CDB: 28 00 e8 e0 86 10 00 00 10 00
  1 Time(s): (da1:mps0:0:10:0): READ(10). CDB: 28 00 e8 e0 86 10 00 00 10 00 length 8192 SMID 952 terminated ioc 804b scsi 0 state 0 xfer 0
  1 Time(s): (da1:mps0:0:10:0): READ(10). CDB: 28 00 e8 e0 86 10 00 00 10 00 length 8192 SMID 953 terminated ioc 804b scsi 0 state 0 xfer 0
  4 Time(s): (da1:mps0:0:10:0): READ(6). CDB: 08 00 02 10 10 00
  1 Time(s): (da1:mps0:0:10:0): READ(6). CDB: 08 00 02 10 10 00 length 8192 SMID 780 terminated ioc 804b scsi 0 state 0 xfer 0
  1 Time(s): (da1:mps0:0:10:0): READ(6). CDB: 08 00 02 10 10 00 length 8192 SMID 781 terminated ioc 804b scsi 0 state 0 xfer 0
  8 Time(s): (da1:mps0:0:10:0): SCSI status: Check Condition


In fact after a reboot the system is very slow to actually get to the 
"boot" part and the bios 'select option' screen keeps coming up about 5 
times before the system will actually get to the 'boot' phase; as in 
start from the hard drive.


The actual system board is a SuperMicro server board with the latest 
Xeon E5 cpu, and I'm using an LSI 9207-4i4e HBA.

The LSI has firmware revision 15 on it, and after a bit of research many 
people claimed to have a similar issue; though a firmware update seems 
to have solved most peoples problems. The current firmware for the 
device is revision 19 however, I don't seem to be able to update it??

What is odd though is that the system board is also showing the error as 
the root disks ada0 and ada1 - both with ZFS file system on mirror array 
- are connected directly to the SATA ports on the board.

I'm running FreeBSD 10.0 RELEASE x64.


I attempted to boot from a FreeDOS USB stick created by unetbootin, as 
the stick contained an Arch Linux installer I dd'ed the whole thing to 
Zero before installing FreeDOS onto it.

However, the strange thing is that it doesn't seem to boot from the USB 
stick? It's a 2GB Sandisk Cruzer, which works fine as I installed FBSD 
from it on the same system. I think since there is no boot loader on the 
stick even though the bootable flag is 'on' that it doesn't work.


Has anyone got any experience with this type of issue that can offer 
some advice in either what the issue maybe; the system has been up for a 
few months without any issues and this just started recently. Or how to 
boot from FreeDOS as the .bin file for the firmware of the LSI is 
already copied onto it??


Oh and just before I forget, "zpool status" doesn't show any errors, 
though the main storage pool wasn't able to mount at the first recent 
reboot so the system went into single user "emergency" mode. I had to 
export the pool then reboot, then import it; a scrub was performed which 
took about 29 hours - 10TB on the disks, but reported no issues, also 
disk io speeds seemed good at over 800 ops/sec (considering the drives 
are 2GB 2.5" WD Green). - the root drives are both SSD's.


Many thanks.


Kaya



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?53F378FF.3080302>