Date: Sat, 23 Nov 2013 00:08:30 -0500 From: Jim Ohlstein <jim@ohlste.in> To: stable@freebsd.org Subject: SSD becomes detached 9.2 Message-ID: <5290384E.30603@ohlste.in>
next in thread | raw e-mail | index | archive | help
Hello, I am setting up a new (remote) server. I initially installed 9.2 RC4 amd64 because that's what the data center put in the drive for me. Shortly thereafter I downloaded 9.2-STABLE sources and compiled world and a generic kernel. While doing so the system became unreachable by SSH The SSH sessions appeared to connect but there was never any data returned. I could telnet to port 22 but I could not log in from a terminal. I could ping the server as well. I had the server rebooted. I did install an updated kernel and world (9.2-STABLE amd64 r258426) and it happened again just now. The OS is installed on a 120 GB SSD with root on ZFS. There is also another SSD for L2ARC and there are two 3TB SATA drives in a separate ZFS mirror pool. All drives passed cursory testing with smartmontools. CPU is an AMD-8120 (8 core Zambezi). Very little is running on the server as it is not yet in production (thankfully). Here is the relevant part of dmesg: ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000008 cs 00000000 ss 00000000 rs f8000000 tfd 40 serr 00000000 cmd 00047f17 (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 20 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Command timeout (ada0:ahcich0:0:0:0): Retrying command ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 ada0: <Samsung SSD 840 EVO 120GB EXT0AB0Q> s/n S1D5NSAD915803Y detached ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000001 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (ada0:ahcich0:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Command timeout (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ahcich0: Timeout on slot 3 port 0 ahcich0: is 00000008 cs 00000000 ss 00000000 rs 8000000f tfd 40 serr 00000000 cmd 00046317 (ada0:ahcich0:0:0:0): DSM TRIM. ACB: 06 01 00 00 00 40 00 00 00 00 01 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 20 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Command timeout (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 38 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 88 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 30 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): Periph destroyed ahcich0: Timeout on slot 3 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 00000008 tfd 50 serr 00000000 cmd 00046317 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ahcich0: Timeout on slot 3 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 00000008 tfd 50 serr 00000000 cmd 00046317 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked After this event, gpart show lists only ada1, ada2, and ada3. The boot drive is ada0. The entire dmesg can be seen at http://pastebin.com/RqR8LiSb. -- Jim Ohlstein
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?5290384E.30603>