From owner-freebsd-questions@freebsd.org Tue May 5 10:33:49 2020 Return-Path: Delivered-To: freebsd-questions@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id F21C22E1378 for ; Tue, 5 May 2020 10:33:49 +0000 (UTC) (envelope-from freebsd@qeng-ho.org) Received: from bede.qeng-ho.org (bede.qeng-ho.org [217.155.128.241]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) server-signature RSA-PSS (4096 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 49Gbg05pQLz3G9f for ; Tue, 5 May 2020 10:33:48 +0000 (UTC) (envelope-from freebsd@qeng-ho.org) Received: from arthur.home.qeng-ho.org (arthur.home.qeng-ho.org [172.23.1.2]) by bede.qeng-ho.org (Postfix) with ESMTP id 9AD4710645; Tue, 5 May 2020 11:33:41 +0100 (BST) Subject: Re: g_vfs_done() ada2p1 error = 5 To: Christoph Kukulies , Polytropon Cc: freebsd-questions@freebsd.org References: <2ED6F7F2-0F70-49F0-AF9F-E6E4CE11B2C3@kukulies.org> <20200505105033.ff69a110.freebsd@edvax.de> From: Arthur Chance Message-ID: <24144a81-1f6e-206b-73ab-85846ce6db50@qeng-ho.org> Date: Tue, 5 May 2020 11:33:41 +0100 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:68.0) Gecko/20100101 Thunderbird/68.7.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit X-Rspamd-Queue-Id: 49Gbg05pQLz3G9f X-Spamd-Bar: ---- Authentication-Results: mx1.freebsd.org; dkim=none; dmarc=none; spf=pass (mx1.freebsd.org: domain of freebsd@qeng-ho.org designates 217.155.128.241 as permitted sender) smtp.mailfrom=freebsd@qeng-ho.org X-Spamd-Result: default: False [-4.72 / 15.00]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-1.00)[-1.000,0]; RCPT_COUNT_THREE(0.00)[3]; TO_DN_SOME(0.00)[]; R_SPF_ALLOW(-0.20)[+ip4:217.155.128.240/29]; FROM_HAS_DN(0.00)[]; MIME_GOOD(-0.10)[text/plain]; RCVD_TLS_LAST(0.00)[]; DMARC_NA(0.00)[qeng-ho.org]; NEURAL_HAM_LONG(-1.00)[-1.000,0]; TO_MATCH_ENVRCPT_SOME(0.00)[]; IP_SCORE(-2.42)[ip: (-7.87), ipnet: 217.155.0.0/16(-3.94), asn: 13037(-0.24), country: GB(-0.07)]; FROM_EQ_ENVFROM(0.00)[]; R_DKIM_NA(0.00)[]; MIME_TRACE(0.00)[0:+]; ASN(0.00)[asn:13037, ipnet:217.155.0.0/16, country:GB]; MID_RHS_MATCH_FROM(0.00)[]; RCVD_COUNT_TWO(0.00)[2] X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 05 May 2020 10:33:50 -0000 On 05/05/2020 10:17, Christoph Kukulies wrote: > > >> Am 05.05.2020 um 10:50 schrieb Polytropon : >> >> On Tue, 5 May 2020 10:42:56 +0200, Christoph Kukulies wrote: >>> I have a GPT partitioned SSD here which was serving as a boot >>> volume of my previously installed >>> FreeBSD 8.0 and now, as I have 12.1 installed and booting from >>> that I have mounted this SD into >>> my running system. >>> >>> In my system log I’m seeing the following error message (and when >>> I try to mount that partition, I’m getting an >>> INPUT/OUTPUT error also): >>> >>> g_vfs_done():ada2p1[READ(offest=262144, length=8192)]error = 5 >>> >>> ada2 at ahcich2 bus 0 scbus2 target 0 lun 0 >>> [...] >>> >>> What does this error message mean? How do I get rid of it? >> >> Could it be that the SSD has reached the end of its lifetime? > > > Hope not. Was trusting that FreeBSD takes care of saving SSDs from degrading over time. It's good, but FreeBSD can't prevent wear and tear. SSDs have a finite lifetime and all an OS can do is not decrease it. >> I've seen similar messages on regular hard disks which were >> about to die... Can you check the SSD with smartctl and see >> if there is something suspicious? >> > > > The output of smartctl is overwhelming :) Happen to know what I should look for? The attributes section is often the place to start. You can see just that by giving the -A flag to smartctl. > smartctl 7.1 2019-12-30 r5022 [FreeBSD 12.1-RELEASE amd64] (local build) > Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org > > === START OF INFORMATION SECTION === > Model Family: Samsung based SSDs > Device Model: Samsung SSD 840 EVO 120GB > Serial Number: S1D5NSAD978704E > LU WWN Device Id: 5 002538 8a002c09c > Firmware Version: EXT0AB0Q > User Capacity: 120,034,123,776 bytes [120 GB] > Sector Size: 512 bytes logical/physical > Rotation Rate: Solid State Device > Device is: In smartctl database [for details use: -P show] > ATA Version is: ACS-2, ATA8-ACS T13/1699-D revision 4c > SATA Version is: SATA 3.1, 6.0 Gb/s (current: 3.0 Gb/s) > Local Time is: Tue May 5 11:11:45 2020 CEST > SMART support is: Available - device has SMART capability. > SMART support is: Enabled > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > General SMART Values: > Offline data collection status: (0x00) Offline data collection activity > was never started. > Auto Offline Data Collection: Disabled. > Self-test execution status: ( 0) The previous self-test routine completed > without error or no self-test has ever > been run. > Total time to complete Offline > data collection: ( 4200) seconds. > Offline data collection > capabilities: (0x53) SMART execute Offline immediate. > Auto Offline data collection on/off support. > Suspend Offline collection upon new > command. > No Offline surface scan supported. > Self-test supported. > No Conveyance Self-test supported. > Selective Self-test supported. > SMART capabilities: (0x0003) Saves SMART data before entering > power-saving mode. > Supports SMART auto save timer. > Error logging capability: (0x01) Error logging supported. > General Purpose Logging supported. > Short self-test routine > recommended polling time: ( 2) minutes. > Extended self-test routine > recommended polling time: ( 70) minutes. > SCT capabilities: (0x003d) SCT Status supported. > SCT Error Recovery Control supported. > SCT Feature Control supported. > SCT Data Table supported. > > SMART Attributes Data Structure revision number: 1 > Vendor Specific SMART Attributes with Thresholds: > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE > 5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0 > 9 Power_On_Hours 0x0032 091 091 000 Old_age Always - 42753 > 12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always - 69 > 177 Wear_Leveling_Count 0x0013 099 099 000 Pre-fail Always - 2 > 179 Used_Rsvd_Blk_Cnt_Tot 0x0013 100 100 010 Pre-fail Always - 0 > 181 Program_Fail_Cnt_Total 0x0032 100 100 010 Old_age Always - 0 > 182 Erase_Fail_Count_Total 0x0032 100 100 010 Old_age Always - 0 > 183 Runtime_Bad_Block 0x0013 100 100 010 Pre-fail Always - 0 > 187 Uncorrectable_Error_Cnt 0x0032 100 100 000 Old_age Always - 0 > 190 Airflow_Temperature_Cel 0x0032 073 057 000 Old_age Always - 27 > 195 ECC_Error_Rate 0x001a 200 200 000 Old_age Always - 0 > 199 CRC_Error_Count 0x003e 100 100 000 Old_age Always - 0 > 235 POR_Recovery_Count 0x0012 099 099 000 Old_age Always - 51 > 241 Total_LBAs_Written 0x0032 099 099 000 Old_age Always - 1078954976 Attributes 177, 179, 181, 182, 183 & 241 are interesting for SSDs. You've not used any of your spare blocks, and you've had ~1 billion writes. For comparison I've got a 840 PRO in one of my machines and that's showing a wear leveling count of 17 and ~4 billion writes. >From Samsung's documentation (found via an online forum) ID # 177 Wear Leveling Count This attribute represents the number of media program and erase operations (the number of times a block has been erased). This value is directly related to the lifetime of the SSD. The raw value of this attribute shows the total count of P/E Cycles. Each MLC NAND cell can be erased ~10,000 times before it stops reliably holding charge. Given your wear leveling count is 2 you should be a long way off end of life. > SMART Error Log Version: 1 > No Errors Logged > > SMART Self-test log structure revision number 1 > No self-tests have been logged. [To run self-tests, use: smartctl -t] > > SMART Selective self-test log data structure revision number 1 > SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS > 1 0 0 Not_testing > 2 0 0 Not_testing > 3 0 0 Not_testing > 4 0 0 Not_testing > 5 0 0 Not_testing > Selective self-test flags (0x0): > After scanning selected spans, do NOT read-scan remainder of disk. > If Selective self-test is pending on power-up, resume after 0 minute delay. > -- Fat Earther: One who believes the world is round but has put on too much weight round the middle.