From owner-freebsd-questions@freebsd.org Tue Feb 16 10:13:09 2021 Return-Path: Delivered-To: freebsd-questions@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id C8385531400 for ; Tue, 16 Feb 2021 10:13:09 +0000 (UTC) (envelope-from web@3dresearch.com) Received: from smtpg.telissant.net (smtpg.telissant.net [104.225.1.73]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4Dfxcj0qTHz3Mr2 for ; Tue, 16 Feb 2021 10:13:08 +0000 (UTC) (envelope-from web@3dresearch.com) Received: from sacada.3dresearch.com (localhost [127.0.0.1]) by smtpg.telissant.net (Postfix) with ESMTP id 4DfxcZ4D1fz2D1r8 for ; Tue, 16 Feb 2021 05:13:02 -0500 (EST) X-Virus-Scanned: amavisd-new at telissant.net Received: from smtpg.telissant.net ([127.0.0.1]) by sacada.3dresearch.com (sacada.3dresearch.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id iLJmcFKKFEpc for ; Tue, 16 Feb 2021 05:13:01 -0500 (EST) Received: from bufftemp.3dresearch.com (unknown [71.112.244.15]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) (Authenticated sender: bufftemp@sacada.3dresearch.com) by smtpg.telissant.net (Postfix) with ESMTPSA id 4DfxcY63nBz2D1r7 for ; Tue, 16 Feb 2021 05:13:01 -0500 (EST) Received: from bufftemp.3dresearch.com (localhost [127.0.0.1]) by bufftemp.3dresearch.com (Postfix) with SMTP id 5A0F35E979 for ; Tue, 16 Feb 2021 05:13:01 -0500 (EST) Date: Tue, 16 Feb 2021 04:58:38 -0500 From: Janos Dohanics To: FreeBSD Questions Subject: Re: zpool CKSUM 0 --> 1 while resilvering Message-Id: <20210216045838.2771e7bb04e8aab45092b50a@3dresearch.com> In-Reply-To: References: <20210215131139.c3ad9f9c9f907ee5b058fd37@3dresearch.com> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.32; amd64-portbld-freebsd12.1) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 4Dfxcj0qTHz3Mr2 X-Spamd-Bar: / Authentication-Results: mx1.freebsd.org; dkim=none; dmarc=none; spf=pass (mx1.freebsd.org: domain of web@3dresearch.com designates 104.225.1.73 as permitted sender) smtp.mailfrom=web@3dresearch.com X-Spamd-Result: default: False [-0.75 / 15.00]; RCVD_VIA_SMTP_AUTH(0.00)[]; ENVFROM_SERVICE_ACCT(1.00)[]; MV_CASE(0.50)[]; R_SPF_ALLOW(-0.20)[+mx]; TO_DN_ALL(0.00)[]; NEURAL_HAM_SHORT(-0.95)[-0.947]; RECEIVED_SPAMHAUS_PBL(0.00)[71.112.244.15:received]; RCVD_TLS_LAST(0.00)[]; R_DKIM_NA(0.00)[]; RBL_DBL_DONT_QUERY_IPS(0.00)[104.225.1.73:from]; MID_RHS_MATCH_FROM(0.00)[]; ASN(0.00)[asn:36236, ipnet:104.225.1.0/24, country:US]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-1.00)[-1.000]; RCVD_COUNT_FIVE(0.00)[5]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; FROM_HAS_DN(0.00)[]; TO_MATCH_ENVRCPT_ALL(0.00)[]; NEURAL_HAM_LONG(-1.00)[-1.000]; MIME_GOOD(-0.10)[text/plain]; PREVIOUSLY_DELIVERED(0.00)[freebsd-questions@freebsd.org]; DMARC_NA(0.00)[3dresearch.com]; RCPT_COUNT_ONE(0.00)[1]; SPAMHAUS_ZRD(0.00)[104.225.1.73:from:127.0.2.255]; FROM_SERVICE_ACCT(1.00)[]; MAILMAN_DEST(0.00)[freebsd-questions] X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 16 Feb 2021 10:13:09 -0000 On Mon, 15 Feb 2021 12:56:31 -0800 David Christensen wrote: > On 2021-02-15 10:11, Janos Dohanics wrote: > > [...] > > > > # zpool status > > pool: zroot > > state: ONLINE > > status: One or more devices has experienced an unrecoverable > > error. An attempt was made to correct the error. Applications are > > unaffected. action: Determine if the device needs to be replaced, > > and clear the errors using 'zpool clear' or replace the device with > > 'zpool replace'. see: http://illumos.org/msg/ZFS-8000-9P > > scan: resilvered 1.97T in 0 days 08:25:27 with 0 errors on Mon > > Feb 15 05:09:47 2021 config: > > > > NAME STATE READ WRITE CKSUM > > zroot ONLINE 0 0 0 > > mirror-0 ONLINE 0 0 0 > > ada0p3 ONLINE 0 0 0 > > ada3p3 ONLINE 0 0 1 > > ada2p3 ONLINE 0 0 0 > > > > errors: No known data errors > > > > No errors reported by smartctl(8) for /dev/ada3. > > > > Can I consider this a harmless error and should I just run "zpool > > clear ada3p3'? > > > > Please advise. > > > STFW 'zpool status cksum': > > https://docs.oracle.com/cd/E19120-01/open.solaris/817-2271/gbcve/index.html > > https://docs.oracle.com/cd/E19120-01/open.solaris/817-2271/gbbzs/index.html > > I would: > > 1. Check for interface, cable, and/or rack errors. These should > generate error messages in dmesg(8) and /var/log/messages. I > especially hate red, non-locking SATA cables without any speed > marking. More than a few people agree that the red dye in the > insulation will corrode copper. I have replaced all of my SATA > cables with new black, locking, 6 Gbps SATA cables (made by Cable > Matters). Red SATA cables? I was certainly ignorant about this! > 2. If and when all of the above is okay, I would run SMART short and > long tests on all three drives. (I used to believe in manufacturer > diagnostic tools, but recent experiences with Seagate and Western > Digital have convinced me otherwise; notably when following up with > technical support. If anyone knows of a brand of HDD with good > drives, good diagnostic tools, and good technical support, please > advise.) > > 3. If and when all of the above is okay, I would scrub the pool. > > 4. If and when all of the above is okay and CKSUM remains (#3 might > clear it?), I would do the 'zpool clear ...'. Thank you for the comprehensive advice, I'm glad I asked... -- Janos Dohanics