From owner-netperf-users@freebsd.org Fri Nov 20 17:47:43 2020 Return-Path: Delivered-To: netperf-users@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 00CC4468C63 for ; Fri, 20 Nov 2020 17:47:43 +0000 (UTC) (envelope-from mjguzik@gmail.com) Received: from mail-wr1-x429.google.com (mail-wr1-x429.google.com [IPv6:2a00:1450:4864:20::429]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "smtp.gmail.com", Issuer "GTS CA 1O1" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4Cd3sp6M2cz3wDl; Fri, 20 Nov 2020 17:47:42 +0000 (UTC) (envelope-from mjguzik@gmail.com) Received: by mail-wr1-x429.google.com with SMTP id m6so10858087wrg.7; Fri, 20 Nov 2020 09:47:42 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=TktrETOQa/K85qoakvIiHOyyI3ZbIL7h4gJ0qZw7FBc=; b=S0MX/InQ4ZHuO1YePc2Tq1q4iImzjf6L/cCZM5xyw0G0N6kJT8789smzNEPfYn3TWe MJ/BP3fFD7feGvhPzpexwGIiVg9EO3ei2FBf1bCqJd3hijJGR14Le7jP1HrUtX42awUy f4tgdmo+7LDh+hyvBROl38Oz/NXoKfSG3+Y7OLlSJN4YM2Y575FRbWOqWisQRzPouxdp B5JG/nzXNS4VES1BGIDKsFASvQelkh+hotFzLKgpFc2ThImMm3Q9kbWmg9TABSPHJSve wSBKfQJzJAAyOCxR1RtarbAQjBvsOk5yDZ/biYVHfxAidBig3MSGE5Ao57tBaUDqhSN3 eQWg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=TktrETOQa/K85qoakvIiHOyyI3ZbIL7h4gJ0qZw7FBc=; b=jeHIqv79TG2ryh3z2O0kBhxhSzPuNF3oRizhl5aXnCojDsapyL1wxiOw+LVvG9/2/D zP9N0lP8w3OTBo3BYYOS+ncRKFpXq766oly4BajKXOskz17RaYAnYnuZLGc7V24co1XM gko3x9m2DRRUaLfopbB0TrHtMJJYRDt5ZfuBOI1ejXcJnsFH/QSI36IOQQaSbjHWRVS2 3mKBoPPUc3MdNjI4UajgtdfkrhFsWcDMLsnZDF2AjAA00YE7QdRBExfl9oNbYmj31Htw jSGcJtDian09XiS7lP49p1e6v072F2sQa6EK59tu0Gbz/MTiLuyZzfxZZ5VGFr0m+kZz 4j+Q== X-Gm-Message-State: AOAM531Lps7etdIZbxV2cDlqkTcrE0UBAPvBM+92Wft5NAHPKEe+8XtH UCIuYXezsehwNH4pqVz0SV5q1krUhEV6Pd/H3d2A9dHhbdg= X-Google-Smtp-Source: ABdhPJwnvkqciqfIKBHMD1vNVqg0aCmmcx0Py+pxc1J+2vS+c6DKEGk4FG+vRpYt18w0VTu6WYS2ocbudAN0I4+m1JU= X-Received: by 2002:adf:9043:: with SMTP id h61mr17811954wrh.237.1605894461073; Fri, 20 Nov 2020 09:47:41 -0800 (PST) MIME-Version: 1.0 Received: by 2002:adf:dec7:0:0:0:0:0 with HTTP; Fri, 20 Nov 2020 09:47:39 -0800 (PST) In-Reply-To: References: <1f8e49ff-e3da-8d24-57f1-11f17389aa84@sentex.net> <2691e1fd-5a27-4dd0-2ef7-b1c06fd4e751@sentex.net> <5A5094BC-D417-4BA6-97E2-7CB522B51368@FreeBSD.org> <4ec6ed6f-b3b4-22ae-e1ec-93a46f3d88ea@sentex.net> <0ddec867-32b5-f667-d617-0ddc71726d09@sentex.net> <5549CA9F-BCF4-4043-BA2F-A2C41D13D955@freebsd.org> From: Mateusz Guzik Date: Fri, 20 Nov 2020 18:47:39 +0100 Message-ID: Subject: Re: zoo reboot Friday Nov 20 14:00 UTC To: mike tancsa Cc: Philip Paeps , "Bjoern A. Zeeb" , netperf-admin@freebsd.org, netperf-users@freebsd.org, Allan Jude Content-Type: text/plain; charset="UTF-8" X-Rspamd-Queue-Id: 4Cd3sp6M2cz3wDl X-Spamd-Bar: ---- Authentication-Results: mx1.freebsd.org; none X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[] X-BeenThere: netperf-users@freebsd.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: "Announcements and discussions related to the netperf cluster. " List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 20 Nov 2020 17:47:43 -0000 CC'ing Allan Jude So: pool: zroot state: DEGRADED status: One or more devices could not be opened. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Attach the missing device and online it using 'zpool online'. see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-2Q scan: scrub repaired 0B in 05:17:02 with 0 errors on Tue Aug 18 15:19:00 2020 config: NAME STATE READ WRITE CKSUM zroot DEGRADED 0 0 0 mirror-0 DEGRADED 0 0 0 1517819109053923011 UNAVAIL 0 0 0 was /dev/ada0p3 ada1 ONLINE 0 0 0 mirror-1 ONLINE 0 0 0 ada3p3 ONLINE 0 0 0 ada4p3 ONLINE 0 0 0 mirror-2 ONLINE 0 0 0 ada5p3 ONLINE 0 0 0 ada6p3 ONLINE 0 0 0 special mirror-3 ONLINE 0 0 0 gptid/db15e826-1a9c-11eb-8d25-0cc47a1f2fa0 ONLINE 0 0 0 mfid1p2 ONLINE 0 0 0 errors: No known data errors # dmesg | grep ada0 Trying to mount root from ufs:/dev/ada0p2 [rw]... ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 ada0: ACS-2 ATA SATA 3.x device ada0: Serial Number WD-WCC137TALF5K ada0: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes) ada0: Command Queueing enabled ada0: 2861588MB (5860533168 512 byte sectors) ada0: quirks=0x1<4K> Mounting from ufs:/dev/ada0p2 failed with error 2; retrying for 3 more seconds Mounting from ufs:/dev/ada0p2 failed with error 2. vfs.root.mountfrom=ufs:/dev/ada0p2 GEOM_PART: Partition 'ada0p3' not suitable for kernel dumps (wrong type?) ZFS WARNING: Unable to attach to ada0p3. ZFS WARNING: Unable to attach to ada0p3. ZFS WARNING: Unable to attach to ada0p3. ZFS WARNING: Unable to attach to ada0p3. ZFS WARNING: Unable to attach to ada0p3. ZFS WARNING: Unable to attach to ada0p3. # gpart show ada0 => 34 5860533101 ada0 GPT (2.7T) 34 6 - free - (3.0K) 40 88 1 freebsd-boot (44K) 128 3072000 2 freebsd-swap (1.5G) 3072128 5857461000 3 freebsd-zfs (2.7T) 5860533128 7 - free - (3.5K) Running naive dd if=/dev/ada0p3 works, so I don't know what zfs complains about. On 11/20/20, mike tancsa wrote: > On 11/20/2020 11:40 AM, Philip Paeps wrote: >> On 2020-11-21 00:04:19 (+0800), Mateusz Guzik wrote: >> >>> Oh, that's a bummer. I wonder if there is a regression in the boot >>> loader though. >>> >>> Does the pool mount if you boot the system from a cd/over the >>> network/whatever? >> >> It's worth checking if the freebsd-boot partition is large enough. I >> noticed during the cluster refresh that we often use 108k for >> freebsd-boot but recent head wants 117k. I've been bumping the >> bootblocks to 236k. >> >> So far, all the cluster machines I've upgraded booted though .. so ... >> I might be talking ex recto. :) >> > I put in an ssd drive and booted from it. One of the drives might have > gotten loose or died in the power cycles, but there is still redundancy > and I was able to mount the pool. Not sure why it cant find the file ? > > root@zoo2:~ # diff /boot/lua/loader.lua /mnt/boot/lua/loader.lua > 29c29 > < -- $FreeBSD$ > --- >> -- $FreeBSD: head/stand/lua/loader.lua 359371 2020-03-27 17:37:31Z > freqlabs $ > root@zoo2:~ # > > > % ls -l /mnt/boot/lua/ > total 110 > -r--r--r-- 1 root wheel 4300 Nov 20 08:41 cli.lua > -r--r--r-- 1 root wheel 3288 Nov 20 08:41 color.lua > -r--r--r-- 1 root wheel 18538 Nov 20 08:41 config.lua > -r--r--r-- 1 root wheel 12610 Nov 20 08:41 core.lua > -r--r--r-- 1 root wheel 11707 Nov 20 08:41 drawer.lua > -r--r--r-- 1 root wheel 2456 Nov 20 08:41 gfx-beastie.lua > -r--r--r-- 1 root wheel 2235 Nov 20 08:41 gfx-beastiebw.lua > -r--r--r-- 1 root wheel 1958 Nov 20 08:41 gfx-fbsdbw.lua > -r--r--r-- 1 root wheel 2413 Nov 20 08:41 gfx-orb.lua > -r--r--r-- 1 root wheel 2140 Nov 20 08:41 gfx-orbbw.lua > -r--r--r-- 1 root wheel 3324 Nov 20 08:41 hook.lua > -r--r--r-- 1 root wheel 2395 Nov 20 08:41 loader.lua > -r--r--r-- 1 root wheel 2429 Sep 24 09:09 logo-beastie.lua > -r--r--r-- 1 root wheel 2203 Sep 24 09:09 logo-beastiebw.lua > -r--r--r-- 1 root wheel 1958 Sep 24 09:09 logo-fbsdbw.lua > -r--r--r-- 1 root wheel 2397 Sep 24 09:09 logo-orb.lua > -r--r--r-- 1 root wheel 2119 Sep 24 09:09 logo-orbbw.lua > -r--r--r-- 1 root wheel 14201 Nov 20 08:41 menu.lua > -r--r--r-- 1 root wheel 4299 Nov 20 08:41 password.lua > -r--r--r-- 1 root wheel 2227 Nov 20 08:41 screen.lua > > > -- Mateusz Guzik