From owner-freebsd-fs@freebsd.org  Fri Jan 29 14:35:17 2016
Return-Path: <owner-freebsd-fs@freebsd.org>
Delivered-To: freebsd-fs@mailman.ysv.freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org
 [IPv6:2001:1900:2254:206a::19:1])
 by mailman.ysv.freebsd.org (Postfix) with ESMTP id A448CA7204D
 for <freebsd-fs@mailman.ysv.freebsd.org>; Fri, 29 Jan 2016 14:35:17 +0000 (UTC)
 (envelope-from zanchey@ucc.gu.uwa.edu.au)
Received: from mail-ext-sout1.uwa.edu.au (mail-ext-sout1.uwa.edu.au
 [130.95.128.72]) (using TLSv1 with cipher RC4-SHA (128/128 bits))
 (Client CN "IronPort Appliance Demo Certificate",
 Issuer "IronPort Appliance Demo Certificate" (not verified))
 by mx1.freebsd.org (Postfix) with ESMTPS id DB74819EA
 for <freebsd-fs@freebsd.org>; Fri, 29 Jan 2016 14:35:16 +0000 (UTC)
 (envelope-from zanchey@ucc.gu.uwa.edu.au)
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A2DoBADbd6tW/8+AX4JWCINcgR2IWJ15BpUrGId2AQEBAQEBgQuFAoFMRIgboSmbB4QHhUaCPIZQDIJgC0CBJwWGEwmHCHOIV48mh2eFL449YoFCAYI2XQGIfAEBAQ
X-IPAS-Result: A2DoBADbd6tW/8+AX4JWCINcgR2IWJ15BpUrGId2AQEBAQEBgQuFAoFMRIgboSmbB4QHhUaCPIZQDIJgC0CBJwWGEwmHCHOIV48mh2eFL449YoFCAYI2XQGIfAEBAQ
X-IronPort-AV: E=Sophos;i="5.22,364,1449504000"; d="scan'208";a="196546648"
Received: from f5-new.net.uwa.edu.au (HELO mooneye.ucc.gu.uwa.edu.au)
 ([130.95.128.207])
 by mail-ext-out1.uwa.edu.au with ESMTP/TLS/ADH-AES256-SHA;
 29 Jan 2016 22:34:03 +0800
Received: by mooneye.ucc.gu.uwa.edu.au (Postfix, from userid 801)
 id 2D4553C057; Fri, 29 Jan 2016 22:34:03 +0800 (AWST)
Received: from motsugo.ucc.gu.uwa.edu.au (motsugo.ucc.gu.uwa.edu.au
 [130.95.13.7])
 by mooneye.ucc.gu.uwa.edu.au (Postfix) with ESMTP id 023D23C055
 for <freebsd-fs@freebsd.org>; Fri, 29 Jan 2016 22:34:03 +0800 (AWST)
Received: by motsugo.ucc.gu.uwa.edu.au (Postfix, from userid 11251)
 id E6F0120083; Fri, 29 Jan 2016 22:34:02 +0800 (AWST)
Received: from localhost (localhost [127.0.0.1])
 by motsugo.ucc.gu.uwa.edu.au (Postfix) with ESMTP id E223C2003D
 for <freebsd-fs@freebsd.org>; Fri, 29 Jan 2016 22:34:02 +0800 (AWST)
Date: Fri, 29 Jan 2016 22:34:02 +0800 (AWST)
From: David Adam <zanchey@ucc.gu.uwa.edu.au>
To: freebsd-fs@freebsd.org
Subject: Poor ZFS+NFSv3 read/write performance and panic
Message-ID: <alpine.DEB.2.11.1601292153420.26396@motsugo.ucc.gu.uwa.edu.au>
User-Agent: Alpine 2.11 (DEB 23 2013-08-11)
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
X-BeenThere: freebsd-fs@freebsd.org
X-Mailman-Version: 2.1.20
Precedence: list
List-Id: Filesystems <freebsd-fs.freebsd.org>
List-Unsubscribe: <https://lists.freebsd.org/mailman/options/freebsd-fs>,
 <mailto:freebsd-fs-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-fs/>
List-Post: <mailto:freebsd-fs@freebsd.org>
List-Help: <mailto:freebsd-fs-request@freebsd.org?subject=help>
List-Subscribe: <https://lists.freebsd.org/mailman/listinfo/freebsd-fs>,
 <mailto:freebsd-fs-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Fri, 29 Jan 2016 14:35:17 -0000

Hi all,

We have a FreeBSD 10.2 server sharing some ZFS datasets over NFSv3. It's 
worked well until recently, but has started to routinely perform 
exceptionally poorly, eventually panicing in vdev_deadman() (which I 
understand is a feature).

Initally after booting, things are fine, but performance rapidly begins to 
degrade. Both read and write performance is terrible, with many operations 
either hanging indefinitely or timing out.

When this happens, I can break into DDB and see lots of nfsd process stuck 
waiting for a lock:
Process 784 (nfsd) thread 0xfffff80234795000 (100455)
shared lockmgr zfs (zfs) r = 0 (0xfffff8000b91f548) locked @ 
/usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/zfs_vfsops.c:2196

and the backtrace looks like this:
  sched_switch() at sched_switch+0x495/frame 0xfffffe04677740b0
  mi_switch() at mi_switch+0x179/frame 0xfffffe04677740f0
  turnstile_wait() at turnstile_wait+0x3b2/frame 0xfffffe0467774140
  __mtx_lock_sleep() at __mtx_lock_sleep+0x2c0/frame 0xfffffe04677741c0
  __mtx_lock_flags() at __mtx_lock_flags+0x102/frame 0xfffffe0467774210
  vmem_size() at vmem_size+0x5a/frame 0xfffffe0467774240
  arc_reclaim_needed() at arc_reclaim_needed+0xd2/frame 0xfffffe0467774260
  arc_get_data_buf() at arc_get_data_buf+0x157/frame 0xfffffe04677742a0
  arc_read() at arc_read+0x68b/frame 0xfffffe0467774350
  dbuf_read() at dbuf_read+0x7ed/frame 0xfffffe04677743f0
  dmu_tx_check_ioerr() at dmu_tx_check_ioerr+0x8b/frame 0xfffffe0467774420
  dmu_tx_count_write() at dmu_tx_count_write+0x17e/frame 0xfffffe0467774540
  dmu_tx_hold_write() at dmu_tx_hold_write+0xba/frame 0xfffffe0467774580
  zfs_freebsd_write() at zfs_freebsd_write+0x55d/frame 0xfffffe04677747b0
  VOP_WRITE_APV() at VOP_WRITE_APV+0x193/frame 0xfffffe04677748c0
  nfsvno_write() at nfsvno_write+0x13e/frame 0xfffffe0467774970
  nfsrvd_write() at nfsrvd_write+0x496/frame 0xfffffe0467774c80
  nfsrvd_dorpc() at nfsrvd_dorpc+0x66b/frame 0xfffffe0467774e40
  nfssvc_program() at nfssvc_program+0x4e6/frame 0xfffffe0467774ff0
  svc_run_internal() at svc_run_internal+0xbb7/frame 0xfffffe0467775180
  svc_run() at svc_run+0x1db/frame 0xfffffe04677751f0
  nfsrvd_nfsd() at nfsrvd_nfsd+0x1f0/frame 0xfffffe0467775350
  nfssvc_nfsd() at nfssvc_nfsd+0x124/frame 0xfffffe0467775970
  sys_nfssvc() at sys_nfssvc+0xb7/frame 0xfffffe04677759a0
  amd64_syscall() at amd64_syscall+0x278/frame 0xfffffe0467775ab0
  Xfast_syscall() at Xfast_syscall+0xfb/frame 0xfffffe0467775ab0

Is this likely to be due to bad hardware? I can't see any problems in
the SMART data, and `camcontrol tags da0 -v` etc. does not reveal any
particularly long queues. Are there other useful things to check?

If not, do you have any other ideas? I can make the full DDB information 
available if that would be helpful.

The pool is configured thus:
        NAME                  STATE     READ WRITE CKSUM
        space                 ONLINE       0     0     0
          mirror-0            ONLINE       0     0     0
            da0               ONLINE       0     0     0
            da1               ONLINE       0     0     0
          mirror-1            ONLINE       0     0     0
            da2               ONLINE       0     0     0
            da3               ONLINE       0     0     0
          mirror-2            ONLINE       0     0     0
            da4               ONLINE       0     0     0
            da6               ONLINE       0     0     0
          mirror-3            ONLINE       0     0     0
            da7               ONLINE       0     0     0
            da8               ONLINE       0     0     0
        logs
          mirror-4            ONLINE       0     0     0
            gpt/molmol-slog   ONLINE       0     0     0
            gpt/molmol-slog0  ONLINE       0     0     0
where the da? devices are WD Reds and the SLOG partitions are on Samsung 
840s.

Many thanks,

David Adam
zanchey@ucc.gu.uwa.edu.au