From owner-freebsd-bugs@freebsd.org Wed May 17 13:48:02 2017 Return-Path: Delivered-To: freebsd-bugs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 80E1CD70177 for ; Wed, 17 May 2017 13:48:02 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2001:1900:2254:206a::16:76]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 672101121 for ; Wed, 17 May 2017 13:48:02 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from bugs.freebsd.org ([127.0.1.118]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id v4HDm2MC031451 for ; Wed, 17 May 2017 13:48:02 GMT (envelope-from bugzilla-noreply@freebsd.org) From: bugzilla-noreply@freebsd.org To: freebsd-bugs@FreeBSD.org Subject: [Bug 219355] Heavy disk activity in bhyve deadlocks host Date: Wed, 17 May 2017 13:48:01 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 11.0-STABLE X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Only Me X-Bugzilla-Who: freebsd-bugs@joe.mulloy.me X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: freebsd-bugs@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version rep_platform op_sys bug_status bug_severity priority component assigned_to reporter Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 17 May 2017 13:48:02 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D219355 Bug ID: 219355 Summary: Heavy disk activity in bhyve deadlocks host Product: Base System Version: 11.0-STABLE Hardware: amd64 OS: Any Status: New Severity: Affects Only Me Priority: --- Component: kern Assignee: freebsd-bugs@FreeBSD.org Reporter: freebsd-bugs@joe.mulloy.me Hello, I have a somewhat complicated server setup that I am trying to run bhyve on= . I did not have this problem on FreeBSD 10, although it may have been due to my zpool not being as full. My main os pool is a small unencrypted mirror. My jails and bhyve VMs are stored on a separate pool named "data" that is made= up of 4x 3TB WD Red drives in a pair of mirrors (RAID 10) with each disk being encrypted via geli and the geli device being passed to ZFS. I am using chyv= es to manage my bhyve VMs. I have found that doing heavy disk activity will reliably deadlock the host system and I will then need to reboot/reset it. = The VM disks are stored in zvols. So far I have been able to trigger this condi= tion by attempting to install Windows 7, where the host crashes when Windows sta= rts copying files. I have also managed to trigger it a couple times by trying to assemble a jail by hand in a FreeBSD guest by copying a template directory holding a FreeBSD installation. The host system has 32GB of RAM and I only = give 2GB to the VM so I should have plenty of memory. Below is the output of top during one of the deadlocks when trying to install Windows. The state of the bhyve process is kqread. I was able to successfully install Windows 7 by storing the guest on a separate geli encrypted pool. At one point my pool w= as 85% full. I cleaned it up to be only 50% full but I'm still having this iss= ue. I think I have somehow got my pool in a state where it's going to keep havi= ng this problem. I would like to fix it by recreating my pool but I would like= to debug it first in case there is some bug that can be fixed. I don't know ho= w to debug this further so if someone could provide me with some instructions or commands to debug this further I would appreciate it. root@server1:~ # chyves win7 get all Checking for newer version of chyves on the master branch from https://github.com/chyves/chyves/raw/master/sbin/chyves. Setting global property 'check_for_updates_last_check' to value: '20170517' On current version, will check again on: 20170524 Setting global property 'check_for_updates_last_check_status' to value: '0' Getting all win7's properties... bargs -H -P -S bhyve_disk_type ahci-hd bhyve_net_type virtio-net bhyveload_flags chyves_guest_version 0300 cpu 1 creation Created on Mon Oct 10 22:06:33 UTC 201= 6 by chyves v0.2.0 2016/09/11 using __create() description - eject_iso_on_n_reboot 2 loader uefi net_ifaces tap55 notes - os default ram 2G rcboot 0 revert_to_snapshot_method off revert_to_snapshot serial nmdm55 template no uefi_console_output vnc uefi_firmware BHYVE_UEFI_20160704_1.fd uefi_vnc_ip 10.2.4.50 uefi_vnc_mouse_type ps2 uefi_vnc_pause_until_client_connect yes uefi_vnc_port 5900 uefi_vnc_res 1024x768 uuid cce87028-8f35-11e6-86a3-94de80a12470 chyves version: chyves v0.2.0 2016/09/11 root@server1:~ # uname -a FreeBSD server1.jdmulloy.net 11.0-RELEASE-p9 FreeBSD 11.0-RELEASE-p9 #0: Tue Apr 11 08:48:40 UTC 2017=20=20=20=20 root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC amd64 483 processes: 2 running, 451 sleeping, 30 waiting CPU: 16.3% user, 0.0% nice, 83.7% system, 0.0% interrupt, 0.0% idle Mem: 2357M Active, 864M Inact, 28G Wired, 4520K Free=20 ARC: 4978M Total, 3836K MFU, 115M MRU, 2456M Anon, 1602M Header, 800M Other Swap: 16G Total, 257M Used, 16G Free, 1% Inuse PID USERNAME THR PRI NICE SIZE RES STATE C TIME WCPU COMMA= ND 10662 root 15 52 0 45776K 17136K uwait 2 2:11 72.10% fileb= eat 10136 root 12 20 0 43344K 16612K uwait 2 3:38 51.58% fileb= eat 11208 root 12 20 0 43344K 16840K tx->tx 2 2:25 35.52% fileb= eat 5373 root 14 20 0 43600K 14824K pfault 1 1:51 27.82% fileb= eat 840 root 11 20 0 39120K 11004K pfault 0 0:39 12.06% fileb= eat 792 zabbix 1 20 0 30596K 4204K RUN 3 0:32 12.00% zabbix_agentd 13401 root 21 20 0 2127M 1963M kqread 1 3:18 3.23% bhyve 8940 root 1 20 0 26264K 5000K CPU3 3 0:01 0.19% top 11613 root 11 47 0 41100K 14368K uwait 3 0:00 0.12% fileb= eat 7411 root 14 20 0 45100K 17612K tx->tx 3 0:00 0.09% fileb= eat 6504 root 10 52 0 39984K 13992K uwait 3 0:01 0.09% fileb= eat 1125 root 1 20 0 22004K 3692K select 3 0:02 0.06% tmux 1105 jdmulloy2 1 20 0 46760K 5688K select 3 0:01 0.06% mosh-server --=20 You are receiving this mail because: You are the assignee for the bug.=