From owner-freebsd-questions@FreeBSD.ORG Sun Dec 7 01:02:02 2014 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 0979C5DE for ; Sun, 7 Dec 2014 01:02:02 +0000 (UTC) Received: from rc1.surewest.net (rc1.surewest.net [66.60.130.50]) by mx1.freebsd.org (Postfix) with ESMTP id DC0A699E for ; Sun, 7 Dec 2014 01:02:01 +0000 (UTC) Received: from smtp3.surewest.net ([66.60.130.145]) by rc1.surewest.net ({dfaaa318-551d-4a0a-8038-7c31cf31c4f6}) via TCP (outbound) with ESMTP id 20141207005840351 for ; Sun, 07 Dec 2014 00:58:40 +0000 X-RC-FROM: X-RC-RCPT: Received: from smtpauth.surewest.net (smtpauth.surewest.net [66.60.130.153]) by smtp3.surewest.net (Postfix) with ESMTP id E27E889662 for ; Sat, 6 Dec 2014 16:58:26 -0800 (PST) Received: from blacklamb.mykitchentable.net (mykitchentable.net [69.62.167.70]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by smtpauth.surewest.net (Postfix) with ESMTPSA id 5B09A9C1CA for ; Sat, 6 Dec 2014 16:58:38 -0800 (PST) Received: from [127.0.0.1] (blacksheep.mykitchentable.net [192.168.1.2]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: drew) by blacklamb.mykitchentable.net (Postfix) with ESMTPSA id C616CDBF02 for ; Sat, 6 Dec 2014 16:58:32 -0800 (PST) Message-ID: <5483A639.2050704@mykitchentable.net> Date: Sat, 06 Dec 2014 16:58:33 -0800 From: Drew Tomlinson User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.3.0 MIME-Version: 1.0 To: freebsd-questions@FreeBSD.org Subject: Probably Hardware Trouble But What Is It? Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Antivirus: avast! (VPS 141206-1, 12/06/2014), Outbound message X-Antivirus-Status: Clean X-MAG-OUTBOUND: surewest.redcondor.net@66.60.130.145/32 X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 07 Dec 2014 01:02:02 -0000 I'm running FBS 9.1 RELEASE that I built several years ago. It's mostly a Samba server and has "just worked" so I've never done much more with it. However recently, I find it "locked up" with thousands of these messages on the console: kernel: kern.maxfiles limit exceeded by uid 0, please see tuning(7) I've looked in /var/log/messages and also see lots of messages like these: Dec 6 13:55:53 vm kernel: siisch0: ... waiting for slots 18000000 Dec 6 13:55:53 vm kernel: siisch0: Timeout on slot 28 Dec 6 13:55:53 vm kernel: siisch0: siis_timeout is 00040000 ss 78000000 rs 78000000 es 00000000 sts 801b0000 serr 00000000 Dec 6 13:55:53 vm kernel: siisch0: ... waiting for slots 08000000 Dec 6 13:55:55 vm kernel: siisch0: Timeout on slot 27 Dec 6 13:55:55 vm kernel: siisch0: siis_timeout is 00040000 ss 78000000 rs 78000000 es 00000000 sts 801b0000 serr 00000000 Dec 6 13:55:55 vm kernel: (ada0:siisch0:0:0:0): FLUSHCACHE48. ACB: ea 00 00 00 00 40 00 00 00 00 00 00 Dec 6 13:55:55 vm kernel: (ada0:siisch0:0:0:0): CAM status: Command timeout Dec 6 13:55:55 vm kernel: (ada0:siisch0:0:0:0): Retrying command Dec 6 13:55:55 vm kernel: (ada0:siisch0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 01 fe d8 74 40 39 00 00 00 00 00 Dec 6 13:55:55 vm kernel: (ada0:siisch0:0:0:0): CAM status: Command timeout Dec 6 13:55:55 vm kernel: (ada0:siisch0:0:0:0): Retrying command Dec 6 13:55:55 vm kernel: (ada0:siisch0:0:0:0): READ_FPDMA_QUEUED. ACB: 60 0a a5 7f 00 40 4c 00 00 00 00 00 This machine uses zfs. I have two pools: # zpool list NAME SIZE ALLOC FREE CAP DEDUP HEALTH ALTROOT zback 1.81T 848G 1008G 45% 1.00x ONLINE - zroot 1.81T 1.16T 666G 64% 1.00x ONLINE - Then I tried this and my ssh window is now stuck: # zpool status pool: zback state: ONLINE status: One or more devices are faulted in response to IO failures. action: Make sure the affected devices are connected, then run 'zpool clear'. see: http://illumos.org/msg/ZFS-8000-HC scan: none requested config: NAME STATE READ WRITE CKSUM zback ONLINE 3 0 0 ada0 ONLINE 4 0 0 I opened another ssh window and tried 'zpool clear zback' as suggested but it appears stuck too. I'm sure I haven't provided all the relevant information so please ask and I will do so. I'd appreciate any guidance on how to take a proper backup of ada0 and what I should do next. I think this zback pool is just the one disk which is a 2TB drive. I'd like to know how to confirm that if possible since it seems the zpool commands aren't able to complete. I appreciate any suggestions or guidance. Thanks, Drew -- Like card tricks? Visit The Alchemist's Warehouse to learn card magic secrets for free! http://alchemistswarehouse.com