From owner-freebsd-proliant@FreeBSD.ORG Tue Jul 30 09:45:45 2013 Return-Path: Delivered-To: freebsd-proliant@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTP id 23368EE5 for ; Tue, 30 Jul 2013 09:45:45 +0000 (UTC) (envelope-from prvs=916e580da=a@jenisch.at) Received: from mgaterz1.oekb.co.at (mgaterz1.oekb.co.at [143.245.5.111]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 8B8A72204 for ; Tue, 30 Jul 2013 09:45:44 +0000 (UTC) Received: from exchhubcas1.oekb.co.at ([143.245.3.64]) by mgaterz1.oekb.co.at with ESMTP/TLS/AES128-SHA; 30 Jul 2013 11:44:32 +0200 Received: from aurora.oekb.co.at (143.245.9.16) by internal-relay-exchhubcas1.oekb.co.at (143.245.3.65) with Microsoft SMTP Server id 14.2.318.4; Tue, 30 Jul 2013 11:44:32 +0200 Received: from aurora.oekb.co.at (localhost [127.0.0.1]) by aurora.oekb.co.at (8.14.7/8.14.7) with ESMTP id r6U9iWZJ002307; Tue, 30 Jul 2013 11:44:32 +0200 (CEST) (envelope-from a@jenisch.at) Received: (from ej@localhost) by aurora.oekb.co.at (8.14.7/8.14.7/Submit) id r6U9iWZd002306; Tue, 30 Jul 2013 11:44:32 +0200 (CEST) (envelope-from a@jenisch.at) X-Authentication-Warning: aurora.oekb.co.at: ej set sender to a@jenisch.at using -f Date: Tue, 30 Jul 2013 11:44:32 +0200 From: Ewald Jenisch To: Subject: DL585 G5 - enormous delays with disk access Message-ID: <20130730094432.GA2161@aurora.oekb.co.at> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Disposition: inline User-Agent: Mutt/1.5.21 (2010-09-15) X-BeenThere: freebsd-proliant@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Technical discussion of FreeBSD on HP ProLiant server platforms." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 30 Jul 2013 09:45:45 -0000 Hi, I'm having a really hard time getting a HP DL585 G5 to work. To be specific: When there's any disk io the machine completely freezes, i.e. no console input possible, no screen output - complete lock. After some minutes it comes back to normal again - but sure enough with the next disk io it freezes again. To give you a specific example: On one session (logged in via ssh) I've got a "portsnap fetch extract" running; in a second window I do a "sync". Normally this should complete in a matter of milliseconds or seconds in the worst case - but dig this: # date;time sync;date Tue Jul 30 09:57:38 CEST 2013 0.000u 0.311s 9:54.69 0.0% 4+161k 0+1287io 0pf+0w Tue Jul 30 10:07:38 CEST 2013 # No, this is not a typo - it really took ten minutes (!) for the sync to complete. In the meantime - every windows, all activity (console, screen-output etc.) is completely frozen. ('portsnap fetch extract' was only given as an example here - the lockup occurs whenever there is disk io) We're speaking about a machine with decent hardware here, not an old i386 type of box - here's an excerpt from "dmesg": ------------------------------ < Cut here > ------------------------------ FreeBSD 9.2-BETA2 #0 r253750: Mon Jul 29 11:07:04 CEST 2013 root@sniff-rz2:/usr/obj/usr/src/sys/GENERIC amd64 gcc version 4.2.1 20070831 patched [FreeBSD] CPU: Quad-Core AMD Opteron(tm) Processor 8358 SE (2411.16-MHz K8-class CPU) Origin = "AuthenticAMD" Id = 0x100f23 Family = 0x10 Model = 0x2 Stepping = 3 Features=0x178bfbff Features2=0x802009 AMD Features=0xee400800 AMD Features2=0x7ff TSC: P-state invariant real memory = 137438953472 (131072 MB) avail memory = 132973432832 (126813 MB) Event timer "LAPIC" quality 400 ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 16 CPUs ... ciss0: port 0x3000-0x30ff mem 0xd9e00000-0xd9efffff,0xd9df0000-0xd9df0fff irq 16 at device 0.0 on pci8 ciss0: PERFORMANT Transport ... da0 at ciss0 bus 0 scbus2 target 0 lun 0 da0: Fixed Direct Access SCSI-5 device da0: 135.168MB/s transfers da0: Command Queueing enabled da0: 139979MB (286677120 512 byte sectors: 255H 32S/T 35132C) da0: quirks=0x1 ------------------------------ < Cut here > ------------------------------ Kernel: Latest kernel as of yesterday (9.2Beta) BIOS: is at the latest level (Support pack as of Spring 2013) installed which updated BIOS, iLO etc. Aside from that I reset BIOS to default values just to be sure. SmartArray P400 - Firmware 7.24 (latest) Harddisks: Two 146GB HDs running in Raid1-mode. Already hot-swapped both of them (i.e. the second one after raid-rebuild was complete for the first swap) to see whether there's a HW-problem - didn't change anything. So my primary question is what's causing this absolutely annoying problem and what can be done against it? Thanks much in advance for any help, -ewald