From owner-freebsd-stable@FreeBSD.ORG Sun Oct 21 13:06:34 2007 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9DCD716A418 for ; Sun, 21 Oct 2007 13:06:34 +0000 (UTC) (envelope-from oleg@vsi.ru) Received: from serv4.vsi.ru (serv4.vsi.ru [80.82.32.19]) by mx1.freebsd.org (Postfix) with ESMTP id C62F813C494 for ; Sun, 21 Oct 2007 13:05:00 +0000 (UTC) (envelope-from oleg@vsi.ru) Received: from NBOOOD (ws4.oood.vsi.ru [88.83.197.239]) by serv4.vsi.ru (8.13.8+Sun/8.13.8) with SMTP id l9KJvCKn013366; Sat, 20 Oct 2007 23:57:20 +0400 (MSD) Message-ID: <001201c81353$6b17e300$efc55358@NBOOOD> From: "Oleg Derevenetz" To: "Alfred Perlstein" References: <027d01c8125c$73d4db80$c8c55358@delloleg><20071019220501.GL31826@elvis.mu.org><006d01c8133a$674a90b0$eec55358@W2KOOOD> <20071020192601.GW31826@elvis.mu.org> Date: Sat, 20 Oct 2007 23:57:10 +0400 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="iso-8859-1"; reply-type=original Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.3138 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3198 X-Antivirus: avast! (VPS 000782-4, 20.10.2007), Outbound message X-Antivirus-Status: Clean Cc: freebsd-stable@freebsd.org Subject: Re: kern/104406: [ufs] Processes get stuck in "ufs" stateunderpersistent CPU load X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 21 Oct 2007 13:06:34 -0000 >> > > After my break to debugger using Ctrl+Alt+Esc sequence and entering a >> > > "panic" command kernel does not wrote a kernel dump but seems to >> > > hang. >> Can >> > > anyone describe how to obtain a kernel dump in this situation, or at >> least >> > > say - which output of show commands need in first place to debug this >> > > ? >> > > Output of all suggested commands is huge and I afraid of making >> > > mistake >> > > when carrying this output from screen to list of paper and back :-) >> > >> > Oleg, one thing you can do to make this less painful is to >> > run your machine's console over serial port. >> > >> > First get a crossover serial cable, make sure it works from one >> > box to another, it should be easy to run "tip com1" on both >> > boxes to ensure that it works. >> > >> > Then you just need to add console=comconsole to /boot/loader.conf >> > and your box's console should come over serial. >> > >> > Then on the machine watching the console, you can just do this: >> > >> > % script >> > Script started, output file is typescript >> > % tip com1 >> > ...do ddb stuff now... >> > ...stop tip >> > % exit >> > >> > now you should have everything logged into a file called "typescript" >> > should save you a big headache. >> >> Thanks, I'll try it in the monday morning. >> >> > As far as getting a dump from ddb, try this: >> > >> > ddb> call doadump >> > >> > I'm completely at a loss why this isn't a base ddb command "dump" >> > but whatever... :) >> >> Unfortunately, this doesn't work too. I called duty personnel in this >> datacenter and asked them to do this, and person on duty tells me that >> after >> he enters this command something like that arrives on monitor: >> >> db> call doadump >> Dumping 3072 MB >> >> Dump aborted error I/O >> Dump failed. (Error 5) > > Hmnmm, that seems like you might be having a hardware problem, It is possible, but unlikely: 1. I have simular symptoms on another AMD64 machine with 6.2 (uname -a from this machine listed in PR kern/104406 in my followup at Wed, 7 Mar 2007 05:10:59 +0300), but they are rare and this machine is in production, so I can't make experiments with it; 2. All these hardware successfully works earlier with FreeBSD 4.6. > what disk device do you have? Dumpdev is swap partition on da0 (single physical disk) that connected to Mylex AcceleRAID 170 RAID controller. The problem arrives when I copy large amount of files from FTP to another disk (da1) that is connected to the same RAID controller. > Have you also enabled kernel dumps via /etc/rc.conf:dumpdev= > ? Yes, I have dumpdev="AUTO" in rc.conf and swap device (4G) listed in /etc/fstab. -- Oleg Derevenetz OOD3-RIPE Phone: +7 4732 539880 Fax: +7 4732 531415 http://www.vsi.ru CenterTelecom Voronezh ISP http://isp.vsi.ru