From owner-freebsd-hackers Sat Apr 6 19:51:36 1996 Return-Path: owner-hackers Received: (from root@localhost) by freefall.freebsd.org (8.7.3/8.7.3) id TAA12684 for hackers-outgoing; Sat, 6 Apr 1996 19:51:36 -0800 (PST) Received: from hemi.com (hemi.com [204.132.158.10]) by freefall.freebsd.org (8.7.3/8.7.3) with SMTP id TAA12677 for ; Sat, 6 Apr 1996 19:51:28 -0800 (PST) Received: (from mbarkah@localhost) by hemi.com (8.6.12/8.6.12) id UAA15621 for hackers@freebsd.org; Sat, 6 Apr 1996 20:52:53 -0700 From: Ade Barkah Message-Id: <199604070352.UAA15621@hemi.com> Subject: HELP! All processes CPU time *stuck* at 0.00% To: hackers@freebsd.org Date: Sat, 6 Apr 1996 20:52:53 -0700 (MST) X-Mailer: ELM [version 2.4 PL24] Content-Type: text Sender: owner-hackers@freebsd.org X-Loop: FreeBSD.org Precedence: bulk I have a really weird problem. All of the process CPU percentage time as reported by ps and top is 0.0% (even if I run a huge cpu-hog test program.) vmstat reports the computer is at 0% idle(!) (top says it about 9.3% idle, usually we're at 85-90%+ idle), although the load average is near 0 (0.08, etc.) The computer seems to be running fine despite these extremely weird stat numbers. Any ideas before I do the "let's reboot and see if this gets fixed" solution ? My environment is FreeBSD 2.1-RELEASE. The machine crashed earlier today (12 hours ago) for unknown reasons (probably dying Adaptec card, could be bad L2 cache.) Some very interesting output: 1. ps | PID %CPU COMMAND | 0 0.0 (swapper) | 1 0.0 /sbin/init -- | 2 0.0 (pagedaemon) | 3 0.0 (vmdaemon) | 4 0.0 (update) | 54 0.0 routed -q | 69 0.0 /etc/annex/erpcd | 74 0.0 /usr/local/sbin/httpd -d /usr/local/www/server | 98 0.0 syslogd | 103 0.0 named -b /etc/named.boot [...rest deleted... all stuck at 0.0] 2. top | load averages: 0.04, 0.14, 0.21 20:34:35 | 112 processes: 1 running, 111 sleeping | | Memory: 44M Active, 3544K Inact, 7900K Wired, 4088K Cache, 2600K Free | Swap: 131M Total, 131M Free | | PID USERNAME PRI NICE SIZE RES STATE TIME WCPU CPU COMMAND | 12590 user1 18 0 668K 1092K sleep 0:00 0.00% 0.00% tcsh | 9867 user2 18 0 660K 1068K sleep 0:01 0.00% 0.00% tcsh | 12617 user1 18 0 668K 1064K sleep 0:00 0.00% 0.00% tcsh [...rest deleted... all %CPU stuck at 0.0, TIME gets incremented] 3. vmstat, 5 second samples |procs memory page disks faults cpu |r b w avm fre flt re pi po fr sr f0 s0 s1 c0 in sy cs us sy id |2 0 0183116 4492 43 0 0 0 32 0 0 1 0 0 257 210 33 3 4 93 |0 0 0183120 4484 1 0 0 0 0 0 0 0 0 0 170 343 91 0 0 0 |0 0 0152696 4180 33 0 0 0 14 0 0 512 0 0 197 391 92 0 0 0 |0 0 0152112 4044 57 0 0 0 44 0 0 0 0 0 176 521 99 0 0 0 |0 0 0173264 3816 102 0 0 0 80 0 0 0 512 0 148 903 258 0 0 0 Any ideas ? These numbers really make me uneasy. Thanks in advance, -Ade Barkah -------------------------------------------------------------------- Inet: mbarkah@hemi.com - HEMISPHERE ONLINE - www: --------------------------------------------------------------------