From owner-freebsd-hackers  Sat Apr  6 19:51:36 1996
Return-Path: owner-hackers
Received: (from root@localhost)
          by freefall.freebsd.org (8.7.3/8.7.3) id TAA12684
          for hackers-outgoing; Sat, 6 Apr 1996 19:51:36 -0800 (PST)
Received: from hemi.com (hemi.com [204.132.158.10])
          by freefall.freebsd.org (8.7.3/8.7.3) with SMTP id TAA12677
          for <hackers@freebsd.org>; Sat, 6 Apr 1996 19:51:28 -0800 (PST)
Received: (from mbarkah@localhost) by hemi.com (8.6.12/8.6.12) id UAA15621 for hackers@freebsd.org; Sat, 6 Apr 1996 20:52:53 -0700
From: Ade Barkah <mbarkah@hemi.com>
Message-Id: <199604070352.UAA15621@hemi.com>
Subject: HELP! All processes CPU time *stuck* at 0.00%
To: hackers@freebsd.org
Date: Sat, 6 Apr 1996 20:52:53 -0700 (MST)
X-Mailer: ELM [version 2.4 PL24]
Content-Type: text
Sender: owner-hackers@freebsd.org
X-Loop: FreeBSD.org
Precedence: bulk


I have a really weird problem. All of the process CPU percentage
time as reported by ps and top is 0.0% (even if I run a huge cpu-hog
test program.) vmstat reports the computer is at 0% idle(!) (top says 
it about 9.3% idle, usually we're at 85-90%+ idle), although the load 
average is near 0 (0.08, etc.) The computer seems to be running fine 
despite these extremely weird stat numbers.

Any ideas before I do the "let's reboot and see if this gets fixed"
solution ?

My environment is FreeBSD 2.1-RELEASE. The machine crashed earlier
today (12 hours ago) for unknown reasons (probably dying Adaptec
card, could be bad L2 cache.)

Some very interesting output:

1. ps

|   PID %CPU COMMAND
|     0  0.0  (swapper)
|     1  0.0 /sbin/init --
|     2  0.0  (pagedaemon)
|     3  0.0  (vmdaemon)
|     4  0.0  (update)
|    54  0.0 routed -q
|    69  0.0 /etc/annex/erpcd
|    74  0.0 /usr/local/sbin/httpd -d /usr/local/www/server
|    98  0.0 syslogd
|   103  0.0 named -b /etc/named.boot
[...rest deleted... all stuck at 0.0]

2. top

| load averages:  0.04,  0.14,  0.21    20:34:35
| 112 processes: 1 running, 111 sleeping
| 
| Memory: 44M Active, 3544K Inact, 7900K Wired, 4088K Cache, 2600K Free
| Swap:   131M Total, 131M Free
| 
|   PID USERNAME PRI NICE   SIZE   RES STATE   TIME   WCPU    CPU COMMAND
| 12590 user1     18    0   668K 1092K sleep   0:00  0.00%  0.00% tcsh
|  9867 user2     18    0   660K 1068K sleep   0:01  0.00%  0.00% tcsh
| 12617 user1     18    0   668K 1064K sleep   0:00  0.00%  0.00% tcsh
[...rest deleted... all %CPU stuck at 0.0, TIME gets incremented]

3. vmstat, 5 second samples

|procs   memory     page                    disks         faults      cpu
|r b w   avm   fre  flt  re  pi  po  fr  sr f0 s0 s1 c0   in   sy  cs us sy id
|2 0 0183116  4492   43   0   0   0  32   0  0  1  0  0  257  210  33  3  4 93
|0 0 0183120  4484    1   0   0   0   0   0  0  0  0  0  170  343  91  0  0  0
|0 0 0152696  4180   33   0   0   0  14   0  0 512  0  0  197  391  92  0  0  0
|0 0 0152112  4044   57   0   0   0  44   0  0  0  0  0  176  521  99  0  0  0
|0 0 0173264  3816  102   0   0   0  80   0  0  0 512  0  148  903 258  0  0  0

Any ideas ? These numbers really make me uneasy. 

Thanks in advance,

-Ade Barkah
--------------------------------------------------------------------
Inet: mbarkah@hemi.com - HEMISPHERE ONLINE - www: <http://hemi.com/>
--------------------------------------------------------------------