Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 26 Jun 2006 23:05:15 +0300
From:      Kostik Belousov <kostikbel@gmail.com>
To:        "Marc G. Fournier" <scrappy@hub.org>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: What denotes a 'blocked' process?
Message-ID:  <20060626200515.GL79678@deviant.kiev.zoral.com.ua>
In-Reply-To: <20060626124226.Y1114@ganymede.hub.org>
References:  <20060626120842.Q1114@ganymede.hub.org> <20060626124226.Y1114@ganymede.hub.org>

next in thread | previous in thread | raw e-mail | index | archive | help

--sdEQJo40s7ofW8iR
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Mon, Jun 26, 2006 at 12:44:17PM -0300, Marc G. Fournier wrote:
> On Mon, 26 Jun 2006, Marc G. Fournier wrote:
>=20
> >
> >Just upgraded to June 15th sources, started up all the processes, and am=
=20
> >already at 29 blocked processes ...
> >
> >I've checked for states D, E and L ... nothing ...
> >
> >Actually, let's go one better ... attached is a complete list of my=20
> >process table (MWCHAN, STATE, COMMAND) ... right now, vmstat is showing:
> >
> >1 33 0 6381952 177944 1695   0   0   0 1601   0   1   0  416 50012 1657 =
14=20
> >14 72
> >1 33 2 6376440 181744 2013   0   0   0 2172   0   3   0  448 68528 1629 =
17=20
> >15 68
> >4 33 0 6385484 178364 1944   0   3   0 1758   0   8   0  420 57698 1221 =
17=20
> >14 69
> >23 46 0 6463664 149528 5294  29   4   2 4659   0  37   0  505 44758 3040=
=20
> >27 28 45
> >4 34 1 6424904 169660 4216  16   7   0 4047   0 211   0 1002 47502 5769 =
42=20
> >30 28
> >1 35 0 6453992 167388 2414   0   9   0 2265   0  44   0  535 62932 3160 =
18=20
> >18 64
> >7 33 0 6443672 168100 1642   0   0   0 1652   0   5   0  448 51974 2163 =
15=20
> >15 70
> >
> >So, according to this, there should be 33 processes blocked somewhere ..=
.=20
> >STATEs D/E/L all show nothing ... even state R (long shot) is showing 3-=
4=20
> >processes, and that's it ...
> >
> >This kernel is actually worse then the last, in that the last, on a=20
> >reboot, I'd see 4-5 blocked, and then it would slowly rise over the cour=
se=20
> >of 24 hours, not start at 33 and rise from there ...
>=20
> Wow, in less then 1 hour, I'm up to 60 blocked, barely 1 runnable:
>=20
>  0 60 0 7016076 187424 2527   0   0   0 1722   0   5   0  320 7921 2140 2=
4=20
>  19 57
>  0 60 0 7027436 185124  581   0   1   0 428   0   9   0  303 3214 2425  5=
 =20
>  9 86
>  0 60 0 7053368 183060  217   4   1   0 130   0  71   0  453 1748 1157  6=
 =20
>  4 90
>  1 60 1 7050848 183556    4   0   0   7  27   0  21   0  307  965 857  1 =
 4=20
>  94
>  0 60 2 7050860 183652    2   0   0   0   6   0   0   0  256  829 1030  2=
 =20
>  3 95
>  0 60 0 7051028 183348   28   1   2   0  11   0   3   0  307  944 855  3 =
 3=20
>  95
>  0 60 1 7056876 182248  136   0   0   0  66   0   8   0  285 1190 945  1 =
 4=20
>  95
>=20
> And nadda in ps:
>=20
> pluto# ps ax -O ppid,flags,mwchan | awk '$6 ~ /^D/ || $6 =3D=3D "STAT"' ;=
 ps=20
> aux | wc -l
>   PID  PPID       F MWCHAN  TT  STAT      TIME COMMAND
>     2     0     204 -       ??  DL     0:00.45 [g_event]
>     3     0     204 -       ??  DL     0:04.87 [g_up]
>     4     0     204 -       ??  DL     0:06.19 [g_down]
>     5     0     204 -       ??  DL     0:00.00 [thread taskq]
>     6     0     204 -       ??  DL     0:00.00 [kqueue taskq]
>     7     0     204 -       ??  DL     0:00.00 [acpi_task0]
>     8     0     204 -       ??  DL     0:00.00 [acpi_task1]
>     9     0     204 -       ??  DL     0:00.00 [acpi_task2]
>    10     0     204 ktrace  ??  DL     0:00.00 [ktrace]
>    15     0     204 -       ??  DL     0:00.68 [yarrow]
>    25     0     204 psleep  ??  DL     0:00.70 [pagedaemon]
>    26     0     204 psleep  ??  DL     0:00.00 [vmdaemon]
>    27     0     20c pgzero  ??  DL     0:14.43 [pagezero]
>    28     0     204 psleep  ??  DL     0:00.14 [bufdaemon]
>    29     0     204 vlruwt  ??  DL     0:00.15 [vnlru]
>    30     0     204 syncer  ??  DL     0:10.29 [syncer]
>    31     0     204 sdflus  ??  DL     0:00.68 [softdepflush]
>    32     0     204 -       ??  DL     0:03.28 [schedcpu]
>     1170
> pluto# ps ax -O ppid,flags,mwchan | awk '$6 ~ /^E/ || $6 =3D=3D "STAT"' ;=
 ps=20
> aux | wc -l
>   PID  PPID       F MWCHAN  TT  STAT      TIME COMMAND
>     1174
> pluto# ps ax -O ppid,flags,mwchan | awk '$6 ~ /^L/ || $6 =3D=3D "STAT"' ;=
 ps=20
> aux | wc -l
>   PID  PPID       F MWCHAN  TT  STAT      TIME COMMAND
>    12     0     20c Giant   ??  LL     0:08.16 [swi4: clock]
>     1170
> pluto#
>=20
> Something *has* to be leaking here somewhere ... :(

Dumb unmotivated question: do you have nfs exports on this machine ?

--sdEQJo40s7ofW8iR
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.4 (FreeBSD)

iD8DBQFEoD36C3+MBN1Mb4gRAqsEAJ9h8wqD0qw+ZSXL2t/5tOBElnJikwCg03av
EGiIO+IV5osudKuXl4EoZjY=
=lROV
-----END PGP SIGNATURE-----

--sdEQJo40s7ofW8iR--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20060626200515.GL79678>