Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 20 Aug 2014 08:13:48 -0500
From:      "Polyack, Steve" <Steve.Polyack@intermedix.com>
To:        "Polyack, Steve" <Steve.Polyack@intermedix.com>, Alan Cox <alc@rice.edu>,  "freebsd-stable@freebsd.org" <freebsd-stable@freebsd.org>
Subject:   RE: vmdaemon CPU usage and poor performance in 10.0-RELEASE
Message-ID:  <4D557EC7CC2A544AA7C1A3B9CBA2B3672609CF2F8F@exchange03.epbs.com>
In-Reply-To: <4D557EC7CC2A544AA7C1A3B9CBA2B3672609CF28E5@exchange03.epbs.com>
References:  <4D557EC7CC2A544AA7C1A3B9CBA2B36726098846B4@exchange03.epbs.com> <20140813152522.GI9400@home.opsec.eu> <4D557EC7CC2A544AA7C1A3B9CBA2B36726098847AF@exchange03.epbs.com> <CAJUyCcNoTJ3xqkC_Prz3N%2BApEqYy3Mi2gA%2BuDo33dczaTMONrA@mail.gmail.com> <4D557EC7CC2A544AA7C1A3B9CBA2B3672609BBA3C4@exchange03.epbs.com> <53F24E5B.1010809@rice.edu> <4D557EC7CC2A544AA7C1A3B9CBA2B3672609BBA64F@exchange03.epbs.com> <53F2790C.20703@rice.edu> <4D557EC7CC2A544AA7C1A3B9CBA2B3672609CF28E5@exchange03.epbs.com>

next in thread | previous in thread | raw e-mail | index | archive | help

> -----Original Message-----
> From: owner-freebsd-stable@freebsd.org [mailto:owner-freebsd-
> stable@freebsd.org] On Behalf Of Polyack, Steve
> Sent: Tuesday, August 19, 2014 12:37 PM
> To: Alan Cox; freebsd-stable@freebsd.org
> Subject: RE: vmdaemon CPU usage and poor performance in 10.0-RELEASE
>=20
> > -----Original Message-----
> > From: owner-freebsd-stable@freebsd.org [mailto:owner-freebsd-
> > stable@freebsd.org] On Behalf Of Alan Cox
> > Sent: Monday, August 18, 2014 6:07 PM
> > To: freebsd-stable@freebsd.org
> > Subject: Re: vmdaemon CPU usage and poor performance in 10.0-RELEASE
> >
> > On 08/18/2014 16:29, Polyack, Steve wrote:
> > >> -----Original Message-----
> > >> From: owner-freebsd-stable@freebsd.org [mailto:owner-freebsd-
> > >> stable@freebsd.org] On Behalf Of Alan Cox
> > >> Sent: Monday, August 18, 2014 3:05 PM
> > >> To: freebsd-stable@freebsd.org
> > >> Subject: Re: vmdaemon CPU usage and poor performance in 10.0-
> RELEASE
> > >>
> > >> On 08/18/2014 13:42, Polyack, Steve wrote:
> > >>> Excuse my poorly formatted reply at the moment, but this seems to
> > have
> > >> fixed our problems.  I'm going to update the bug report with a note.
> > >>> Thanks Alan!
> > >> You're welcome.  And, thanks for letting me know of the outcome.
> > >>
> > > Actually, I may have spoken too soon, as it looks like we're seeing
> > vmdaemon tying up the system again:
> > > root                6  100.0  0.0        0       16  -  DL   Wed04PM =
     4:37.95
> [vmdaemon]
> > >
> > > Is there anything I can check to help narrow down what may be the
> > problem?  KTrace/truss on the "process" doesn't give any information, I
> > suppose because it's actually a kernel thread.
> >
> > Can you provide the full output of top?  Is there anything unusual abou=
t
> > the hardware or software configuration?
>=20
> This may have just been a fluke (maybe NFS caching the old vm_pageout.c
> during the first source build).  We've rebuilt and are monitoring it now.
>=20
> The hardware consists of a few Dell PowerEdge R720xd servers with 256GB
> of RAM and array of SSDs (no ZFS).  64GB is dedicated to postgres
> shared_buffers right now. FreeBSD 10, PostgreSQL 9.3, Slony-I v2.2.2, and
> redis-2.8.11 are all in use here.  I can't say that anything is unusual a=
bout the
> configuration.
>=20

We are still seeing the issue.  It seems to manifest once the "Free" memory=
 gets under 10GB (of 256GB on the system), even though ~200GB of this is cl=
assified as Inactive.  For us, this was about 7 hours of database activity =
(initial replication w/ slony).  Right now vmdaemon is consuming 100% CPU a=
nd shows 671:34 CPU time when it showed 0:00 up until the problem manifeste=
d.  The full top output (that fits on my screen) is below:

last pid: 62309;  load averages:  4.05,  4.24,  4.10                       =
                                                                       up 0=
+22:34:31  09:08:43
159 processes: 8 running, 145 sleeping, 1 waiting, 5 lock
CPU: 14.5% user,  0.0% nice,  4.9% system,  0.0% interrupt, 80.5% idle
Mem: 26G Active, 216G Inact, 4122M Wired, 1178M Cache, 1632M Buf, 2136M Fre=
e
Swap: 32G Total, 32G Free

  PID USERNAME       THR PRI NICE   SIZE    RES STATE   C   TIME    WCPU CO=
MMAND
   11 root            32 155 ki31     0K   512K CPU31  31 669.6H 2934.23% i=
dle
    6 root             1 -16    -     0K    16K CPU19  19 678:57 100.00% vm=
daemon
 1963 pgsql            1  45    0 67538M   208M CPU0    0 121:46  17.38% po=
stgres
 2037 pgsql            1  77    0 67536M  2200K *vm ob 14   6:24  15.97% po=
stgres
 1864 pgsql            1  31    0 67536M  1290M semwai  4 174:41  15.19% po=
stgres
 1996 pgsql            1  38    0 67538M   202M semwai 16 120:27  15.09% po=
stgres
 1959 pgsql            1  39    0 67538M   204M CPU27  27 117:30  15.09% po=
stgres
 1849 pgsql            1  32    0 67536M  1272M semwai 23 126:22  13.96% po=
stgres
 1997 pgsql            1  31    0 67538M   206M CPU30  30 122:26  11.77% po=
stgres
 2002 pgsql            1  34    0 67538M   182M sbwait 11  55:20  11.28% po=
stgres
 1961 pgsql            1  32    0 67538M   206M CPU12  12 121:47  10.99% po=
stgres
 1964 pgsql            1  30    0 67538M   206M semwai 28 122:08   9.86% po=
stgres
 1962 pgsql            1  29    0 67538M  1286M sbwait  2  45:49   7.18% po=
stgres
 1752 root             1  22    0 78356K  8688K CPU2    2 175:46   6.88% sn=
mpd
 1965 pgsql            1  25    0 67538M   207M semwai  9 120:55   6.59% po=
stgres
 1960 pgsql            1  23    0 67538M   177M semwai  6  52:42   4.88% po=
stgres
 1863 pgsql            1  25    0 67542M   388M semwai 25   9:12   2.20% po=
stgres
 1859 pgsql            1  22    0 67538M  1453M *vm ob 20   6:13   2.10% po=
stgres
 1860 pgsql            1  22    0 67538M  1454M sbwait  8   6:08   1.95% po=
stgres
 1848 pgsql            1  21    0 67586M 66676M *vm ob 30 517:07   1.66% po=
stgres
 1856 pgsql            1  22    0 67538M   290M *vm ob 15   5:39   1.66% po=
stgres
 1846 pgsql            1  21    0 67538M   163M sbwait 15   5:46   1.46% po=
stgres
 1853 pgsql            1  21    0 67538M   110M sbwait 30   8:54   1.17% po=
stgres
 1989 pgsql            1  23    0 67536M  5180K sbwait 18   1:41   0.98% po=
stgres
    5 root             1 -16    -     0K    16K psleep  6   9:33   0.78% pa=
gedaemon
 1854 pgsql            1  20    0 67538M   338M sbwait 22   5:38   0.78% po=
stgres
 1861 pgsql            1  20    0 67538M   286M sbwait 15   6:13   0.68% po=
stgres
 1857 pgsql            1  20    0 67538M  1454M semwai 10   6:19   0.49% po=
stgres
 1999 pgsql            1  36    0 67538M   156M *vm ob 28 120:56   0.39% po=
stgres
 1851 pgsql            1  20    0 67538M   136M sbwait 22   5:48   0.39% po=
stgres
 1975 pgsql            1  20    0 67536M  5688K sbwait 25   1:40   0.29% po=
stgres
 1858 pgsql            1  20    0 67538M   417M sbwait  3   5:55   0.20% po=
stgres
 2031 pgsql            1  20    0 67536M  5664K sbwait  5   3:26   0.10% po=
stgres
 1834 root            12  20    0 71892K 12848K select 20  34:05   0.00% sl=
on
   12 root            78 -76    -     0K  1248K WAIT    0  25:47   0.00% in=
tr
 2041 pgsql            1  20    0 67536M  5932K sbwait 14  12:50   0.00% po=
stgres
 2039 pgsql            1  20    0 67536M  5960K sbwait 17   9:59   0.00% po=
stgres
 2038 pgsql            1  20    0 67536M  5956K sbwait  6   8:21   0.00% po=
stgres
 2040 pgsql            1  20    0 67536M  5996K sbwait  7   8:20   0.00% po=
stgres
 2032 pgsql            1  20    0 67536M  5800K sbwait 22   7:03   0.00% po=
stgres
 2036 pgsql            1  20    0 67536M  5748K sbwait 23   6:38   0.00% po=
stgres
 1812 pgsql            1  20    0 67538M 59185M select  1   5:46   0.00% po=
stgres
 2005 pgsql            1  20    0 67536M  5788K sbwait 23   5:14   0.00% po=
stgres
 2035 pgsql            1  20    0 67536M  4892K sbwait 18   4:52   0.00% <p=
ostgres>
 1852 pgsql            1  21    0 67536M  1230M semwai  7   4:47   0.00% po=
stgres
   13 root             3  -8    -     0K    48K -      28   4:46   0.00% ge=
om


Thanks,

Steve



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4D557EC7CC2A544AA7C1A3B9CBA2B3672609CF2F8F>