Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 14 Apr 2010 12:47:31 -0500
From:      Adam Vande More <amvandemore@gmail.com>
To:        Andriy Gapon <avg@freebsd.org>
Cc:        alc@freebsd.org, Maho NAKATA <chat95@mac.com>, alan.l.cox@gmail.com, freebsd-stable@freebsd.org, als@modulus.org
Subject:   Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920
Message-ID:  <n2o6201873e1004141047t97d89cb0o2688fae1875eae08@mail.gmail.com>
In-Reply-To: <4BC5F289.7020408@freebsd.org>
References:  <h2yca3526251004122230l909bc93ey916d7fe0dd24fd33@mail.gmail.com> <4BC402B7.5000400@modulus.org> <v2gca3526251004122322i709c523ct4f93bcf75a778a8e@mail.gmail.com> <20100414.082109.29593248145846106.chat95@mac.com> <4BC5DEB4.1090208@freebsd.org> <x2k6201873e1004140934z6f7518b9j72ffd9e1adc1ad49@mail.gmail.com> <m2y6201873e1004140945n855c8800we9baced2e293f270@mail.gmail.com> <4BC5F289.7020408@freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Apr 14, 2010 at 11:51 AM, Andriy Gapon <avg@freebsd.org> wrote:

> on 14/04/2010 19:45 Adam Vande More said the following:
> >
> > also if I run cpuset on the dgemm then the utilization is basically at
> > the theoretical max for one core so at least that part is working.
>
> You can also try procstat -t <pid> to find out thread IDs and cpuset -t to
> pin the
> threads to the cores.
>

it gets to around 90% doing that.

time : 103.617271 or 27.140992
Mflops : 47172.925449
n: 4100
time : 113.910669 or 30.520677
Mflops : 45174.496186
n: 4200
time : 121.880695 or 32.068070
Mflops : 46217.711013
n: 4300

tried a couple of different thread orders but didn't seem to make a
difference.

galacticdominator% procstat -t 1922
  PID    TID COMM             TDNAME           CPU  PRI STATE   WCHAN
 1922 100092 dgemm            initial thread     0  190 run     -
 1922 100268 dgemm            -                  1  190 run     -
 1922 100270 dgemm            -                  1  191 run     -
 1922 100272 dgemm            -                  3  190 run     -
 1922 100273 dgemm            -                  2  191 run     -
 1922 100274 dgemm            -                  2  191 run     -
 1922 100282 dgemm            -                  0  190 run     -
 1922 100283 dgemm            -                  3  190 run     -

galacticdominator% cpuset -t 100092 -l 0
galacticdominator% cpuset -t 100268 -l 1
galacticdominator% cpuset -t 100270 -l 2
galacticdominator% cpuset -t 100272 -l 3
galacticdominator% cpuset -t 100273 -l 0
galacticdominator% cpuset -t 100274 -l 1
galacticdominator% cpuset -t 100282 -l 2
galacticdominator% cpuset -t 100283 -l 3


galacticdominator% cpuset -t 100092 -l 0
galacticdominator% cpuset -t 100268 -l 0
galacticdominator% cpuset -t 100270 -l 1
galacticdominator% cpuset -t 100272 -l 1
galacticdominator% cpuset -t 100273 -l 2
galacticdominator% cpuset -t 100274 -l 2
galacticdominator% cpuset -t 100282 -l 3
galacticdominator% cpuset -t 100283 -l 3


This is from the second set:

time : 150.348850 or 40.488350
Mflops : 45022.951141
n: 4600
time : 161.968982 or 43.589618
Mflops : 44669.884500
n: 4700

Since this is a full fledged desktop environment, 90% utilization seems
pretty good.  I'm no expert Andriy, but it seems like if gotoblas
implemented some of the FreeBSD optimizations then we'd be in the same
ballpark.


-- 
Adam Vande More



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?n2o6201873e1004141047t97d89cb0o2688fae1875eae08>