Re: stream_c results for Mac Pro

From: Stefan Ludwig <stefan@stefanirene.org>
Date: Fri Oct 23 2009 - 08:09:16 CDT

Updated results after a reboot and very few things running. I guess
having an L3 that doesn't get thrashed all the time is useful. :-)

Cheers,

Stefan

---

$ make clean all run
rm -f stream_f stream_c *.o
gcc -O2 stream.c -o stream_c
./stream_c
-------------------------------------------------------------
STREAM version $Revision: 5.9 $
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 3668 microseconds.
    (= 3668 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 8427.5856 0.0039 0.0038 0.0042
Scale: 8054.3524 0.0040 0.0040 0.0041
Add: 8817.3517 0.0056 0.0054 0.0058
Triad: 8953.4195 0.0056 0.0054 0.0061
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

On 23.10.2009, at 09:53, Stefan Ludwig wrote:

> Hi John
>
> Thanks for maintaining your site and the stream benchmark.
>
> Here are results for my Mac Pro, running SnowLeopard (OS X 10.6.1).
>
> Cheers,
>
> Stefan Ludwig
>
> Model Name: Mac Pro
> Model Identifier: MacPro4,1
> Processor Name: Quad-Core Intel Xeon
> Processor Speed: 2.66 GHz
> Number Of Processors: 1
> Total Number Of Cores: 4
> L2 Cache (per core): 256 KB
> L3 Cache: 8 MB
> Memory: 8 GB (4 x 2GB, DDR3 at 1066 Mbps)
>
> $ make clean all run
> rm -f stream_c *.o
> gcc -O2 stream.c -o stream_c
> ./stream_c
> -------------------------------------------------------------
> STREAM version $Revision: 5.9 $
> -------------------------------------------------------------
> This system uses 8 bytes per DOUBLE PRECISION word.
> -------------------------------------------------------------
> Array size = 2000000, Offset = 0
> Total memory required = 45.8 MB.
> Each test is run 10 times, but only
> the *best* time for each is used.
> -------------------------------------------------------------
> Printing one line per active thread....
> -------------------------------------------------------------
> Your clock granularity/precision appears to be 1 microseconds.
> Each test below will take on the order of 4095 microseconds.
> (= 4095 clock ticks)
> Increase the size of the arrays if this shows that
> you are not getting at least 20 clock ticks per test.
> -------------------------------------------------------------
> WARNING -- The above is only a rough guideline.
> For best results, please be sure you know the
> precision of your system timer.
> -------------------------------------------------------------
> Function Rate (MB/s) Avg time Min time Max time
> Copy: 7396.9539 0.0044 0.0043 0.0046
> Scale: 6988.6867 0.0047 0.0046 0.0048
> Add: 7358.4281 0.0066 0.0065 0.0067
> Triad: 7070.2930 0.0069 0.0068 0.0073
> -------------------------------------------------------------
> Solution Validates
> -------------------------------------------------------------
>
> $ gcc -v
> Using built-in specs.
> Target: i686-apple-darwin10
> Configured with: /var/tmp/gcc/gcc-5646~6/src/configure --disable-
> checking --enable-werror --prefix=/usr --mandir=/share/man --enable-
> languages=c,objc,c++,obj-c++ --program-transform-name=/^[cg][^.-]*$/
> s/$/-4.2/ --with-slibdir=/usr/lib --build=i686-apple-darwin10 --with-
> gxx-include-dir=/include/c++/4.2.1 --program-prefix=i686-apple-
> darwin10- --host=x86_64-apple-darwin10 --target=i686-apple-darwin10
> Thread model: posix
> gcc version 4.2.1 (Apple Inc. build 5646)
>



Received on Sat Oct 24 11:37:14 2009

This archive was generated by hypermail 2.1.8 : Wed Feb 03 2010 - 10:00:31 CST