Stream result for AMD 2GHz "Barcelona" 2P (KFSN4-DRE) system

From: Waldecker, Brian <brian.waldecker@amd.com>
Date: Sun Nov 04 2007 - 08:32:56 CST

Hello,

Included below, please find the Stream result for a two cpu (2.0GHz)
Barcelona system with description. This is an ASUS split-power-plane
motherboard.

thank you,
Brian Waldecker
Advanced Micro Devices



System Description
------------------

ASUS KFSN4-DRE motherboard
Two AMD Opteron 2350 ("Barcelona") cpus (2.0 GHz)
8 x 2GB DDR2-667 CL5 memory ECC, REG, 2Rx4 (Micron)
default bios settings loaded

SUSE Linux Enterprise Server 10 (x86_64)
VERSION = 10
PATCHLEVEL = 1
Linux ghasus2P 2.6.16.46-0.10-smp #1 SMP Mon May 7 13:37:05 UTC 2007
x86_64 x86_64 x86_64 GNU/Linux

QLogic PathScale(TM) Compiler Suite: Version 3.0
GNU gcc version 4.0.2 (PathScale 3.0 driver)

Compiler invocation:
  pathcc -gnu3 -mp -Ofast -static -static-libgcc
-CG:load_exe=2:use_prefetchnta=ON \
         -LNO:blocking=0 -o stream_psc3_omp_c_Ofast stream.c


-------------------------------------------------------------
STREAM version $Revision: 5.6 $
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 384
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Number of Threads requested = 8
-------------------------------------------------------------
Printing one line per active thread....
Printing one line per active thread....
Printing one line per active thread....
Printing one line per active thread....
Printing one line per active thread....
Printing one line per active thread....
Printing one line per active thread....
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 2 microseconds.
Each test below will take on the order of 1631 microseconds.
   (= 815 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 17104.3364 0.0019 0.0019 0.0019
Scale: 16842.4806 0.0019 0.0019 0.0019
Add: 16673.0097 0.0029 0.0029 0.0029
Triad: 16760.4555 0.0029 0.0029 0.0029
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------


Received on Sun Nov 04 09:34:50 2007

This archive was generated by hypermail 2.1.8 : Tue Nov 06 2007 - 14:44:33 CST