STREAM results (Fujitsu SPARC Enterprise M9000 (128 cores))

From: B <hondou@jp.fujitsu.com>
Date: Sat Apr 14 2007 - 01:07:01 CDT

Dear Dr. McCalpin,

Please find our STREAM results for Fujitsu SPARC Enterprise
M9000 system to update the STREAM Web site.

         System Name: Fujitsu SPARC Enterprise M9000
            CPU Name: SPARC64 VI
             CPU MHz: 2400
      CPU(s) enabled: 128 cores, 64 chips, 2 cores/chip, 2 threads/core
       Primary Cache: 128 KB I + 128 KB D on chip per core
     Secondary Cache: 6 MB I+D on chip per chip
            L3 Cache: None
         Other Cache: None
              Memory: 1 TB (512 x 2 GB), 8-way interleaved
    Operating System: Solaris 10 7/07
            Compiler: Sun Studio 12
   Compilation Flags: -fast -xvector=no -m64 -xopenmp
                      -xprefetch=latx:3.4 -Qoption cg
                      -xchip=sparc64vi,-m_arch=sparcfmaf,-fma=fused
                      -xtypemap=integer:64
  STREAM Source Code: Fortran version (v5.0)
         OS Settings: default
   Shell Environment: OMP_NUM_THREADS=128
                      SUNW_MP_PROCBIND=" 0 2 4 6 8 10 12 14 16 18
                      20 22 24 26 28 30 32 34 36 38 40 42 44 46 48
                      50 52 54 56 58 60 62 64 66 68 70 72 74 76 78
                      80 82 84 86 88 90 92 94 96 98 100 102 104 106 108
                      110 112 114 116 118 120 122 124 126 128
                      130 132 134 136 138 140 142 144 146 148
                      150 152 154 156 158 160 162 164 166 168
                      170 172 174 176 178 180 182 184 186 188
                      190 192 194 196 198 200 202 204 206 208
                      210 212 214 216 218 220 222 224 226 228
                      230 232 234 236 238 240 242 244 246 248
                      250 252 254 "
                 Run: ppgsz -o heap=4m,stack=4m <stream>

Outputs:
 ----------------------------------------------------
 Array size = 675000000
 Offset = 524288
 The total memory requirement is 15449 MB
 You are running each test 10 times
 --
 The *best* time for each test is used
 *EXCLUDING* the first and last iterations
 ----------------------------------------------------
 Your clock granularity/precision appears to be 1 microseconds
 ----------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 224400.9987 0.0492 0.0481 0.0500
Scale: 223113.3641 0.0494 0.0484 0.0504
Add: 224271.3817 0.0734 0.0722 0.0743
Triad: 227059.3074 0.0725 0.0713 0.0735
 ----------------------------------------------------
 Solution Validates!
 ----------------------------------------------------

--
Mikio HONDOU
Enterprise Server Development Division
FUJITSU Limited
Phone:  +81 (0)44-754-3233
E-mail: hondou@jp.fujitsu.com
Received on Sat Apr 14 07:07:27 2007

This archive was generated by hypermail 2.1.8 : Tue Apr 17 2007 - 07:28:04 CDT