STREAM results (Fujitsu SPARC Enterprise M8000 (64 cores))

From: Mikio HONDOU <hondou@jp.fujitsu.com>
Date: Mon Oct 12 2009 - 21:49:45 CDT

Dear Dr. McCalpin,

Please find our STREAM results for Fujitsu SPARC Enterprise
M8000 system to update the STREAM Web site.

         System Name: Fujitsu SPARC Enterprise M8000
            CPU Name: SPARC64 VII
             CPU MHz: 2880
      CPU(s) enabled: 64 cores, 16 chips, 4 cores/chip, 2 threads/core
       Primary Cache: 64 KB I + 64 KB D on chip per core
     Secondary Cache: 6 MB I+D on chip per chip
            L3 Cache: None
         Other Cache: None
              Memory: 384 GB (64 x 2 GB + 64 x 4 GB), 8-way interleaved
    Operating System: Solaris 10 5/09 with patches 119963-13, 120753-06, 118683-03
            Compiler: Sun Studio 12 Update 1
   Compilation Flags: -fast -xvector=no -m64 -xopenmp
                      -xprefetch=auto
                      -xtypemap=integer:64
  STREAM Source Code: the v5.6 f90 version with format changes
                      for large arrays.
         OS Settings: default
   Shell Environment: MPSSHEAP=4M
                      MPSSSTACK=4M
                      LD_PRELOAD=mpss.so.1:madv.so.1
                      MADV=access_lwp
                      OMP_NUM_THREADS=16
                      SUNW_MP_PROCBIND="0 8 16 24 32 40 48 56 64
                      72 80 88 96 104 112 120"

Outputs:
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 ----------------------------------------------
 STREAM Version $Revision: 5.6 $
 ----------------------------------------------
 Array size = 1700000032
 Offset = 0
 The total memory requirement is 38909 MB
 You are running each test 10 times
 --
 The *best* time for each test is used
 *EXCLUDING* the first and last iterations
 ----------------------------------------------
 Number of Threads = 16
 ----------------------------------------------
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 Printing one line per active thread....
 ----------------------------------------------------
 Your clock granularity/precision appears to be 1 microseconds
 ----------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 69711.0560 0.3904 0.3902 0.3907
Scale: 71800.3870 0.3793 0.3788 0.3798
Add: 82492.7626 0.4952 0.4946 0.4971
Triad: 81750.7441 0.4997 0.4991 0.5014
 ----------------------------------------------------
 Solution Validates!
 ----------------------------------------------------

--
Mikio HONDOU
Next Generation Technical Computing Unit
FUJITSU Limited
E-mail: hondou@jp.fujitsu.com

Received on Wed Oct 14 12:17:56 2009

This archive was generated by hypermail 2.1.8 : Sat Nov 28 2009 - 14:46:30 CST