tuned STREAM on IBM BladeCenter JS21 (2CPUs 2.7 GHz)

From: Ly Vu (lyvu@us.ibm.com)
Date: Wed Feb 08 2006 - 11:31:51 CST

  • Next message: Ly Vu: "tuned STREAM on IBM System p5 575 (16CPUs 1.9 GHz)"

    Hi John,

    These are tuned STREAM results on an IBM BladeCenter
    JS21 with two 2.7 GHz cpus.
    Large pages were used in all cases.

    Function Rate (MB/s) RMS time Min time Max time
    Copy: 7053.44 .08 .08 .08
    Scale: 6321.18 .09 .08 .09
    Add: 6560.83 .13 .12 .13
    Triad: 6521.02 .13 .12 .13

    Here is the full output file:
    --------------------------------------------------
     Requesting Large Pages
     Setting up for 2 CPUs per module
     Number of segments per array = 1
     CPU binding list : 0
     Shared Segment Pointer = 504403158265495552
     Shared Segment Pointer = 504403158533931008
     Shared Segment Pointer = 504403158802366464
     Segment Size (B) = 268435456 (MB = 256 )
     Array Size (B) = 268435456 (MB = 256 )
     Array Size (DW) = 33554432
     Num_threads = 2
     Num_threads = 2
     rebind: num_parthds is 2
     Starting Initialization
     Done With Initialization
     a(1) 1.00000000000000000
     b(M) 1.00000000000000000
     c(M) 1.00000000000000000
     Incremental Offset = 2560
     Number of Threads = 2
    ----------------------------------------------
     Double precision appears to have 16 digits of accuracy
     Assuming 8 bytes per DOUBLE PRECISION word
    ----------------------------------------------
     Array size = 33537024
     Offset = 0
     The total memory requirement is 767 MB
     You are running each test 5 times
     The *best* time for each test is used
     ----------------------------------------------------
     Your clock granularity appears to be less than one microsecond
     Your clock granularity/precision appears to be 1 microseconds
     The tests below will each take a time on the order
     of 94282 microseconds
        (= 94282 clock ticks)
     Increase the size of the arrays if this shows that
     you are not getting at least 20 clock ticks per test.
     ----------------------------------------------------
     WARNING -- The above is only a rough guideline.
     For best results, please be sure you know the
     precision of your system timer.
     ----------------------------------------------------
    Function Rate (MB/s) RMS time Min time Max time
    Copy: 7053.44 .08 .08 .08
    Scale: 6321.18 .09 .08 .09
    Add: 6560.83 .13 .12 .13
    Triad: 6521.02 .13 .12 .13
     Sum of a is = 50934355200000.0000
     Sum of b is = 10186871040000.0000
     Sum of c is = 13582494720000.0000

    ______________________________________________
    Ly Vu
    IBM Corp. - Austin, Texas.
    AIX/pSeries Performance
    Phone : (512) 838-8228
    Email : lyvu@us.ibm.com



    This archive was generated by hypermail 2.1.4 : Sun Feb 12 2006 - 00:14:47 CST