tuned STREAM on IBM eServer p5 570 (1900 MHz, 8 cpu)

From: Frank Johnston (fjohn@us.ibm.com)
Date: Mon Jul 12 2004 - 16:10:14 CDT

  • Next message: Frank Johnston: "standard STREAM on IBM eServer p5 570 (1900 MHz, 16 cpu)"

    IBM eServer p5 570 (1900 MHz, 8 cpu, 36MB L3 cache) with DDR2 memory.

     Requesting Large Pages
     Setting up for 2 CPUs per module
     Number of segments per array = 4
     CPU binding list : 0 2 4 6
     Shared Segment Pointer = 504403158265495552
     Shared Segment Pointer = 504403159339237376
     Shared Segment Pointer = 504403160412979200
     Segment Size (B) = 268435456 (MB = 256 )
     Array Size (B) = 1073741824 (MB = 1024 )
     Array Size (DW) = 134217728
     Num_threads = 8
     Num_threads = 8
     Num_threads = 8
     Num_threads = 8
     Num_threads = 8
     Num_threads = 8
     Num_threads = 8
     Num_threads = 8
     rebind: num_parthds is 8
     Starting Initialization
     Done With Initialization
     a(1) 1.00000000000000000
     b(M) 1.00000000000000000
     c(M) 1.00000000000000000
     Incremental Offset = 1536
     Number of Threads = 8
    ----------------------------------------------
     Double precision appears to have 16 digits of accuracy
     Assuming 8 bytes per DOUBLE PRECISION word
    ----------------------------------------------
     Array size = 134149120
     Offset = 0
     The total memory requirement is 3070 MB
     You are running each test 5 times
     The *best* time for each test is used
     ----------------------------------------------------
     Your clock granularity appears to be less than one microsecond
     Your clock granularity/precision appears to be 1 microseconds
     The tests below will each take a time on the order
     of 48264 microseconds
        (= 48264 clock ticks)
     Increase the size of the arrays if this shows that
     you are not getting at least 20 clock ticks per test.
     ----------------------------------------------------
     WARNING -- The above is only a rough guideline.
     For best results, please be sure you know the
     precision of your system timer.
     ----------------------------------------------------
    Function Rate (MB/s) RMS time Min time Max time
    Copy: 38737.1668 .2149 .0554 .0594
    Scale: 38506.3626 .2150 .0557 .0615
    Add: 42548.4284 .2255 .0757 .0761
    Triad: 43037.4145 .2251 .0748 .0760
     Sum of a is = 203738455068750.000
     Sum of b is = 40747691013750.0000
     Sum of c is = 54330254685000.0000



    This archive was generated by hypermail 2.1.4 : Tue Jul 13 2004 - 08:50:46 CDT