standard STREAM on IBM System p5 550Q (8CPUs 1.5 GHz)

From: Ly Vu (lyvu@us.ibm.com)
Date: Mon Oct 03 2005 - 14:03:30 CDT

  • Next message: Ly Vu: "tuned STREAM on IBM System p5 550Q (8CPUs 1.5 GHz)"

    These are standard STREAM results on an IBM System p5 550Q
    with eight 1.5 GHz cpus. This is a POWER5+ SMP machine.
    Large pages were used in all cases.

    Function Rate (MB/s) Avg time Min time Max time
    Copy: 15717.9163 .0682 .0682 .0682
    Scale: 15282.5470 .0702 .0701 .0702
    Add: 16917.2731 .0950 .0950 .0951
    Triad: 17330.8483 .0928 .0928 .0928

    Here is the full output file:
    -----------------------------

     Requesting Large Pages
     Setting up for 8 CPUs per module
     Number of segments per array = 2
     CPU binding list : 0 8
     Shared Segment Pointer = 504403158265495552
     Shared Segment Pointer = 504403158802366464
     Shared Segment Pointer = 504403159339237376
     Segment Size (B) = 268435456 (MB = 256 )
     Array Size (B) = 536870912 (MB = 512 )
     Array Size (DW) = 67108864
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     Num_threads = 16
     rebind: num_parthds is 16
     Starting Initialization
     Done With Initialization
     a(1) 1.00000000000000000
     b(M) 1.00000000000000000
     c(M) 1.00000000000000000

     Incremental Offset = 512
    ----------------------------------------------
     Double precision appears to have 16 digits of accuracy
     Assuming 8 bytes per DOUBLE PRECISION word
    ----------------------------------------------
     Array size = 66980864
     The total memory requirement is 1533 MB
     You are running each test 5 times
     --
     The *best* time for each test is used
     *EXCLUDING* the first and last iterations
     ----------------------------------------------------
     Your clock granularity appears to be less than one microsecond
     Your clock granularity/precision appears to be 1 microseconds
     ----------------------------------------------------
    Function Rate (MB/s) Avg time Min time Max time
    Copy: 15717.9163 .0682 .0682 .0682
    Scale: 15282.5470 .0702 .0701 .0702
    Add: 16917.2731 .0950 .0950 .0951
    Triad: 17330.8483 .0928 .0928 .0928
     ----------------------------------------------------
     Solution Validates!
     ----------------------------------------------------

    ______________________________________________
    Ly Vu
    IBM Corp. - Austin, Texas.
    RS/6000 Performance Analysis.
    Phone : (512) 838-8228
    Email : lyvu@us.ibm.com



    This archive was generated by hypermail 2.1.4 : Mon Oct 03 2005 - 20:46:12 CDT