Superdome 32-way Stream results

From: Kirby Collins (kcollins@rsn.hp.com)
Date: Fri May 18 2001 - 16:21:09 CDT

  • Next message: Kirby Collins: "Superdome 64-way Stream results"

    attached are Stream results for a Superdome with:

    8 cells (in a single partition)
    32 cpus (fully populated)
    128GB of memory (fully populated)
    552MHz PA8600 cpus, 500MHz system frequency
    running HP-UX 11i

    compiled with fortran90 in 64 bit mode as follows:

    f90 -o stream_d.mp +extend_source +autodbl4 +DA2.0W +noppu +DS2.0 +O3 -Wl,+pd,L -Wl,-aarchive +Oparallel strea
    m_d.f second_wall.o

    and source modified to adjust array size and put arrays in COMMON:

    for 16 and 32 cpu runs:
    63c63
    < PARAMETER (n=2000000,offset=0,ndim=n+offset,ntimes=10)

    ---
    >       PARAMETER (n=53477800,offset=0,ndim=n+offset,ntimes=10)
    88c88
    < *     COMMON a,b,c
    ---
    >       COMMON a,b,c
    

    ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 53477800 Offset = 0 The total memory requirement is 1224 MB You are running each test 10 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity/precision appears to be 2 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12704.8280 0.0676 0.0673 0.0678 Scale: 12735.0790 0.0674 0.0672 0.0677 Add: 13485.4877 0.0954 0.0952 0.0956 Triad: 13410.1041 0.0968 0.0957 0.1004 ---------------------------------------------------- Solution Validates! ----------------------------------------------------



    This archive was generated by hypermail 2b29 : Mon May 21 2001 - 08:38:48 CDT