stream 128 on IBM 990

From: John McCalpin (mccalpin@grieg.udel.edu)
Date: Fri May 02 1997 - 15:18:25 CDT


xlf -O3 -qarch=pwrx stream_128.f -o stream
"stream_128.f", 1500-036 (I) Optimization level 3 has the potential to alter the semantics of a program. Please refer to documentation on -O3 and the STRICT option for more information.
** stream === End of Compilation 1 ===
** realsize === End of Compilation 2 ===
** confuse === End of Compilation 3 ===
** checktick === End of Compilation 4 ===
** second === End of Compilation 5 ===
1501-51xlf -O3 -qarch=pwrx stream_128.f -o stream
"stream_128.f", 1500-036 (I) Optimization level 3 has the potential to alter the semantics of a program. Please refer to documentation on -O3 and the STRICT option for more information.
** stream === End of Compilation 1 ===
** realsize === End of Compilation 2 ===
** confuse === End of Compilation 3 ===
** checktick === End of Compilation 4 ===
** second === End of Compilation 5 ===
1501-510 Compilation successful for file stream_128.f.
grieg 21% ./stream
 Hmmmm. I am unable to determine the size.
 Please enter the number of Bytes per DOUBLE PRECISION number :
16
 You have manually entered a size of 16 bytes per DOUBLE PRECISION number
----------------------------------------------
 Array size = 2000000
 Offset = 0
 The total memory requirement is 91 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity/precision appears to be 10000 microseconds
 The tests below will each take a time on the order
 of 440000 microseconds
    (= 44 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know tho be 10000 microseconds
 The tests below will each take a time on the order
 of 440000 microseconds
    (= 44 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 800.00 .0800 .1300 .1050 .1059 .1100
Scale: 130.61 .4900 .5400 .5110 .5113 .5100
Add: 118.52 .8100 .9200 .8850 .8856 .8800
Triad: 68.57 1.4000 1.5700 1.4790 1.4800 1.4100
-----------------------------------------------------------------------------
 All times are
     .1100 .5000 .9200 1.5400
     .1000 .4900 .9000 1.4500
     .1100 .5400 .8800 1.5300
     .0900 .5100 .8700 1.5200
     .1000 .5100 .9100 1.4000
     .1200 .5100 .8500 1.4200
     .1300 .5200 .9100 1.4500
     .0800 .5400 .8800 1.4400
     .1100 .5000 .8100 1.5700
     .1000 .4900 .9200 1.4700
-----------------------------------------------------------------------------
 Sum of a is = 115330078125.000000
 Sum of b is = 23066015625.0000000
 Sum of c is = 30754687500.0000000



This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:06 CDT