225QC in L2

From: John McCalpin (mccalpin)
Date: Thu Jul 09 1998 - 18:20:45 CDT


bsw-1 6# f77 -n32 -mips4 -Ofast=ip27 stream.f second.o -o stream.small
bsw-1 7# ./stream.small
            100149D8
            1004F358
            10089CD8
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 30000
 Offset = 0
 The total memory requirement is 0 MB
 You are running each test 20 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity/precision appears to be 4 microseconds
 The tests below will each take a time on the order
 of 306 microseconds
    (= 77 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 1030.04 0.0005 0.0005 0.0005 0.0005 0.0005
Scale: 1085.97 0.0004 0.0005 0.0004 0.0004 0.0004
Add: 1089.26 0.0007 0.0007 0.0007 0.0007 0.0007
Triad: 1182.27 0.0006 0.0006 0.0006 0.0006 0.0006
-----------------------------------------------------------------------------
 Sum of a is = 3.3252567300810404E+22
 Sum of b is = 6.6505134601593369E+21
 Sum of c is = 8.8673512802117495E+21
 a(1),a(n) = 6.6505134601593012E+23 6.6505134601593012E+23
 b(1),b(n) = 1.3301026920318602E+23 1.3301026920318602E+23
 c(1),c(n) = 1.7734702560424803E+23 1.7734702560424803E+23
bsw-1 8# ^P
^P - Command not found
bsw-1 9# ./stream.small
            100149D8
            1004F358
            10089CD8
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 30000
 Offset = 0
 The total memory requirement is 0 MB
 You are running each test 20 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity/precision appears to be 4 microseconds
 The tests below will each take a time on the order
 of 299 microseconds
    (= 75 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 1025.64 0.0005 0.0005 0.0005 0.0005 0.0005
Scale: 1085.97 0.0004 0.0005 0.0004 0.0004 0.0004
Add: 1089.26 0.0007 0.0007 0.0007 0.0007 0.0007
Triad: 1184.21 0.0006 0.0006 0.0006 0.0006 0.0006
-----------------------------------------------------------------------------
 Sum of a is = 3.3252567300810404E+22
 Sum of b is = 6.6505134601593369E+21
 Sum of c is = 8.8673512802117495E+21
 a(1),a(n) = 6.6505134601593012E+23 6.6505134601593012E+23
 b(1),b(n) = 1.3301026920318602E+23 1.3301026920318602E+23
 c(1),c(n) = 1.7734702560424803E+23 1.7734702560424803E+23
bsw-1 10# !!
./stream.small
            100149D8
            1004F358
            10089CD8
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 30000
 Offset = 0
 The total memory requirement is 0 MB
 You are running each test 20 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity/precision appears to be 4 microseconds
 The tests below will each take a time on the order
 of 304 microseconds
    (= 76 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 1032.26 0.0005 0.0005 0.0005 0.0005 0.0005
Scale: 1088.44 0.0004 0.0005 0.0004 0.0004 0.0004
Add: 1089.26 0.0007 0.0007 0.0007 0.0007 0.0007
Triad: 1182.27 0.0006 0.0006 0.0006 0.0006 0.0006
-----------------------------------------------------------------------------
 Sum of a is = 3.3252567300810404E+22
 Sum of b is = 6.6505134601593369E+21
 Sum of c is = 8.8673512802117495E+21
 a(1),a(n) = 6.6505134601593012E+23 6.6505134601593012E+23
 b(1),b(n) = 1.3301026920318602E+23 1.3301026920318602E+23
 c(1),c(n) = 1.7734702560424803E+23 1.7734702560424803E+23

-- 
--
John D. McCalpin, Ph.D.      Server System Architect
Server Platform Engineering  http://reality.sgi.com/mccalpin/
Silicon Graphics, Inc.       mccalpin@sgi.com  650-933-7407



This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:07 CDT