225QC/2-cpu

From: John McCalpin (mccalpin)
Date: Thu Jul 09 1998 - 18:16:54 CDT


bsw-1 2# setenv MP_SET_NUMTHREADS 2
bsw-1 3# ./stream.mp.4e6
            10036960
            11EBB160
            13D3FD28
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 4000000
 Offset = 0
 The total memory requirement is 91 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity/precision appears to be 4 microseconds
 The tests below will each take a time on the order
 of 134646 microseconds
    (= 33662 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 331.95 0.1928 0.2065 0.2039 0.2040 0.2062
Scale: 336.37 0.1903 0.2085 0.2054 0.2055 0.2077
Add: 373.28 0.2572 0.2609 0.2592 0.2592 0.2592
Triad: 364.67 0.2633 0.2649 0.2637 0.2637 0.2634
-----------------------------------------------------------------------------
 Sum of a is = 57665039062.50000
 Sum of b is = 11533007812.50000
 Sum of c is = 15377343750.00000
 a(1),a(n) = 1153300781250.000 1153300781250.000
 b(1),b(n) = 230660156250.0000 230660156250.0000
 c(1),c(n) = 307546875000.0000 307546875000.0000
bsw-1 4# !!
./stream.mp.4e6
            10036960
            11EBB160
            13D3FD28
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 4000000
 Offset = 0
 The total memory requirement is 91 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity/precision appears to be 4 microseconds
 The tests below will each take a time on the order
 of 134403 microseconds
    (= 33601 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) Min time Max time Mean time RMS time Median
Copy: 333.35 0.1920 0.2063 0.2048 0.2048 0.2062
Scale: 337.55 0.1896 0.2078 0.2053 0.2053 0.2069
Add: 373.45 0.2571 0.2591 0.2588 0.2588 0.2590
Triad: 364.63 0.2633 0.2635 0.2633 0.2633 0.2633
-----------------------------------------------------------------------------
 Sum of a is = 57665039062.50000
 Sum of b is = 11533007812.50000
 Sum of c is = 15377343750.00000
 a(1),a(n) = 1153300781250.000 1153300781250.000
 b(1),b(n) = 230660156250.0000 230660156250.0000
 c(1),c(n) = 307546875000.0000 307546875000.0000

-- 
--
John D. McCalpin, Ph.D.      Server System Architect
Server Platform Engineering  http://reality.sgi.com/mccalpin/
Silicon Graphics, Inc.       mccalpin@sgi.com  650-933-7407



This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:07 CDT