Re: Stream Results - Sun E10000

From: Brian Whitney (Brian.Whitney@West.Sun.COM)
Date: Fri Jun 18 1999 - 11:05:38 CDT


Dr McCalpin...

I have rerun things in 64-bit mode for 64, 48, 32, and 16 CPUs.
I did 5 runs per CPU config. Attached are the logs. The array size is
300,000,000. This is 4.6M array elements per processor, or 36.8MB
array per processor. This should be large enough (correct?). All
runs are on a full system, just less CPUs.

Results track pretty well from what I submitted before, but these are
a bit higher in general.

As far as the presentation of you table, it would be good to have
translations of the machine name to a bit more detail. I just
need a way to translate the listed name to what a particular company
offerred.

The names of the attached files should be pretty self explanatory, but

pstx2-64.logx16.400x8.64 64 processor results
pstx2-64.logx16.400x8.48 48 processor results
pstx2-64.logx16.400x8.32 32 processor results
pstx2-64.logx16.400x8.16 16 processor results

keeper@sae3:/home/keeper% 201) sys
sae3 06/18/99 09:00:05 SunOS 5.7 Generic_106541-02 sun4u
400/100 MHz 64 CPUs 8 MB E$ 64.00 GB memory SUNW,Ultra-Enterprise-10000
keeper@sae3:/home/keeper% 202)

Brian


----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 596697 microseconds
    (= 596697 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 12141.5503 0.4165 0.3953 0.4814
Scale: 7970.4096 0.6312 0.6022 0.6602
Add: 8599.0450 0.8534 0.8373 0.8617
Triad: 8600.4443 0.8726 0.8372 0.8786
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 595651 microseconds
    (= 595651 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 12130.0444 0.4414 0.3957 0.6169
Scale: 7958.3941 0.6284 0.6031 0.6484
Add: 8599.2201 0.8521 0.8373 0.8739
Triad: 8630.3129 0.8726 0.8343 0.8958
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 579513 microseconds
    (= 579513 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 12167.0028 0.4196 0.3945 0.5000
Scale: 7970.0226 0.6201 0.6023 0.6277
Add: 8605.7956 0.8408 0.8366 0.8469
Triad: 8616.0746 0.8604 0.8356 0.8677
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 597482 microseconds
    (= 597482 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 12132.9098 0.4200 0.3956 0.4814
Scale: 7966.3353 0.6192 0.6025 0.6306
Add: 8472.3587 0.8530 0.8498 0.8620
Triad: 8613.7797 0.8651 0.8359 0.8741
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 581655 microseconds
    (= 581655 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 12159.9570 0.4243 0.3947 0.5513
Scale: 7977.7957 0.6290 0.6017 0.6637
Add: 8610.8937 0.8489 0.8362 0.8539
Triad: 8251.6698 0.8734 0.8726 0.8741
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)


----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 599304 microseconds
    (= 599304 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 11577.5532 0.4209 0.4146 0.4680
Scale: 7353.7291 0.6549 0.6527 0.6694
Add: 7789.0422 0.9253 0.9244 0.9274
Triad: 7201.2731 1.0018 0.9998 1.0027
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 600044 microseconds
    (= 600044 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 11729.9561 0.4144 0.4092 0.4155
Scale: 7395.0202 0.6515 0.6491 0.6678
Add: 7823.1004 0.9209 0.9204 0.9219
Triad: 7229.6332 0.9965 0.9959 0.9975
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 599581 microseconds
    (= 599581 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 11595.9554 0.4250 0.4139 0.5069
Scale: 7415.4925 0.6500 0.6473 0.6657
Add: 7840.7754 0.9189 0.9183 0.9195
Triad: 7238.2820 0.9956 0.9947 0.9964
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 600269 microseconds
    (= 600269 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 11610.4149 0.4328 0.4134 0.5741
Scale: 7479.3262 0.6480 0.6418 0.6954
Add: 7862.3724 0.9164 0.9158 0.9169
Triad: 7274.7876 0.9904 0.9897 0.9909
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 599749 microseconds
    (= 599749 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 11599.4181 0.4229 0.4138 0.4933
Scale: 7477.0253 0.6449 0.6420 0.6606
Add: 7840.6027 0.9189 0.9183 0.9203
Triad: 7306.8919 0.9860 0.9854 0.9868
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)


----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 780404 microseconds
    (= 780404 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 9367.4950 0.5307 0.5124 0.6678
Scale: 5874.4214 0.8187 0.8171 0.8261
Add: 6068.6759 1.1876 1.1864 1.1885
Triad: 5176.2323 1.3918 1.3910 1.3928
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 781327 microseconds
    (= 781327 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 9362.0170 0.5134 0.5127 0.5140
Scale: 5872.6900 0.8187 0.8173 0.8272
Add: 6066.3548 1.1878 1.1869 1.1885
Triad: 5174.4295 1.3922 1.3915 1.3934
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 780310 microseconds
    (= 780310 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 9382.2079 0.5224 0.5116 0.6046
Scale: 5878.0314 0.8178 0.8166 0.8247
Add: 6073.9815 1.1861 1.1854 1.1870
Triad: 5175.0300 1.3920 1.3913 1.3931
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 780756 microseconds
    (= 780756 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 9447.7536 0.5272 0.5081 0.6665
Scale: 5902.1595 0.8144 0.8133 0.8209
Add: 6096.9868 1.1818 1.1809 1.1825
Triad: 5179.7939 1.3907 1.3900 1.3912
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 780661 microseconds
    (= 780661 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 9457.4364 0.5110 0.5075 0.5116
Scale: 5898.7208 0.8154 0.8137 0.8229
Add: 6088.9977 1.1833 1.1825 1.1840
Triad: 5175.8864 1.3916 1.3911 1.3923
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)


----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 1423088 microseconds
    (= 1423088 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 5415.7614 0.8872 0.8863 0.8914
Scale: 3489.8197 1.3760 1.3754 1.3776
Add: 3581.2702 2.0126 2.0105 2.0216
Triad: 2788.4158 2.5837 2.5821 2.5846
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 1420296 microseconds
    (= 1420296 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 5427.8851 0.8868 0.8843 0.8899
Scale: 3479.8422 1.3805 1.3794 1.3834
Add: 3560.7318 2.0231 2.0221 2.0242
Triad: 2786.9876 2.5845 2.5834 2.5853
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 1422022 microseconds
    (= 1422022 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 5417.5998 0.8906 0.8860 0.9264
Scale: 3497.1673 1.3732 1.3725 1.3743
Add: 3593.3089 2.0056 2.0037 2.0117
Triad: 2790.1061 2.5825 2.5805 2.5836
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 1420779 microseconds
    (= 1420779 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 5385.5530 0.8929 0.8913 0.9047
Scale: 3468.2695 1.3848 1.3840 1.3862
Add: 3552.4322 2.0277 2.0268 2.0285
Triad: 2781.6282 2.5892 2.5884 2.5902
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 Array size = 300000000
 Offset = 0
 The total memory requirement is 6866 MB
 You are running each test 10 times
 The *best* time for each test is used
 ----------------------------------------------------
 Your clock granularity appears to be less than one microsecond
 Your clock granularity/precision appears to be 1 microseconds
 The tests below will each take a time on the order
 of 1422451 microseconds
    (= 1422451 clock ticks)
 Increase the size of the arrays if this shows that
 you are not getting at least 20 clock ticks per test.
 ----------------------------------------------------
 WARNING -- The above is only a rough guideline.
 For best results, please be sure you know the
 precision of your system timer.
 ----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 5426.0808 0.8864 0.8846 0.8881
Scale: 3492.3740 1.3755 1.3744 1.3764
Add: 3579.7885 2.0130 2.0113 2.0185
Triad: 2788.6280 2.5829 2.5819 2.5837
 Sum of a is = 3.4599023433963D+20
 Sum of b is = 6.9198046533463D+19
 Sum of c is = 9.2264062741574D+19
 Note: Nonstandard floating-point mode enabled
 See the Numerical Computation Guide, ieee_sun(3M)



This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:08 CDT