Below are single cpu stream results for an HP ProLiant BL20pG3, configured as follows: HP ProLiant BL20pG3 2x3.6GHz/1M Xeon processors 8GB PC3200 memory (4x2GB DIMMs) Red Hat Enterprise Linux 3 Update 3 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 45000000 Offset = 0 The total memory requirement is 1029 MB You are running each test 100 times The *best* time for each test is used ---------------------------------------------------- Your clock granularity/precision appears to be 10000 microseconds The tests below will each take a time on the order of 220000 microseconds (= 22 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ---------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ---------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Copy: 3600.0000 0.2000 0.2000 0.2000 Scale: 3600.0000 0.2031 0.2000 0.2100 Add: 3857.1429 0.2869 0.2800 0.2900 Triad: 3724.1379 0.2934 0.2900 0.3000 Sum of a is = 3.659050599255510E+125 Sum of b is = 7.318101194156921E+124 Sum of c is = 9.757468262585657E+124 STREAM v4.1 Compiled with Intel Compilers 8.1: ifort -fpp -xP -parallel -ip -auto_ilp32 streamd.o secondwall.o