Below are single cpu stream results for an HP ProLiant DL360G4, configured as follows: HP ProLiant DL360G4 2x3.6GHz/1M Xeon processors 8GB PC2700 memory (4x2GB DIMMs) Red Hat Enterprise Linux 3 Update 3 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 5000000 Offset = 0 The total memory requirement is 114 MB You are running each test 100 times The *best* time for each test is used ---------------------------------------------------- Your clock granularity/precision appears to be 1 microseconds The tests below will each take a time on the order of 26167 microseconds (= 26167 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ---------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ---------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Copy: 3420.5576 0.0236 0.0234 0.0270 Scale: 3434.2134 0.0235 0.0233 0.0264 Add: 3369.7453 0.0359 0.0356 0.0413 Triad: 3425.9286 0.0354 0.0350 0.0403 Sum of a is = 4.065611775448443E+124 Sum of b is = 8.131223550519913E+123 Sum of c is = 1.084163140035985E+124 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- STREAM Source version 4.1 Compiled with Intel Compilers 8.1: ifort -fpp -xP -parallel -ip -auto_ilp32 streamd.o secondwall.o