Stream results for an rx3600 with Dual-Core Itanium 2 processors

From: Kirby L. Collins <kcollins@rsn.hp.com>
Date: Wed Sep 20 2006 - 21:33:35 CST

Stream results for HP Integrity rx3600 with the zx2 chipset, and 1.6Ghz/18MB
Dual-Core Intel(R) Itanium(R) 2 processors, running HP-UX 11.23:

1.6GHz/18MB Dual-Core Intel Itanium 2
16GB memory (8x2GB DIMMs), AD124A 8-DIMM memory carrier
HP-UX 11.23.0609, HP f90 11.23.32
---------------------------------------------------------
Function Rate (MB/s) Avg time Min time Max time
Copy: 4817.4451 0.1168 0.1063 0.1706
Scale: 4783.0246 0.1144 0.1070 0.1727
Add: 5573.0127 0.1487 0.1378 0.1918
Triad: 5583.4258 0.1472 0.1376 0.2240

The runs used the v5.6 f90 version of the stream benchmark, with the
following changes:

96,97c96,97
< INTEGER n,offset,ndim,ntimes
< PARAMETER (n=2000000,offset=0,ndim=n+offset,ntimes=10)

---
>       INTEGER*8 n,offset,ndim,ntimes
>       PARAMETER (n=32000480,offset=0,ndim=n+offset,ntimes=10)
106c106
<       INTEGER bytes(4)
---
>       INTEGER*8 bytes(4)
124c124
< *     COMMON a,b,c
---
>       COMMON a,b,c
245c245
<  9020 FORMAT (1x,a,i4,a)
---
>  9020 FORMAT (1x,a,i6,a)
compiled as follows:
f90 -o stream_d.uni +Ofaster +DSitanium2 +DD64 +extend_source +autodbl4
+noppu -Wl,+pd,64M +Onoopenmp stream.uni.f mysecond.o
Here is the output:
stream_d.uni
----------------------------------------------
 Double precision appears to have 16 digits of accuracy
 Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
 ----------------------------------------------
 STREAM Version $Revision: 5.6 $
 ----------------------------------------------
 Array size =   32000480
 Offset     =          0
 The total memory requirement is    732 MB
 You are running each test  10 times
 --
 The *best* time for each test is used
 *EXCLUDING* the first and last iterations
 ----------------------------------------------
 ----------------------------------------------
 Printing one line per active thread....
 ----------------------------------------------------
 Your clock granularity/precision appears to be      1 microseconds
 ----------------------------------------------------
Function     Rate (MB/s)  Avg time   Min time  Max time
Copy:       4817.4451      0.1168      0.1063      0.1706
Scale:      4783.0246      0.1144      0.1070      0.1727
Add:        5573.0127      0.1487      0.1378      0.1918
Triad:      5583.4258      0.1472      0.1376      0.2240
 ----------------------------------------------------
 Solution Validates!
 ----------------------------------------------------
Received on Thu Sep 21 07:34:36 2006

This archive was generated by hypermail 2.1.8 : Thu Sep 21 2006 - 07:53:34 CST