Hi John, Here is the 1-CPU AlphaServer ES45 at 1000 MHz. This run uses the same source file as the 4-cpu run (mailed separately), but it is compiled without "-omp". -john Script started on Tue Jun 5 11:47:52 2001 % /usr/sbin/psrinfo -v Status of processor 0 as of: 06/05/01 11:48:01 Processor has been on-line since 06/05/2001 11:13:05 The alpha EV6.8CB (21264C) processor operates at 1000 MHz, has a cache size of 8388608 bytes, and has an alpha internal floating point processor. % cat build_noomp.csh #!/bin/csh set verbose unlimit f90 -v -source_listing -machine_code \ -o mcc_noomp_`date +%Y%m%d` \ -fast -O5 -unroll 32 -arch ev6 \ mcc_omp.f grep COMPILER: mcc_omp.lis % ./!$ % ./build_noomp.csh unlimit f90 -v -source_listing -machine_code -o mcc_noomp_`date +%Y%m%d` -fast -O5 -unroll 32 -arch ev6 mcc_omp.f /usr/lib/cmplrs/fort90/decfort90 -machine_code -fast -O5 -unroll 32 -arch ev6 -I/usr/lib/cmplrs/hpfrtl -source_listing -o /tmp/forAAAaacrda.o mcc_omp.f /usr/bin/cc -v -o mcc_noomp_20010605 -arch ev6 /usr/lib/cmplrs/fort90/for_main.o -source_listing /tmp/forAAAaacrda.o -O4 -qlshpf -lUfor -lfor -lFutil -lm -lots -lm_c32 -lmld -lexc /usr/lib/cmplrs/cc.dtk/ld -o mcc_noomp_20010605 -g0 -O4 -call_shared /usr/lib/cmplrs/cc.dtk/crt0.o /usr/lib/cmplrs/fort90/for_main.o /tmp/forAAAaacrda.o -qlshpf -lUfor -lfor -lFutil -lm -lots -lm_c32 -lmld -lexc -lc /usr/lib/cmplrs/cc.dtk/ld: 0.01u 0.00s 0:00 33% 0+22k 0+13io 0pf+0w 22stk+2128mem grep COMPILER: mcc_omp.lis COMPILER: Compaq Fortran V5.4A-1472-46B2F % echo "4500110,0,10" | mcc_noomp_20010605 n, offset, ntimes ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 4500110 Offset = 0 The total memory requirement is 103.00 MB You are running each test 10 times The *best* time for each test is used ---------------------------------------------------- Your clock granularity/precision appears to be 900 microseconds The tests below will each take a time on the order of 31200 microseconds (= 35 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ---------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ---------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Copy: 1945.9935 0.0379 0.0370 0.0390 Scale: 1940.7482 0.0379 0.0371 0.0381 Add: 1978.0703 0.0554 0.0546 0.0557 Triad: 1978.0703 0.0549 0.0546 0.0557 AvgBW: 1960.7206 Sum of a is = 5.189980378317790E+018 Sum of b is = 1.037996075790888E+018 Sum of c is = 1.383994767594204E+018 % exit % script done on Tue Jun 5 11:48:45 2001