From dik@cwi.nl Fri Nov 8 08:01:02 1991 Received: from charon.cwi.nl by perelandra.cms.udel.edu (5.52/890607.SGI) (for mccalpin) id AA27310; Fri, 8 Nov 91 08:01:02 EST Received: by charon.cwi.nl with SMTP; Fri, 8 Nov 1991 14:01:57 +0100 Received: by paring.cwi.nl ; Fri, 8 Nov 91 14:01:55 +0100 Date: Fri, 8 Nov 91 14:01:55 +0100 From: dik@cwi.nl Message-Id: <9111081301.AA17636@paring.cwi.nl> To: mccalpin Subject: streams on NEC SX3 Status: R The two files in the sharchive below are the stream results for the NEC SX3. General information: System: NEC SX3/14 single processor, four vector pipes OS: SX/UX Compiler:f77sx Rev.012 Option: -O The instructions told me to increase the number of array elements. The number of elements in the timings was 5,000,000 (on the IBM 3090 in my previous mail it was 500,000). Still does not make it the initial calibration 30, but also here the clock has adequate resolution. Feel free to ask any questions you have, dik -- dik t. winter, cwi, amsterdam, nederland dik@cwi.nl -- #! /bin/sh # This is a shell archive, meaning: # 1. Remove everything above the #! /bin/sh line. # 2. Save the resulting text in a file. # 3. Execute the file with /bin/sh (not csh) to create: # out.single # out.double # This archive created: Fri Nov 8 12:34:35 1991 export PATH; PATH=/bin:/usr/bin:$PATH if test -f 'out.single' then echo shar: "will not over-write existing file 'out.single'" else cat << \SHAR_EOF > 'out.single' -------------------------------------- Single precision appears to have 6 digits of accuracy Assuming 4 bytes per default REAL word -------------------------------------- Timing calibration ; time = 2.176048 hundredths of a second Increase the size of the arrays if this is <30 and your clock precision is =<1/100 second --------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Assignment: 4151.4609 0.0096 0.0096 0.0096 Scaling : 3835.5132 0.0104 0.0104 0.0104 Summing : 4993.7852 0.0120 0.0120 0.0120 SAXPYing : 4993.8984 0.0120 0.0120 0.0120 SHAR_EOF fi if test -f 'out.double' then echo shar: "will not over-write existing file 'out.double'" else cat << \SHAR_EOF > 'out.double' -------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLEPRECISION word -------------------------------------- Timing calibration ; time = 2.176072797738016 hundredths of a second Increase the size of the arrays if this is <30 and your clock precision is =<1/100 second --------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Assignment: 8302.1617 0.0096 0.0096 0.0096 Scaling : 7670.8556 0.0104 0.0104 0.0104 Summing : 9712.7843 0.0124 0.0124 0.0124 SAXPYing : 9712.9248 0.0124 0.0124 0.0124 SHAR_EOF fi exit 0 # End of shell archive From dik@cwi.nl Fri Nov 22 04:00:01 1991 Received: from charon.cwi.nl by perelandra.cms.udel.edu (5.52/890607.SGI) (for mccalpin) id AA08763; Fri, 22 Nov 91 04:00:01 EST Received: by charon.cwi.nl with SMTP; Fri, 22 Nov 1991 10:01:49 +0100 Received: by paring.cwi.nl ; Fri, 22 Nov 91 10:01:47 +0100 Date: Fri, 22 Nov 91 10:01:47 +0100 From: dik@cwi.nl Message-Id: <9111220901.AA00373@paring.cwi.nl> To: mccalpin Subject: SX3 benchmark Status: R I have been informed that the SX3 I did use was not a SX3/14 but a SX3/12 (which means single processor, 2 vector pipes; not 4 vector pipes). It appears that the memory bandwidth measures is indeed what NEC representatives calculate. dik From aspgmid@cid.aes.doe.CA Wed Feb 12 13:14:16 1992 Received: from cidsv01.cid.aes.doe.ca by perelandra.cms.udel.edu via SMTP (911016.SGI/911001.SGI) for mccalpin id AA03120; Wed, 12 Feb 92 13:14:16 -0500 Return-Path: Received: by cidsv01 (5.64-bind 1.5+ida UNSUPPORTED PROTOTYPE-MX) id AA06643; Wed, 12 Feb 92 18:13:47 GMT From: Michel Dansereau Message-Id: <9202121813.AA06643@cidsv01> Subject: The stream benchmark results for an SX3-44 To: mccalpin Date: Wed, 12 Feb 92 18:13:46 GMT X-Mailer: ELM [version 2.3 PL11] Status: R M. McCalpin I ran the Stream benchmark on our SX3/44 and obtained the following results: -------------------------------------- The overhead of calling second = 2.251414E-06 seconds. Timing calibration ; time = 9.539266917478173 hundredths of a second Increase the size of the arrays if this is <30 and your clock precision is =<1/100 second --------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Assignment:16940.9591 0.0331 0.0331 0.0331 Scaling :15640.6689 0.0358 0.0358 0.0359 Summing :22436.5215 0.0376 0.0374 0.0380 SAXPYing :21972.1612 0.0383 0.0382 0.0388 -------------------------------------- Modifications: The time required to get the "time" was evaluated and subtracted from the timing of the loops. The second() function was written to take advantage of the internal free running "tic counter" ( f = 1 / 2.9E09 ). HNSX personnel on site have approved of these figure and mentioned that they are conservative. -- ------------------------------------------------------------------------------ Michel Dansereau | Environment Canada Tel: (514) 421-4696 | Atmospheric Environment Service (AES) Email: mdansereau@cid.aes.doe.ca | Dorval Computing Centre (CMIS) Fax: (514) 421-4703 | Systems Group (CMISS) ------------------------------------------------------------------------------