STREAM: Sustainable Memory Bandwidth in High Performance Computers


John D. McCalpin, Ph.D.
john@mccalpin.com
"Dr. Bandwidth"
STREAM Logo (Image)
What's New?
McCalpin's Bandwidth Blog
Here are the current RESULTS!
STREAM FAQ
Analyses, Commentary, etc....
Hypermail archives of original contributions
Source Code Directory
Raw Data Files
STREAM2 (experimental) pages.
Shortcut Links to All Tables Bandwidth MFLOPS Balance
Top 20 link link link
Standard link link link
Tuned link link link
ALL Results sorted by Name link link link
ALL Results sorted by Date link link link
PC compatible link link link
Mac Compatible link link link
Experimental/Non-standard link link link
MPI link link link
32 bit link link link
Obsolete link link link




Here are the RESULTS!

Top 20 Results for Shared-Memory Systems!

This set of results includes the top 20 shared-memory systems (either "standard" or "tuned" results), ranked by STREAM TRIAD performance. Like the LINPACK NxN benchmark, this is intended to show off the best possible bandwidth of these large systems. The results are currently presented in the following tables:

Standard Results

The "standard" set of results presents the results of the C or Fortran versions of the STREAM benchmark running with 64-bit data types on production hardware. The "standard" set of results excludes: The results are currently presented in the following tables:

Tuned Results

The "tuned" set of results presents the results of the STREAM benchmark running with 64-bit data types on production hardware, but allows code modification (including assembly language coding). The "tuned" set of results excludes: The results are currently presented in the following tables:

ALL Results sorted by Name

All results are presented in the following tables:

ALL Results sorted by Submission Date

All results are presented in the following tables:

PC-compatible Results

This set of tables summarizes the "standard" test cases, but restricted to IBM PC-compatible computers.

An easy-to-use set of executables for Windows (32-bit and 64-bit) with both single-thread and OpenMP versions is available in StreamWin-32-64_distro.zip.

Users are (of course) free to re-compile the source code (C and FORTRAN are both equally "official"). Older binaries are also available in the FTP Contrib area.
Use of the old DOS binaries is discouraged.

The results are currently presented in the following tables:


Macintosh-Compatible Results

This set of tables summarizes the "standard" test cases, but restricted to Macintosh and compatible computers.

Users are free to re-compile the code or use the contributed binaries.

The results are currently presented in the following tables:


Experimental/Nonstandard Results

These tables include only results that are Note:
A "partially depopulated" system is one in which only a subset of the cpus are used for the benchmark, and for which this subset is spread around the machine to decrease contention. For example on the SGI Origin2000, each node has 2 cpus sharing a single bus and memory subsystem. The results in this table labelled "1 per node" are based on using only one cpu per node board, and are considered a "nonstandard" way of using the machine. Similarly, the Sun Ultra10000 has 4 cpus per node board, so results using 1, 2, or 3 cpus per node also go into this table of "nonstandard" results. NEW!

The results are currently presented in the following tables:


MPI Results

These results are based on the MPI code (in FORTRAN) -- typically applied to clusters. (I don't an MPI version written in C right now.) There is nothing "non-standard" about these numbers -- I just wanted a way to try to keep the SMP and cluster results separated.

32-bit Results

These tables include only results with 32-bit operands or operations. Results using 64-bit operands that move the data in 32-bit "chunks" are not here. The results are currently presented in the following tables:

Obsolete Results

These results have been superceded for one reason or another, but I never throw anything away (unless it was just plain wrong....).
The STREAM benchmark and web site are the responsibility of John D. McCalpin mccalpin@cs.virginia.edu

The STREAM website is hosted as a courtesy by the
Department of Computer Science

School of Engineering and Applied Science
University of Virginia, Charlottesville, Virginia

Valid HTML 4.0!