Difference between revisions of "Rodinia:Accelerating Compute-Intensive Applications with Accelerators"

From Rodinia
Jump to: navigation, search
(The Rodinia Benchmark Suite)
(The Rodinia Benchmark Suite)
Line 36: Line 36:
  
 
<TABLE border="1">
 
<TABLE border="1">
<TR><TH WIDTH="150">Applications<TH WIDTH="150">Dwarves<TH WIDTH="150">Domains<TH WIDTH="120">Parallel Model<TH WIDTH="150">Reason in the "Other" category
+
<TR><TH WIDTH="150">Applications<TH WIDTH="150">Dwarves<TH WIDTH="150">Domains<TH WIDTH="120">Parallel Model<TH WIDTH="350">Reason in the "Other" category
 
<TR><TD>[[SQL Database | SQLite Select]]<TD>Map Reduce<TD>Relational Database<TD ALIGN="center">CUDA<TD>This benchmark needs a multithreaded CPU version
 
<TR><TD>[[SQL Database | SQLite Select]]<TD>Map Reduce<TD>Relational Database<TD ALIGN="center">CUDA<TD>This benchmark needs a multithreaded CPU version
 
<TR><TD>[[Nearest Neighbor]]<TD>Dense Linear Algebra<TD>Data Mining<TD ALIGN="center">CUDA, OMP, OCL<TD>Compute pattern is similar to Kmeans
 
<TR><TD>[[Nearest Neighbor]]<TD>Dense Linear Algebra<TD>Data Mining<TD ALIGN="center">CUDA, OMP, OCL<TD>Compute pattern is similar to Kmeans
 
<TR><TD>[[Gaussian Elimination]]<TD>Dense Linear Algebra<TD>Linear Algebra<TD ALIGN="center">CUDA<TD>This benchmark needs a multithreaded CPU version
 
<TR><TD>[[Gaussian Elimination]]<TD>Dense Linear Algebra<TD>Linear Algebra<TD ALIGN="center">CUDA<TD>This benchmark needs a multithreaded CPU version
 
<TR><TD>[[Cell]]<TD>Structured Grid<TD>Cellular Automation<TD ALIGN="center">CUDA<TD>This benchmark needs a multithreaded CPU version
 
<TR><TD>[[Cell]]<TD>Structured Grid<TD>Cellular Automation<TD ALIGN="center">CUDA<TD>This benchmark needs a multithreaded CPU version
<TR><TD>[[Shortest Path | PathFinder]]<TD>Dynamic Programming<TD>Grid Traversal<TD ALIGN="center">CUDA, OCL<TD>
+
<TR><TD>[[Shortest Path | PathFinder]]<TD>Dynamic Programming<TD>Grid Traversal<TD ALIGN="center">CUDA, OCL<TD>Compute pattern is similar to Needleman Wunsch
 
<TR><TD>[[Hybrid Sort]]<TD>Sorting<TD>Sorting Algorithms<TD ALIGN="center">CUDA<TD>This benchmark needs a multithreaded CPU version
 
<TR><TD>[[Hybrid Sort]]<TD>Sorting<TD>Sorting Algorithms<TD ALIGN="center">CUDA<TD>This benchmark needs a multithreaded CPU version
<TR><TD>[[Myocyte | Myocyte]]<TD>Structured Grid<TD>Biological Simulation<TD ALIGN="center">CUDA, OMP<TD>
+
<TR><TD>[[Myocyte | Myocyte]]<TD>Structured Grid<TD>Biological Simulation<TD ALIGN="center">CUDA, OMP<TD>This benchmark is considered to merge into the main suite
<TR><TD>[[Particle Filter]]<TD>Structured Grid<TD>Medical Imaging<TD ALIGN="center">CUDA, OMP<TD>
+
<TR><TD>[[Particle Filter]]<TD>Structured Grid<TD>Medical Imaging<TD ALIGN="center">CUDA, OMP<TD>This benchmark is considered to merge into the main suite
 
</TABLE>
 
</TABLE>
  

Revision as of 16:54, 14 October 2011

A vision of heterogeneous computer systems that incorporate diverse
accelerators and automatically select the best computational unit for
a particular task is widely shared amongresearchers and many industry
analysts; however, there are no agreed-upon benchmarks to support the
research needed in the development of such a platform. There are many
suites for parallel computing on general-purpose CPU architectures,
but accelerators fall into a gap that is not covered by current benchmark
suites or benchmark development.

The Rodinia Benchmark Suite

Rodinia is designed for heterogeneous computing infrastructures, and,
using OpenMP and CUDA, targets both GPUs and multicore CPUs.
They have been parallelized with OpenMP for multicore CPUs and with
CUDA for GPUs. We have been preparing for the OpenCL release.

Current Rodinia applications:

ApplicationsDwarvesDomainsParallel ModelIncre. Ver.
LeukocyteStructured GridMedical ImagingCUDA, OMP
Heart WallStructured GridMedical ImagingCUDA, OMP
MUMmerGPUGraph TraversalBioinformaticsCUDA, OMP
CFD Solver1Unstructured GridFluid DynamicsCUDA, OMP, OCL
LU Decomposition Dense Linear AlgebraLinear AlgebraCUDA, OMP
HotSpotStructured GridPhysics SimulationCUDA, OMP, OCL
Back PropagationUnstructured GridPattern RecognitionCUDA, OMP, OCL
Needleman-WunschDynamic ProgrammingBioinformaticsCUDA, OMP, OCL
KmeansDense Linear AlgebraData MiningCUDA, OMP, OCL
Breadth-First Search1Graph TraversalGraph AlgorithmsCUDA, OMP, OCL
SRADStructured GridImage ProcessingCUDA, OMP, OCL
Streamcluster1Dense Linear AlgebraData MiningCUDA, OMP, OCL

Other applications (some with CUDA only):

ApplicationsDwarvesDomainsParallel ModelReason in the "Other" category
SQLite SelectMap ReduceRelational DatabaseCUDAThis benchmark needs a multithreaded CPU version
Nearest NeighborDense Linear AlgebraData MiningCUDA, OMP, OCLCompute pattern is similar to Kmeans
Gaussian EliminationDense Linear AlgebraLinear AlgebraCUDAThis benchmark needs a multithreaded CPU version
CellStructured GridCellular AutomationCUDAThis benchmark needs a multithreaded CPU version
PathFinderDynamic ProgrammingGrid TraversalCUDA, OCLCompute pattern is similar to Needleman Wunsch
Hybrid SortSortingSorting AlgorithmsCUDAThis benchmark needs a multithreaded CPU version
MyocyteStructured GridBiological SimulationCUDA, OMPThis benchmark is considered to merge into the main suite
Particle FilterStructured GridMedical ImagingCUDA, OMPThis benchmark is considered to merge into the main suite

New Rodinia applications coming soon:

ApplicationsDwarvesDomainsParallel Model
LavaMD2Structured GridMolecular DynamicsCUDA, OMP


1Ana Lucia Varbanescu and Jianbin Fang, Delft University of Technology contributed the OpenCL version.

2Collaboration with Lawrence Livermore National Laboratory.

Application Download

Please read the license file.

Also, if your use of Rodinia results in a publication, please cite:

[1] S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S.-H. Lee, and K. Skadron.
Rodinia: A Benchmark Suite for Heterogeneous Computing. In Proceedings of the IEEE
International Symposium on Workload Characterization (IISWC), pp. 44-54, Oct. 2009.

[2] S. Che, J. W. Sheaffer, M. Boyer, L. G. Szafaryn, L. Wang, and K. Skadron.
A Characterization of the Rodinia Benchmark Suite with Comparison to Contemporary
CMP Workloads. In Proceedings of the IEEE International Symposium on Workload
Characterization, Dec. 2010.

This work is supported by NSF grant nos. IIS-0612049, CNS-0916908 and CNS-0615277, a grant from the SRC under task no. 1607, and grants from AMD, NEC labs, and NVIDIA Research.