Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Autotuning CUDA compiler parameters for heterogeneous applications using the OpenTuner framework

Full text
Author(s):
Bruel, Pedro [1] ; Amaris, Marcos [1] ; Goldman, Alfredo [1]
Total Authors: 3
Affiliation:
[1] Univ Sao Paulo, IME, R Matao 1010, Cidade Univ, Sao Paulo, SP - Brazil
Total Affiliations: 1
Document type: Journal article
Source: CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE; v. 29, n. 22, SI NOV 25 2017.
Web of Science Citations: 4
Abstract

A Graphics Processing Unit (GPU) is a parallel computing coprocessor specialized in accelerating vector operations. The enormous heterogeneity of parallel computing platforms justifies and motivates the development of automated optimization tools and techniques. The Algorithm Selection Problem consists in finding a combination of algorithms, or a configuration of an algorithm, that optimizes the solution of a set of problem instances. An autotuner solves the Algorithm Selection Problem using search and optimization techniques. In this paper, we implement an autotuner for the Compute Unified Device Architecture compiler's parameters using the OpenTuner framework. The autotuner searches for a set of compilation parameters that optimizes the time to solve a problem. We analyze the performance speedups, in comparison with high-level compiler optimizations, achieved in three different GPU devices, for 17 heterogeneous GPU applications, 12 of which are from the Rodinia Benchmark Suite. The autotuner often beats the compiler's high-level optimizations, but underperformed for some problems. We achieved over 2x speedup for Gaussian Elimination and almost 2x speedup for Heart Wall, both problems from the Rodinia Benchmark, and over 4x speedup for a matrix multiplication algorithm. Copyright (c) 2017 John Wiley \& Sons, Ltd. (AU)

FAPESP's process: 12/23300-7 - Bulk Synchronous Parallel Model on Graphic Processing Units
Grantee:Marcos Tulio Amaris González
Support Opportunities: Scholarships in Brazil - Doctorate