Title page for ETD etd-07282011-122302


Type of Document Master's Thesis
Author Martinez Arroyo, Gabriel Ernesto
Author's Email Address mystal@vt.edu
URN etd-07282011-122302
Title Cu2cl: a Cuda-To-Opencl Translator for Multi- and Many-Core Architectures
Degree Master of Science
Department Computer Science and Applications
Advisory Committee
Advisor Name Title
Feng, Wu-Chun Committee Chair
Gardner, Mark Committee Member
Sandu, Adrian Committee Member
Keywords
  • GPU
  • Compilers
  • CUDA
  • OpenCL
  • Source Translation
  • Clang
Date of Defense 2011-07-14
Availability unrestricted
Abstract
The use of graphics processing units (GPUs) in high-performance parallel computing continues to steadily become more prevalent, often as part of a heterogeneous system. For years, CUDA has been the de facto programming environment for nearly all general-purpose GPU (GPGPU) applications. In spite of this, the framework is available only on NVIDIA GPUs, traditionally requiring reimplementation in other frameworks in order to utilize additional multi- or many-core devices. On the other hand, OpenCL provides an open and vendor-neutral programming environment and run-time system. With implementations available for CPUs, GPUs, and other types of accelerators, OpenCL therefore holds the promise of a “write once, run anywhere” ecosystem for heterogeneous computing.

Given the many similarities between CUDA and OpenCL, manually porting a CUDA application to OpenCL is almost straightforward, albeit tedious and error-prone. In response to this issue, we created CU2CL, an automated CUDA-to-OpenCL source-to-source translator that possesses a novel design and clever reuse of the Clang compiler framework. Currently, the CU2CL translator covers the primary constructs found in the CUDA Runtime API, and we have successfully translated several applications from the CUDA SDK and Rodinia benchmark suite. CU2CL’s translation times are reasonable, allowing for many applications to be translated at once. The number of manual changes required after executing our translator on CUDA source is minimal, with some compiling and working with no changes at all. The performance of our automatically translated applications via CU2CL is on par with their manually ported counterparts.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  Martinez_GE_T_2011.pdf 597.08 Kb 00:02:45 00:01:25 00:01:14 00:00:37 00:00:03

Browse All Available ETDs by ( Author | Department )

dla home
etds imagebase journals news ereserve special collections
virgnia tech home contact dla university libraries

If you have questions or technical problems, please Contact DLA.