Cyber Center Publications

A hybrid approach of OpenMP for clusters

Okwan Kwon
Fahed Jubair
Rudolf Eigenmann
Samuel Midkiff

Abstract

We present the first fully automated compiler-runtime system that successfully translates and executes OpenMP shared-address-space programs on laboratory-size clusters, for the complete set of regular, repetitive applications in the NAS Parallel Benchmarks. We introduce a hybrid compiler-runtime translation scheme. Compared to previous work, this scheme features a new runtime data flow analysis and new compiler techniques for improving data affinity and reducing communication costs. We present and discuss the performance of our translated programs, and compare them with the performance of the MPI, HPF and UPC versions of the benchmarks. The results show that our translated programs achieve 75% of the hand-coded MPI programs, on average.

Keywords

algorithms, code generation, compilers, mpi, openmp, optimization, runtime data flow analysis, translator

Date of this Version

2012

DOI

10.1145/2145816.2145827

Comments

Proceeding PPoPP '12 Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming

Link to Full Text

Find in your library

COinS

Cyber Center Publications

A hybrid approach of OpenMP for clusters

Abstract

Keywords

Date of this Version

DOI

Comments

Search

Links

Links for Authors

Browse

Cyber Center Publications

A hybrid approach of OpenMP for clusters

Authors

Abstract

Keywords

Date of this Version

DOI

Comments

Share

Search

Links

Links for Authors

Browse