Binding nested OpenMP programs on hierarchical memory architectures.

Keywords

Authors

D. Schmidl
C. Terboven
D. an Mey
H. M. Bücker

Abstract

In this work we discuss the performance problems of nested OpenMP programs concerning thread and data locality particularly on cc-NUMA architectures. We provide a user friendly solution and demonstrate its benefits by comparing the performance of some kernel benchmarks and some real-world applications with and without applying our affinity optimizations.

Advanced Computing

Research

Binding nested OpenMP programs on hierarchical memory architectures.

Keywords

Authors

Abstract

Paper

Links