Wang, P, Gao, W, Fang, J et al. (2 more authors) (2022) Characterizing OpenMP SynchronizationImplementations on ARMv8 Multi-Cores. In: 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys). 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys), 20-22 Dec 2021, Haikou, Hainan, China. IEEE , pp. 669-676. ISBN 978-1-6654-9458-8
Abstract
Synchronization operations like barriers are fre-quently seen in parallel OpenMP programs, where an inefficient implementation can severely limit the application performance. While synchronization optimization has been heavily studied on traditional x86 architectures, there is no consensus on how synchronization can be best implemented on the ARMv8 multi-core CPUs. This paper presents a study of OpenMP synchronization implementation on two representative ARMv8 multi-core architectures, Phytium 2000+ and ThunderX2, by considering various OpenMP synchronization mechanisms offered by two mainstreamed OpenMP compilers, GCC and LLVM. Our evalu-ation compares the performance, overhead and scalability of both compiler implementations. We show that there is no “one-fits-for-all” synchronization mechanism, and the efficiency of a scheme varies across hardware architectures and thread parallelism. We then share our insights and discuss how OpenMP synchronization operations can be better optimized on emerging ARMv8 multi-cores, offering quantified results for future research directions.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | ©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Keywords: | OpenMP; Scalability; Synchronization; Performance |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Funding Information: | Funder Grant number Royal Society IEC\NSFC\191465 |
Depositing User: | Symplectic Publications |
Date Deposited: | 09 Nov 2021 11:34 |
Last Modified: | 16 Oct 2023 16:00 |
Status: | Published |
Publisher: | IEEE |
Identification Number: | 10.1109/HPCC-DSS-SmartCity-DependSys53884.2021.00111 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:180101 |