Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/34668| Title: | Exploiting Very-Wide Vectors on Intel Xeon Phi with Lattice-QCD Kernels | Authors: | Diavastos, Andreas Stylianou, Giannos Koutsou, Giannis |
Major Field of Science: | Engineering and Technology | Field Category: | Computer and Information Sciences | Issue Date: | 31-Mar-2016 | Source: | Proceedings - 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2016 | Abstract: | Our target in this work is to study ways of exploring the parallelism offered by vectorization on accelerators with very wide vector units. To this end, we implemented two kernels that derive from the Wilson Dslash operator and investigate several data layout techniques for increasing the scalability of lattice QCD scientific kernels suitable for the Intel Xeon Phi. In parts of the application where real numbers are used for computation, we see a 6.6x increase in bandwidth compared to scalar code, thanks to the auto-vectorization by the compiler. In other kernels where arithmetic operations on complex numbers dominate, our hand-vectorized code out-performs the auto-vectorization of the compiler. In this paper we find that our proposed Hopping Vector-friendly Ordering allows for more efficient vectorization of complex arithmetic floating point operations. Using this data layout, we manage to increase the sustained bandwidth by approximately 1.8x. | URI: | https://hdl.handle.net/20.500.14279/34668 | ISBN: | [9781467387750] | DOI: | 10.1109/PDP.2016.116 | Type: | Conference Papers | Affiliation : | The Cyprus Institute University of Cyprus |
Publication Type: | Peer Reviewed |
| Appears in Collections: | Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation |
CORE Recommender
Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.

