Exploiting Very-Wide Vectors on Intel Xeon Phi with Lattice-QCD Kernels

Diavastos, Andreas; Stylianou, Giannos; Koutsou, Giannis

doi:10.1109/PDP.2016.116

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/34668

Title:	Exploiting Very-Wide Vectors on Intel Xeon Phi with Lattice-QCD Kernels
Authors:	Diavastos, Andreas Stylianou, Giannos Koutsou, Giannis
Major Field of Science:	Engineering and Technology
Field Category:	Computer and Information Sciences
Issue Date:	31-Mar-2016
Source:	Proceedings - 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2016
Abstract:	Our target in this work is to study ways of exploring the parallelism offered by vectorization on accelerators with very wide vector units. To this end, we implemented two kernels that derive from the Wilson Dslash operator and investigate several data layout techniques for increasing the scalability of lattice QCD scientific kernels suitable for the Intel Xeon Phi. In parts of the application where real numbers are used for computation, we see a 6.6x increase in bandwidth compared to scalar code, thanks to the auto-vectorization by the compiler. In other kernels where arithmetic operations on complex numbers dominate, our hand-vectorized code out-performs the auto-vectorization of the compiler. In this paper we find that our proposed Hopping Vector-friendly Ordering allows for more efficient vectorization of complex arithmetic floating point operations. Using this data layout, we manage to increase the sustained bandwidth by approximately 1.8x.
URI:	https://hdl.handle.net/20.500.14279/34668
ISBN:	[9781467387750]
DOI:	10.1109/PDP.2016.116
Type:	Conference Papers
Affiliation :	The Cyprus Institute University of Cyprus
Publication Type:	Peer Reviewed
Appears in Collections:	Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation

CORE Recommender

Show full item record

Page view(s)

118

Last Week
1

Last month
3

checked on May 15, 2026

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Altmetric

Google Scholar^TM