Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/34668
Title: Exploiting Very-Wide Vectors on Intel Xeon Phi with Lattice-QCD Kernels
Authors: Diavastos, Andreas 
Stylianou, Giannos 
Koutsou, Giannis 
Major Field of Science: Engineering and Technology
Field Category: Computer and Information Sciences
Issue Date: 31-Mar-2016
Source: Proceedings - 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2016
Abstract: Our target in this work is to study ways of exploring the parallelism offered by vectorization on accelerators with very wide vector units. To this end, we implemented two kernels that derive from the Wilson Dslash operator and investigate several data layout techniques for increasing the scalability of lattice QCD scientific kernels suitable for the Intel Xeon Phi. In parts of the application where real numbers are used for computation, we see a 6.6x increase in bandwidth compared to scalar code, thanks to the auto-vectorization by the compiler. In other kernels where arithmetic operations on complex numbers dominate, our hand-vectorized code out-performs the auto-vectorization of the compiler. In this paper we find that our proposed Hopping Vector-friendly Ordering allows for more efficient vectorization of complex arithmetic floating point operations. Using this data layout, we manage to increase the sustained bandwidth by approximately 1.8x.
URI: https://hdl.handle.net/20.500.14279/34668
ISBN: [9781467387750]
DOI: 10.1109/PDP.2016.116
Type: Conference Papers
Affiliation : The Cyprus Institute 
University of Cyprus 
Publication Type: Peer Reviewed
Appears in Collections:Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation

CORE Recommender
Show full item record

Page view(s)

118
Last Week
1
Last month
3
checked on May 15, 2026

Google ScholarTM

Check

Altmetric


Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.