Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/1640
DC FieldValueLanguage
dc.contributor.authorMichail, Harris-
dc.contributor.authorKelefouras, Vasileios I.-
dc.contributor.authorAthanasiou, George S.-
dc.contributor.authorAlachiotis, Nikolaos-
dc.contributor.authorKritikakou, Angeliki S.-
dc.contributor.authorGoutis, Costas E.-
dc.date.accessioned2013-02-21T13:36:15Zen
dc.date.accessioned2013-05-17T05:22:39Z-
dc.date.accessioned2015-12-02T10:05:07Z-
dc.date.available2013-02-21T13:36:15Zen
dc.date.available2013-05-17T05:22:39Z-
dc.date.available2015-12-02T10:05:07Z-
dc.date.issued2011-12-
dc.identifier.citationIEEE Transactions on Signal Processing, 2011, vol. 59, no. 12, pp. 6217-6226en_US
dc.identifier.issn19410476-
dc.identifier.urihttps://hdl.handle.net/20.500.14279/1640-
dc.description.abstractSeveral SOA (state of the art) self-tuning software libraries exist, such as the Fastest Fourier Transform in the West (FFTW) for fast Fourier transform (FFT). FFT is a highly important kernel and the performance of its software implementations depends on the memory hierarchy's utilization. FFTW minimizes register spills and data cache accesses by finding a schedule that is independent of the number of the registers and of the number of levels and size of the cache, which is a serious drawback. In this paper, a new methodology is presented, achieving improved performance by focusing on memory hierarchy utilization. The proposed methodology has three major advantages. First, the combination of production and consumption of butterflies' results, data reuse, FFT parallelism, symmetries of twiddle factors and also additions by zeros and multiplications by zeros and ones when twiddle factors are zero or one, are fully and simultaneously exploited. Second, the optimal solution is found according to the number of the registers, the data cache sizes, the number of the levels of data cache hierarchy, the main memory page size, the associativity of the data caches and the data cache line sizes, which are also considered simultaneously and not separate. Third, compilation time and source code size are very small compared with FFTW. The proposed methodology achieves performance gain about 40% (speed-up of 1.7) for architectures with small data cache sizes where memory management has a larger effect on performance and 20% (speed-up of 1.25) on average for architectures with large data cache sizes (Pentium) in comparison with FFTW.en_US
dc.formatpdfen_US
dc.language.isoenen_US
dc.relation.ispartofIEEE Transactions on Signal Processingen_US
dc.rights© IEEEen_US
dc.subjectCache memoryen_US
dc.subjectEmbedded computer systemsen_US
dc.subjectMultiplicationen_US
dc.subjectCompilers (Computer programs)en_US
dc.titleA methodology for speeding up fast fourier transform focusing on memory architecture utilizationen_US
dc.typeArticleen_US
dc.affiliationUniversity of Patrasen
dc.collaborationUniversity of Patrasen_US
dc.subject.categoryElectrical Engineering - Electronic Engineering - Information Engineeringen_US
dc.journalsSubscriptionen_US
dc.countryGreeceen_US
dc.subject.fieldEngineering and Technologyen_US
dc.publicationPeer Revieweden_US
dc.identifier.doi10.1109/TSP.2011.2168525en_US
dc.dept.handle123456789/54en
dc.relation.issue12en_US
dc.relation.volume59en_US
cut.common.academicyear2011-2012en_US
dc.identifier.spage6217en_US
dc.identifier.epage6226en_US
item.fulltextNo Fulltext-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.openairecristypehttp://purl.org/coar/resource_type/c_6501-
item.openairetypearticle-
item.languageiso639-1en-
crisitem.author.deptDepartment of Electrical Engineering, Computer Engineering and Informatics-
crisitem.author.facultyFaculty of Engineering and Technology-
crisitem.author.orcid0000-0002-8299-8737-
crisitem.author.parentorgFaculty of Engineering and Technology-
Appears in Collections:Άρθρα/Articles
CORE Recommender
Show simple item record

SCOPUSTM   
Citations

9
checked on Nov 9, 2023

WEB OF SCIENCETM
Citations

9
Last Week
0
Last month
0
checked on Oct 29, 2023

Page view(s) 20

475
Last Week
7
Last month
15
checked on May 11, 2024

Google ScholarTM

Check

Altmetric


Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.