Decoupled processors architecture for accelerating data intensive applications using scratch-pad memory hierarchy

Michail, Harris; Milidonis, Athanasios S.; Alachiotis, Nikolaos

doi:10.1007/s11265-009-0393-9

Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://hdl.handle.net/20.500.14279/1644

Τίτλος:	Decoupled processors architecture for accelerating data intensive applications using scratch-pad memory hierarchy
Συγγραφείς:	Michail, Harris Milidonis, Athanasios S. Alachiotis, Nikolaos
Major Field of Science:	Engineering and Technology
Field Category:	Electrical Engineering - Electronic Engineering - Information Engineering
Λέξεις-κλειδιά:	Decoupled;Scratch pad
Ημερομηνία Έκδοσης:	Ιου-2010
Πηγή:	Journal of Signal Processing Systems, 2010, vol. 59, no. 3, pp. 281-296
Volume:	59
Issue:	3
Start page:	281
End page:	296
Περιοδικό:	Journal of Signal Processing Systems
Περίληψη:	We present an architecture of decoupled processors with a memory hierarchy consisting only of scratch-pad memories, and a main memory. This architecture exploits the more efficient pre-fetching of Decoupled processors, that make use of the parallelism between address computation and application data processing, which mainly exists in streaming applications. This benefit combined with the ability of scratch-pad memories to store data with no conflict misses and low energy per access contributes significantly for increasing the system's performance. The application code is split in two parallel programs the first runs on the Access processor and computes the addresses of the data in the memory hierarchy. The second processes the application data and runs on the Execute processor, a processor with a limited address space-just the register file addresses. Each transfer of any block in the memory hierarchy up to the Execute processor's register file is controlled by the Access processor and the DMA units. This strongly differentiates this architecture from traditional uniprocessors and existing decoupled processors with cache memory hierarchies. The architecture is compared in performance with uniprocessor architectures with (a) scratch-pad and (b) cache memory hierarchies and (c) the existing decoupled architectures, showing its higher normalized performance. The reason for this gain is the efficiency of data transferring that the scratch-pad memory hierarchy provides combined with the ability of the Decoupled processors to eliminate memory latency using memory management techniques for transferring data instead of fixed prefetching methods. Experimental results show that the performance is increased up to almost 2 times compared to uniprocessor architectures with scratch-pad and up to 3.7 times compared to the ones with cache. The proposed architecture achieves the above performance without having penalties in energy delay product costs
URI:	https://hdl.handle.net/20.500.14279/1644
ISSN:	19398115
DOI:	10.1007/s11265-009-0393-9
Rights:	© Springer
Type:	Article
Affiliation:	University of Patras
Affiliation:	University of Patras Cyprus University of Technology
Publication Type:	Peer Reviewed
Εμφανίζεται στις συλλογές:	Άρθρα/Articles

CORE Recommender

Δείξε την πλήρη περιγραφή του τεκμηρίου

Page view(s)

497

Last Week
1

Last month
4

checked on 27 Νοε 2024

Google Scholar^TM

Check

Altmetric

Όλα τα τεκμήρια του δικτυακού τόπου προστατεύονται από πνευματικά δικαιώματα

Page view(s)

Google ScholarTM

Altmetric

Google Scholar^TM