Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://hdl.handle.net/20.500.14279/23024
Τίτλος: Trident: Task Scheduling over Tiered Storage Systems in Big Data Platforms
Συγγραφείς: Herodotou, Herodotos 
Kakoulli, Elena 
Major Field of Science: Natural Sciences
Field Category: Computer and Information Sciences
Λέξεις-κλειδιά: Tiered storage system;Tiered storage;Pruning algorithms;Storage tiers;Spark;Hadoop
Ημερομηνία Έκδοσης: Μαΐ-2021
Πηγή: Proceedings of the VLDB Endowment, 2021, vol. 14, no. 9, pp. 1570-1582
Volume: 14
Issue: 9
Start page: 1570
End page: 1582
Link: http://vldb.org/pvldb/vol14/p1570-herodotou.pdf
Περιοδικό: Proceedings of the VLDB Endowment 
Περίληψη: The recent advancements in storage technologies have popularized the use of tiered storage systems in data-intensive compute clusters. The Hadoop Distributed File System (HDFS), for example, now supports storing data in memory, SSDs, and HDDs, while OctopusFS and hatS offer fine-grained storage tiering solutions. However, the task schedulers of big data platforms (such as Hadoop and Spark) will assign tasks to available resources only based on data locality information, and completely ignore the fact that local data is now stored on a variety of storage media with different performance characteristics. This paper presents Trident, a principled task scheduling approach that is designed to make optimal task assignment decisions based on both locality and storage tier information. Trident formulates task scheduling as a minimum cost maximum matching problem in a bipartite graph and uses a standard solver for finding the optimal solution. In addition, Trident utilizes two novel pruning algorithms for bounding the size of the graph, while still guaranteeing optimality. Trident is implemented in both Spark and Hadoop, and evaluated extensively using a realistic workload derived from Facebook traces as well as an industry-validated benchmark, demonstrating significant benefits in terms of application performance and cluster efficiency.
URI: https://hdl.handle.net/20.500.14279/23024
ISSN: 21508097
DOI: 10.14778/3461535.3461545
Rights: This work is licensed under the Creative Commons BY-NC-ND 4.0 International License.
Attribution-NonCommercial-NoDerivatives 4.0 International
Type: Article
Affiliation: Cyprus University of Technology 
Publication Type: Peer Reviewed
Εμφανίζεται στις συλλογές:Άρθρα/Articles

Αρχεία σε αυτό το τεκμήριο:
Αρχείο Περιγραφή ΜέγεθοςΜορφότυπος
p1570-herodotou.pdfFulltext5.54 MBAdobe PDFΔείτε/ Ανοίξτε
CORE Recommender
Δείξε την πλήρη περιγραφή του τεκμηρίου

SCOPUSTM   
Citations

4
checked on 9 Νοε 2023

WEB OF SCIENCETM
Citations

1
Last Week
0
Last month
0
checked on 29 Οκτ 2023

Page view(s)

410
Last Week
0
Last month
0
checked on 6 Νοε 2024

Download(s)

532
checked on 6 Νοε 2024

Google ScholarTM

Check

Altmetric


Αυτό το τεκμήριο προστατεύεται από άδεια Άδεια Creative Commons Creative Commons