Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/19364
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Herodotou, Herodotos | - |
dc.contributor.author | Kakoulli, Elena | - |
dc.date.accessioned | 2020-11-11T12:45:05Z | - |
dc.date.available | 2020-11-11T12:45:05Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | Proceedings of the VLDB Endowment, 2020, vol. 13, no. 1, pp. 43-56 | en_US |
dc.identifier.issn | 21508097 | - |
dc.identifier.uri | https://hdl.handle.net/20.500.14279/19364 | - |
dc.description | Presented at 46th International Conference on Very Large Data Bases, 31 August - 4 September 2020, Japan | en_US |
dc.description.abstract | Data-intensive platforms such as Hadoop and Spark are routinely used to process massive amounts of data residing on distributed le systems like HDFS. Increasing memory sizes and new hardware technologies (e.g., NVRAM, SSDs) have recently led to the introduction of storage tiering in such settings. However, users are now burdened with the additional complexity of managing the multiple storage tiers and the data residing on them while trying to optimize their workloads. In this paper, we develop a general framework for automatically moving data across the available storage tiers in distributed le systems. Moreover, we employ machine learning for tracking and predicting le access patterns, which we use to decide when and which data to move up or down the storage tiers for increasing system performance. Our approach uses incremental learning to dynamically rene the models with new le accesses, allowing them to naturally adjust and adapt to workload changes over time. Our extensive evaluation using realistic workloads derived from Facebook and CMU traces compares our approach with several other policies and showcases signicant bene ts in terms of both workload performance and cluster effciency. | en_US |
dc.format | en_US | |
dc.language.iso | en | en_US |
dc.relation.ispartof | Proceedings of the VLDB Endowment | en_US |
dc.rights | This work is licensed under the Creative Commons AttributionNonCommercial-NoDerivatives 4.0 International License. | en_US |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Hadoop | en_US |
dc.subject | Distributed File System | en_US |
dc.subject | Mapreduce | en_US |
dc.title | Automating distributed tiered storage management in cluster computing | en_US |
dc.type | Article | en_US |
dc.collaboration | Cyprus University of Technology | en_US |
dc.subject.category | Computer and Information Sciences | en_US |
dc.journals | Open Access | en_US |
dc.country | Cyprus | en_US |
dc.subject.field | Natural Sciences | en_US |
dc.publication | Peer Reviewed | en_US |
dc.identifier.doi | 10.14778/3357377.3357381 | en_US |
dc.relation.issue | 1 | en_US |
dc.relation.volume | 13 | en_US |
cut.common.academicyear | 2019-2020 | en_US |
dc.identifier.spage | 43 | en_US |
dc.identifier.epage | 56 | en_US |
item.grantfulltext | open | - |
item.openairecristype | http://purl.org/coar/resource_type/c_6501 | - |
item.fulltext | With Fulltext | - |
item.languageiso639-1 | en | - |
item.cerifentitytype | Publications | - |
item.openairetype | article | - |
crisitem.journal.journalissn | 2150-8097 | - |
crisitem.journal.publisher | ACM | - |
crisitem.author.dept | Department of Electrical Engineering, Computer Engineering and Informatics | - |
crisitem.author.dept | Department of Electrical Engineering, Computer Engineering and Informatics | - |
crisitem.author.faculty | Faculty of Engineering and Technology | - |
crisitem.author.faculty | Faculty of Engineering and Technology | - |
crisitem.author.orcid | 0000-0002-8717-1691 | - |
crisitem.author.orcid | 0000-0003-1489-807X | - |
crisitem.author.parentorg | Faculty of Engineering and Technology | - |
crisitem.author.parentorg | Faculty of Engineering and Technology | - |
Appears in Collections: | Άρθρα/Articles |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
3357377.3357381.pdf | Fulltext | 974.4 kB | Adobe PDF | View/Open |
CORE Recommender
SCOPUSTM
Citations
14
checked on Nov 6, 2023
WEB OF SCIENCETM
Citations
10
Last Week
1
1
Last month
1
1
checked on Oct 29, 2023
Page view(s) 50
318
Last Week
0
0
Last month
2
2
checked on Dec 22, 2024
Download(s)
85
checked on Dec 22, 2024
Google ScholarTM
Check
Altmetric
This item is licensed under a Creative Commons License