Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/19036
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Herodotou, Herodotos | - |
dc.date.accessioned | 2020-09-21T05:55:21Z | - |
dc.date.available | 2020-09-21T05:55:21Z | - |
dc.date.issued | 2019-07-01 | - |
dc.identifier.citation | IEEE 35th International Conference on Data Engineering Workshops, 2019, 8-12 April, Macao, China | en_US |
dc.identifier.issn | 978-1-7281-0890-2 | - |
dc.identifier.uri | https://hdl.handle.net/20.500.14279/19036 | - |
dc.description.abstract | The use of computational platforms such as Hadoop and Spark is growing rapidly as a successful paradigm for processing large-scale data residing in distributed file systems like HDFS. Increasing memory sizes have recently led to the introduction of caching and in-memory file systems. However, these systems lack any automated caching mechanisms for storing data in memory. This paper presents AutoCache, a caching framework that automates the decisions for when and which files to store in, or remove from, the cache for increasing system performance. The decisions are based on machine learning models that track and predict file access patterns from evolving data processing workloads. Our evaluation using real-world workload traces from a Facebook production cluster compares our approach with several other policies and showcases significant benefits in terms of both workload performance and cluster efficiency. | en_US |
dc.format | en_US | |
dc.language.iso | en | en_US |
dc.rights | © IEEE | en_US |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Automated caching | en_US |
dc.subject | Distributed file systems | en_US |
dc.title | AutoCache: Employing machine learning to automate caching in distributed file systems | en_US |
dc.type | Conference Papers | en_US |
dc.collaboration | Cyprus University of Technology | en_US |
dc.subject.category | Computer and Information Sciences | en_US |
dc.country | Cyprus | en_US |
dc.subject.field | Natural Sciences | en_US |
dc.publication | Peer Reviewed | en_US |
dc.relation.conference | IEEE International Conference on Data Engineering Workshops | en_US |
cut.common.academicyear | 2018-2019 | en_US |
item.openairecristype | http://purl.org/coar/resource_type/c_c94f | - |
item.openairetype | conferenceObject | - |
item.cerifentitytype | Publications | - |
item.grantfulltext | none | - |
item.languageiso639-1 | en | - |
item.fulltext | No Fulltext | - |
crisitem.author.dept | Department of Electrical Engineering, Computer Engineering and Informatics | - |
crisitem.author.faculty | Faculty of Engineering and Technology | - |
crisitem.author.orcid | 0000-0002-8717-1691 | - |
crisitem.author.parentorg | Faculty of Engineering and Technology | - |
Appears in Collections: | Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation |
CORE Recommender
This item is licensed under a Creative Commons License