Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/19036
DC FieldValueLanguage
dc.contributor.authorHerodotou, Herodotos-
dc.date.accessioned2020-09-21T05:55:21Z-
dc.date.available2020-09-21T05:55:21Z-
dc.date.issued2019-07-01-
dc.identifier.citationIEEE 35th International Conference on Data Engineering Workshops, 2019, 8-12 April, Macao, Chinaen_US
dc.identifier.issn978-1-7281-0890-2-
dc.identifier.urihttps://hdl.handle.net/20.500.14279/19036-
dc.description.abstractThe use of computational platforms such as Hadoop and Spark is growing rapidly as a successful paradigm for processing large-scale data residing in distributed file systems like HDFS. Increasing memory sizes have recently led to the introduction of caching and in-memory file systems. However, these systems lack any automated caching mechanisms for storing data in memory. This paper presents AutoCache, a caching framework that automates the decisions for when and which files to store in, or remove from, the cache for increasing system performance. The decisions are based on machine learning models that track and predict file access patterns from evolving data processing workloads. Our evaluation using real-world workload traces from a Facebook production cluster compares our approach with several other policies and showcases significant benefits in terms of both workload performance and cluster efficiency.en_US
dc.formatpdfen_US
dc.language.isoenen_US
dc.rights© IEEEen_US
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectAutomated cachingen_US
dc.subjectDistributed file systemsen_US
dc.titleAutoCache: Employing machine learning to automate caching in distributed file systemsen_US
dc.typeConference Papersen_US
dc.collaborationCyprus University of Technologyen_US
dc.subject.categoryComputer and Information Sciencesen_US
dc.countryCyprusen_US
dc.subject.fieldNatural Sciencesen_US
dc.publicationPeer Revieweden_US
dc.relation.conferenceIEEE International Conference on Data Engineering Workshopsen_US
cut.common.academicyear2018-2019en_US
item.openairecristypehttp://purl.org/coar/resource_type/c_c94f-
item.openairetypeconferenceObject-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.languageiso639-1en-
item.fulltextNo Fulltext-
crisitem.author.deptDepartment of Electrical Engineering, Computer Engineering and Informatics-
crisitem.author.facultyFaculty of Engineering and Technology-
crisitem.author.orcid0000-0002-8717-1691-
crisitem.author.parentorgFaculty of Engineering and Technology-
Appears in Collections:Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation
CORE Recommender
Show simple item record

Page view(s) 20

307
Last Week
0
Last month
9
checked on Nov 21, 2024

Google ScholarTM

Check


This item is licensed under a Creative Commons License Creative Commons