Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/10295
Title: | OctopuSFS: A distributed file system with tiered storage management | Authors: | Kakoulli, Elena Herodotou, Herodotos |
Major Field of Science: | Engineering and Technology | Field Category: | Electrical Engineering - Electronic Engineering - Information Engineering | Keywords: | Cluster computing;Data handling;Digital storage;Distributed computer systems;Distributed database systems;Fault tolerance;File organization;Molluscs;Multiobjective optimization;Multiprocessing systems;Storage management | Issue Date: | May-2017 | Source: | Proceedings of the 2017 ACM International Conference on Management of Data, pp. 65-78 | Volume: | Part F127746 | Conference: | International Conference on Management of Data | Abstract: | The ever-growing data storage and I/O demands of modern large-scale data analytics are challenging the current distributed storage systems. A promising trend is to exploit the recent improvements in memory, storage media, and networks for sustaining high performance and low cost. While past work explores using memory or SSDs as local storage or combine local with network-attached storage in cluster computing, this work focuses on managing multiple storage tiers in a distributed setting. We present OctopusFS, a novel distributed file system that is aware of heterogeneous storage media (e.g., memory, SSDs, HDDs, NAS) with different capacities and performance characteristics. The system offers a variety of pluggable policies for automating data management across the storage tiers and cluster nodes. The policies employ multi-objective optimization techniques for making intelligent data management decisions based on the requirements of fault tolerance, data and load balancing, and throughput maximization. At the same time, the storage media are explicitly exposed to users and applications, allowing them to choose the distribution and placement of replicas in the cluster based on their own performance and fault tolerance requirements. Our extensive evaluation shows the immediate benefits of using OctopusFS with data-intensive processing systems, such as Hadoop and Spark, in terms of both increased performance and better cluster utilization. | URI: | https://hdl.handle.net/20.500.14279/10295 | ISBN: | 9781450341974 | DOI: | 10.1145/3035918.3064023 | Rights: | © ACM | Type: | Conference Papers | Affiliation : | Cyprus University of Technology | Publication Type: | Peer Reviewed |
Appears in Collections: | Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation |
CORE Recommender
SCOPUSTM
Citations
39
checked on Mar 14, 2024
Page view(s)
573
Last Week
0
0
Last month
2
2
checked on Nov 21, 2024
Google ScholarTM
Check
Altmetric
Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.