Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/35643
Title: Employing Streaming Machine Learning for Modeling Workload Patterns in Multi-Tiered Data Storage Systems †
Authors: Lucas Filho, Edson Ramiro 
Savva, George 
Lun, Yang 
Fu, Kebo 
Shen, Jianqiang 
Herodotou, Herodotos 
Major Field of Science: Engineering and Technology
Field Category: Computer and Information Sciences
Keywords: multi-tiered data storage systems;workload patterns;streaming machine learning
Issue Date: 11-Apr-2025
Source: Future Internet, 2025
Volume: 17
Issue: 4
Journal: Future Internet 
Abstract: Modern multi-tiered data storage systems optimize file access by managing data across a hybrid composition of caches and storage tiers while using policies whose decisions can severely impact the storage system’s performance. Recently, different Machine-Learning (ML) algorithms have been used to model access patterns from complex workloads. Yet, current approaches train their models offline in a batch-based approach, even though storage systems are processing a stream of file requests with dynamic workloads. In this manuscript, we advocate the streaming ML paradigm for modeling access patterns in multi-tiered storage systems as it introduces various advantages, including high efficiency, high accuracy, and high adaptability. Moreover, representative file access patterns, including temporal, spatial, length, and frequency patterns, are identified for individual files, directories, and file formats, and used as features. Streaming ML models are developed, trained, and tested on different file system traces for making two types of predictions: the next offset to be read in a file and the future file hotness. An extensive evaluation is performed with production traces provided by Huawei Technologies, showing that the models are practical, with low memory consumption (<1.3 MB) and low training delay (<1.8 ms per training instance), and can make accurate predictions online (0.98 F1 score and 0.07 MAE on average).
URI: https://hdl.handle.net/20.500.14279/35643
ISSN: 19995903
DOI: 10.3390/fi17040170
Type: Article
Affiliation : Cyprus University of Technology 
Huawei Technologies Co. Ltd 
Publication Type: Peer Reviewed
Appears in Collections:Άρθρα/Articles

Files in This Item:
File SizeFormat
futureinternet-2025.pdf2.89 MBAdobe PDFView/Open
CORE Recommender
Show full item record

Page view(s)

37
Last Week
13
Last month
checked on Feb 12, 2026

Download(s) 50

12
checked on Feb 12, 2026

Google ScholarTM

Check

Altmetric


Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.