Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/35643| Title: | Employing Streaming Machine Learning for Modeling Workload Patterns in Multi-Tiered Data Storage Systems † | Authors: | Lucas Filho, Edson Ramiro Savva, George Lun, Yang Fu, Kebo Shen, Jianqiang Herodotou, Herodotos |
Major Field of Science: | Engineering and Technology | Field Category: | Computer and Information Sciences | Keywords: | multi-tiered data storage systems;workload patterns;streaming machine learning | Issue Date: | 11-Apr-2025 | Source: | Future Internet, 2025 | Volume: | 17 | Issue: | 4 | Journal: | Future Internet | Abstract: | Modern multi-tiered data storage systems optimize file access by managing data across a hybrid composition of caches and storage tiers while using policies whose decisions can severely impact the storage system’s performance. Recently, different Machine-Learning (ML) algorithms have been used to model access patterns from complex workloads. Yet, current approaches train their models offline in a batch-based approach, even though storage systems are processing a stream of file requests with dynamic workloads. In this manuscript, we advocate the streaming ML paradigm for modeling access patterns in multi-tiered storage systems as it introduces various advantages, including high efficiency, high accuracy, and high adaptability. Moreover, representative file access patterns, including temporal, spatial, length, and frequency patterns, are identified for individual files, directories, and file formats, and used as features. Streaming ML models are developed, trained, and tested on different file system traces for making two types of predictions: the next offset to be read in a file and the future file hotness. An extensive evaluation is performed with production traces provided by Huawei Technologies, showing that the models are practical, with low memory consumption (<1.3 MB) and low training delay (<1.8 ms per training instance), and can make accurate predictions online (0.98 F1 score and 0.07 MAE on average). | URI: | https://hdl.handle.net/20.500.14279/35643 | ISSN: | 19995903 | DOI: | 10.3390/fi17040170 | Type: | Article | Affiliation : | Cyprus University of Technology Huawei Technologies Co. Ltd |
Publication Type: | Peer Reviewed |
| Appears in Collections: | Άρθρα/Articles |
Files in This Item:
| File | Size | Format | |
|---|---|---|---|
| futureinternet-2025.pdf | 2.89 MB | Adobe PDF | View/Open |
CORE Recommender
Page view(s)
37
Last Week
13
13
Last month
checked on Feb 12, 2026
Download(s) 50
12
checked on Feb 12, 2026
Google ScholarTM
Check
Altmetric
Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.

