Systems with multiple servers under heavy-tailed workloads

Prabhakar, Balaji; Papadopoulos, Fragkiskos; Molinero-Fernández, Pablo; Psounis, Konstantinos

doi:10.1016/j.peva.2005.07.030

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/13912

Title:	Systems with multiple servers under heavy-tailed workloads
Authors:	Prabhakar, Balaji Papadopoulos, Fragkiskos Molinero-Fernández, Pablo Psounis, Konstantinos
Major Field of Science:	Engineering and Technology
Field Category:	Electrical Engineering - Electronic Engineering - Information Engineering
Keywords:	Blocking probability;Expected delay;Heavy-tailed size distribution;M/G/K queue;Multi-server computer systems;Practical approximation formula
Issue Date:	1-Oct-2005
Source:	Performance Evaluation, 2005, vol. 62, no. 1–4, pp. 456-474
Volume:	62
Issue:	1-4
Start page:	456
End page:	474
Journal:	Performance Evaluation
Abstract:	The heavy-tailed nature of Internet flow sizes, web pages and computer files can cause non-preemptive scheduling policies to have a large average response time. Since there are numerous communication and distributed processing systems where preempting jobs can be quite expensive, reducing response times under this constraint is a pressing issue. One proposal for tackling non-preemption is through the use of multiple servers: classify jobs according to size and assign a server to each class. Unfortunately, in most systems of interest, job sizes are unknown. An alterative is to queue all jobs together in a central-queue and assign them in a FCFS fashion to the next available server. But, this has been believed to yield large response times. In this paper, we argue that this is not the case, so long as there are enough servers. The question then is: what is the right number of servers, and is this small enough to be practical? Despite the large amount of prior work in analyzing the behavior of a central-queue system, no existing models are accurate for the case of heavy-tailed size distributions. Our main contribution is a simple yet accurate model for a central-queue with multiple servers. This model accurately predicts the right number of servers, and the average and variance of the response time of the system. Hence, it can be used to improve the performance of some real systems, such as multi-server supercomputing centers and multi-channel communication systems. © 2005 Elsevier B.V. All rights reserved.
ISSN:	01665316
DOI:	10.1016/j.peva.2005.07.030
Rights:	© Elsevier 2005
Type:	Article
Affiliation :	University of Southern California Stanford University
Publication Type:	Peer Reviewed
Appears in Collections:	Άρθρα/Articles

CORE Recommender

Show full item record

SCOPUS^TM
Citations

40

checked on Mar 14, 2024

WEB OF SCIENCE^TM
Citations

29

Last Week
0

Last month
0

checked on Oct 29, 2023

Page view(s) 50

406

Last Week
1

Last month
4

checked on Dec 5, 2025

Google Scholar^TM

Check

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Page view(s) 50

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM