Optimal probabilistic routing in distributed parallel queues

Xin Guo; Yingdong Lu; Mark S. Squillante

Optimal probabilistic routing in distributed parallel queues

Yingdong Lu

2004, ACM SIGMETRICS Performance Evaluation Review

visibility

…

description

11 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

In this paper we consider the fundamental problem of routing customers among multiple distributed parallel queues to minimize an objective function based on equilibrium sojourn times, which arises in a wide variety of distributed computer systems, networks and applications. We derive optimal solutions to this theoretical scheduling problem under general assumptions for the arrival and service processes through stochastic-process limits. Our analysis extends previous studies by providing explicit solutions for the optimal scheduling problem and by considering general single-server queues, including correlated arrivals, under both first-come first-serve and processor-sharing queueing disciplines. In addition, we derive bounds for the variance of customer waiting times and exploit these results in order to obtain optimal solutions to the scheduling problem of interest based on equilibrium sojourn times subject to constraints on the waiting time variance, which have been ignored in previous studies. This collection of results allow us to cover risk factors and incorporate risk management within the context of our optimal scheduling problem. Numerical experiments with data from a real Web server system demonstrate the potential benefits of our theoretical results and methods in practice.

Figures (6)

Figure 1: Queueing Network Model of Distibuted Parallel Queues on IR* has mean E[A] = \~! and variance Var[.A] = o%. Each customer is routed to one oft the queues immediately upon its arrival according to a probability vector P = [pn], <,,< yj, independent of all else; i.e., a customer arrival is independently routed to queue n with probability p,. The router is assumed to be sufficiently fast that customers essentially have no service demands and do not queue at the router. Customer service times are i.i.d. following gen- eral distributions S,, on IR* that depend upon the queue where the customer is served and have mean E[S;,.] = ju,’ and coefficient of variation CZ, = 1,..., N, mutually independent of the arrival and routing processes.

Hence, the added condition based on the variation as a risk factor can be surrogated by the following side constraint

where M;(t) denotes the running maximum of a Brownian mo- tion Xs with drift G5 and variance os, F',,(ds) denotes the density function of the duration of the Markov chain 6(t) at state 6 = 0, 1, and F'x,, (dy) denotes the density function of the value of the diffu- sion process X (6) (t). The second term in Equation (21) can be derived via direct cal- culations on distributions of the running maximum of a Markov- modulated diffusion process. Letting /°(t) be the running maxi- mum process with 6(0) = 6 € {0,1}, and upon conditioning on the time of the first jump 75 of the Markov chain, we then have the following recursion for the distribution of 7° (t).

Consider a system that consists of an arrival stream with unit arrival rate, two servers with mean service time 0.4 and 0.5. The holding costs, or weights, for the two queues are hy = 2 and ho = 3. In the first experiment, we set the coefficient of variation (CV) for the two service time distributions to be 2 and 1.5, respectively, and then we vary the CV for the interarrival distribution from 0.6 to 1.5. In the second experiment, we fix the CV for the interarrival distribution and that of the service time at the first server, and then we vary the CV of the service time at the second server from 0.6 to 1.5. The optimal routing probabilities for the queues with FCFS and PS disciplines are calculated for both experiments. In Figure 2, the optimal routing probability is plotted. We can observe that the differences between the two disciplines are quite visible. The FCFS queues are more sensitive to changes in variance. Moreover, such changes in variance can yield trends in opposite directions for the FCFS and PS queues. 5.2 Comparison with Web Site Data

Table 1 provides the corresponding equilibrium sojoum time re- sults for both peak and off-peak traffic intervals. In particular, we illustrate the performance of the 12 server nodes under our opti- mal routing solution relative to the performance under the routing policy employed at the existing Web site. Negative values imply that our optimal policy provides lower equilibrium sojourn times than those obtained under the existing-system routing policy and quantify such performance improvements, whereas positive values indicate improvements in the equilibrium sojourn time under the existing per-location load-balancing routers.

Mark Van Oyen

Queueing Systems, 1995

We consider the problem of allocating a single server to a system of queues with Poisson arrivals. Each queue represents a class of jobs and possesses a holding cost rate, general service distribution, and a set-up cost. The objective is to minimize the expected cost due to the waiting of jobs and the switching of the server. A set-up cost is required to effect an instantaneous switch from one queue to another. We partially characterize an optimal policy and provide a simple heuristic scheduling policy. The heuristic's performance is evaluated in the cases of two and three queues by comparison with a numerically obtained optimal policy. Simulation results are provided to demonstrate the effectiveness of our heuristic over a wide range of problem instances with four queues.

Log In

Optimal probabilistic routing in distributed parallel queues

Sign up for access to the world's latest research

Abstract

Figures (6)

Related papers

Related papers

Related topics