Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2020, Cybernetics and Systems Analysis
The mathematical model of a queueing system with heterogeneous servers, without queues, and with two types of requests is investigated. High-priority requests are processed in fast servers while low-priority calls are processed in slow servers. If all servers in some group are busy, then reassigning of requests to another group is allowed. Reassigning is based on random schemes and reassignment probability depends on the number of busy servers in appropriate group. Exact and approximate methods are developed for the analysis of characteristics of the system. Explicit approximate formulas to calculate the approximate values of characteristics are proposed.
Theoretical Computer Science, 1992
El-Taha, M. and S. Stidham Jr., Deterministic analysis of queueing systems with heterogeneous servers, Theoretical Computer Science 106 (1992) 243-264.
Journal of Computer and Systems Sciences International, 2019
Markov models of systems with heterogeneous servers, various types of requests, and jump priorities are proposed. It is assumed that there are high and low priority requests; the high priority requests are assigned to the server with the high service rate, and the low priority requests are assigned to the server with the low service rate. Models of two types are studied: with separate queues and the shared queue for requests of different types. Jump priorities determine the rules according to which the type of low priority requests changes depending on the state of the queue. Methods for calculating the distribution of the probability of the system states are developed, formulas for the calculation of the characteristics of the system are derived, and the problem of the optimization of these characteristics is solved. The results of numerical experiments are presented.
Trends in Telecommunications Technologies, 2010
Microelectronics Reliability, 1985
This paper studies the steady state behaviour in discrete time of a limited space queueing problem with random memory arrivals wherein service is accomplished through S-heterogeneous parallel channels. The arrivals at two consecutive time marks depend upon a random variable which takes values one and zero with probabilities p and q respectively according to whether the arrivals at two consecutive time marks are correlated or uncorrelated. The explicit probabilities for the number of units in the system have been worked out. Some special cases of interest have also been derived.
RUDN Journal of Mathematics, Information Sciences and Physics
The mathematical model of the system, that consists of a storage device and several homogeneous servers and operates in a random environment, and provides incoming applications not only services, but also access to resources of the system, is being constructed. The random environment is represented by two independent Markov processes. The first of Markov processes controls the incoming flow of applications to the system and the size of resources required by each application. The incoming flow is a Poisson one, the rate of the flow and the amount of resources required for the application are determined by the state of the external Markov process. The service time for applications on servers is exponential distributed. The service rate and the maximum amount of system resources are determined by the state of the second external Markov process. When the application leaves the system, its resources are returned to the system. In the system under consideration, there may be failures in accepting incoming applications due to a lack of resources, as well as loss of the applications already accepted in the system, when the state of the external Markov process controlling the service and provision of resources changes. A random process describing the functioning of this system is constructed. The system of equations for the stationary probability distribution of the constructed random process is presented in scalar form. The main tasks for further research are formulated.
2007
The probability of losing a customer in M/G/n/0 and GI/M/n/0 loss queuing systems withheterogeneous servers is minimized. The first system uses a queue discipline in which a customer who arriveswhen there are free servers chooses any one of them with equal probability, but is lost otherwise. Providedthat the sum of the servers rates are fixed, loss probability in this system attains minimum value when all theservice rates are equal. The second system uses queue discipline, in which a customer who enters into thesystem is assigned to the server with the lowest number. Loss probability in this system takes the minimumvalue in the case when the fastest server rule is used in which an incoming customer is served by the freeserver with the shortest mean service time. If the mean of the arrival distribution is fixed, then loss probabilityis minimized by deterministic arrival distribution
International Journal of Research in Advent Technology
In this article we consider a single server queueing model in which customers arrive according to Poisson process. At the time of arrival, all customers are classified as ordinary. If the server is busy the arriving customers enter an orbit of infinite capacity. Each customer in the orbit tries, independently of each other, to access the server at a constant rate σ. Each customer in the orbit, independently of others, generate priority with inter-occurrence time exponentially distributed with parameter. A priority generated customer is immediately taken for service if the server is free. Else, such a customer is placed in a waiting space of capacity 1 which is reserved only for priority generated customers. We consider a customer induced interruption while service is going on. The interruption occurs according to a Poisson process. The interrupted customers will enter into a buffer of finite capacity and they will spend a random period of time for completion of interruption. The duration of the interruption of customers in follows an exponential distribution. The service facility consists of one server and the duration of service times of ordinary, priority and interruption completed customers follows an exponential distribution with appropriate representations. Various performance measures are obtained for an appropriate system designing.
European Journal of Operational Research, 2008
We consider a queueing system with disruptive and non-disruptive server interruptions. Both disruptive and non-disruptive interruptions may start when there is a customer in service. The customer repeats its service after a disruptive interruption, and continues its service after a nondisruptive interruption. Using a transform approach, we obtain various performance measures such as the moments of the queue content and waiting times. We illustrate our approach by means of some numerical examples.
This paper highlights a few results of the Poisson queue with two heterogeneous servers studied by Krishnamoorthi (1963) under queue discipline-II that minimizes the violation of the classical ‘First Come First Served (FCFS)’ queue discipline. One special feature of this study is to solve an optimization problem relative to the service rate μ2 of the slow server. A simple way of tracing the optimal service rate μ*2of the slow server is illustrated in a specific numerical exerciseto equalize the average queue length cost with that of the service cost. Further itproposes a new Poisson queue operated bytwo heterogeneous service channels with an m-policy to yield the same steady state probability distribution of queue length and other performance measures. The two service channels provide parallel service in the former Poisson queue with an m-policy to take care of the FCFS discipline while in the proposed Poisson queue those two service channels are serially connectedsuch that each customer is served jointly by both servers according to an equivalent m-policy which perfectly implements the FCFS rule. A numerical illustration is also provided to support the fact of ‘no violation of the FCFS rule’through a comparative method over appropriate measures.
Journal of Statistical Planning and Inference, 2002
Transient probability functions of a single server preemptive (repeat-identical) two priority queueing system are determined using the randomization solution form and lattice path combinatorics.
Distributed Computer and Communication Networks, 2020
In this paper, we propose an approximate method to investigate the Markovian queuing system with two separate pools of heterogeneous servers (HS) under N-policy. It is assumed that fast servers (F-servers) remain awake all the time while slow servers (S-servers) will go to sleep independently when number of calls in the buffer less than some threshold. At the end epoch of a sleep period, if the number of the calls gathered in the system buffer reaches or exceeds a given threshold, the corresponding S-server will wake up independently; otherwise, the S-server will begin another sleep period. An approximate method is applied under the condition that the sleep rates is essentially less than both arrival intensity of calls and their service intensity. The joint probability distribution of the number of calls in system and number of busy S-servers is determined by simple computational procedures. Illustrative numerical examples show the high accuracy of the proposed approximate method.
Mathematics, 2020
The paper studies a controllable multi-server heterogeneous queueing system where servers 1 operate at different service rates without preemption, i.e. the service times are uninterrupted. The 2 optimal control policy allocates the customers between the servers in such a way that the mean 3 number of customers in the system reaches its minimal value. The Markov decision model and the 4 policy-iteration algorithm are used to calculate the optimal allocation policy and corresponding mean 5 performance characteristics. The optimal policy, when neglecting the weak influence of slow servers, 6 is of threshold type defined as a sequence of threshold levels which specifies the queue lengths 7 for the usage of any slower server. To avoid time-consuming calculations for systems with a large 8 number of servers, we focus here on a heuristic evaluation of the optimal thresholds and compare this 9 solution with the real values. We develop also the simple lower and upper bound methods based on 10 approximation by an equivalent heterogeneous queueing system with a preemption to measure the 11 mean number of customers in the system operating under the optimal policy. Finally, the simulation 12 technique is used to provide sensitivity analysis of the heuristic solution to changes in the form of 13 inter-arrival and service time distributions. 14 Keywords: Heterogeneous servers; Markov decision process; policy-iteration algorithm; mean 15 number of customers; decomposable semi-regenerative process 16
Queueing Systems, 2016
Recent computer systems research has proposed using redundant requests to reduce latency. The idea is to run a request on multiple servers and wait for the first completion (discarding all remaining copies of the request). However, there is no exact analysis of systems with redundancy. This paper presents the first exact analysis of systems with redundancy. We allow for any number of classes of redundant requests, any number of classes of non-redundant requests, any degree of redundancy, and any number of heterogeneous servers. In all cases we derive the limiting distribution of the state of the system. In small (two or three server) systems, we derive simple forms for the distribution of response time of both the redundant classes and non-redundant classes, and we quantify the "gain" to redundant classes and "pain" to non-redundant classes caused by redundancy. We find some surprising results. First, the response time of a fully redundant class follows a simple exponential distribution and that of the non-redundant class follows a generalized hyperexponential. Second, fully redundant classes are "immune" to any pain caused by other classes becoming redundant. We also compare redundancy with other approaches for reducing latency, such as optimal probabilistic splitting of a class among servers (Opt-Split) and join-the-shortest-queue (JSQ) routing of a class. We find that, in many cases, redundancy outperforms JSQ and Opt-Split with respect to overall response time, making it an attractive solution.
Mathematics and Statistics, 2023
In this paper, we consider a single server queueing system operating in a random environment subject to disaster, repair and customer impatience. The random environment resides in any one of N + 1 phases 0, 1, 2, • • • , N + 1. The queueing system resides in phase k, k = 1, 2, • • • , N for a random interval of time and the sojourn period ends at the occurrence of a disaster. The sojourn period is exponentially distributed with mean 1/η k. At the end of the sojourn period, all customers in the system are washed out, the server goes for repair/set up and the system moves to phase 0. During the repair time, customers join the system, become impatient and leave the system. The impatience time is exponentially distributed with mean 1 ξ. Immediately after the repair, the server is ready for offering service in phase i with probability q k , k = 1, 2, • • • , N. In the k−level of the environment, customers arrive according to a Poisson process with rate λ k and the service time is exponential with mean 1/µ k. Explicit expressions for time-dependent state probabilities are found and the corresponding steady-state probabilities are deduced. Some new performance measures are also obtained. Choosing arbitrary values of the parameters subject to the stability condition, the behaviour of the system is examined. For the chosen values of the parameters, the performance measures indicated that the system did not exhibit much deviation by the presence of several phases of the environment.
Recent computer systems research has proposed using redundant requests to reduce latency. The idea is to run a single request on multiple servers and only wait for the first completion (discarding all remaining instances of the request). However no exact analysis of systems with redundancy exists. This paper presents the first exact analysis of systems with redundancy. We allow for any number of classes of redundant requests, any number of classes of non-redundant requests, any degree of redundancy, and any number of heterogeneous servers. In all cases we derive the limiting distribution on the state of the system. In small (two or three server) systems, we derive simple forms for the distribution of response time of both the redundant classes and non-redundant classes, and we quantify the "gain" to redundant classes and "pain" to non-redundant classes caused by redundancy. We find some surprising results. First, in many cases the response time of the redundant class follows a simple Exponential distribution and that of the non-redundant class follows a Generalized Hyperexponential. Second, once a class is fully redundant, it becomes "immune" to any pain caused by other classes becoming redundant. We also compare redundancy with other approaches for reducing latency, such as optimal probabilistic splitting of a class among servers (Opt-Split) and Join-the-Shortest-Queue (JSQ) routing of a class. We find that redundancy outperforms JSQ and Opt-Split with respect to overall response time, making it an attractive solution.
Industrial Engineering and Management Systems, 2016
We consider a multi-server queueing system without buffer and with two types of customers as a model of operation of a mobile network cell. Customers arrive at the system in the marked Markovian arrival flow. The service times of customers are exponentially distributed with parameters depending on the type of customer. A part of the available servers is reserved exclusively for service of first type customers. Customers who do not receive service upon arrival, can make repeated attempts. The system operation is influenced by random factors, leading to a change of the system parameters, including the total number of servers and the number of reserved servers. The behavior of the system is described by the multi-dimensional Markov chain. The generator of this Markov chain is constructed and the ergodicity condition is derived. Formulas for computation of the main performance measures of the system based on the stationary distribution of the Markov chain are derived. Numerical examples are presented.
Journal of Applied Probability, 2011
We consider several versions of the job assignment problem for an M/M/m queue with servers of different speeds. When there are two classes of customers, primary and secondary, the number of secondary customers is infinite, and idling is not permitted, we develop an intuitive proof that the optimal policy that minimizes the mean waiting time has a threshold structure. That is, for each server, there is a server-dependent threshold such that a primary customer will be assigned to that server if and only if the queue length of primary customers meets or exceeds the threshold. Our key argument can be generalized to extend the structural result to models with impatient customers, discounted waiting time, batch arrivals and services, geometrically distributed service times, and a random environment. We show how to compute the optimal thresholds, and study the impact of heterogeneity in server speeds on mean waiting times. We also apply the same machinery to the classical slow-server probl...
There are three topics in the thesis. In the first topic, we addressed a control problem for a queueing system, known as the "N-system", under the Halfin-Whitt heavy traffic regime and a static priority policy was proposed and is shown to be asymptotically optimal, using weak convergence techniques. In the second topic, we focused on the hospitals, where faster servers(nurses), though work more efficiently, have the heavier workload, and the Randomized Most-Idle (RMI) routing policy was proposed to tackle this unfairness issue, trying to reward faster servers who serve more with less workload. we extended the existing result to show that this desirable property of the RMI policy holds under a system with multiple customer classes using theoretical exact analysis as well as numerical simulations. In the third topic, the problem was to decide an appropriate number of representatives over time according to the prescribed service quality level in the call center. We examined the stability of two methods which were designed to generate appropriate staffing functions on a simulated data and real call center data from an actual bank.
Mathematics, 2020
A system with heterogeneous servers, Markov Modulated Poisson flow and instantaneous feedback is studied. The primary call is serviced on a high-speed server, and after it is serviced, each call, according to the Bernoulli scheme, either leaves the system or requires re-servicing. After the completion of servicing of a call in a slow server, according to the Bernoulli scheme, it also either leaves the system or requires re-servicing. If upon arrival of a primary call the queue length of such calls exceeds a certain threshold value and the slow server is free, then the incoming primary call, according to the Bernoulli scheme, is either sent to the slow server or joins its own queue. A mathematical model of the studied system is constructed in the form of a three-dimensional Markov chain. Approximate algorithms for calculating the steady-state probabilities of the models with finite and infinite queues are proposed and their high accuracy is shown. The results of numerical experiments...
Mathematics
The paper deals with a finite-source queueing system serving one class of customers and consisting of heterogeneous servers with unequal service intensities and of one common queue. The main model has a non-preemptive service when the customer can not change the server during its service time. The optimal allocation problem is formulated as a Markov-decision one. We show numerically that the optimal policy which minimizes the long-run average number of customers in the system has a threshold structure. We derive the matrix expressions for performance measures of the system and compare the main model with alternative simplified queuing systems which are analysed for the arbitrary number of servers. We observe that the preemptive heterogeneous model operating under a threshold policy is a good approximation for the main model by calculating the mean number of customers in the system. Moreover, using the preemptive and non-preemptive queueing models with the faster server first policy ...
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.