Call Centers with Delay Information: Models and Insights - CiteSeerX

6 downloads 69348 Views 433KB Size Report
In this paper, we analyze a call center with impatient customers. ... call center as an M/M/s+M queue with endogenized customer reactions to announcements,.
Call Centers with Delay Information: Models and Insights Oualid Jouini† • Zeynep Ak¸sin‡ • Yves Dallery† †

Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290 Chˆatenay-Malabry, France ‡ College of Administrative Sciences and Economics, Ko¸c University, Rumeli Feneri Yolu, 34450 Sariyer-Istanbul, Turkey [email protected][email protected][email protected] In this paper, we analyze a call center with impatient customers. We study how informing customers about their anticipated delays affects performance. Customers react by balking upon hearing the delay announcement, and may subsequently renege, particularly if the realized waiting time exceeds the delay that has originally been announced to them. The balking and reneging from such a system are a function of the delay announcement. Modeling the call center as an M/M/s+M queue with endogenized customer reactions to announcements, we analytically characterize performance measures for this model. The analysis allows us to explore the role announcing different percentiles of the waiting time distribution, i.e., announcement coverage, plays on subsequent performance in terms of balking and reneging. Through a numerical study we explore when informing customers about delays is beneficial, and what the optimal coverage should be in these announcements. It is shown how managers of a call center with delay announcements can control the tradeoff between balking and reneging, through their choice of announcements to be made. Keywords queues; telephone call centers; customer behavior: impatient customers, balking; state-dependent analysis; predicting and announcing delays.

1.

Introduction

As call centers are growing in numbers and importance, they are also becoming more sophisticated in their use of technology (Gans et al. (2003); Ak¸sin et al. (2007)). Announcing anticipated delays to customers in order to modulate their use of the call center is one such use of technology, which motivates the research in this article. The main reason for informing customers about their queueing delays is to alleviate congestion and to reduce customer dissatisfaction with waiting. Information about anticipated delays is especially important in service systems with invisible queues (tele-queue) of which call centers are a prime example (Bitran et al. (2008), Zohar et al. (2002). In such systems, the uncertainty involved in waiting is higher than 1

that in systems with visible queues. Upon arrival and during their wait, customers have no means to estimate queue lengths or progress rate. ”Uncertain waits are perceived to be longer than known, finite waits” (Maister (1985) p. 118) and have been related with lower satisfaction (Taylor (1994)). Providing delay information is shown to improve satisfaction (see for example Taylor (1994), Katz et al. (1991), Hui and Zhou (1996)). Several reasons have been attributed to this improvement: a sense of better control on the part of the customer (Katz et al. (1991),Taylor (1994), Hui and Tse (1996)), a reduction in perceived waiting time (Hui and Zhou (1996)), or alternatively an improvement in the acceptability of a wait as a result of delay information (Hui and Zhou (1996)). In call center settings, satisfaction with waiting experiences affect customers’ reactions in terms of balking and reneging behavior. Delay announcements, through their effects on customers, may further modulate these customer reactions. When a new arrival thinks that her anticipated delay is too long, she can balk without joining the system. A delay announcement will instigate further balking, leading to a reduced number of customers in queue and shorter waits. For customers who enter the queue, delay announcements may further have the effect of increasing their patience by reducing the uncertainty. However, since perfect announcements are not possible in reality, some customers may experience longer delays than what has been announced to them. This in turn may have a negative effect on satisfaction and resulting patience behavior. This loose discussion indicates a close relationship between delay announcements and resulting customer behavior. We explore this relationship in the subsequent analysis and propose a model to analyze it. We aim to answer the following questions: how can delays be predicted and once the predicted waiting time distribution is obtained, what should be announced to customers? Is it always in the service provider’s best interest to announce delays, and if so should the announcement be as close to reality as possible? Predicting delays for arriving customers is system state-dependent. This is different from estimating stationary performance measures and usually makes the analysis intractable. In the context of prediction and announcement of delays, an extra layer of complexity is introduced. The analysis becomes more difficult since we have to take into account the description of the system in addition to the announcements given to each waiting customer in queue. Existing research often looks for approximations, such as announcing the stationary mean waiting time, or announcing the actual delay of the last customer (motivated by large systems in an overload regime, see Armony et al. (2005)). In this work, we analyze Markovian models. Paralleling Whitt (1999a), we consider a single class multi-server call center model with impatient customers and working under the first come, first served (FCFS) 2

assumption. Customer reactions to delay announcements are modeled as balking and a form of abandonment. Our main insights are derived from the analysis of this simple model. Starting from a model with perfect delay announcements and contrasting it with a model with no delay announcement, we detail and justify the quantitative building of the new model with potentially imperfect delay information. In our model, customers have the opportunity to balk in response to their anticipated delay. Further to the balking reaction, it has been shown using real call center data that the reneging behavior may also change in response to delay information (Feigin (2005)). We model that effect by extending the model of Whitt (1999a), letting already informed customers renege, even after having chosen to join the queue. We propose a method for the modeling of the new reneging experience by relating it to the delay announcement that is made. Once we obtain the predicted waiting time distribution of a new arrival, we investigate how the service manager should profit from that information to make the announcement. The manager has a choice in terms of choosing a particular percentile of the waiting time distribution to announce to the customer. In making that choice, the manager considers the following tradeoff. Informing the customer of short waiting times, which is likely to underestimate the actual waiting, might lead to less balking but excessive reneging and reduce the reliability of the service provider in the eyes of the customers. On the other hand, informing the customer of large waiting times increases the number of balking customers, but as a result leading to a system that might allow to serve customers within shorter and reasonable delays. Through a numerical analysis, we investigate how the ideal percentile should be chosen as a function of costs associated with balking and reneging customers, system size, and different modeling assumptions about customer reaction. The idea of controlling the tradeoff between balking and reneging through appropriate manipulation of customer reactions that we analyze herein, is similar to the idea of selecting the size of a finite waiting space in a queueuing system to control this tradeoff (Kolesar (1984), Koole and Mandelbaum (2002)). Our main contributions can be summarized as follows: We propose a model that explicitly takes customer reactions to delay announcements into account, both in terms of balking and reneging behavior. This distinction in reactions allows us to model reality more closely, where we observe both types of reactions. It further enables us to differentiate between the two types of departure from queue from a customer standpoint, where a customer may prefer an informed balking decision when delays are announced over a misinformed reneging decision when the announced time is exceeded in practice. We analyze the resulting model as a birthdeath process, and characterize its performance measures. The analysis allows us to explore the role announcing different percentiles of the waiting time distribution, i.e., announcement 3

coverage, plays on subsequent performance in terms of balking and reneging. We show under what conditions providing information on delays to customers will be beneficial, and illustrate that more likely delay information is not necessarily better in terms of system performance. We demonstrate that while the service provider will want to announce a delay that is typically different from the expected waiting time, the relationship between performance and different announcement coverages is not monotonic due to the tradeoff between balking and reneging. The idea of controlling this tradeoff through a choice of announcement coverages is new, and constitutes our main managerial contribution. The paper is structured as follows. In Section 2 we provide a review of related literature. Section 3 describes the setting and possibilities regarding types of delay announcements and reactions to these by customers. The models without and with delay announcement are formulated in Section 4. Sections 5 and 6 characterize their performance measures, respectively. We also establish in Section 6 the basic relationship between announcement coverage and customer balking and reneging through a stochastic comparison. The numerical analysis in Section 7 compares the models with and without announcements, and explores the role the announcement coverage plays. The paper ends with concluding remarks. In the appendix, we present support and proofs of some results derived in the core of the paper.

2.

Literature Overview

The literature related to the subject of this paper spans mainly three areas. The first one deals with queueing models incorporating reneging. The second area pertains to the prediction and announcement of delays seen from a queueing perspective. The third area is related to the psychology of waiting and the qualitative impact of announcing delays on customer behavior. In the following, we highlight some of the literature with regard to the first area, without providing a complete overview. The presence of customers that renege subsequent to delay announcements is a feature often found in call centers that makes this study of value in practice. The importance of modeling abandonments in call centers is emphasized by Garnett et al. (2002), Gans et al. (2003) and Mandelbaum and Zeltyn (2006). Empirical evidence regarding abandonments in call centers can be found in Brown et al. (2005) and Feigin (2005). We refer the reader to Ancker and Gafarian (1962), Garnett et al. (2002), and references therein for simple models assuming exponential impatience, as we do in this paper. In Jouini and Dallery (2007a), the authors consider performance evaluation of a Markovian multiserver queue with two types of impatient customers, having different priorities. Other papers have 4

allowed reneging times to follow a general distribution. Related studies include those by Baccelli and Hebuterne (1981), Brandt and Brandt (2002) and Zeltyn and Mandelbaum (2005). As for the second area, we refer the reader to the relevant work of Whitt (1999b). The author focuses on estimating state-dependent delays and presents both accurate methods and approximations of waiting times in different situations. Models under consideration are queueing systems with different customer classes, characterized by different service rates, exponential and non-exponential service times. Nakibly (2002) reviews several classical results, and extends the analysis to some other complex models, with priorities but without abandonments. Announcement of these delay estimates to customers and subsequent reactions of customers are not considered in these studies. As already mentioned, estimating state-dependent delays is known to be difficult. It is related to the transient analysis of birthdeath processes, and in general of Markov chains. The literature on birth-death processes is extensive and growing, see Keilson (1964a), Kijima (1997), Jouini and Dallery (2007b), and references therein. The problem of predicting and announcing delays has recently received a lot of attention in the field of call centers. Whitt (1999a) models and quantifies the effect of giving state information to customers on the performance in a single class Markovian call center model. The analysis shows that average waiting times can be reduced with accurate announcements. Jouini et al. (2007) extend the analysis of Whitt (1999a) to a setting with two customer classes with priorities, incorporating delay announcements. Guo and Zipkin (2007) consider a simple queueing model where three levels of information can be provided to customers, namely no information, partial information and full information (the exact waiting time). The authors investigate how information about delays can enhance system performance. None of these papers allow for reneging subsequent to joining the queue, which is a restriction that is relaxed in Armony et al. (2005) as well as in the current paper. Taking general distributions for service times and times before reneging, Armony et al. (2005) develops methods based on fluid approximations to study the effect of announcing delays. The performance of one particular delay announcement, that of announcing the last customer’s realized delay, is explored. Motivated by this type of delay announcement, Ibrahim and Whitt (2007) explore the performance of different real-time delay estimators based on recent delay experience by customers. Armony and Maglaras (2004), and Armony and Maglaras (2004a) consider a slightly different model. Based on his anticipated delay information, a customer may balk, elect to wait, or leave a message. When a message is left, the service provider calls the customer back within a guaranteed time. The authors show that system 5

performance improves, both in terms of throughput and in terms of average waiting time. Note that although the modeling approaches differ from one work to another, the findings usually confirm the benefits of communicating delays to customers. Guo and Zipkin (2007) is one exception, where conditions are identified under which more information may hurt the customer or the service provider. In a more general service setting, where delay information provided to customers is allowed to take different forms including qualitative and vague statements, Allon et al. (2007) show the possibility of a strategic self-interested firm that might choose to provide intentionally vague information to strategic customers to induce desired behavior from them. Our modeling approach, which allows us to capture the link between the announcement that is made and resulting system performance explicitly, enables us to similarly explore different effects on system performance that different announcements may have and as a result propose that the service provider could choose delay announcements to induce particular balking and reneging reactions by customers. The third area of literature close to this paper is related to the psychology of waiting. The literature on customers influenced by delay begins with Naor (1969). Focusing on customer psychology in waiting situations Maister (1985) proposes a set of hypotheses some of which are tested in the subsequent literature. We rely on the recent review by Bitran et al. (2008) and references therein to highlight results from the literature on which we draw in formulating our model. It is apparent from this review that modeling waiting experiences and generalizing customer reactions are difficult to do due to the presence of many moderating effects including personal differences and service context. Given that call center specific evidence is very limited, we also rely on results that emerge from other contexts. One of the first models is by Osuna (1985) constructing a direct relationship between waiting time and dissatisfaction or stress experienced by customers during the wait. Elsewhere it is argued that it is not time but the perception thereof that drives customer satisfaction (see for example Hornik (1984), Zakay (1989)). It is shown that in settings where customers lack information on the duration of the wait, i.e. in settings with uncertainty concerning queue length and progress as in tele-queues, people tend to overestimate waiting durations (Taylor (1994)). This implies higher dissatisfaction with the wait. Information on delays on the other hand shorten perceived waiting time by customers (Katz et al. (1991), Hui and Tse (1996)). Many have attributed this to the sense of control that the customer feels about the wait once uncertainty about it has been removed through an announcement (Taylor (1994), Katz et al. (1991), Hui and Tse (1996)). But this does not seem to be the only effect: told that they will wait for a certain amount, people are documented to adjust their acceptance level for waiting (Hui and Zhou (1996)). More recent results emphasize the importance of 6

a sense of progress to customers Rafaeli (2002), Munichor and Rafaeli (2007)). In physical queues, people observe that they are getting closer to service and this has an improving effect on their mood while waiting (Rafaeli (2002)). Lab experiments simulating a tele-queue in Munichor and Rafaeli (2007) show that customers prefer queue position announcements over music or apologies, and react by being more persistent in holding the line, providing further evidence to the idea that progress matters to customers. An improvement in satisfaction level as customers approach service is also consistent with the check-in phase of the five phase service encounter profile detailed in Bitran and Lojo (1993), of which an example in a call center context is provided in Bitran et al. (2008). While a delay announcement can act like a time guarantee thereby increasing satisfaction as the wait proceeds to the announced time, exceeding this time will have a negative effect on customers, reducing their satisfaction (Katz et al. (1991), Kumar et al. (1997)). Results from this literature motivate some of our modeling assumptions, as detailed in the following section. We note that many of the results pertain to satisfaction levels in general waiting settings, and links to customer reactions like balking and reneging in call center settings have not been established. Some exceptions are papers that directly analyze call center data (Brown et al. (2005), Feigin (2005)) and the lab experiments in Munichor and Rafaeli (2007).

3.

Announcing Delays in Call Centers

This section lays out basic characteristics of the setting being modeled and describes possible impact of the announcement of delays on customer behavior.

3.1

Preliminaries

Consider a call center with a single group of agents, serving a single class of customers. The model consists of one infinite queue, and a set of s parallel, identical servers representing the set of customer representatives (agents). Customers waiting in queue are served under the FCFS discipline. Interarrival times as well as successive service times are random, and assumed to be independent and identically distributed (i.i.d.). Upon arrival, a customer is addressed by one of the available agents, if any. If not, the customer may decide to immediately leave the system (balking) or to wait, at least for a certain amount of time (waiting). In addition, this decision is likely to depend on whether or not the call center is providing information about anticipated delays in the queue. In this paper, we will study and compare the behavior of a call center without delay announcements 7

and of a call center with delay announcements. In order for the comparison to make sense, we need to specify customer patience in these different environments. To do so, let us consider the following idealized situation with perfect delay announcements first. Consider a “perfect information call center” in which the waiting time of a customer could be predicted exactly and given to the customer upon his arrival as information. Further assume that the customers of this call center are fully aware of such an announcement and its exactness. In this situation, each customer, say customer k, would arrive with a given (deterministic) patience time tk . Note that this patience time could possibly be equal to 0 representing a customer who is not willing to wait (infinitely impatient customer). This time represents an ideal construct which we may call patience threshold under perfect delay information. It would represent the customer’s willingness to wait including all dimensions of the service (like for example the importance attributed to the service by the customer), however excluding the customer’s preferences concerning uncertainty in the duration of the hold. The behavior would then simply be as follows. If the announced delay time, say dk , exceeds the patience time of the customer, i.e., dk > tk , the customer would leave the system immediately (balking); otherwise, the customer would join the queue and stay until entering service. Because of the accuracy of the delay announcement and the definition of tk , the customer would never abandon the queue. For simplicity, we assume here and throughout the paper that the customer’s discovery of a busy system, listening to a delay announcement, and making a decision on balking all happen in zero time. In reality we expect these durations to be very short, nevertheless still positive. Since customers will vary in their preferences, each customer may have a different tk value. We assume that we can statistically characterize the patience behavior of the customers by a distribution of patience times. This distribution will in general consist of an atom at the origin, corresponding to very impatient customers, and a remaining continuous distribution. An atom close to the origin is also present in the Israeli call center data analyzed in Brown et. al. (2005) and the U.S. Bank call center data analyzed in Feigin (2005). Let α0 denote the probability that a customer, arriving to a busy system, will immediately balk. This feature models a non-negligible portion of the customers who call with the idea to hang up immediately once they know that they have to wait for service. On the other hand, customers who find a busy system will accept to join the queue with probability 1 − α0 . For these customers, we assume that their patience thresholds are random and i.i.d. under a given continuous distribution. The patience of a particular customer is thus a random outcome of this distribution and is known upon arrival in the queue. We will assume that these patience times are exponentially distributed with parameter γ. We continue to impose 8

the exponentiality assumption on patience times in the models with (imperfect) delay announcement and no delay announcement that we present below. This is mainly to preserve tractability in our models. Indeed, empirical evidence from real call centers suggests that patience times may not be exponential (Brown et al. (2005), Feigin (2005)).

3.2

The Impact of Announcing Delays

In this section we consider a call center that provides information on delays to its customers. Unlike what is assumed in Section 3.1, delay announcements are not perfect, thus allowing us to model reality where indeed it is impossible to provide exact delay announcements. This brings with it the added complexity that different delay predictions are possible and hence that different durations may be announced to customers. From a modeling perspective, an important feature is that customers are expected to react to such announcements by changing their behavior (Katz et al. (1991), Hui and Tse (1996), Taylor (1994), Hui and Zhou (1996), Munichor and Rafaeli (2007)). Different types of announcements might instigate different types of reactions in terms of balking or reneging. When we inform a customer about her anticipated delay, she will decide right away, either to hang up immediately because she estimates that her delay is too long, or to start waiting in the queue. In the latter case, there are two further possibilities. The first is that customers never abandon thereafter. The second possibility is that they are still impatient, i.e., customers may abandon even if they have chosen to start waiting. It is natural to expect that customers would abandon in a different way than in the model without announcements (Feigin (2005)). Similarly, the balking reaction is expected to depend on the information provided. For customers who find the system busy, the balking with probability α0 is still present. The remaining customers (with probability 1 − α0 ) will get delay information. Thus, their subsequent balking decisions will depend on the delay information. Below, we discuss possible forms of delay information that could be given to customers, and present the different modeling possibilities that arise from these. 3.2.1

What Information Do We Provide?

It is possible to provide different types of information regarding delays. A common one is to announce the number of customers ahead. This is not very meaningful in a setting where the number of servers are unknown to customers, service times are random and where customers may abandon the queue. A more sophisticated version announces the number

9

of customers ahead upon arrival, and updates the announcement each time that number changes. However, in addition to creating operational complexity, this type of information may create customer confusion: due to randomness, equivalent decreases in queue length may take very different lengths of time. A further possibility is to announce some realtime delay estimators based on recent delay experience by customers, as in Armony et al. (2005) or Ibrahim and Whitt (2007). This type of announcement is shown to work well when delays are relatively long, i.e., for highly loaded systems. Since we want to announce state dependent queueing delays, we do not consider this type of announcement any further herein. For call center settings where the state-dependent waiting time of each new arrival can be derived, we distinguish between three additional types of delay announcement. The first, as in Whitt (1999a), is to give the whole distribution of the anticipated waiting time to each new customer. A customer will make use of the entire delay distribution to decide wether to join the queue or not. This is like saying that each customer has her own coverage probability (precision) which may be different from that of the other customers. Then, the customer will pick up the realization from the distribution, that corresponds to her coverage probability, and will make her balking decision in response. This first type of announcement is at odds with reality. In practice it is unrealistic to give a customer the distribution function of her anticipated delay. For that reason, a second type of announcement, which is to communicate the expected value of the delay distribution, has been suggested by Whitt (1999a). This is simple and easy to understand by customers. On the other hand, when the number of customers already in queue is small the state-dependent waiting time can be quite different from its mean, since the variability of state-dependent delays will be high in that case. The expected delay will only be a good estimate of the state-dependent delay when the number of customers ahead in queue is large, since the delay distribution is in that case highly concentrated around its mean (see the remark on page 196 of Whitt (1999a)). In the third type of announcement, the call center manager specifies a unique tail probability for everybody, say β. The parameter β is a coverage probability based on which we determine a time x from the waiting time distribution of a newly arriving customer. The actual waiting time of this customer will be less than that x with probability β. It will be larger than that x with probability 1 − β. Typically, the announcement information would be “You will wait less than x minutes”. This third type of delay announcement is more plausible in a real setting, since a typical customer will understand a given delay announcement, and the service provider is able to control the desired reliability of the announcement by 10

choosing β. Our analysis in this paper is for the third form of announcement with a given coverage probability. 3.2.2

How Does a Customer React?

The first model we can consider, that captures the change in customer behavior with respect to delay information, is the model analyzed by Whitt (1999a). Accordingly, a customer balks with a probability that depends on her anticipated delay. Once she decides to join the queue, she never abandons thereafter. Replacing all abandonments by balking is quite meaningful in an environment with perfect delay announcements, as described in Section 3.1, but when predictions and therefore announcements are not exact and customers know about this, we can envisage many other things happening. In particular, we expect customers to abandon the queue, even after having chosen to join initially, and further expect this abandonment behavior to depend on the announced delay. For example a customer who does not balk, may perceive the subsequent wait as longer than it really is and lose patience as a result. Customers may take the announced time like a time guarantee, or time progressing towards the announced wait as a sign of progress and persist in holding until the announced delay, but if the wait starts exceeding the announced time they will be frustrated and may lose their patience as a result. Other customers may not experience any dissatisfaction from uncertain waits, or may not take the delay announcements by the firm to be credible, and thus may not change their initial patience behavior. These are only some possibilities, motivated by observations from the earlier literature, and we can easily envisage these effects being combined with others to yield customer responses to imperfect delay announcements. What we conclude from these possibilities is that a model that considers customer abandonments subsequent to a decision to join the waiting line seems more realistic. We thus consider a model that incorporates customer reactions both in terms of balking that depends on a customer’s anticipated delay, and subsequent reneging. Given the complexity of reality, we opt for a heuristic approach and model the customer reaction in terms of reneging by making use of several extreme cases. One possibility is that of a customer who ignores the delay information and does not update her patience threshold tk . We call this the no-update case. Among other things, this might be attributed to a form of duration negligence, as discussed and reviewed in Bitran et al. (2008). Another possibility is a customer who completely updates her patience threshold and replaces tk by the announced delay. We call this the update case. One possible explanation for this type of behavior is the presence of an anchoring effect (see for example Kahneman et al. (1982)) on the announced delay. Given the definition of tk in a world 11

with perfect delay information, we do not expect a delay announcement to affect customer patience by prolonging it beyond this value. In other words, the patience prolonging effect of delay announcements that is suggested by the data in Feigin (2005) and by evidence in Hui and Zhou (1996), is already incorporated in tk . Clearly, other external factors may prolong a customer’s patience but we do not model these herein, so these possibilities are not considered. This suggests that the no-update case represents one extreme case. Similarly, some customers may update their patience threshold to a value that is below the announced delay. While we recognize this as a possibility, we think such customers will be limited and can be disregarded in the model. The earlier cited work suggesting an increase in satisfaction as time progresses and approaches the announced delay subsequent to joining the queue (Bitran et al. (2008), Munichor and Rafaeli (2007)), lends support to this view. Thus, we consider the update case as another extreme case. In reality, the most likely scenario is that of customers that combine these effects, resulting in a patience threshold that lies somewhere between the announced delay and tk . We propose to model this case by assuming that each customer has a probability q to be an update type and 1 − q to be a no-update type, and call the resulting situation the mixed case. For each of the three cases that we consider in modeling customer behavior (no-update, update, and mixed), the desired patience behavior is approximated by an exponential distribution. The new reneging experience in the system with delay announcement is characterized in Section 6.1.

4.

Description of the Models

In Sections 4.1 and 4.2, we formulate the models without and with delay information, respectively. In the first model we consider, we assume that no delay information is given to customers. After entering the queue, a customer will wait a random period of time for service to start. If service has not begun by this time she will renege. In the second model, each customer gets information about her anticipated delay and reacts to that. We formulate both models based on comparisons with the perfect delay information call center we introduced in section 3.1. As before, in both models there is a single class of customers and s statistically identical servers in parallel. We also assume, in both models, that arrivals follow a Poisson process with rate λ, and service times are exponentially distributed with rate µ. Another similarity is that in all models, we consider the proportion α0 of customers who are not willing to wait, irrespective of available information.

12

4.1

The Model without Delay Announcement

The model with perfect delay information represents a setting with no uncertainty about waiting duration, once the customer has chosen to wait. The model without delay announcement on the other hand represents the other extreme with the highest level of uncertainty concerning the waiting duration. Given risk-averse behavior exhibited by customers in the domain of time Leclerc et al. (1995) and based on earlier cited evidence, as uncertainty increases, we expect lower customer satisfaction. Due to this, we consider the proportion of customers who are unwilling to wait to be higher in a call center with no delay announcement when compared to an ideal call center with perfect delay information. Among the customers who find all agents busy, a proportion α0 immediately balks. This is the base proportion (customers who are unwilling to wait any amount of time). For the remaining customers (proportion 1−α0 ), there is a proportion α1 who choose to immediately balk too. We attribute this proportion to the frustration a customer may anticipate in case she will join an invisible queue with no information about anticipated delays. Even if these customers were willing to wait some given amount of time, they would feel afraid that their waiting time will be too long. So, they would prefer to immediately leave the system upon their arrival instead of experiencing the uncertainty of waiting in queue. In practice, distinguishing the two proportions (α0 , α1 ) is difficult and we only differentiate between the two proportions to clarify the comparisons. The uncertain wait is also expected to affect the resulting patience of the customers that decide to wait, potentially due to overestimating the delay Katz et al. (1991), or because uncertainty negatively effects the acceptability of a wait Hui and Zhou (1996). In the model with perfect delay announcements customers have been modeled as having patience times that are exponentially distributed with parameter γ. Since delay announcements are exact, there is no reneging by customers in this model, and all reneging is replaced by balking subsequent to delay announcement. We model the setting with no delay announcements as having patience times that are exponentially distributed with parameter γ + γ1 where γ1 ≥ 0. This allows us to consider the possibility of higher impatience in the case with no delay information. While a comparison with a perfect information call center is not possible in practice, the comparison of hazard function estimates for customers who have and have not heard delay announcements in a call center (in Figure 12 of Feigin (2005)) seems to lend support to the idea that initial abandonment (modeled as balking in our setting) is higher and subsequent patience seems to be lower when there is no announcement. 13

The resulting model referred to as Model 1 is shown in Figure 1. λα 0

λ

λ (1 − α 0 )α1

λ (1 − α 0 )

γ + γ1

λ (1 − α 0 )(1 − α1 )

s

Figure 1: Model without delay announcement, Model 1

We note that in Model 1 there is no time difference between the two balking decisions. The only difference is the reason for balking, or in other words the type of balking. Customers of both types of balking will immediately leave the system. So in Model 1, a customer who finds all servers busy will balk with probability α0 + (1 − α0 )α1 . She will join the queue with the complementary probability (1 − α0 )(1 − α1 ). In the latter case, she is willing to wait in queue only a certain amount of time. If service has not begun by this time she will renege and be lost. We assume that her patience threshold is a realization derived from an exponentially distributed random variable with rate γ + γ1 . Finally, we assume that retrials are ignored, and reneging is not allowed once a customer starts being served. Model 1 can be viewed as an M/M/s+M queueing system with balking. The symbol M after the + is to indicate the Markovian assumption for times before reneging. Note that reneging makes the system unconditionally ergodic for any γ > 0.

4.2

The Model with Delay Announcement

Based on the discussion in Section 3.2, we describe in what follows the call center model in which we add delay announcement. Upon arrival, if less than s customers are in system, no action is taken because the new customer gets service immediately. If all agents are busy, we again model the fact that a proportion of new arrivals come being unwilling to wait any duration of time. We assume that each new arrival to a busy system has a probability α0 to immediately balk, before even hearing his anticipated delay. It is reasonable to assume that the proportion α0 is identical to that in the model with perfect information and in Model 1 since these customers do not experience any difference between the two systems. Contrary to Model 1, we believe that balking stemming from waiting uncertainty (no information) has no reason to be present here, i.e., there is no α1 parameter. In addition, we assume that customers have trust in the information given by the call center. All customers who were initially willing to wait (with probability 1 −α0 ) will give the call center the chance to provide them with delay information. This information is assumed to be delivered in zero 14

time. These customers may however balk for another reason, namely, they may decide to balk as a function of the announced delay. Consider a customer who finds all servers busy and n (n ≥ 0) waiting customers ahead of her in queue. With probability 1 − α0 , she will accept to hear the information provided to her about her anticipated delay. If she does so, we derive the distribution of her virtual delay, say Dn . The quantity Dn is a conditional random variable, given the queue state n. It is the waiting time in queue for a customer that will wait until service begins (in case she elects to join the queue). Then, we communicate to her the delay which corresponds to a given coverage probability β. Let gn (t) be the probability density function (pdf) of Dn , and let Gn (t) be its Cumulative Distribution Function (CDF), t ≥ 0. Let T be the random variable measuring the initial random patience threshold of customers, (with probability 1 − α0 : an exponential distribution with rate γ). Let dn be the delay we communicate to our customer, dn = G−1 n (β). It means that the queueing delay of the new customer does not exceed dn with a chance β. For that customer, we stipulate that she decides to balk if her random patience threshold, T , does not exceed her anticipated delay dn . Let pB (n) denote the probability of this event. Next assuming that balking decisions of successive customers are independent, we get pB (n) = P (T < dn ) = 1 − e−γdn , n ≥ 0.

(1)

So, given that a customer does not balk with probability α0 , she may balk with probability pB (n) in response to the delay announcement. Once the new customer elects to wait with probability 1 − pB (n), she may renege within a random delay. This new random threshold is statistically different from the initial one and is characterized in Section 6.1 as a function of the delay announcement made. Again, we assume that retrials are ignored, and reneging is not allowed once a customer starts being served. The resulting model referred to as Model 2 is shown in Figure 2. The parameter of reneging γ 0 in this figure will be defined later in Section 6.1. λα 0

λ

λ (1 − α 0 ) p B (n)

λ (1 − α 0 )

λ (1 − α 0 )(1 − p B (n))

γ'

s

Figure 2: The new model incorporating delay announcement, Model 2

15

5.

Analysis without Delay Announcement

In this section, we derive several performance measures related to Model 1. The approach here is based on system state probabilities seen by a randomly chosen new arrival. From the classical PASTA property (Poisson Arrivals See Time Averages), these probabilities coincide with those seen by an outside random observer, i.e., simply the probabilities that the system is in a given state at a random instant. The PASTA property is based on the memoryless property of the Poisson process, which allows to generate a sequence of arrivals that take a random look at the system. We refer the reader to Kleinrock (1975) for further explanation, and Wolff (1982) for a rigorous proof. While our approach is similar to that of Whitt (1999a), the model we consider as well as details of how we derive the performance expressions are different. To avoid clutter in the notation, we will use the same notation for Models 1 and 2. Also, we derive the performance measures while ignoring the parameter γ1 defined in Section 4.1. In the analysis below, the reneging rate γ might be simply replaced by γ + γ1 if we have more reneging as explained in Section 4.1. We denote the system state by a random variable E(t) taking non-negative integer values representing the number of customers in the system at t ≥ 0. Let p(i) be the steady state probability that i customers are present in Model 1 at a random instant, i ≥ 0. The process {E(t), t ≥ 0} is a birth-death process, see Figure 3. When the number of customers present in system, i, is less than or equal to s−1, all arrivals enter the system. So, the birth rates are λ. In addition, all departures are service completions. Then, the death rates are iµ, i < s. Otherwise if i ≥ s, an arrival has a probability α0 + (1 − α0 )α1 to immediately balk. So, the birth rates are λ(1 − α0 )(1 − α1 ). Departures may be service completions or abandonments. Thus, the death rates are sµ + (i − s)γ. λ

0

1 µ

λ

λ …

(1 −

α0 )

(1 −



λ

(1 −

α0 )

(1 −

s+1

s

s-1

α1 )

sµ+ γ

α1 ) s+2



sµ+2 γ

Figure 3: Birth-death process for Model 1

In the steady state, one has a set of infinite recursive relations relating the steady state probabilities p(i), i ≥ 0. We go on to solve by iteration and get the following solutions p(i) =

λi λi (1 − α0 )i−s (1 − α1 )i−s p(0) for i > s, p(0) for 0 ≤ i ≤ s, and p(i) = Q i! µi s!µs i−s j=1 (s µ + j γ) 16

(2)

where p(0) is the stationary probability to have no customers in system. Using the normalization condition, it follows that !−1 Ã s ∞ X λi X λi (1 − α0 )i−s (1 − α1 )i−s p(0) = . + Q i! µi i=s+1 s! µs i−s j=1 (s µ + j γ) i=0

(3)

Equations (2) and (3) enable us to compute all system states’ stationary probabilities at a random instant. Define now P I as the probability of immediate service for a newly arriving customer. Applying PASTA, we get I

P =

s−1 X

p(i).

(4)

i=0

We denote by Lq the expected number of customers waiting in queue, and by Ls the expected number of customers present in system. They are given by Lq =

∞ X

ip(s + i),

and Ls =

i=1

∞ X

ip(i).

(5)

i=1

Let us now proceed to compute the probability for a new arrival to balk, say P B , that to enter service, say P S , and that to renege, say P R . When a new customer finds less than s customers in the system, she gets service immediately. This is equivalent to finding at least one server idle, and it occurs with the probability P I given in Equation (4). Let us now consider the complementary event, i.e., assume that a new customer finds all servers busy. This customer will balk with probability α0 +(1−α0 )α1 . Applying PASTA we get ¡ ¢ P B = (α0 + (1 − α0 )α1 ) 1 − P I .

(6)

Let us compute the stationary probability to renege, P R . This quantity can be viewed as the stationary proportion of customers who renege, seen at the epoch of a new arrival. Then, it is the fraction of the stationary mean rate of reneging customers (seen by a new arrival) over that of arrivals. Applying PASTA, the latter fraction is equal to that seen at a random instant. Moreover, since times before reneging are i.i.d. and exponentially distributed, the stationary mean rate of reneging customers is simply γLq . So, P R may be computed as PR =

γLq . λ

(7)

A customer who does not balk upon arrival and does not renege thereafter (in case she waits in queue) will necessarily enter service. Thus, the stationary probability for a new arrival to enter service is simply given by P S = 1 − P B − P R . 17

We consider the analysis of queueing delays next. The sojourn time of a customer in queue will end either as a result of reneging or start of service. Here, we focus on the conditional waiting time, given that service is completed. We compute any kth order moment, k ≥ 0, of this conditional random variable, say E(X k | S). To do so, we use a similar approach to that used in Jouini and Dallery (2007a), which is originally inspired from Whitt (1999a). Consider a customer who finds all servers busy and n waiting customers in queue, n ≥ 0. Assume that this customer does not balk, and let us define a pure-death process with statedependent death rates as shown in Figure 4. The process is derived from that in Figure 3. We only consider states ranging from s to s + n + 1 (the n + s customers already in system plus the new arrival). We do not consider birth rates because future arrivals have no priority over the customer of interest (the discipline of service is FCFS). s

s+1



sµ + γ

s+n

sµ + (n+1) γ

s+n+1

Figure 4: Pure death process for the customer s + n + 1

Conditioning on the system state, this new arrival will start her sojourn at position s + n + 1 in system (position n + 1 in queue). There are two possibilities: the customer of interest will wait until service begins (with probability Ψn+1 ) or renege before service begins (with probability 1−Ψn+1 ). The process moves from state i to state i−1, s+1 ≤ i ≤ s+n+1, in the case of a departure event, i.e., in the case of either a service completion with rate sµ, or a reneging event with a rate equal to the number of waiting customers times the reneging rate, (i − s)γ. The memoryless property of service times as well as times before reneging allows us to state the following claim. When being in state i, the probability that the process moves down due to the event of reneging of our customer of interest is given by

γ . sµ+iγ

Next,

the conditional probability, Ψn+1 , that our customer does not renege while waiting in queue, given that she finds n waiting customers upon arrival and does not balk, may be written as Ψn+1 =

n+1 Yµ i=1

γ 1− sµ + iγ

¶ .

(8)

In other words, the latter event means that our customer does not renege in all possible queue positions she will occupy, starting from position n + 1 until position 1 and enters service afterwards. Let us now define the conditional random variable Xn+1 , given that our customer finds the system busy and n waiting customers, n ≥ 0. The random variable Xn+1 denotes the time it takes to empty the queue of n + 1 waiting customers (without considering eventual 18

future arrivals). Our customer will enter service with probability (1 − α0 )(1 − α1 )Ψn+1 in Xn+1 units of time. Conditioning on a state seen by a new arrival and averaging over all possibilities, we state using PASTA that the kth order moment of the sojourn time in P k queue and being served afterwards is given by ∞ n=0 (1 − α0 )(1 − α1 )p(s + n)Ψn+1 E(Xn+1 ), k where E(Xn+1 ) denotes the kth order moment of Xn+1 . Thus, the kth order moment of the

conditional waiting time in queue, given service is completed, is P∞ k ) (1 − α0 )(1 − α1 )p(s + n)Ψn+1 E(Xn+1 k E(X | S) = n=0 . S P

(9)

It remains for us to derive the expression of E(Xnk ), n ≥ 1. The random variable Xn can be viewed as the first passage time to state s starting from state s + n in the pure death process of Figure 4. Then, the distribution of Xn is the convolution of n independent exponential distributions with parameters sµ + γ, sµ + 2γ, ..., and sµ + nγ, which is an hypoexponential distribution, see Ross (1997). So, all moments of Xn may be easily derived in closed form using for example the Laplace transform of the hypoexponential distribution. Here, we only give the first two moments of Xn . They are à n ! à n !2 n X X X 1 1 1 E(Xn ) = , and E(Xn2 ) = + . 2 sµ + iγ (sµ + iγ) sµ + iγ i=1 i=1 i=1

(10)

Finally, the variance and standard deviation of the conditional waiting time given service, are p given by V ar(X | S) = E(X 2 | S) − E(X | S)2 and σ(X | S) = V ar(X | S), respectively. The performance measures we derive in this section will be used later in Section 7 for the numerical comparison between the models with and without announcement.

6.

Analysis with Delay Announcement

In this section, we consider the analysis of the model with delay information, Model 2. We first characterize in Section 6.1 the new patience behavior of customers as a function of the delay announcement made. Performance measures of Model 2 are then derived in Section 6.2. In Section 6.3, some results about the relationship between announcement coverage and customer related performance measures are investigated.

6.1

Characterization of the New Reneging Behavior

For a customer waiting in queue in Model 2, we assume that her random patience threshold is exponentially distributed with rate γ 0 , γ 0 6= γ. While we need the exponentiality assumption (which is an approximation) for an exact analysis of the corresponding queue to be feasible, 19

we argue later in this section (after Equation (16)) that the same procedure can be applied as a good approximation in the case of some general impatience distributions. A plausible alternative to the exponential distribution with rate γ 0 is a delayed exponential distribution where each customer waits for a given delay and then reneges with an exponential rate. In the appendix, we present simulation experiments that give further support regarding the quality of the exponential approximation. In what follows, we compute the distribution of the virtual delay Dn as a function of γ 0 . The time until our new arrival is scheduled to start service is the time it takes for the n customers ahead to leave the queue (either abandon or enter service) plus the time required for a service completion (when all servers are busy). So, the virtual delay Dn can be characterized by the pure-death process as shown in Figure 5. s-1

s



s+n

s+n-1 sµ+



n γ’

Figure 5: The random variable Dn

The random variable Dn is the downcrossing time from state n + s until absorption in state s−1. It represents the time it takes to empty the queue for our customer, plus a service completion. Thus, the distribution of Dn is the convolution of n + 1 independent exponential distributions with parameters sµ, sµ + γ 0 , ..., and sµ + nγ 0 , which is an hypoexponential distribution. From Ross (1997), the pdf and CDF of Dn are given by ! Ã n n X Y sµ + jγ 0 0 · (sµ + iγ 0 ) · e−(sµ+iγ )t , gn (t) = 0 (j − i)γ i=0 j=0, j6=i Gn (t) = 1 −

n X i=0

Ã

n Y

sµ + jγ 0 (j − i)γ 0 j=0, j6=i

(11)

! 0

· e−(sµ+iγ )t ,

t ≥ 0.

(12)

(Note that by convention, an empty product is equal to 1). To answer the question of how γ 0 should be computed, it is natural to relate the new customer patience to her original one and to the announced delay. To do so, we define for each new arrival finding n customers in queue, the conditional probability that this customer will renege, given that she does not balk after hearing her anticipated delay. Recall that we will consider three situations in our analysis: the no-update case where we denote this conditional probability by rnN , and the update case where we denote it by rnU , and the mixed case where we denote it by rnM . The no-update case is one where the patience threshold of our customer (denoted by customer k) does not change, that is, she was willing to wait up to a duration 20

tk and she still has that threshold even after being informed about her anticipated delay. The update case is one where the delay information affects the customer’s initial patience, and the delay value dn we announce to her substitutes her initial patience threshold tk . The mixed case combines the behavior in the other two cases, and is analyzed in the appendix. We show there that the resulting conditional probability that the customer will renege, rnM , will lie between the corresponding probabilities of the update and no-update cases. For the no-update case, the quantity rnN is the conditional probability that the queueing delay, Dn , exceeds the random initial patience, T , given that our customer elects to join the queue, i.e., given that the random patience exceeds the announced delay, dn . It is given by rnN = P (T < Dn | T ≥ dn ).

(13)

For the update case, the quantity rnU is the conditional probability that the queueing delay, Dn , exceeds the announced value, dn , given that our customer elects to join the queue, i.e., given that the random patience exceeds the announced delay, dn . It is given by rnU = P (dn < Dn | T ≥ dn ).

(14)

With a little thought, it should be clear that rnU is equal to 1−β, which is the probability that the call center misinforms the customer. We will present necessary details for calculating both rnN and rnU in Section 6.2, and will prove that rnN is lower than rnU . Intuitively, this can be seen as follows: Whenever we announce dn based on a given coverage probability β, we shall make at worst a mistake with a chance of 1 − β. Consider a new arrival finding n customers in queue. Assume that the customer is willing to wait tk for service to begin. The duration tk is a random realization of the random variable T . If tk > dn , then the customer joins the waiting line. After joining the queue, the probability that the time it takes for a server to become free (for the customer of interest) exceeds dn is 1 − β. Since the customer is initially willing to wait up to tk , hence knowing that tk > dn , the probability that the duration tk passes before a server becomes free for our customer is less than 1 − β. In what follows, we give a method to compute the new reneging rate γ 0 for both cases. For presentation issue, we will differentiate between the notations of the quantities rnN and rnU only if necessary. Otherwise, i.e., if what we are considering holds for both cases, we will denote both of them by simply rn . Assume we reach the stationary regime, and let λR be the mean rate of abandoning customers. Applying PASTA, this quantity equals that seen by a new arrival. We get, by averaging on all possibilities, λR =

∞ X

λ(1 − α0 )(1 − pB (n))p(s + n)rn ,

n=0

21

(15)

where p(n), n ≥ 0, are the stationary probabilities of the number of customers in system seen at a random instant. We will compute these quantities in Section 6.2. On the other hand, if we denote by Lq the expected number of customers in queue, we can write using the exponentiality of customer impatience λR = γ 0 Lq .

(16)

We shall give the expression of Lq in Section 6.2. Note that Equation (16) implies a linear relationship between the reneging probability and average waiting time in queue, which is exact in the case of an exponential patience distribution as assumed here. While this relationship does not hold theoretically for general patience time distributions, Mandelbaum and Zeltyn (2004) show that practically this linearity is preserved for many general distributions, including the delayed exponential, under moderate loads. This suggests that our approach to calculate γ 0 provides a good approximation under some non-exponential patience times. Combining Equations (15) and (16) we get γ0 =

∞ λ X (1 − α0 )(1 − pB (n))p(s + n)rn . Lq n=0

(17)

The quantities Lq , p(s + n), pB (n) and rn are functions of γ 0 . So, denoting the right hand side in Equation (16) by a continuous function f in γ 0 , we may write γ 0 = f (γ 0 ). As a consequence, we state that γ 0 is a point mapped to itself by the function f . In mathematical terms, γ 0 is said to be a fixed point of f . To numerically compute γ 0 , we propose the following fixed point algorithm. Fixed point algorithm() Initialization:

γ 0(0) ← γ, i ← 0, ²

Do i←i+1 P B 0(i−1) )) · p(s + n)(γ 0(i−1) ) · rn (γ 0(i−1) ) λR (i) ← ∞ n=0 λ · (1 − α0 ) · (1 − p (n)(γ P (i) 0(i−1) ) Lq ← ∞ n=1 n · p(s + n)(γ (i)

γ 0(i) ← λR (i) /Lq

While | γ 0(i) − γ 0(i−1) |> ² γ 0 ← γ 0(i) End Algorithm. We note that in the algorithm above, p(s + n)(γ 0(i−1) ) is a functional form with γ 0(i−1) as the argument. We do not prove the convergence and uniqueness of the solution of the fixed

22

point algorithm, but a large numerical experience with the algorithm is encouraging in this respect. Clearly the reneging parameter γ 0 depends on the choice of β. On the one hand if β increases, the announced delays will increase. Then, for both the no-update and update cases, the set of customers that will elect to wait is smaller, by excluding the customers with low patience thresholds. As β increases, the rate γ 0 will thus decrease. This is confirmed later in the numerical experiments in the appendix. In the particular case, β = 0, one may see that Model 2 under the no-update case coincides with Model 1 in terms of the distribution of times before reneging (γ 0 = γ). Consider Model 2 under the no-update case and let β = 0. Then, we will announce dn = 0 for any new arrival (finding a busy system and n waiting customers in queue, n ≥ 0). No customer will balk, i.e. pB (n) = P (T < dn ) = 0. In addition, a new customer who chooses to join the queue has a probability rnN = P (T < Dn | T ≥ dn ) = P (T < Dn ). In this particular case, the expression of the mean abandoning rate λR , see Equation (15), coincides with that in the case of an exponential patience with parameter γ. We then deduce that γ is the fixed point of f (solution of γ 0 = f (γ 0 )). In words, it means that we are accepting all arrivals in Model 2. Since these arrivals will keep their initial patience thresholds, we exactly reproduce the initial reneging distribution (exponential with rate γ).

6.2

Performance Analysis

We start by defining the birth-death process as shown in Figure 6. Birth and death rates are both state-dependent. The new element here is that we have to take into account the state-dependent balking decisions when the process moves from state i to state i + 1, for i ≥ s, i.e., all servers are busy. λ

µ

λ

λ

1

0



(1 − α ) (1 − p 0

s

s-1 sµ

B

λ

(0))

(1 − α ) (1 − 0

B p (1))

s+2

s+1 sµ+ γ’



sµ+2 γ’

Figure 6: Birth-death process for Model 2

In the stationary regime, we get the following steady state probabilities seen at a random instant λi λi (1 − α0 )i−s p(i) = p(0), for 0 ≤ i ≤ s, and p(i) = i! µi s!µs

à i−s ! Y 1 − pB (j − 1) j=1

s µ + j γ0

p(0), for i > s, (18)

23

where p(0) is the stationary probability to have no customers in the system. It is given by à s à i−s !!−1 ∞ X λi X λi (1 − α0 )i−s Y 1 − pB (j − 1) p(0) = + . (19) i s 0 i! µ s! µ s µ + j γ j=1 i=0 i=s+1 Hence, the probability of immediate service, P I , the mean number of customers in queue, Lq , and the mean number of customers in system, Ls , can be calculated as in Relation (20). I

P =

s−1 X

p(i),

Lq =

∞ X

i=0

ip(s + i),

and Ls =

i=1

∞ X

ip(i).

(20)

i=1

Having the expression for Lq , it only remains for us to compute the quantity rn so that we can apply the fixed point algorithm, shown in Section 6.1, to get γ 0 . In what follows, we give closed-form expressions of rn in the update and no-update cases. Let us start by the update case, rnU . It is the simpler case. Equation (14) may be rewritten as

P (Dn < dn and T ≥ dn ) . (21) P (T ≥ dn ) Since the random variables T and Dn are independent, we have P (Dn < dn and T ≥ dn ) = rnU =

P (Dn < dn ) · P (T ≥ dn ). Thus rnU = 1 − β,

(22)

which agrees with intuition as explained in Section 6.1. For the no-update case, we use Equation (13) to rewrite rnN as follows. rnN =

P (dn ≤ T < Dn ) . P (T ≥ dn )

(23)

Since T is exponentially distributed with rate γ, the denominator in the right hand side of Equation (23) is simply P (T ≥ dn ) = e−γdn .

(24)

As for the numerator, it is given by Z ∞ Z P (dn ≤ T < Dn ) = gn (t) · P (dn ≤ T < t) dt = dn



¡ ¢ gn (t) · e−γdn − e−γt dt.

(25)

gn (t) · e−γt dt.

(26)

dn

Calculating further, we get

Z −γdn

P (dn ≤ T < Dn ) = (1 − β) · e



− dn

Next, observing that à n Z ∞ n X Y gn (t) · e−γt dt = dn

! Z ∞ sµ + jγ 0 0 0 · (sµ + iγ ) · e−(sµ+γ+iγ )t dt 0 (j − i)γ dn i=0 j=0, j6=i à ! n n X Y sµ + jγ 0 sµ + iγ 0 −(sµ+γ+iγ 0 )dn = · · e , 0 0 (j − i)γ sµ + γ + iγ i=0 j=0, j6=i 24

(27)

and coming back to Equation (23), we finally state that ! Ã n n X Y sµ + jγ 0 sµ + iγ 0 0 rnN = 1 − β − · · e−(sµ+iγ )dn . 0 0 (j − i)γ sµ + γ + iγ i=0 j=0, j6=i

(28)

Also, we show through Proposition 1 our intuitive claim about the comparison between the probabilities rnN and rnU . All proofs can be found in the Appendix. Proposition 1 For β ∈ [0, 1], the conditional probability to renege in queue given the system state under no-update, rnN , is lower or equal to that under update, rnU . The equality only holds for β = 1. In the latter case, we have rnN = rnU = 0. As a conclusion, a system with announcement in which the original patience times of customers do not change (no-update) leads to less reneging than that where customers substitute their original patience times by the delays we announce to them (update). A mixture of these two behaviors, as modeled by the mixed case leads to an intermediate level of reneging. We want to note here that under the no-update case there is really no change in customer behavior, although the new patience distribution is different from the initial one. Customers who self-select to wait have higher thresholds of patience. These thresholds are identical to the original ones. So, times before reneging in the no-update case are simply sampled from a subset of the original ones. Assuming further that the new times before reneging are exponentially distributed, we conclude that γ 0 < γ. This will be confirmed later in the numerical experiments. The probability to balk, to enter service, and to renege are P B , P S , and P R , respectively. A customer who finds at least one server idle will immediately start service (balks with probability 0). If she finds all servers busy and n waiting customers, n ≥ 0, she balks with probability α0 because she is not willing to wait any duration of time. If not, she will balk with probability pB (n). Then, a customer who finds all servers busy and n waiting customers, will balk with probability α0 + (1 − α0 )pB (n). Averaging on all possibilities and applying PASTA, we get P

B

∞ X = (α0 + (1 − α0 )pB (n))p(s + n).

(29)

n=0

In the same manner as in Section 5, the stationary probability to renege is given by PR =

γ 0 Lq . λ

(30)

Finally, we remark that a customer who does not balk and does not renege will necessarily enter service. Hence, the stationary probability to enter service is P S = 1 − P B − P R . 25

We carry on computing the performance measures of interest, namely the analysis of the conditional waiting time in queue, given that service is completed. We denote by E(X k | S) the kth order moment of that random variable. Following again a similar analysis as that in Section 5, we have P∞ k

E(X | S) =

n=0 (1

k − α0 )(1 − pB (n))p(s + n)Ψn+1 E(Xn+1 ) , S P

(31)

where Ψn+1 is the probability that a new arrival who joins the queue and finds n waiting customers ahead of her in queue does not renege until the start of service. It is given by Ψn+1 =

n+1 Yµ i=1

γ0 1− sµ + iγ 0

¶ ,

(32)

and Xn , n ≥ 1, denotes the time it takes to empty the queue of n waiting customers (without considering eventual future arrivals), and E(Xnk ) denotes its kth order moment, k ≥ 1. As in Model 1, the distribution of Xn is the convolution of n independent exponential distributions with parameters sµ + γ 0 , sµ + 2γ 0 , ..., and sµ + nγ 0 . The expected value and the second moment of Xn are respectively given by the same expressions as in 10 where γ is replaced by γ 0 .

6.3

Stochastic Comparison

Understanding the relationship between announcement coverage and customer related performance measures is important. We next provide a stochastic comparison result, that allows us to state the direction of this relationship. Proposition 2 Consider Model 2 under any case, update or no-update. For two systems a and b having identical parameters but with different announcement coverage βa > βb , we have PaB > PbB and PaR < PbR . In words, increasing the announcement coverage reduces the reneging of customers at the expense of additional balking. The numerical analysis below will help us understand the role the announcement coverage β plays on performance further, and compare to performance under Model 1 when no delay announcements are made.

7.

Numerical Analysis

In this section we analyze the sensitivity of the optimal announcement coverage to various system parameters and compare the performance of the models with and without delay 26

information. The parameters of Model 1 are λ, s, µ, α0 , α1 , γ and γ1 , while those of Model 2 are λ, s, µ, β, α0 and γ. Throughout this section, we set α0 to 0. In other words, we consider numerical examples where the initial balking probability α0 is negligible. We expect this choice to have no effect on the results below in the qualitative sense. As a consequence, we only consider the second type of balking. Also in Model 1, we do not consider the possibility of higher impatience, i.e., γ1 = 0. We choose to capture customers’ dislike for uncertainty, in case of a system with no information about anticipated delays, only by a higher initial abandonment (modeled as balking in our setting). To analyze and compare Models 1 and 2, we build two frameworks of comparison. The first is based on system throughput and is investigated in Section 7.1. The second is based on an economic model and is presented in Section 7.2.

7.1

Flow Framework

In what follows, we compare the two models using a flow framework. We compare the throughput, or equivalently the probability to enter service. A coherent comparison requires computation of the optimal announcement coverage, β ∗ , that maximizes the probability to enter service, P2S in Model 2. Let us first consider Model 2 under the no-update case. Consider two systems. The first is Model 2 with β = 0 (all customers join the system) and the second is Model 2 with β > 0. Then more customers are accepted in the first system because some customers will immediately balk in the second one. In the first system, some of these customers (who have balked in the second system) may not renege (due to uncertainty the system may be available to serve them before they would renege). So, more customers enter service in the first system than in the second. In Model 2, β = 0 maximizes P2S , β ∗ = 0. As already mentioned at the end of Section 6.1, the threshold patience of a customer who elects to wait (in Model 2 under the no-update case and with β = 0) is exponentially distributed with rate γ. As a conclusion, Models 1 and 2 coincide for α1 = 0 and Model 2 performs better than Model 1 for any α1 > 0. For a more interesting comparison, we next focus for Model 2 on the customer reaction following the update case. If β is small, customers tend to join the queue but they will renege very quickly after. So, P2S will be very low. In the extreme case (β approaches 0), only customers who find an idle system will be served. On the other hand if β is large, only a few customers will elect to join the queue. Again in this second extreme case (β approaches 100%), only customers who find an idle system will be served. As a consequence, there

27

should an optimal β in between, β ∗ ∈ (0, 100%). In other words, β ∗ is the tradeoff between two situations. In the first situation, we provide low anticipated delays to the detriment of having a lot of reneging. In the second situation, we decrease the reneging: we ensure high coverage probability by providing high values of anticipated delays to the detriment of having a lot of immediate balking. Under a flow framework, we ask whether we really need to provide information about delays. We consider Model 2 working under the optimal value of β. We vary the balking parameter α1 in Model 1 from 0% to 100%, and compute the associated probability of being served P1S . It is not surprising that P1S is continuous (from Equations (6) and (7), and knowing that P1S = 1 − P1B − P1R ) and strictly decreasing in α1 . The maximum of P1S is reached for α1 = 0, and its minimum is reached for α1 = 100% (all customers finding a busy system will balk, i.e., Model 1 have no queue). Furthermore for α1 = 0, P1S > P2S for any β > 0 because we accept fewer customers in Model 2. For β < 100%, P2S > 0. As a consequence, there exists a unique threshold value α10 belonging to (0, 100%) such that any value lower or equal to that value will imply P1S ≥ P2S , and above that value will lead to P1S < P2S . In other words, α10 is the threshold below which Model 1 has not that much balking. For this range of values, one would prefer not to announce delays because Model 2 performs worse. For Models 1 and 2, we consider six systems with increasing levels of pooling. The common parameters are (s, λ) = (3,3), (5,5), (10,10), (20,20), (50,50), and (100,100). For all cases, µ = 1, γ = 0.5, α0 = 0, and γ1 = 0 (for Model 1). We purposely choose highly loaded systems in order to see the effect of reneging. For each set of parameters, we consider Model 2 working under the optimal announcement coverage β ∗ , found numerically, given in percentages and rounded up to integer values. The results of the comparison are presented in Table 1. For each example, we give for Model 1 the corresponding threshold value α10 , found also numerically, given in percentages and rounded up to integer values. Table 1: Comparison of throughput s, λ = 3 s, λ = 5 s, λ = 10 s, λ = 20 s, λ = 50 s, λ = 100

Model 1 α10 P1S 31% 75.110% 24% 80.630% 17% 86.260% 11% 90.460% 6% 94.140% 4% 95.899%

Model 2 β∗ P2S 83% 75.241% 89% 80.637% 94% 86.360% 97% 90.503% 99% 94.155% 99% 95.939%

From Table 1, we see that α10 is relatively small for large systems. The reason is that large 28

systems have good performance due to the pooling effect. By increasing pooling in Model 2, we accept more customers because waiting times and particulary their variances become shorter and shorter. The latter allows us, indeed, to provide higher coverage probability in our announcements while improving at the same time the probability to enter service. This explains the increase of β ∗ in the system size. The minimum value of the balking probability α10 in Model 1, that makes Model 2 better in terms of throughput, decreases as a consequence in the system size. These examples illustrate that from a flow perspective, managers of small call centers may prefer to have some balking, and if the balking is sufficiently low, may prefer not to announce anticipated delays to customers. On the other hand, they would prefer large call centers with delay information. The latter has less lost customers than that without delay information under realistically achievable non-zero immediate balking probabilities in settings with no announcement.

7.2

Economic Framework

In this section, we use a more robust framework of comparison by considering the loss of goodwill costs. We define an economic framework in which we introduce different penalty costs for lost customers, those who balk and those who renege, and differentiate between these based on whether the customers were being informed about delays or not. The penalty costs can be seen as a valuation of the customer loss of goodwill (and potentially loss of business) generated by balking and reneging customers. Consider a customer in Model 1 who is not immediately served, and let us define the cost parameters that the service provider may incur. This customer may balk because of non immediate service (with probability α0 ). The service provider then incurs cB,0 1 . If not, she may balk because of no delay announcement (with probability α1 ). The service provider R then incurs cB,1 1 . If not, she may renege while queueing. The service provider then incurs c1 .

The second type of balking, i.e., due to no delay announcement in Model 1, should be more costly than the first one due to non immediate service. Lost customers under the second type of balking decide to leave the system because they anticipate that they will experience a long delay of uncertain duration before entering service. At the epoch of their call, they desire to obtain the service and are willing to wait for it, albeit not for long and possibly not for an unknown duration. This is different from lost customers’ perception under the first type of balking. The latter have no idea about the system state with regard to its availability for them, or do not consider this in their decision. They are simply unwilling to

29

wait at the epoch of their call. It is the difference in initial willingness to wait (irrespective of subsequent delays) that distinguishes the two types of balking. We assume cB,0 < cB,1 1 1 since the second type of customer is foregoing more by balking. While this assumption is consistent with our distinction between α0 and α1 , it remains to be tested and verified in practice. A customer who balks (under any type of balking) in Model 1 should have a higher probability to call back than that of a customer who reneges. A reneging customer leaves the system with frustration due to the time lost, potentially also losing trust in the service provider. However, a balking customer leaves the system without waiting. This customer may think that the system will be able to serve her later within a reasonable delay, whereas the customer who leaves the system by reneging has already experienced a delay perceived to be too long. Consistent with these observations, we assume that cB,0 < cB,1 < cR 1 1 1. The second inequality implies that customers attach an opportunity cost to the time spent waiting before reneging, which is lost once they renege. While a consistent valuation of time by customers has been challenged in the literature (as reviewed by Bitran et al. (2008), page 67), our qualitative results which only depend on the direction of the relationship and not on the absolute values of these costs should not be affected by these findings. Similarly for Model 2, we pay cB,0 for a customer who balks because of non immediate 2 service. If she balks based on the delay announcement we provide her, we pay cB,1 2 . If she reneges while queueing, we pay cR 2 . With the same reasoning as that for Model 1, the second type of balking in Model 2 (based on delay information) should be more costly than the first one (with probability α0 ), cB,0 < cB,1 2 2 . Also in Model 2, a reneging customer should cost more than a balking customer (any type of balking). A balking customer leaves the system based on information provided. This information would avoid loss of business because it is perceived as an invitation to call back later. A reneging customer leaves because the wait is perceived as too long, possibly because the announced delay has been exceeded. That such a customer should feel less satisfaction is consistent with evidence in Katz et al. (1991) and Kumar et al. (1997). So, we assume cB,0 < cB,1 < cR 2 2 2 . In Figures 7 and 8, we give a presentation of the cost parameters in Models 1 and 2, respectively. Let us now compare the two models for each type of cost. In a real setting, customers who are unwilling to wait any amount of time (first type of balking with probability α0 in both models) should have very similar loss of goodwill when being lost. Their experience will not lead them to distinguish between Model 1 and 2. Hence, it should be plausible to B,1 choose cB,0 = cB,0 should 1 2 . By focusing on the second type of balking, one may see that c1 B,1 be higher than cB,1 > cB,1 2 , c1 2 . The reason is basically related to a higher uncertainty

30

Immediate service (IS)

Immediate service (IS)

Non IS

Balking because of non IS (α0), c1B,0 Non IS

Balking because of no delay announcement (α1), c1B,1

Balking because of non IS (α0), c2B,0 Balking because of delay announcement, c2B,1

Reneging while queueing, c1R

Reneging while queueing, c2R

Being served

Being served

Figure 7: Costs in Model 1

Figure 8: Costs in Model 2

(or lack of information) in Model 1 when comparing this type of lost customers to that in Model 2 (see for example Katz et al. (1991), Hui and Zhou (1996)). However, consistent with Kumar et al. (1997), a customer who reneges in Model 2 would be unhappier than that in Model 1 because she has been misinformed about her queueing delay. To reflect this, we R assume cR 1 < c2 . In Table 2, we summarize the comparison between the costs parameters

we described above. Table 2: Ordering relations between the cost parameters of Models 1 and 2 Model 1 Model 2

Balking cB,0 < cB,1 1 1W < k B,0 c2 < cB,1 < 2

Reneging R cV 1 cR 2

Recall that we set α0 to 0, see Section 7.1. For both models, costs related to the first type of balking are nonexistent. For presentation issues, we then substitute the notations B by cB and cB,1 cB,1 2 1 1 and c2 , respectively. Also we set α1 to 10%.

In the following analysis, we consider Model 2 under the no-update case and investigate four issues. The first three only concern Model 2: the effect of the announcement coverage β on the total loss of goodwill cost of the call center (balking + reneging costs) leading to the determination of the optimal announcement coverage, β ∗ , which minimizes the total operating cost. The second issue is the effect of the assumed cost parameters on β ∗ . The third one is the effect of pooling on the performances of Model 2 and on β ∗ . The last issue deals with the comparison between Models 1 and 2, where we assume that Model 2 is being operated under the optimal announcement coverage. Effect of β on total cost: This section numerically illustrates the stochastic comparison result in Section 6.3, and introduces the notion of an optimal announcement coverage β ∗ under an economic framework.

31

As we will see, increasing β is not always better. So, there is a need to control this parameter. Let s = λ = 5, the service rate be µ = 1, and the abandonment rate be γ = 0.5. We label this example as the base case. We vary the announcement coverage β from 50% up to 99%. For each β, we build up the model with announcement by computing the new reneging rate γ 0 , as explained in Section 6.1. We assume that the call center incurs cB 2 = 1 for each customer who balks, and three times the latter cost for each customer who reneges cR 2 = 3. We compute for each β the stationary probabilities of balking and reneging. We also compute the stationary cost (per unit of time) of balking, that of reneging and the total B one, say C2B , C2R and C2T , respectively. These quantities are calculated as C2B = cB 2 λP , R C2R = cR and C2T = C2B + C2R . In Figure 9, we plot the curves of the different costs as a 2 λP

function of β. Costs for Model 2

1.40 1.20 1.00 0.80

B

C2 C(BK)

0.60

C(R) C2

R

T C(TOT) C 2

0.40 0.20

beta β

0.00 50%

60%

70%

80%

90%

100%

Figure 9: Costs for Model 2 as a function of β, s = λ = 4.5

As shown in Proposition 2, we see from Figure 9 that P B (P R ) is strictly increasing (decreasing) in β. A better announcement coverage leads to more balking upon arrival and less reneging for customers who elect to join the queue. The total cost is unimodal and its minimum is reached at β ∗ = 83% with C2T ∗ = 0.916. Effect of the costs parameters on the optimal announcement coverage β ∗ : In Figure 10, we plot for Model 2 the total cost as a function of β, for different cost parameters. R We keep the balking cost unchanged, cB 2 = 1, and vary the reneging cost, c2 = 1.25, 1.5, 2,

3, 5 and 10. From Figure 10, we see that the optimal announcement coverage is increasing in the reneging cost, varying between 75% and 95%. The balking cost does not change since cB 2 is held constant. However the reneging cost and its relative weight in the total cost increases in cR 2 . The higher the cost of reneging, the more the announcement coverage should be increased. The analysis illustrates how β controls the tradeoff between balking 32

and reneging, and points out the importance of controlling this parameter as a function of customer characteristics (captured through costs herein). 1,75

Total Cost for Model 2 c 2 =1.25, βbeta*=75% * =75% C'2/C2=1.25, R

c 2 =1.5, βbeta*=80% * =80% C'2/C2=1.5, R

1,50

c 2 =2, βbeta*=81% * =81% C'2/C2=2, R

* =83% c 2 =3, βbeta*=83% C'2/C2=3, R R

β

c 2 =5, beta*=84% * =84% C'2/C2=5,

1,25

R c 2 =10, β * =95%

C'2/C2=10, beta*=95%

1,00

0,75 70%

β beta 80%

90%

100%

Figure 10: Total costs for Model 2 as a function of β, s = λ = 5

Effect of the system size on the optimal announcement coverage β ∗ : We again consider (as in Section 7.1) six systems with increasing size (s, λ) = (3,3), (5,5), (10,10), R (20,20), (50,50), and (100,100). For all cases, µ = 1, γ = 0.5, α0 = 0, cB 2 = 1 and c2 = 3.

Recall that α1 = 10% and γ1 = 0. For each set of parameters, we consider the model corresponding to the optimal announcement coverage β ∗ , found numerically, given in percentages and rounded up to integer values. The results are displayed in Tables 3 and 4. We compute various performance measures for Model 2 but also for the corresponding Model 1, which will be used in the subsequent comparison of the two models. For simplicity, we choose equal costs for the second type of B balking customers, i.e., cB 1 = c2 . Note that this plays in favor of Model 1.

Observe that the optimal announcement coverage is increasing in system size. Even though β ∗ is increasing, the delay that is announced with n customers waiting in queue decreases in system size. This is shown in Figure 11, where we use the virtual delay distribution in Section 6.1 with the corresponding γ 0 and β ∗ from Tables 3 and 4 to obtain the delay announcement for a given queue size n. We observe that as the number of servers increases, β ∗ approaches 100% with both the probability of balking and reneging going to zero. However the relative decrease in the reneging probability (when moving from a system to a larger one) is greater than that in the balking probability. The shorter waiting times coupled with higher coverage probability in the announcements as we pool, ensure that fewer customers exceed the announced delays, thus reducing the probability of reneging. The resulting γ 0 is decreasing. Balking is also affected by shorter delay announcements, but does not benefit from the increased coverage probability since customers who balk do not elect to wait in the first place. In fact as β ∗ increases the decrease in the announced delay is much slower 33

(as seen from the shape of the curves in Figure 11) so that the relative benefit in terms of reduced balking is also slower. Due to this relative difference, the minimum cost is reached for a higher value of β that favors a faster decrease in reneging probability as system size B increases. The increase in β ∗ as a function of system size would be higher for lower cR 2 /c2

ratios. Table 3: (s, λ) = (3,3), (5,5) and (10,10)

PI Lq Ls PS PR PB E(X | S) σ(X | S) CB CR CT Decrease in C T

s = 3, λ = 3 β ∗ = 82%, γ 0 = 0.033 Model 1 Model 2 0.393 0.431 0.884 0.619 3.260 2.935 0.792 0.772 0.147 0.007 0.061 0.221 0.283 0.263 0.444 0.471 0.182 0.664 0.884 0.061 1.066 0.725 — 31.996%

s = 5, λ = 5 β ∗ = 83%, γ 0 = 0.024 Model 1 Model 2 0.419 0.439 1.089 0.881 5.254 5.006 0.833 0.825 0.109 0.004 0.058 0.171 0.214 0.211 0.336 0.359 0.290 0.853 1.089 0.063 1.379 0.916 — 33.552%

s = 10, λ = 10 β ∗ = 83%, γ 0 = 0.020 Model 1 Model 2 0.456 0.443 1.400 1.388 10.156 10.178 0.876 0.879 0.070 0.003 0.054 0.118 0.140 0.157 0.225 0.250 0.544 1.182 1.400 0.083 1.945 1.265 — 34.943%

Table 4: (s, λ) = (20,20), (50,50) and (100,100)

PI Lq Ls PS PR PB E(X | S) σ(X | S) CB CR CT Decrease in C T

s = 20, λ = 20 β ∗ = 84%, γ 0 = 0.018 Model 1 Model 2 0.496 0.446 1.727 2.101 19.855 20.418 0.906 0.916 0.043 0.002 0.050 0.082 0.087 0.114 0.146 0.174 1.009 1.647 1.727 0.111 2.736 1.758 — 35.753%

s = 50, λ = 50 β ∗ = 86%, γ 0 = 0.012 Model 1 Model 2 0.557 0.447 2.098 3.545 48.836 50.931 0.935 0.948 0.021 0.001 0.044 0.051 0.043 0.075 0.078 0.110 2.213 2.572 2.098 0.126 4.311 2.698 — 37.417%

s = 100, λ = 100 β ∗ = 86%, γ 0 = 0.010 Model 1 Model 2 0.612 0.446 2.248 5.215 97.239 101.563 0.950 0.963 0.011 0.0005 0.039 0.036 0.023 0.054 0.046 0.078 3.885 3.604 2.248 0.142 6.133 3.746 — 38.910%

Comparison between Models 1 and 2: We again consider Tables 3 and 4, and compare Models 1 and 2. The latter is assumed to be operating under the optimal announcement coverage. The rows C B , C R and C T are to indicate the costs for balking, reneging and the total one, respectively. The last row gives the relative decrease in the total cost in going from Model 1 to Model 2,

C1T −C2T C1T

. From the numerical experiments, we see that all performance 34

Figure 11: Delay announcements as a function of s, for given n

measures are improving as a function of system size for both models. This is the well known pooling effect in queues. Since the optimal β ∗ for our examples here are quite high and α1 is not that high, we have more balking in Model 2 in all cases except for s = λ = 100. In the latter, waiting times are low because of the pooling effect. Then, balking as a response to delay announcement (in Model 2) is low so that the balking related to α1 (in Model 1) becomes relatively more important. We also see that the probability of reneging is lower for Model 2 throughout. The reason is that Model 2, as it were, “refuses” entry for customers who potentially may renege. If α1 were chosen to be 0, one would see for all system sizes, but especially for small systems, that the probability of service is a bit better in Model 1. The reason is that some of the customers who balk in Model 2 might wait in Model 1 until service begins, without reneging. Furthermore, the conditional waiting time would be better in Model 2 (less customers are served in Model 2). For our numerical examples here (α1 = 10%), we see that for small call centers, the probability of service (P S ) is better for Model 1. For large call centers, the opposite is true. The reason is that balking is high in Model 2 for small call centers. As call centers size grows, waiting times improve. So, balking decreases in Model 2 because the delays we are announcing are decreasing. The conclusions about the comparison of conditional waiting times are the opposite to those of the probability of service. This is expected since when the probability of service is the best in one model, then more customers are present in the latter (in the statistical sense). While having the same capacity, this model will perform the worst in terms of waiting times before being served. For the cost parameters chosen here, we see that the balking cost is lower in Model 1 for all cases except for the case s = λ = 100. The explanation is the same as that for the comparison between balking probabilities (same balking cost for both models). Although the 35

3.00

Costs for Model 2, update

2.50

B

C(BK) C2

R C(R) C2

2.00

T C(TOT) C 2

1.50 1.00 0.50 0.00 50%

beta β

60%

70%

80%

90%

100%

Figure 12: Costs for Model 2, update

reneging cost parameter is higher for Model 2, reneging costs are always better for Model 2 because β ∗ is high (so little reneging). We observe that the total cost is always better for Model 2 than that in Model 1. In addition, increasing the pooling effect makes Model 2 further preferable. The relative difference between the two models’ total costs is indeed increasing in the system size. We conclude that Model 2 is profiting more from the pooling effect than Model 1 is. Numerical Analysis under the update case: The results presented up to this point are still valid in the update case, where customers substitute their patience threshold by the delay we announce to them. Let us again consider Model 2 in the base case with update type customers. In Figure 12, we plot the costs as a function of β. From Figure 12, we see the same qualitative results as those for the no-update case. The difference here is that γ 0 may be greater than γ for low values of β. The conditional probability for a customer to renege given that she elects to wait is 1 − β, which can approach 1 for low β. The new reneging rate γ 0 will converge to 0 as β goes to 100%. From Figure 12, we deduce that the optimal announcement coverage is β ∗ = 98%, higher than the one found under the no-update case. In general the optimal value β ∗ is larger for the update case than that of the no-update case. This is due to the fact that the reneging rate in the update case is higher than that of the no-update case, see Proposition 1.

8.

Concluding Remarks

Our analysis has formulated and characterized the performance measures of a call center model where anticipated delays are announced to customers upon arrival, who may then balk in response. Announcements are made as a chosen percentile of the delay distribution. For

36

customers who opt to wait, if realized waiting times exceed those that have been announced, a customer further reacts by reneging, where this reneging rate is modeled to be different from the one in an equivalent system without delay announcement. This latter feature, as well as the possibility of announcing a percentile of the delay distribution, different than the mean delay, are key distinguishing features of the model. The numerical analysis illustrates tradeoffs between reneging and balking that have to be made in choosing the announcement percentile, and illustrates the role cost and system size parameters play on this choice. It is shown that more coverage is not necessarily better for the service provider and that announcement coverage has to be carefully controlled in the presence of different customer reactions. This additional control, if properly used, provides managers with a means of improving costs and reaping higher benefits from pooling. Considering different performance measures as comparison points, conditions are established under which the model with delay information is preferred. This analysis highlights that in some settings, particularly in smaller call centers, managers may prefer not to provide delay information. The fundamental tradeoff between balking and reneging in a call center with delay information (selection of the announcement percentile) is similar to that in queueing systems with finite waiting space (selection of the queue size). In such systems, less waiting space means more balking customers; but at the same time it also leads to lower waiting time for those who get access (and if reneging is present, it leads to lower reneging customers). In future work, it would be interesting to empirically describe customers’ reactions in response to delay announcements, in order to validate our modeling of that reaction herein. In particular the heuristic modeling of the reneging reaction, through customers of type no-update, update and mixtures of the two, remains to be tested. Lab experiments that control for everything else and allow a direct comparison between the models with perfect, imperfect, and no delay information would be valuable in supporting assumptions pertaining to balking and reneging made in the analysis herein. Providing direct evidence from a call center setting for the cost relationships in the economic framework of Section 7.2 is another issue that remains to be explored. An ambitious extension of the current analysis is to consider non-stationary arrivals which would be an important issue in practice. Another extension worth exploring is one where balking or reneging customers may call back. Rather than capturing different implications of balking and reneging through assumptions on the costs, as was done in the current paper, one could concretely model the differences as different probabilities to call back for customers who have balked or abandoned in a setting with retrials. 37

Appendix Validation of the Exponential Approximation In this section, we investigate the quality of the approximation we consider for new patience times in Model 2. To get a tractable analysis, we assumed in Section 6.1 that new patience times are exponentially distributed with rate γ 0 and we developed a method to compute the parameter γ 0 . To assess the exponential assumption, we compare in what follows the performance measures of Model 2 derived by the numerical method (exponential approximation) with those given by simulation. We conduct this study for both the no-update and update cases. Consider the proposed Model 2 with update or no-update behavior, which we will call the ideal simulation. Under the no-update case, a customer who joins the queue will renege as soon as her waiting time reaches tk (initial random threshold). Under the update case, a customer who joins the queue will renege as soon as her waiting time reaches dn (the delay estimate). We underline a complexity associated with the delay we announce, dn . It is very hard to derive the exact distribution of Dn , based on which we determine the value dn with coverage probability β. We need to compute the convolution of the distributions of the remaining patience times of the customers waiting in queue for each customer. To simplify the analysis, we simulate an intermediate Model 2 in which we assume that Dn is exponentially distributed with rate γ 0 . To determine the value of γ 0 , one would think to apply a fixed point algorithm using simulation at each step. This is again very heavy to do. Instead we use the value of γ 0 derived from the numerical computation using the fixed-point algorithm from Section 6.1. We label the latter as the exponential approximation method in this appendix. In summary, we will simulate Model 2 such that we announce the same delay dn as in the exponential approximation model. Customers who join the queue will then renege exactly as assumed either under the no-update (after tk ), or the update case (after dn ). The resulting Model 2 that we simulate is intermediate between the ideal simulation and the exponential approximate model. We believe that the comparison between this model and the approximate exponential one (developed in Section 6.1) will give good indications about the quality of the exponential approximation made in the latter. We run simulations for several values of the system parameters. For all cases, µ = 1, γ = 0.5, α0 = 0 and γ1 = 0 (for Model 1). We consider (s, λ) = (5,5), (10,10), and (50,50). For each system, we choose the coverage probability β = 50% and β = 80%. For each β, the reneging rate, based on which we compute dn , is derived from the numerical approximation of Section 6.1. In Tables 5 and 6, we give the results of the comparison for the no-update 38

and update cases, respectively. In these tables, we present the performance measures derived by simulation and those derived by the numerical method (approximation) developed in the core of the paper. Table 5: Simulation vs exponential approximation, no-update case β γ0 PS PR PB E(X | S) σ(X | S)

s = 5, 50% 0.258 Sim App 0.833 0.844 0.050 0.030 0.117 0.126 0.211 0.254 0.336 0.372

λ=5 80% 0.033 Sim App 0.827 0.827 0.006 0.008 0.167 0.165 0.217 0.206 0.363 0.339

s = 10, 50% 0.318 Sim App 0.879 0.890 0.043 0.020 0.078 0.090 0.141 0.186 0.221 0.264

λ = 10 80% 0.029 Sim App 0.880 0.880 0.004 0.007 0.116 0.113 0.160 0.156 0.253 0.244

s = 50, 50% 0.454 Sim App 0.941 0.952 0.023 0.020 0.036 0.028 0.051 0.088 0.081 0.119

λ = 50 80% 0.017 Sim App 0.949 0.946 0.001 0.013 0.050 0.042 0.078 0.061 0.112 0.088

Table 6: Simulation vs exponential approximation, update case β γ0 PS PR PB E(X | S) σ(X | S)

s = 5, 50% 3.528 Sim App 0.769 0.775 0.189 0.181 0.043 0.044 0.037 0.033 0.094 0.072

λ=5 80% 0.668 Sim App 0.803 0.808 0.077 0.065 0.121 0.127 0.118 0.114 0.226 0.200

s = 10, 50% 6.762 Sim App 0.820 0.824 0.161 0.156 0.019 0.020 0.015 0.014 0.045 0.036

λ = 10 80% 1.106 Sim App 0.853 0.859 0.079 0.066 0.068 0.075 0.069 0.074 0.133 0.130

s = 50, 50% 32.464 Sim App 0.905 0.906 0.093 0.091 0.002 0.002 0.002 0.002 0.007 0.006

λ = 50 80% 4.170 Sim App 0.922 0.927 0.065 0.056 0.013 0.017 0.014 0.019 0.031 0.037

From Tables 5 and 6, we observe that the exponential approximation gives good results in both the no-update and update cases. This can be explained using the results derived in Zeltyn and Mandelbaum (2005). Under the Quality-Efficiency-Driven regime where delays are short, it is pointed out that the patience distribution near the origin determines the behavior of the system. Similar results are found for patience distributions with a positive density at the origin (Theorem 4.1 in Zeltyn and Mandelbaum (2005)), as well as for patience distributions with an atom at the origin (Theorems 4.2 and 4.3 in Zeltyn and Mandelbaum (2005)). In Theorem 4.3 of Zeltyn and Mandelbaum (2005), the authors consider a scaled balking increasing in the queue state, which is somewhat similar to what we consider in this paper, pB (n). They conclude that approximate performance measures depend only on the patience distribution near the origin. For small systems as those in our experiments, waiting times are also relatively short thus giving the main importance to the behavior of the patience distribution at the origin rather than in the tail. Since in addition to that the balking behavior is the same for the simulated and exponential approximated models, the exponential approximation has a good 39

quality. Recall that for both the no-update and the update cases the simulated model takes the random variable Dn to be the same as that in the approximation (based on the same value of γ 0 ). So, pB (n) is the same in the simulated and approximated models. Then, the patience distributions in both models have the same atom at the origin. Based on the results in Tables 5 and 6 we make some observations about the value of the new reneging rate γ 0 . For the no-update case, γ 0 is always lower than γ (As already explained in Section 6.2). We observe from 6 that for the update case γ 0 > γ in these examples. From Table 5, γ 0 increases in the system size for β = 50%, and decreases in the system size for β = 80%. This example illustrates how beta controls the tradeoff between reneging and balking, and how a low coverage probability choice as represented by the β = 50% case here can lead to an increased rate of reneging from the system despite the pooling effect. In the larger systems, we are accepting more customers with low patience thresholds, and having announced delays with β = 50%, more of these customers will have been misinformed which will thus lead to higher reneging rates from the system. For the update case in Table 6, even for β = 80% the system reneging rate cannot be reduced through pooling. This is because customers who decide to wait are updating their patience to the times we announce, which are lower than their original thresholds, thus leading to a situation resembling that described for the β = 50% case under no-update. Analysis of the Mixed Case In this section, we consider the mixed case. To analyze this case, it suffices to compute for each new arrival finding n customers in queue, the conditional probability that this customer will renege, given that he does not balk after hearing her anticipated delay. We denote this probability by rnM . We then apply the same analysis as that in Section 6 to get the performance analysis of Model 2 under the mixed case. We assume that we have a Bernoulli process. Each customer electing to join the queue has a probability q (q ∈ [0, 1]) to be an update type and 1 − q to be a no-update type, independently of all other events. This leads to rnM = q rnU + (1 − q) rnN ,

(33)

where rnU and rnN are given in Equations (22) and (28), respectively. Using Proposition 1 and Equation (33), one may easily state the following inequality rnN ≤ rnM ≤ rnU .

40

(34)

Proof of Proposition 1 We use the expressions of rnU and rnN given in Equations (22) and (28), respectively. For 0 ≤ i < n + 1, we have

1 1 > . (35) 0 sµ + γ + iγ sµ + γ + (n + 1)γ 0 Then, we derive for β < 1 the following inequalities for the last term in the right hand side of Equation (28). ! Ã n n X Y sµ + jγ 0 sµ + iγ 0 gn (dn ) 0 · · e−(sµ+iγ )dn > > 0. 0 0 (j − i)γ sµ + γ + iγ sµ + γ + (n + 1)γ 0 i=0 j=0, j6=i

(36)

where gn is the pdf of the hypoexponential distribution given in Equation (11). This finishes the first part of the proof about the strict inequality. In the extreme case β = 1, Equations (22) and (28) lead to rnU = rnN = 0. This means that all customers in both situations will not renege. The reason is that no customers enter the systems. All of them balk because we announce to them an infinite anticipated delay, dn = ∞ for n ≥ 0, so, pB (n) = P (T < dn ) = 1 for n ≥ 0.

2

Proof of Proposition 2 Let Ni (t), i = {a, b} be the number of customers in System i at time t. We use uniformization and coupling, and show by induction that Na (t) ≤ Nb (t), implying the desired orderings on the probability of balking and reneging in Systems a and b. All events are generated from the same Poisson process with rate Γ = λ + sµ + Kγ where K is chosen as a very large number, ensuring that the probability of having more than K customers in queue is negligible. The birth death rates provide the corresponding event probabilities. For example, an arrival occurs with probability λ/Γ, or a service completion with kµ/Γ in state k < s. Both systems are assumed to start out empty, Na (0) = Nb (0) = 0. For all states k < s arrivals and service completions are identical in both systems, and there is no balking or reneging from either system. So until the time when k ≥ s, we have Na (t) = Nb (t). Now consider a state k > s: βa > βb implies that da,k−s > db,k−s . Since the probability of balking is given by P (T < dk−s ), we note that every time a customer balks from System b, she will also balk in System a and there will be additional customers who balk in System a, thus leading to Na (t) ≤ Nb (t). Whenever Na (t) ≤ Nb (t) there will be more service completions from System b. Under the no-update case, the probability that the k − sth customer in queue reneges is given by P (T < Dk−s |T > dk−s ), where the delay in queue increases as a function of the length of the queue. Using the fact that reneging is exponential, we can state that whenever 41

a customer reneges from System a, she will also renege in System b with additional reneging occurring in System b. More specifically, all customers in System a will have their identical counterparts in System b. These identical customers will renege in a similar fashion. In addition, System b will have some additional customers, non-identical to any in System a. If all of these non-identical customers do not renege at time t, one has Na (t) < Nb (t), if on the other hand all of them renege the system states can revert back to Na (t) = Nb (t). Under the update case, the probability that the k − sth customer in queue reneges is given by P (dk−s < Dk−s |T > dk−s ). Once again, when Na (t) ≤ Nb (t), there will be a customer in System b corresponding to each customer in System a, plus some additional customers. Since da,k−s > db,k−s for all k > s, for the customers that are common in both systems a renege is more likely in System b. In other words every time one of the common customers reneges in System a she will also renege in System b, but it is possible to have the customer in a stay while the corresponding customer in b departs. The additional customers in System b may stay or renege, ensuring all the time that either Na (t) < Nb (t) or the system states revert back to Na (t) = Nb (t). Whenever Na (t) = Nb (t) either due to extra service completions or extra reneging in System b, the two systems can be coupled again and the same arguments ensuring the sample path ordering Na (t) ≤ Nb (t) will hold. In steady state we can state that E[Na (t)] < E[Nb (t)], implying the desired relationships PaR < PbR and PaB > PbB .

2

References Ak¸sin, O., Armony, M., and Mehrotra, V. 2007.

The Modern Call-Center: A Multi-

Disciplinary Perspective on Operations Management Research.

Production and Op-

erations Management, 16:665–688. Allon, G., Bassamboo, A., and Gurvich, I. 2007. We Will Be Right with You”: Managing Customers with Vague Promises. Working paper. Ancker, C. J. and Gafarian, A. 1962. Queueing with Impatient Customers Who Leave at Random. Journal of Industrial Engineering, 13:84–90. Armony, M. and Maglaras, C. 2004a. Contact Centers with a Call-Back Option and RealTime Delay Information. Operations Research, 52:527–545. Armony, M. and Maglaras, C. 2004.

On Customers Contact Centers with a Call-Back

Option: Customer Decisions, Routing Rules, and System Design. Operations Research, 52:271–292. 42

Armony, M., Shimkin, N., and Whitt, W. 2005. The Impact of Delay Announcements in Many-Server Queues with Abandonment. Operations Research. To appear. Baccelli, F. and Hebuterne, G. 1981.

On Queues With Impatient Customers.

Perfor-

mance’81 North-Holland Publishing Company, pages 159–179. A Framework for Analyzing the Quality of the Customer Interface. European Management Journal, 11:385-396. Bitran, G.R., Ferrer, J-C., and Rocha e Oliviera, P. 2008. Managing Customer Experiences: Perspectives on the Temporal Aspects of Service Encounters. Manufacturing & Service Operations Management, 10:61–83. Brandt, A. and Brandt, M. 2002. Assymptotic Results and a Markovian Approximation for the M(n)/M(n)/C + GI System. Queueing Systems, 41:73–94. Brown, L., Gans, N., Mandelbaum, A., Sakov, A., Shen, H., Zeltyn, S., and Zhao, L. 2005. Statistical Analysis of a Telephone Call Center: A Queueing-Science Perspective. Journal of the American Statistical Association, 100:36-50. Feigin, P. 2005. Analysis of Customer Patience in a Bank Call Center. Working Paper. Gans, N., Koole, G., and Mandelbaum, A. 2003. Telephone Call Centers: Tutorial, Review, and Research Prospects. Manufacturing & Service Operations Management, 5:73–141. Garnett, O., Mandelbaum, A., and Reiman, M. 2002. Designing a Call Center with Impatient Customers. Manufacturing & Service Operations Management, 4:208–227. Guo, P. and Zipkin, P. 2007. Analysis and Comparison of Queues with Different Levels of Delay Information. Management Science, 53:962–970. Hornik, J. 1984. Subjective vs. Objective Time Measures: a Note on the Perception of Time in Consumer Behavior. Journal of Consumer Research, 11:615–618. Hui, M. and Tse, D. 1996.

What to Tell Customer in Waits of Different Lengths: an

Integrative Model of Service Evaluation. Journal of Marketing, 60:81–90. Hui, M. and Zhou, L. 1996. How Does Waiting Duration Information Influence Customers’ Reactions to Waiting for Services? Journal of Applied Social Psychology, 26:1702–1717. Ibrahim, R. and Whitt, W. 2007. Real-Time Delay Estimation Based on Delay History. Manufacturing & Service Operations Management. To appear. Jouini, O. and Dallery, Y. 2007a. Analysis of a Multiple Priority Queue with Impatient Customers. Working paper. Jouini, O. and Dallery, Y. 2007b. Moments of First Passage Times in General Birth-Death 43

Processes. Mathematical Methods of Operational Research, 68:49–76. Jouini, O., Dallery, Y., and Ak¸sin, O. Z. 2007. Queueing Models for Full-Flexible Multi-class Call Centers with Real-Time Anticipated Delays. International Journal of Production Economics. to appear. Kahneman, D., Slovic, P., and Tversky, A., eds. 1982.

Judgement Under Uncertainty:

Heuristics and Biases Cambridge university Press, Cambridge. Katz, K., Larson, B., and Larson, R. 1991.

Prescription for the Waiting-in-Line Blues:

Entertain, Enlighten, and Engage. Sloan Management Review, pages 44–53. Keilson, J. 1964a. A Review of Transient Behavior in Regular Diffusion and Birth-Death Processes. Part I. Journal of Applied Probability, 1:247–266. Kijima, M. 1997. Markov Processes for Stochastic Modeling. Chapman & Hall. 1st Edition. Kleinrock, L. 1975. Queueing Systems, Theory, volume I. A Wiley-Interscience Publication. Kolesar, P. 1984. Stalking the Endangered CAT: A Queueing Analysis of Congestion at Automatic Teller Machines. Interfaces, 14:16–26. Koole, G. and Mandelbaum, A. Queueing Models of Call Centers: An Introduction Annals of Operations Research, 113:41–59. Kumar, P., Kalwani, M.U., and Dada, M. 1997. The Impact of Waiting Time Guarantees on Customers Waiting Experiences. Marketing Science, 16:295-314. Leclerc, F., Schmitt, B., and Dube, L. 1995. Waiting Time and Decision Making: Is Time Like Money? Journal of Consumer Research, 22:110-119. Maister, D. 1985. The Psychology of Waiting Lines. Czepiel, J., Solomon, M., Suprenant, C. eds. The Service Encounter: Managing Employee/Customer Interaction in Service Businesses. Lexington Books, Lexington MA, 113-123. Mandelbaum, A. and Zeltyn, S. 2004. The Impact of Customers Patience on Delay and Abandonment: Some Empirically-Driven Experiments with the M/M/N+G Queue. OR Spectrum, 26:377–411. Mandelbaum, A. and Zeltyn, S. 2006. Staffing Many-Server Queues with Impatient Customers: Constraint Satisfaction in Call Centers. Working paper. Munichor, N. and Rafaeli, A. 2007. Numbers or Apologies? Customer Reactions to Telephone Waiting Time Fillers. Journal of Applied Psychology, 92:511-518. Nakibly, E. 2002. Predicting Waiting Times in Telephone Service Systems. Ph.D. Thesis, Technion. 44

Naor, P. 1969. The Regulation of Queue Size by Levying Tolls. Econometrica, 37:15–24. Osuna, E. 1985. The Psychological Cost of Waiting. Journal of Mathematical Psychology, 29:82–105. Rafaeli, A., Barron, G., and Haber, K. 2002. The Effects of Queue Structure on Attitudes. Journal of Service Research, 5:125–139. Ross, M. 1997. Introduction to Probability Models. Academic Press, San Diego. 7th Edition. Taylor, S. 1994. Waiting for Service: The Relationship Between Delays and Evaluations of Service. Journal of Marketing, 58:56–69. Whitt, W. 1999a. Improving Service by Informing Customers about Anticipated Delays. Management Science, 45:192–207. Whitt, W. 1999b. Predicting Queueing Delays. Management Science, 45:870–888. Wolff, R. 1982. Poisson Arrivals See Time Averages. Operations Research, 30:223–231. Zakay, D. 1989. An Integrated Model of Time Estimation. Times and Human Cognition: A Life Span Perspective. Iris Levin and Dan Zakay, eds, Amsterdam: North Holland. Zeltyn, S. and Mandelbaum, A. 2005. Call Centers with Impatient Customers: Many-Servers Asymptotics of the M/M/n+G Queue. Queueing Systems, 51:361–402. Zohar, E., Mandelbaum, A., and Shimkin, N. 2002. Adaptive Behavior of Impatient Customers in Tele-Queues: Theory and Empirical Support. Management Science, 48: 566– 584.

45

Suggest Documents