This study develops a dynamic bus arrival time prediction model using the data collected by the automatic vehicle location and automatic pas- senger counter ...
Dynamic Prediction Method with Schedule Recovery Impact for Bus Arrival Time Mei Chen, Xiaobo Liu, and Jingxin Xia Among those transit agencies with such information are AC Transit in Almeda County, California (1); the City-University-Energysaver bus system in Fairfax, Virginia (2); the Vail Bus Service in Vail, Colorado (3); the municipal railway system in San Francisco, California (4); the Tri-met Transit Tracker system in Portland, Oregon (5); and King County Metro Transit in the state of Washington (6). Transit agencies have started to use advanced sensing and communication technologies to improve transit service quality. Various technologies such as APC and AVL have been implemented (7 ) nationwide, among which the APC technology is drawing increasing attention. The AVL-APC system is considered an efficient tool in collecting bus operating information (e.g., numbers of passengers boarding and alighting at each stop as well as the corresponding time and location) that is critical to transit operation analysis and service planning. The cost of obtaining such information as provided by the AVL-APC system using traditional data collection methods in the same quantity and quality is prohibitive. Using the rich data made available by such devices installed on buses, a methodology was developed to predict bus arrival time. It is dynamic in that it can incorporate the most recent bus travel information (e.g., current location and time), whenever it becomes available, into the model for arrival time prediction at downstream stops. The algorithm is simple enough to not require intensive computation, which would be desirable in real-time applications. Furthermore, the model takes into consideration the impact of the driver’s schedule recovery effort as a control factor.
This study develops a dynamic bus arrival time prediction model using the data collected by the automatic vehicle location and automatic passenger counter systems. It is based on the Kalman filter algorithm with a two-dimensional state variable in which the prediction error in the most recent observation is used to optimize the arrival time estimate for each downstream stop. The impact of schedule recovery is considered as a control factor in the model to reflect the driver’s schedule recovery behavior. The algorithm performs well when tested with a set of automatic vehicle location–automatic passenger counter data collected from a real-world bus route. The algorithm does not require intensive computation or an excessive data preprocessing effort. It is a promising approach for real-time bus arrival time prediction in practice.
The objective of this study is to develop a dynamic prediction methodology that is capable of providing bus arrival time at downstream major stops listed on the timetable, called time points (TPs), for real-time implementation. This methodology will be able to interface with the data collected by the automatic vehicle location (AVL) and automatic passenger counter (APC) and provide updates on bus arrival time for each downstream stop when the newest information on the bus location and time becomes available. Furthermore, the schedule recovery effort by the bus drivers will be incorporated into the model. Bus transit plays an important role in transportation systems, especially in urban areas. In today’s tough competition with automobiles, transit service needs to improve its quality. Traditionally, passengers rely on published timetables to make decisions about departure time to the bus station and transfer activities. However, due to shared right-of-way, bus services often experience delays that result from unexpected congestion along the routes. Thus, passengers often find themselves waiting a long time at bus stops. This delay may also foil their plans to transfer by incurring additional interruption in their schedules. A more reliable source for bus arrival information is needed to increase the confidence of passengers on bus service, especially in the event of unexpected congestion along the route. More transit agencies are offering bus arrival time information to help transit users make “smarter” decisions about their departure time from home or work to shorten their waiting time. On the other hand, such information can also help transit operators conducting performance evaluation, operational control, and service planning.
PREDICTION METHODOLOGY Various models have been developed to predict bus arrival time. Generally, the techniques used can be categorized as follows: regression (8), artificial neural network (9–11), Kalman filter (9, 11–13), and a combination of the preceding techniques (9). Lin and Zeng (8) developed a set of algorithms to predict bus arrival time based on Global Positioning System (GPS) data. Regression models were built for different combinations of independent variables. The prediction accuracy was limited because of some inherent features of GPS data, such as an inconsistent sample period. Prediction models based on an artificial neural network algorithm were also developed through various studies such as those of Chen et al. (10, 11) and Chien et al. (9). The advantage of such an algorithm is that it does not require an explicit function form or independence among the input variables. However, large amounts of data are required to train the network to achieve a reliable prediction. With its dynamic feature to update the estimation of state variables, a Kalman filter has been used widely in various fields including forecasting traffic parameters. Dailey et al. (12) and Cathey and Dailey (14) developed a Kalman filter-based algorithm for transit
M. Chen and J. Xia, Department of Civil Engineering, University of Kentucky, 267 Raymond Building, Lexington, KY 40506-0281. X. Liu, LS Engineering Associates Corporation, 230 U.S. Highway 206, Flanders, NJ 07836. Transportation Research Record: Journal of the Transportation Research Board, No. 1923, Transportation Research Board of the National Academies, Washington, D.C., 2005, pp. 208–217.
208
Chen, Liu, and Xia
209
arrival time prediction using data collected by onboard AVL units. The prediction was performed at a fixed time interval using observations of vehicle location that were obtained asynchronously. The algorithm has been implemented by King County Metro, a transit agency in the metropolitan Seattle area. A Kalman filter algorithm was also used in studies by Chien et al. (9) and Shalaby and Farhan (13). However, these models were developed based on the assumption that all buses traveling on the route were equipped with APC or AVL devices. In reality, the high initial investment associated with these devices limits their application in real-world practice. In this study, the Kalman filter technique is also used in developing a model to predict bus arrival time. This model will be based on APC data collected from the studied bus route. The data contain not only the information on passenger activities but also information about when and where these activities occurred. A Kalman filter is a powerful mathematical tool that can estimate the future states of variables even without knowing the precise nature of the system modeled. It is a recursive procedure that corrects its estimates whenever new observations become available, with the objective of minimizing the estimated error covariance. The filter procedure developed in this study is designed to predict arrival time at downstream TPs for each bus trip. The starting TP is defined as the origin, and each downstream TP is treated as a destination. Assume there are N TPs numbered sequentially along the route with the origin labeled as 1. The general concept of the prediction algorithm is presented in Figure 1. Let J denote the set of TPs (excluding the origin) along the route; thus, j ∈ (2, N), ∀j ∈ J. Each of these TPs may be a destination to which the travel time will be estimated. Let tk,j denote the travel time from TP k (i.e., the current TP from which arrival time prediction is performed) to destination j (i.e., the downstream TP for which arrival time prediction is performed, j ∈ J and k < j ); τk,k+1 denote the
estimated travel time from TP k to TP k + 1; and sk denote the travel time from the origin (i.e., TP1) to TP k. Then the travel time from TP k + 1 to destination j ( j ∈ J and k + 1 < j ), tk+1,j, can be calculated by Equation 1 as follows: tk +1, j = tk , j − τ k , k +1
(1)
And the travel time from the origin to TP k + 1 can be calculated by Equation 2 as follows: sk +1 = sk + τ k , k +1
(2)
One of the operational goals of a transit agency is to keep buses on schedule. In reality, transit operators tend to constantly adjust vehicle speeds to maintain good schedule adherence. Through examining a set of AVL-APC data collected in 2002 from a reputable transit agency in the northeastern United States, it was found that a schedule recovery effort can be observed on at least half the segments on a transit route. For instance, if a bus is delayed at TP k, around 50% of trips can be observed to have a shorter travel time than scheduled between TPs k and k + 1. Lin and Bertini (15) used the Markov chain concept to model bus operators’ behavior in the schedule recovery process. This study attempted to use the Kalman filter to account for the impact of such an effort. The term τk,k+1 is defined as the estimated travel time between TPs k and k + 1. It consists of two elements; one is the schedule travel time from k to k + 1, Tk,k+1, and the other is the driver adjustment Dk based on the schedule adherence status of the bus. Then τk,k+1 can be estimated as follows: τ k , k +1 = Tk , k +1 + Dk
(3)
s1=0 TP1
t1,2
TP2
TP13
TPk
t1,k
TP14
t1,14
TP1
s2
TP2
TP13 TPk
t2,k
TP14
t2,14
……………………………………..
TP1
TP2 sk
TPk TP13
tk,13 tk,14 FIGURE 1
Arrival time prediction procedure.
TP14
210
Transportation Research Record 1923
The delay experienced by the bus at TP k can be expressed as T1,k − sk. Assume the driver adjustment between k and k + 1, Dk, is proportional to the delay at TP k. Then, if βk represents the adjustment factor for the segment between TPs k and k + 1, one can estimate its value with historical trip information. Therefore, the driver adjustment can be estimated as follows:
tion) for the next segment based on its current state and error covariance estimates. Equation 5 provides a measurement update function that realizes the feedback process—that is, the newly measured travel time (from origin) at the current TP is used to adjust the predicted travel time (to the destination). The overall filtering procedure is a recursive prediction–correction process outlined as follows:
Dk = β k (T1, k − sk )
xk +1, j = Φ k +1 xk , j + uk + wk , j
(5)
• Step 1. Initialize. – Set k = 1 and j = 2 • Step 2. Initialize state variables. Let xˆk,j = (tˆk,j sˆk)T, where tˆk,j is the estimated total travel time from the origin to destination j, and sˆk is the travel time from the origin to TP k. • Step 3. Initialize covariance Pk,j. • Step 4. Extrapolate state variable.
zk = Hk xk , j + vk , j
(6)
xˆ k−+1, j = Φ ′k +1 xˆ k , j + uk′
( 4)
If zk denotes the observed travel time from the origin to TP k, then zk = sk. For each destination j ∈ J, let xk,j = (tk,j sk)T denote a twodimensional state variable, and a Kalman filter can be formulated as follows:
where Φk+1 = Hk = uk = wk, j and vk, j =
⎛ 1 0⎞ ⎜ ⎟, ⎝ 0 1⎠ (0 1), ⎛ −1⎞ ⎜ ⎟ τ k . k +1 , and ⎝ 1⎠ white noise associated with the transition process and measurement, respectively, and are assumed to have zero mean and variances of Qk,j and Rk,j, respectively.
Given Equations 3 and 4, the control input term uk can be converted as follows: ⎛ −1⎞ uk = ⎜ ⎟ τ k , k +1 ⎝ 1⎠
(8)
where Tk,k+1 is the scheduled travel time between TP k and k + 1. • Step 5. Extrapolate covariance. Pk−+1, j = Φ ′k +1 Pk , j Φ ′kT+1 + Qk , j
( 9)
• Step 6. Compute Kalman gain (K). Kk +1, j = Pk−+1, j HkT+1 ( Hk +1 Pk−+1, j H kT+1 + Rk +1, j )
−1
(10)
• Step 7. Update state variable. xˆ k +1, j = xˆ k−+1, j + Kk +1, j ( zk +1 − Hk +1 xˆ k−+1, j )
⎛ −1⎞ = ⎜ ⎟ [Tk , k +1 + β k (T1, k − sk )] ⎝ 1⎠
(11)
Stop if k + 1 = j (i.e., when TP k + 1 is the destination) and j = N. Otherwise, if j < N, go to Step 8; else, if j = N, go to Step 9.
⎛ −1⎞ = ⎜ ⎟ [(Tk , k +1 + β k T1, k ) − β k (0 1) xk , j ] ⎝ 1⎠
• Step 8. Update destination. –j=j+1 – Go to Step 2 • Step 9. Update covariance.
βk⎞ ⎛0 ⎛ −1⎞ = ⎜ ⎟ (Tk , k +1 + β k T1, k ) + ⎜ ⎟ xk , j ⎝ 1⎠ ⎝ 0 −β k ⎠
Pk +1, j = Pk−+1, j − Kk +1, j Hk +1 Pk−+1, j
Let
(12)
– k = k + 1, j = k + 1 – Go to Step 2.
βk⎞ ⎛0 Ak+1 = ⎜ ⎟ ⎝ 0 −β k ⎠ and Equation 5 becomes xk +1, j = Φ ′k +1 xk , j + uk′ + wk , j
β k ⎞ ⎛ tk , j ⎞ ⎛ −1⎞ ⎛1 = ⎜ ⎟ ⎜ ⎟ + ⎜ ⎟ (Tk , k +1 + β k T1, k ) ⎝ 0 1 − β k ⎠ ⎝ sk ⎠ ⎝ 1 ⎠
( 7)
where Φ′k+1 = Φk+1 + Ak+1, and ⎛ −1⎞ u′k = ⎜ ⎟ (Tk , k +1 + β k T1, k ). ⎝ 1⎠ Equations 7 and 5 form a Kalman filter that accounts for the impact of drivers’ schedule recovery effort. It projects the bus progression process using a form of feedback control. Equation 7 describes the time update process in which a bus travels from one TP to its downstream TP. It predicts the state of the variable (e.g., travel time to the destina-
This Kalman filter algorithm starts with a baseline estimate of travel time from the origin to each downstream destination j ( j ∈ J). It uses the most recent observation of travel time between the origin and the last stop k (k < j ) to adjust the predicted travel time from k to each destination j. The predictions are updated whenever the bus reaches the next downstream TP—that is, when a new measurement zk becomes available.
CASE STUDY Data Collection The AVL-APC data collected in 2002 were obtained from a reputable transit agency in the northeastern United States. The pattern
Chen, Liu, and Xia
211
selected for this study has 14 TPs. The main attributes in each APC record used for the model development are summarized in Table 1. The ideal data structure for observing bus operations and developing prediction models is a set of successive TP records. Thus, the actual bus travel times between TPs can be used to update the estimates from the prediction model. However, the APC units record activities on all stops (whether a TP or not). In other words, there will be records on stops made between TPs to pick up or drop off passengers for the stop-on-demand type of operation. On the other hand, there might not be a record for a particular TP if the bus did not stop because no passenger demanded existed there. Therefore, travel time interpolation was performed for those skipped TPs to make the APC data set consistent for prediction model development.
Schedule Recovery Phenomenon It is a common understanding that transit operators tend to actively pursue schedule recovery if the bus is delayed. On the basis of AVLAPC data and the timetable, the delay at each TP was calculated and then correlated with the travel time deviation (from the schedule) on the next segment (from the current TP to the next downstream TP). Schedule recovery was observed on all segments and was particularly frequent on the upstream portions of the route. The passenger boarding–alighting activities along the route, as recorded by the APC devices, can offer some explanation of the pattern of schedule recovery. Generally, fewer stop activities and less passenger demand could make the driver’s schedule recovery effort easier. Further exploration showed that besides traffic impact, the schedule recovery phenomenon usually occurred on segments with fewer stops made between two adjacent TPs and a smaller number of passengers boarding–alighting. For example, because there is no
TABLE 1
Main Index in APC Data Set
Variable
Description
Sched arrival time Transit day Time of day Open time Close time Stop sequence Time point ID Trip status Lat Lon On Off Stop distance Dwell time Leg time Leave psgr load
Scheduled arrival time at each time point Date of the service Time period of bus operation Recorded bus door opening time Recorded bus door closing time A unique number to all intended stops along the route Time point indicator number Trip status (start or end) Latitude Longitude Number of boarding passengers at a stop Number of alighting passengers at a stop Travel distance between two consecutive stops The bus door open time at any stop Inter-stop travel time Number of onboard passengers when the bus leaves a stop Number of onboard passengers when the bus arrives at a stop Unique index associated with a pattern in each pick data Unique index associated with a trip
Arrive psgr load Pattern ID Trip index
intermediate stop between TP4 and −5 (they are two airport terminals), about 85% of the trips recovered a certain loss in travel time on this segment. On the other hand, 73% of the trips experienced additional schedule deviation on the segment between times 10 and 11. The APC data showed that on this segment an average of two stops per mile were made to allow, on average, three passengers to board and four passengers to alight at each stop. Using the historical trip data, one can get the distribution of the adjustment factor (βk) along the route. On most segments, the average adjustment factors are mostly between −0.5 and 0.5. In this study, the average of all historical adjustment factors was used on segment k as the βk. Next, an example is presented to demonstrate implementation of this prediction algorithm with the schedule recovery process on an individual bus trip. This is followed by a discussion of the overall performance of the algorithm and comparative analysis with other bus travel time estimation models.
Individual Trip Analysis Following the steps of the Kalman filter algorithm, travel times from the current stop to every downstream TP are predicted. The predictions are then updated when the bus reaches the next TP. For illustration purposes, an arbitrarily selected trip is used as an example to analyze the algorithm performance. This trip was made on Wednesday, October 2, 2002. It was scheduled to depart TP1 on 1:32 p.m. and arrive at TP14 at 3:19 p.m. Table 2 presents the prediction results using the Kalman filter algorithm on this trip. There are 14 TPs in total along the route, with the first one defined as the origin. When the bus reaches the next TP, 1 is added to the k value until the bus reaches the final destination (i.e., TP14). Each cell in the table represents the estimated state variable xˆk,j = (tˆk,j sˆk)T. For example, when k = 3, the bus has already reached TP3. For destination TP10 (i.e., j = 10), the estimated travel time from TP3 to TP10, tˆ3,10, is 3,851 seconds, and the bus has spent 953 seconds traveling from the origin to TP3 (i.e., sˆ3 = 953 seconds). When k = 1, the filter initializes itself using baseline estimates of travel time between the origin and each of the downstream TPs (destination j ∈ J ). The timetable is a good source for such estimates. For example, when j = 2, the estimated travel time from TP1 to TP2, tˆ1,2, is 480 seconds according to the timetable, and the estimated travel time from the origin (i.e., TP1) to the current TP (i.e., TP1) is 0. When the bus reaches the next TP, k + 1, the travel time predictions to the downstream TPs, k + 2 through N (N = 14), are adjusted using the actual arrival information at the current TP, k + 1. This process is repeated until the bus arrives at the final destination, TP14. One should note that because sk is defined as the travel time from the origin to the current TP k, it always reflects the travel time that has already been recorded and is not related to the downstream destination. Therefore, its estimates for all downstream destinations are the same for a given k. In addition, if one lets Lk,j = tk,j + sk for any j ∈ J and k < j, Lk,j represents the travel time estimate, made when the bus reaches TP k, from the origin to destination j. The prediction error is defined as the difference between the predicted and the actual travel time for each pair of TPs. Figure 2 shows the distribution of the prediction error of the studied trip. Markers on each line represent the prediction errors for corresponding downstream destinations. One can observe that, for a given current location, prediction error tends to be larger at destinations that are farther away from the current bus location. Typically, with the bus proceeding
212
Transportation Research Record 1923
TABLE 2
Travel Time Prediction Output for Individual Trip (seconds)
k\j
1
2
3
4
5
6
7
8
9
10
11
12
13
14
1
— —
480 0 — —
1140 0 675 499 — —
1620 0 1157 499 490 953 — —
1920 0 1458 499 790 953 310 1286 — —
2520 0 2058 499 1391 953 910 1286 609 1497 — —
3540 0 3079 499 2411 953 1930 1286 1629 1497 1027 2030 — —
4380 0 3919 499 3251 953 2770 1286 2470 1497 1867 2030 854 2782 — —
4800 0 4339 499 3671 953 3190 1286 2889 1497 2287 2030 1274 2782 433 3554 — —
4980 0 4519 499 3851 953 3370 1286 3069 1497 2467 2030 1454 2782 613 3554 204 3986 — —
5160 0 4699 499 4031 953 3550 1286 3249 1497 2647 2030 1634 2782 793 3554 384 3986 200 4155 — —
5340 0 4879 499 4211 953 3730 1286 3429 1497 2827 2030 1814 2782 973 3554 564 3986 380 4155 200 4378 — —
6180 0 5719 499 5051 953 4570 1286 4270 1497 3667 2030 2654 2782 1813 3554 1404 3986 1220 4155 1040 4378 862 4564 — —
6420* 0000** 5959 499 5291 953 4810 1286 4509 1497 3907 2030 2893 2782 2053 3554 1643 3986 1460 4155 1280 4378 1102 4564 262 5353 — —
2 3 4 5 6 7 8 9 10 11 12 13 14 *: ˆtk, j **: Sˆ k
1000
Prediction Error (seconds)
800
600
400
200
0 1
2
3
4
5
6
7
8
-200 Time Point FIGURE 2
Error distributions for prediction performed at each TP.
9
10
11
12
13
Chen, Liu, and Xia
213
along the route (i.e., when k approaches the destination j ), the prediction error generally decreases for a given downstream destination. Using TP14 as an example, Figure 3 shows that the prediction error decreases as the bus proceeds and new arrival information becomes available and is applied in correcting the estimates. For the same trip, the predicted arrival times on each TP and actual arrival times are illustrated on a time–space diagram in Figure 4a. Figure 4b is an enlarged local view (from TP9 to TP14) of Figure 4a. The figure shows that the predicted trajectories consistently approach the actual trajectory when the bus proceeds along the route. This results from the continuous adjustment of the predicted travel time. The power of the Kalman filter model is clearly demonstrated in this application.
The smoothing method also depends on historical data. However, it is designed to predict future travel time on a segment by calculating a weighted average between the estimated travel time, based on the latest trip information, and the historical travel time. The weight parameter used in the model is determined by minimizing the mean square error between the projected and actual travel times based on the historical data set. The root-mean-square error (RMSE) is computed as a performance measure to evaluate the average variation between the actual and predicted travel times by each model. The RMSE is defined as follows. 1 N
RMSE =
Performance Analysis
N
∑ (y
i
2 − yˆi )
(13)
i =1
where
The performance of the algorithm is tested on all bus trips for which AVL-APC data are available. Figure 5 shows the distribution of the prediction error, where TP1 is taken as the origin and each of the other TPs along the route is a destination. A comparative analysis is conducted to demonstrate the superior performance of the Kalman filter. Considering the ultimate goal of achieving a real-time application, two additional models, historical average and smoothing, that are relatively easy to implement are built and tested with the same set of data. The historical average approach relies on the analog between past and future traffic conditions. It categorizes the traffic condition by time of day for each segment and then estimates the future travel time on a segment by averaging historical travel times recorded for the same segment during the same time of day.
N = number of test samples, yi = actual travel time of sample i, and yˆi = predicted travel time of sample i. For each travel time prediction method, the RMSEs are computed for travel times from origin to each downstream TP. Their distributions are shown in Figure 6, from which one can conclude that the Kalman filter algorithm provides better predictions of travel times than the timetable as well as the historical average and the smoothing algorithms. The overall RMSEs aggregated from all segments are presented in Figure 7. Compared with the Kalman filter algorithm, the travel times indicated on the bus timetable show a larger deviation from the actual travel time measured, especially for destinations farther away from
800 700
Prediction Error (seconds)
600 500 400 300 200 100 0 1
2
3
4
5
6
7
8
9
10
-100 -200 Time Point FIGURE 3
Prediction error variation for travel time (from TP1 to TP14).
11
12
13
14
214
Transportation Research Record 1923
7000
6000
Travel Time (sec)
5000
4000
3000
2000
1000
0 1
2
3
4
5
6
7
8
9
10
11
12
13
14
Time Point Actual Predicted at TP5 Predicted at TP10
Predicted at TP1 Predicted at TP6 Predicted at TP11
Predicted at TP2 Predicted at TP7 Predicted at TP12
Predicted at TP3 Predicted at TP8 Predicted at TP13
Predicted at TP4 Predicted at TP9
(a) 6500
6250
Travel Time (sec)
6000
5750
5500
5250
5000
4750
4500 12
13
14
Time Point Actual Predicted at TP5 Predicted at TP10
Predicted at TP1 Predicted at TP6 Predicted at TP11
Predicted at TP2 Predicted at TP7 Predicted at TP12
(b) FIGURE 4
Time–space diagram of one sample trip.
Predicted at TP3 Predicted at TP8 Predicted at TP13
Predicted at TP4 Predicted at TP9
Chen, Liu, and Xia
215
1000
Prediction Error (sec)
500
0
-500
-1000
-1500 1
FIGURE 5
2
3
4
5
6
7 8 9 Time Point
10
11
12
13
14
Prediction error distribution.
800 700
RMSE (sec)
600 500 400 300 200 100 0 2
3
4
5
Historical Average FIGURE 6
6
7
8 9 Time Point
Kalman Filter
10
Smoothing
11
12
Schedule
Model performance comparison.
500 450 400 RMSE (sec)
350 300 250 200 150 100 50 0 Historical Average
Kalman Filter
Smoothing
Time Point FIGURE 7
Overall performance comparison.
13
Schedule
14
216
Transportation Research Record 1923
the origin. Because the Kalman filter algorithm is initialized based on the timetable, the prediction shows a larger variation (from the actual travel time) for the first couple of segments, compared with the historical average and smoothing methods. However, with the bus proceeding along the route, the Kalman filter algorithm consistently generates better prediction of travel time than the others. The Kalman filter algorithm also requires minimum effort for data preprocessing when compared with other algorithms. It does not require historical travel time at each step of estimation if it is initialized based on the timetable. However, both historical average and smoothing methods would demand continuous reference to the historical travel time. Other methods for bus travel time prediction, such as that of Chen et al. (11), may involve significant modeling effort (e.g., neural network model development) as part of the algorithm. This certainly increases the difficulty for implementation of such models. Even with the drivers’ schedule recovery effort, the absolute prediction errors still tend to propagate when the bus runs over a longer distance because there are other stochastic factors (such as traffic condition) affecting the bus on time performance. Therefore, the relative prediction error is calculated to evaluate the performance of the prediction model, as indicated in Figure 8. It is observed that the relative prediction error does show a decreasing trend with the bus approaching the final destination.
this algorithm is more straightforward and requires less preprocessing of data. Currently, the APC data are collected along the route and stored at the onboard computer before they are downloaded to the central database after the buses return to the garage. The real-time implementation of the algorithm will become feasible once the live communication mechanism is established to transfer data to the control center where the computation is expected to be carried out. Even though this algorithm does not require a particular trip length, it is shown that, for longer trips, the benefit of the algorithm would be more obvious—that is, the relative prediction errors have a strong tendency of decreasing along the trip.
CONCLUSIONS AVL-APC data contain rich information about various bus operating and service characteristics. In this study, a Kalman filter algorithm is developed to predict bus arrival times based on the time–space information extracted from such data. Using the feature of dynamic prediction adjustment, the algorithm updates estimated arrival times at downstream TPs whenever new observation of travel time becomes available. When it is initialized based on the timetable, this algorithm is not computationally intensive. This is especially beneficial to its potential application in real-world transit systems. In addition, the impact of schedule recovery efforts by bus operators is also taken into consideration in the model. The AVL-APC data greatly facilitate the understanding of the schedule recovery effect as well as its relationship to the number of stops made and passenger on and off counts. The driver adjustment factor estimated from historical trip data is then used as a control input in the prediction model. The overall performance of the Kalman filter model is quite satisfactory; the decreasing trend of relative prediction error along the route demonstrates its dynamic optimization capability. It especially can bring more benefit for longer trips, which cannot be realized by other models such as historical average and smoothing algorithms.
Implementation Issue The Kalman filter algorithm developed in the study does not require intensive calculation. It uses the time and location information at each stop, once they become available, to adjust the prediction dynamically. The computation time is minimal and can be ignored in real-time applications. With a simple initialization based on the bus timetable, the Kalman filter algorithm outperforms the historical average and smoothing models. Compared with these alternatives as well as other existing methods involving the Kalman filter such as that of Chen et al. (11),
60
Relative Error (%)
40
20
0
-20
-40
-60 1
2
3
4
5
6
7
8
9
Time Point FIGURE 8
Distribution of relative prediction error.
10
11
12
13
14
Chen, Liu, and Xia
The low computational intensity of the algorithm certainly ensures its promising potential for real-time implementation.
REFERENCES 1. www.actransit.org/. Accessed July 2004. 2. www.ci.fairfax.va.us/Services/CueBus/NextBus.htm. Accessed July 2004. 3. www.ci.vail.co.us. Accessed July 2004. 4. www.sfmuni.com. Accessed July 2004. 5. www.tri-met.org/its.pdf. Accessed July 2004. 6. www.its.washington.edu/transitwatch/. Accessed July 2004. 7. Okunieff, P. AVL Systems for Bus Transit. TCRP Synthesis of Transit Practice, 24, TRB, National Research Council, Washington, D.C., 1997. trb.org/publications/tcrp/PDF/TSYN24.pdf. 8. Lin, W.-H., and J. Zeng. Experimental Study of Real-Time Bus Arrival Time Prediction with GPS Data. In Transportation Research Record: Journal of the Transportation Research Board, No. 1666, TRB, National Research Council, Washington, D.C., 1999, pp. 101–109. 9. Chien, I.-J., Y. Ding, and C. Wei. Dynamic Bus Arrival Time Prediction with Artificial Neural Networks. Journal of Transportation Engineering, Vol. 128, No. 5, 2002, pp. 429–438.
217
10. Chen, M., J. C. Yaw, S. Chien, and X. Liu. Using AVL and APC Data in Bus Arrival Time Prediction. Presented at 82nd Annual Meeting of the Transportation Research Board, Washington, D.C., 2003. 11. Chen, M., X. Liu, J. Xia, and S. Chien. A Dynamic Bus Arrival Time Prediction Model Based on APC Data. Journal of Computer-Aided Civil and Infrastructure Engineering, Vol. 19, 2004, pp. 364–376. 12. Dailey, D. J., S. D. Maclean, F. W. Cathey, and Z. R. Wall. Transit Vehicle Arrival Prediction: Algorithm and Large-Scale Implementation. In Transportation Research Record: Journal of the Transportation Research Board, No. 1771, TRB, National Research Council, Washington, D.C., 2001, pp. 46–51. 13. Shalaby, A. S., and A. Farhan. Bus Travel Time Prediction Model for Dynamic Operations Control and Passenger Information Systems. Presented at 82nd Annual Meeting of the Transportation Research Board, Washington, D.C., 2003. 14. Cathey, F., and D. Dailey. A Prescription for Transit Arrival/Departure Prediction Using AVL Data. Transportation Research Part C, Vol. 11, 2003, pp. 241–264. 15. Lin, W. H., and R. L. Bertini. Modeling Schedule Recovery Processes in Transit Operations for Bus Arrival Time Prediction. Journal of Advanced Transportation, Vol. 38, No. 3, 2004, pp. 347–365. The Network Transit and Commercial Fleet Subcommittee of the Transportation Network Modeling Committee sponsored publication of this paper.