Imputing Erroneous Data of Single-Station Loop

0 downloads 0 Views 2MB Size Report
Effective use of loop detector data for traffic management requires that errors be ... inaccurate under light traffic conditions, especially when estimates are based on zero flow. 12 ... collection leads to large amounts of data, making manual detection and ... comprehensively using field data in the context of a single station.
Imputing Erroneous Data of Single-Station Loop Detectors for Non-incident Conditions: Comparison between Temporal and Spatial Methods

Weihao Yin Graduate Research Assistant Department of Civil and Environmental Engineering Virginia Tech Tel: (571) – 329 2030 [email protected] Pamela Murray-Tuite* Assistant Professor Department of Civil and Environmental Engineering Virginia Tech 7054 Haycock Road Falls Church, VA, 22043 Tel: (703) 538-3764 Fax: (703) 538-8450 Email: [email protected] Hesham Rakha Professor, Civil & Environmental Engineering - Virginia Tech Director, Center for Sustainable Mobility - VTTI Virginia Tech Transportation Institute 3500 Transportation Research Plaza Blacksburg, VA 24061, USA. Tel: (540) 231-1505 Fax: (540) 231-1555 [email protected]

*Corresponding Author

1 1

ABSTRACT

2

Effective use of loop detector data for traffic management requires that errors be efficiently

3

detected, diagnosed, and corrected. We present two new spatial approaches and compare them to

4

state-of-the-art correction procedures for station flow estimation when detectors within that

5

station malfunction in non-incident conditions. One new method exploits the relationship

6

between individual detector flow and station flow using linear regression. The second

7

incorporates lane use percentages through kernel regression. To comprehensively compare the

8

procedures, systematic and random-error evaluations are conducted for two detector stations with

9

distinct lane configurations. Lane configuration is important for spatial correction methods,

10

which perform well under certain detector failure combinations. The random-error evaluation

11

indicates that temporal correction performs better at all error levels and spatial approaches are

12

inaccurate under light traffic conditions, especially when estimates are based on zero flow

13

readings. When choosing a correction procedure, one should consider facility configurations,

14

error types and magnitudes, and traffic conditions, and calibrate the method for location-specific

15

characteristics.

16 17 18

Keywords: detector, data correction, linear regression, kernel regression

2 1

1. INTRODUCTION

2

In 2010, congestion cost U.S. urban travelers 4.2 billion hours of delay and 2.8 billion gallons of

3

wasted fuel, which resulted in monetary costs of $101 billion (Schrank & Lomax, 2011).

4

Advanced Traveler Information Systems, supported by the Intelligent Transportation System

5

(ITS) infrastructure, help relieve congestion by providing route and decision guidance (FHWA,

6

1998). To provide useful advisory messages, data must be collected, synthesized, and translated

7

(FHWA, 1998). The data's accuracy and reliability impact the user-benefits (Maier et al., 2009;

8

Toppen & Wunderlich, 2003); travelers may encounter worse conditions if they follow advisory

9

messages based on erroneous data. The data often include volume and occupancy, which are

10

readily retrieved from loop detectors (Pignataro, 1973); however, loop detector data is error

11

prone (Peeta & Anastassopoulos, 2002; Vanajakshi & Rilett, 2004). Therefore, it is essential to

12

efficiently and effectively detect, diagnose, and impute erroneous and missing data for ITS

13

applications (Smith & Venkatanarayana, 2005), travel time estimation and prediction, origin-

14

destination demand estimation, and incident detection relying on loop detector data (Fernandez-

15

Moctezuma et al., 1997).

16

Loop detectors are usually installed on each lane and parallel detectors constitute a

17

station. Typical measurements are occupancy, volume, and speed at a pre-defined time resolution.

18

Occupancy is a surrogate for density and volume can be transformed into flow (May, 1990).

19

Therefore, loop detector data essentially provide microscopic (time headways and vehicle speeds)

20

and macroscopic characteristics of traffic flow (May, 1990).

21

Issues associated with loop detector data are erroneous recordings and missing data,

22

caused by improper installation, communication malfunction, and/or wire failures (Bikowitz &

23

Ross, 1985). Data collection is usually continuous, which makes equipment malfunctions more

3 1

likely than if occasional collection techniques are used (Vanajakshi & Rilett, 2004). Continuous

2

collection leads to large amounts of data, making manual detection and correction impractical.

3

Error detection and correction algorithms typically use historical data from the same

4

detector, data from neighboring detectors, or a combination. Depending on the data used, a

5

procedure can be classified as temporal or spatial. "Temporal" procedures rely on historical data

6

of the same detector, while "spatial" approaches use data from detectors within the same station.

7

Spatial correction procedures (e.g., Chen et al. (2003)) use historical data to derive the

8

relationships between in-station detectors for subsequent correction, but are categorized as

9

spatial since correction is based on in-station detectors. Some approaches (e.g., Smith and

10

Conklin (2002)) combine spatial and temporal information.

11

Selection of the best approach for imputing missing and erroneous data is important for

12

the archival of traffic data (Smith & Venkatanarayana, 2005). To aid this selection for non-

13

incident

14

comprehensively using field data in the context of a single station. Among the approaches are

15

two proposed spatial correction procedures. One uses kernel regression to derive station flows

16

based on lane use percentages, and the other modifies Chen et al.'s (2003) approach by using

17

lane flow to estimate station flow (instead of one lane’s flow to predict another lane’s flow).

18

These two new procedures are evaluated along with Smith and Conklin’s (2002) approach and

19

the time-of-day historical mean approach. The comparison is conducted through systematic and

20

random-error evaluations at two detector stations with different lane configurations. The

21

systematic evaluation uses deterministic error configurations while errors are stochastically

22

introduced for the random-error evaluation. The results are analyzed to identify the most accurate

23

method(s) to correct a large dataset and obtain accurate station flow estimates.

conditions,

the performance of four

correction procedures are compared

4 1

The remainder of this paper is divided into six sections. Section 2 reviews loop detector

2

data error detection and correction studies. Section 3 describes the study's dataset. Section 4

3

presents the methods, followed by experimental procedures in Section 5. The results are

4

provided in Section 6. The last section presents conclusions and future directions.

5

2. PREVIOUS STUDIES

6

Error identification is necessary before correction. Though this study does not deal with error

7

detection, these methods are discussed for completeness and some studies jointly addressed error

8

detection and correction.

9

While techniques for error detection and correction generally fall into temporal or spatial

10

categories, some methods detected errors by checking for impossible combinations of traffic

11

flow characteristics. Such combinations were identified by volume-to-occupancy or flow-to-

12

occupancy ratios (Cleghorn et al., 1991; Jacobson et al., 1990). Turner (2004) later incorporated

13

speed. Chen et al. (2003) checked for impossible combinations over an entire day and detected

14

implausible time series. Another widely-used technique compared volumes or flows,

15

occupancies, and speeds with specific thresholds (Payne & Thompson, 2007; Weijermars & Van

16

Berkum, 2006).

17

Temporal detection and correction procedures were implemented by Chen et.al (1987)

18

and many European Intelligent Transportation Systems (Turner, 2004). Park (2003) constructed

19

a multi-variant screening methodology to identify outliers based on a variant of Mahalanobis

20

distance. Ishak (2003) developed an algorithm using fuzzy theory that clusters the input space of

21

the parameters speed, occupancy, and volume into regions based on normalized Euclidian

22

distance. Each observation’s uncertainty was then measured with one parameter, which was used

23

to identify errors. Later, this work was improved using probability theory and applied to real-

5 1

time situations (Ishak et al., 2007; Lee et al., 2010). Nihan (1997) modeled occupancy and flow

2

time-series as Autoregressive Moving Average processes and predicted values in the near future.

3

Other efforts relying on time-series techniques include Maier et al. (2009), who applied robust

4

estimation techniques to flow, occupancy, and speed time-series to estimate corrected

5

measurements. Using a spectral-domain time-series technique, Peeta and Anastassopoulos (2002)

6

developed a Fourier-Transform based algorithm that considered traffic characteristics as a time-

7

dependent function and distinguished abnormal data caused by incidents from detector

8

malfunctions and subsequently corrected erroneous data. One of the major concerns about

9

temporal detection and correction was that traffic flows could deviate from historical values due

10

to special circumstances, such as incidents, instead of malfunctioning detectors (Weijermars &

11

Van Berkum, 2006).

12

Spatial detection and correction procedures exploit the relationships between parallel

13

detectors and/or upstream and downstream detectors. Kwon et al. (2004) developed an algorithm

14

for detecting configuration errors based on the assumption that spatially close detectors should

15

show similar temporal patterns if no major disturbances, such as interchanges, exist in between.

16

Chen et al. (2003) identified strong correlation among neighboring loop detector measurements

17

and modeled this correlation with linear regression models. For a specific detector, a regression

18

model was constructed using the data from this detector and one of its neighbors. The final

19

detector flow estimate was the median of all pair-wise estimates. Chen et al. (2003) reported a

20

mean absolute error of 132 vehicles/hour with a mean hourly flow of 1220 vehicles/hour.

21

Detection and correction methods based on the vehicle conservation principle compare flow

22

measurements between upstream and downstream locations. For example, Vanajakshi et al.

23

(2004) compared cumulative volumes at each location and corrected discrepancies using

6 1

constrained non-linear optimization. Kikuchi et al. (1999) applied fuzzy-optimization to adjust

2

each link's flow to achieve consistency. One potential deficiency of spatial correction procedures

3

was the inability to capture temporal variation of traffic throughout a typical day (Maier et al.,

4

2009).

5

Smith and Conklin's (2002) correction procedure relied on both temporal and spatial

6

information, using lane distribution patterns to derive missing data values. They reported an

7

average error less than 10% for almost all test cases. This method is discussed in detail later.

8

From the previous approaches, Smith and Conklin’s (2002), the time-of-day historical

9

mean, and our modified version of Chen et al.’s (2003) approach were selected for comparison,

10

along with our kernel regression approach, based on implementation ease and reported correction

11

accuracy. The variant of Chen et al.'s (2003) approach constructs a linear relationship between

12

the detector and the station flow instead of between station-mates. The kernel regression

13

technique is inspired by its applications in traffic flow prediction (e.g., (Han et al., 2010; Sun &

14

Chen, 2008)). The main strength is its non-parametric nature, which requires no prior

15

assumptions regarding the model structure (Takezawa, 2006). It models the relationship between

16

lane use percentage and lane flow and subsequently corrects errors. The models' details are

17

presented in Section 4.

18

3. DATASET DESCRIPTION

19

The Virginia Department of Transportation (VDOT) has 91 detectors grouped into 34 stations on

20

eastbound Interstate 66 (I-66E) between Manassas and Falls Church. VDOT provided detector

21

data and incident records from 2008, which are used for this study. From this dataset, we extract

22

the data for two stations. Station 1 is near the Lee Jackson Memorial Highway interchange. This

23

station, which has five detectors, is selected because it relates to ongoing research efforts. Station

7 1

2, with four detectors, is near West Ox Rd. Figure 1 illustrates the detector configurations and

2

station locations. Station 1 has an exit lane while Station 2 has only general-purpose lanes.

3

Insert Figure 1 Here

4

Each detector should report speed, occupancy, and volume every minute; 1440 daily

5

station flow records should exist if no malfunctions occur. The real-time hourly flow is the

6

volume reading multiplied by 60. A detector's working status is recorded by an indicator, which

7

has three possible values. A "0" indicates the detector malfunctioned; a “1” indicates possible

8

erroneous readings. Only readings from a detector with a status of “2” are considered correct.

9

When the status is "0" or "1", the readings have known errors (type 1). In addition to

10

these errors, the status indicator may be incorrectly reported. Type 2 errors include unreasonably

11

high flows; since detectors are lane based, the hourly flow obtained from 1-minute data for a

12

freeway lane cannot exceed 5000 veh/hr (Payne & Thompson, 2007). Hence, the lane's detector

13

status is set to "0" in these situations. Type 3 errors, missing data, are common. Sometimes, an

14

individual detector's readings are missing, and, at others, no detectors report values for an

15

extended duration (more than 10 minutes). Station flow is the sum of the individual detector

16

flows and is correct only when all detectors report correctly. In this study, all error types are

17

converted to missing-data errors since data from detectors with a "0" or "1" status are unreliable.

18

Hence, correction is equivalent to estimating missing data points, and the two terms are used

19

interchangeably.

20

implementation.

Additionally,

this

conversion

simplifies

the

correction

procedures'

21

To ensure fair comparison, three datasets are carefully chosen and constructed for each

22

station. The first is the correction target. The data for Tuesday, April 22, 2008 from 4:00 AM to

23

10:00 PM were selected for technique comparison, similar to previous studies (Payne &

24

Thompson, 2007; Peeta & Anastassopoulos, 2002; Smith & Conklin, 2002). The second (Base 1)

8 1

is the correction base dataset, which contains the data for the three months prior to the target day

2

(01/23/2008 to 04/20/2008), excluding holidays. The third (Base 2) is a subset of Base 1 and

3

contains the data for Tuesdays in Base 1. Base 2 is the correction base dataset for temporal

4

correction, described in the next section. Erroneous data and those for incident durations are

5

eliminated from Base 1 and 2, as in Peeta and Anastassopoulos (2002). Though the data for the

6

incident duration is excluded, potential reporting inaccuracies prohibit a guarantee that the

7

incident impact is completely eliminated.

8

The reasons for selecting April 22, 2008 are:

9



If all detectors report correct readings, 1080 correct station flows (4:00 AM - 10:00

10

PM) would be obtained. The chosen day is among those with the highest number of

11

correct station flow readings - 937 for Station 1 and 930 for Station 2. The 143 and

12

150 incorrect station flows for Stations 1 and 2, respectively, are not used for the

13

accuracy assessment since they are inaccurate/invalid observations.

14



15 16

Base 1 does not involve a seasonal shift, which may cause traffic pattern changes due to school holidays.



No incidents occur on this day within five miles upstream or downstream of both

17

stations. The non-incident condition is explicitly required by Smith and Conklin's

18

(2002) procedure.

19 20

Figure 2 illustrates the station flow profiles for the two stations on the target day. Insert Figure 2 Here

21

The traffic pattern is characterized by light traffic in the early morning and late evening.

22

The morning peak traffic reaches around 8000 veh/hr for both stations. The volume for the

9 1

afternoon peak period is considerably lower since both stations are located on I-66E, on which

2

home-to-work trips concentrate in the morning peak.

3

4. DATA CORRECTION PROCEDURES

4

The four data correction procedures are examined within the context of the target dataset.

5

First, the basic techniques are presented and then their adaptations for this study are discussed.

6

4.1. Temporal Correction (TC)

7

The temporal correction (TC) applied for this study is the Time-of-Day (TOD) historical

8

mean (Daganzo, 1997; Nguyen & Scherer, 2003; Maier et al., 2009). Let d denote the number of

9

different days within Base 2. The corrected flow for detector i at time t, fˆt i is calculated by Eq.(1). d

10

fˆt i =

∑ y ( j) i t

j =1

d

(1)

11

where yti ( j ) denotes the flow reading for detector i at time t on day j. TC takes the arithmetic

12

average of the flows at time t of all days available in Base 2. If a detector is functional, no

13

correction is made to the target dataset and the corrected flow fˆt i is the reported flow ft i . The

14

station flow is the summation of all the estimated detector flows as in Eq.(2).

15

qˆtTC = ∑ fˆt i

(2)

i

16

where qˆtTC denotes the corrected station flow (veh/hr) for time t using TC.

17

4.2. Spatial Correction using Linear Regression (LR)

18

Unlike Chen et al. (2003) who use relationships between flows of individual detectors,

19

we construct a series of linear regression models to estimate the station flow from functional

20

detectors. The target dataset's time span includes the HOV-effective duration (5:30 AM to 9:30

21

AM). To handle traffic pattern shifts due to HOV restriction, we build two sets of regression

10 1

models - one for the HOV period and one for the regular period. For notational simplicity, the

2

two sets of models are written in a unified format. An estimate of the station flow qˆti at time t

3

based on the flow ft i of functional detector i at time t is obtained through the regression model

4

expressed in Eq.(3) qˆti= β1i ft i + β 2i + e, {detector i is functional (status =2)}

5

(3)

6

Only readings from functional detectors are used. The coefficients β1i and β 2i are estimated

7

using the ordinary least squares method. For the models when the HOV lane is activated, only

8

the data points during the HOV time in Base 1 are used to estimate the coefficients; data points

9

for the regular period in Base 1 are used to estimate the coefficients of the regular period's

10

models. The final estimated station flow at time t is the arithmetic average of available station

11

flow estimates using Eq.(4), where n denotes the number of functional detectors at time t.

12

LR t



=

∑ qˆ

i t

i

n

(4)

13

For a given t, one must determine whether this time belongs to the HOV duration or not and

14

choose the set of models accordingly. If three detectors, for example, function at this time, an

15

estimate of station flow can be derived from each detector flow reading. The final estimated

16

station flow is the arithmetic mean of these three station flow estimates. The LR procedure

17

requires at least one functional detector to estimate station flow at a specific time. If no detectors

18

report correctly, the station flow is estimated using the temporal procedure qˆtTC, i.e., TC is the

19

default technique.

20

4.3. Spatial Correction using Kernel Regression (KR)

11 1

Under different congestion levels, drivers use different lanes, creating lane-to-lane flow

2

variability in multi-lane freeway sections (Carter et al, 1999;Amin & Banks, 2005). The lane use

3

pattern depends on the location and flow conditions (TRB, 2000).

4

We propose a new spatial correction procedure that exploits the relationship between lane

5

use patterns and flow conditions. A functional detector i at time t reports flow ft i . The

6

corresponding lane use percentage pti is defined as in Eq.(5). pti =

7

ft i qti

(5)

8

If the lane use percentage pti corresponding to the flow reading is known, the station flow can be

9

estimated by Eq.(6). qˆti =

10

ft i pti

(6)

11

However, no parametric function describes the relationship between lane use percentage and lane

12

flow. Therefore, kernel regression is used to obtain the empirical relationship between lane flow

13

and use percentage.

14

In parametric regression with a single regressor, the equation takes the form y = F(x)+e

15

like Eq.(3), where F(x) is a smooth function with known shape. The m observations (xj, yj) (j =

16

1,2,3,…,m) are used to estimate the unknown parameters and the fitted value yˆ j is given by F(xj).

17

For any xi, one can obtain yˆi using the constructed parametric relationship F. Similarly, the

18

output of kernel regression is a relationship that can be used to estimate yˆ i based on a provided xi.

19

The difference lies in the fact that kernel regression does not make assumptions about the shape

20

of F(x) (Bowman & Azzalini, 1997).

21

Kernel regression's underlying logic is that every observation other than the point of

22

interest (xi, yi) is used to derive the fitted value yˆi and the observations with the most information

12 1

about F(xi), or yˆ i , should be those close to this point of interest (xi, yi) (Bowman & Azzalini,

2

1997; Takezawa, 2006). The closeness of an observation (xj, yj) (j = 1,2,3,…,m and j ≠ i) to the

3

observation in question (xi, yi) is measured by the distance on the horizontal axis |xj – xi|. The

4

immediate neighboring observations, with smaller distance |xj – xi|, contribute more to deciding

5

F(xi) than distant ones. A weight wij measures the magnitude of observation j 's contribution to

6

the fitted value F(xi). Subsequently, F(xi) is a weighted average of all y’s of the observations (xj,

7

yj) (j = 1,2,3,…,m and j ≠ i). Since the observations far from the observation of interest (xi, yi)

8

should receive little or no weight, a decreasing function is desired to assign weights to each

9

observation (xj, yj) based on the distance |xj – xi| (i ≠ j). After the weights for the value yj are

10

determined, F(xi) is computed as the weighted average of all the yj’s. After this process is

11

repeated for each observation, the shape of the function F(x) can be described by a series of

12

derived points (xi, F(xi)) for i = 1,2,3,…,m where m is the total number of observations. The

13

choice of the weighting function should not influence the relationship captured by the kernel

14

regression. Therefore, an additional parameter bandwidth (h), rescales the horizontal distance to

15

minimize the effect of different weighting function choices. The ultimate measure of distance,

16

used to decide an observation's contribution to the calculation of fitted value, is the horizontal

17

distance scaled by bandwidth, i.e. |xj – xi|/h.

18

We treat individual detectors separately by constructing a kernel regression based

19

relationship between lane flow and use percentage for each of them. Suppose m pairs of lane

20

flow fj and use percentage pj (j = 1,2,3,…,m) for one particular detector are available in Base 1.

21

For a specific observation (fi, pi), a series of weights wij (j = 1,2,3,…,m and j ≠ i) are assigned to

22

each lane use percentage pj selected for this detector to determine fitted lane use percentage pˆ i

23

based on the distance measure | f j − fi |. This step is repeated for every pair of lane flow and use

13 1

percentage (fi,pi) for i = 1,2,3,…,m. The weighting function in this study is the one proposed by

2

Nadaraya (1964) and shown in Eq.(7). wij =

3

(f

K h ( fi − f j ) / h 



j =m j =1

  ( f i − f j ) / h 

(7)

− f j ) / h. Then K h ( u ) is a function of u, and h is a positive-

4

To simplify the notation, let= u

5

value bandwidth. The kernel function K h ( u ) used here is the standard Gaussian kernel (Eq.(8)).

Kh (u ) =

6 7 8

i

1 −u2 e 2π

(8)

The equation (Eq.(9)) proposed by Bowman and Azzalini (2005) is used to determine the optimal bandwidth for the standard Gaussian kernel. 1

9

hopt

 4 5 = ,  3m 

(9)

10

where m is the sample size of Base 1 for this particular detector. Using Eqs.(7-9), the fitted data

11

point ( fi , pˆ i ) can be found and the relationship between lane use percentage and flow F can be

12

constructed for this detector. By applying this procedure to all the detectors of a particular station,

13

the corresponding lane use percentage can be estimated for any lane flow through interpolation,

14

based on the relationship F.

15

Once the lane use percentage is determined, station flow is estimated using Eq.(6).

16

Similar to the LR-based spatial correction, the final estimate qˆtKRof station flow at time t, is the

17

arithmetic mean of all the estimates from individual functional detectors. Also, the relationship

18

between the lane flow and use percentage is treated separately for the HOV and regular periods.

19

4.4.Lane Distribution Correction (LD)

14 1

Smith and Conklin's (2002) method exploits both temporal and spatial information. A historical

2

lane use percentage pti is derived from Base 1, computed as the arithmetic mean of all available

3

lane use percentages for lane i at time t as in Eq.(10). d

pti =

4

∑ p ( j) j =1

i t

,

d

(10)

5

where d denotes the number of different days within Base 1 and pti ( j ) is the lane use percentage

6

of lane i at time t of day j, calculated using Eq. (5). This historical lane use percentage is used to

7

estimate the station flow as in Eq.(11).

qˆti =

8 9 10

fti pti

(11)

The final estimate of the station flow for time t is the arithmetic mean of all available estimates based on individual detectors (Eq.(12)).

qˆtLD =

11

∑i qˆti n

,

(12)

12

where n is the number of functional detectors at time t.

13

5. EXPERIMENTAL PROCEDURES

14

The experiment consists of systematic and random-error evaluations. The former assesses the

15

performance of each method under all possible error combinations and serves two purposes. First,

16

it guides the selection of individual detectors for aggregation to derive the final station flow

17

estimate for the kernel regression and lane distribution approaches. Second, it reveals the

18

methods' accuracies under different error configurations and aids the selection of the best method

19

when a specific error configuration is dominant. For n detectors in a station, the number of

20

possible error combinations is (2n – 2). The two excluded cases are (1) all detectors malfunction

21

and (2) all detectors function. Figure 3 visualizes the error combinations.

15 1

Insert Figure 3 Here

2

The systematic evaluation targets two periods (peak and off-peak). The selected peak

3

period is 7:00 to 9:00 AM while the off-peak period is 1:00 to 3:00 PM. For each error

4

combination, the flow readings of the desired detectors are treated as missing for the two-hour

5

duration.

6

The systematic evaluation consists of two stages corresponding to the two purposes

7

aforementioned. In the first (SysEval-I), the KR and LD approaches adopt all available station

8

flow estimates to derive the final estimate for a given time. The results are analyzed by

9

comparing the procedures' performances under different error scenarios to provide insights into

10

each detector's relative importance to the method's accuracy. These insights are used to modify

11

the LD and KR spatial methods by changing the detectors selected for aggregation. The second

12

systematic evaluation (SysEval-II) incorporates these modifications and compares the methods'

13

performances under different error configurations to identify the one that provides the best

14

performance. Detectors often malfunction for some time, resulting in missing readings for this

15

duration. The observations derived from SysEval-II provide practical guidance about the choice

16

of method when prolonged malfunctions occur.

17

Random error evaluation follows the systematic evaluation. The target day contains 5400

18

individual detector flow records (for Station 1) or 1080 station flows. Errors are randomly

19

introduced into the target dataset at different levels: 10, 20, 30, 40, and 50%. The 10% error

20

level means that 10% of the individual flow readings (i.e., 10%×5400 = 540 records) are

21

artificially set to missing values based on random numbers. Since the target dataset contains pre-

22

existing incorrect detector readings, the actual error percentage could be less than the specified

23

error level when the existing incorrect records are flagged as introduced error. For each error

16 1

level, 10 replications are conducted by revising the random number seed. Each replication hosts

2

a particular combination of error configurations defined in the systematic evaluation.

3

The correction methods' performances are compared using two widely-accepted measures

4

of effectiveness (MOEs): Mean Absolute Error (MAE) (Eq. (13)) and Mean Absolute Percentage

5

Error (MAPE) (Eq. (14)), which converts the absolute quantity to a relative one.

6

MAPE measure the average correction accuracy for the target dataset. = MAE

7

1 T ∑ | qt − qˆt |, T t =1

MAE and

(13)

8

where T represents the total number of records in the target dataset and qt is real station flow and

9

qˆt is the estimated station flow. 1 T | qt − qˆt | MAPE = ∑ T t =1 qt

10

(14)

11

Results of the random-error evaluation are analyzed and interpreted from several angles.

12

Each procedure's overall performance is contrasted at different error levels. The corrected value

13

is supplied by TC if the other three procedures (LR, KR, and LD) fail to produce an estimate of

14

the station flow for a given time. Such correction failures are assessed to provide insights to each

15

procedure's robustness. Finally, since the traffic's inherent random variation may not be captured

16

by aggregated MOEs, the overall performance measures (MAPEs) are disaggregated by time of

17

day to uncover temporal performance variations.

18

6. EXPERIMENTAL RESULTS

19

We constructed two sets of linear regression models using Base 1 before conducting the LR

20

correction. Since Base 1 includes readings from functional detectors for the three months prior to

21

the target day, it includes normal traffic fluctuations. The regression models for the HOV and

22

regular periods are shown in Table 1. At Station 1, the linear relationship between detector 609's

17 1

lane flow and the station flow for the HOV period is weak, as shown by the relatively low

2

adjusted R2 metric 0.01. The same findings apply for detectors 607 and 609 for the regular period,

3

which may be explained by detector 609's installation on the exit lane and detector 607 on the

4

adjacent lane. For the HOV period, the final estimate of Station 1's flow is the arithmetic mean of

5

available estimates from detectors 601, 603, 605 and 607. For the regular period, only the

6

available flow estimates based on detectors 601, 603, and 605 are used. All the linear regression

7

models for Station 2 enjoy reasonably good fit and are used in the final estimate.

8 9

Insert Table 1 Here

6.1. Systematic Evaluation

10

The results of the first systematic evaluation (SysEval-I) are shown in Figures 4 and 5 for Station

11

1 and Figures 6 and 7 for Station 2. SysEval-I helps identify the relative importance of lane

12

detectors to LD's and KR's accuracies.

13

Insert Figure 4 Here

14

Insert Figure 5 Here

15

Insert Figure 6 Here

16

Insert Figure 7 Here

17

6.1.1. SysEval-I Results for Station 1

18

The MOEs are not uniform within a scenario group of the same number of malfunctioning

19

detectors, suggesting that the location of malfunctioning detectors impacts the accuracy. (To

20

simplify the narration, detector 601 malfunctioning is represented by “d601-X”.) For the off-

21

peak period, TC (Figure 4's upper left panel) achieves lowest MAPE and MAE for combination

22

5 (d609-X, see Figure 3). For TC, the MAPE for combination 6 (d601-X&d603-X) is

23

significantly higher than that for combination 5. Information from detectors 601 and 603 play

18 1

important roles in TC and for all methods considered in this study. This observation is further

2

substantiated by combinations 26 and 27 exhibiting high MAPE for all methods; for these

3

combinations, detectors 601, 603, and 605 malfunction.

4

Another interesting comparison is among error combinations 4 (d607-X), 5 (d609-X), 15

5

(d607-X & d609-X), and 16 (d601-X, d603-X, & d605-X) in the off-peak period. For KR,

6

combination 15's MAPE is lower than combinations 4 and 5, indicating that including more

7

station flow estimates from individual detectors does not necessarily lead to more accurate final

8

station flow estimates. The addition of information from either detector 607 or 609, in the

9

absence of another detector, decreases KR's effectiveness. Additionally, combination 16's MAPE

10

is higher than that of combinations 4, 5, and 15, any of which has at least one functional detector

11

of the group (d601, d603, and d605). A similar pattern is observed for the peak period. The slight

12

difference lies in KR achieving the lowest MAPE (8.40%) for combination 25 (d605-X, d607-X,

13

& d609-X) for the off-peak period while the lowest MAPE 7.12% is achieved for combination

14

15 (d607-X & d609-X) during the peak period. The practical significance of this finding is that

15

an adaptation to the KR approach is needed. During the off-peak period, only available estimates

16

of station flow based on detectors 601 and 603 are used. During the peak period, detectors 601,

17

603, and 605 are used.

18

A similar pattern exists for LD (lower right panel in Figures 4 and 5). Combination 15's

19

MAPE is much lower than that for combination 16. The estimated station flows based on the two

20

right-most lanes are relatively inaccurate because of the exit lane, whose flow pattern is different

21

from those of its in-station counterparts. Based on this finding, the LD correction method is

22

revised to use only available estimates from detectors 601, 603, and 605 for both peak and off-

23

peak periods.

19 1

6.1.2. SysEval-I Results for Station 2

2

For Station 2, the comparison of interest involves error configurations 4, (d547-X), 10 (d545-X,

3

d547-X), 12 (d541-X, d543-X, d547-X), 13 (d541-X, d545-X, d547-X), and 14 (d543-X, d545-X

4

and d547-X). For the off-peak period (Figure 6's lower left panel), combination 4's MAPE is the

5

lowest and configuration 10's is slightly higher. The common feature of these two combinations

6

is the functional detectors 541 and 543, which are vital to KR's accuracy. The comparison among

7

error configurations 12, 13, and 14 lends additional credence. When both detectors 541 and 543

8

malfunction, the MAPE is higher than when at least one of them works properly (configurations

9

13 and 14). However, the peak period lacks this pattern. Hence, the aggregation step for the KR

10

approach uses station flow estimates from detectors 541, 543, and 545 for off-peak period and all

11

available estimates for peak-period estimation.

12

For LD, in the off-peak period, error configuration 7 (d541-X, d547-X) has the lowest

13

MAPE, suggesting the importance of detectors 543 and 545 for station flow estimation.

14

However, the peak period lacks this pattern. Therefore, the LD approach adopts only the

15

estimates from detectors 543 and 545 for the off-peak period but takes all available estimates for

16

the peak period.

17

The findings of the systematic evaluation for both stations for KR and LD suggest that

18

certain station flow estimates based on individual lane distribution can be inaccurate. The

19

fundamental reason is that the lane distribution varies according to multiple factors, such as

20

traffic composition and the number and location of access points (TRB, 2000). The systematic

21

evaluation highlights the impact of access/egress points and the facility characteristics (number

22

of lanes). Therefore, spatial correction approaches based on lane distribution need calibration and

23

adaptation before application.

20 1

6.1.3. SysEval-II Results for Station 1

2

The modified KR and LD approaches are evaluated in SysEval-II. The intent of SysEval-II is to

3

identify the best method for every error configuration. Tables 2 and 3 present the results for

4

Stations 1 and 2, respectively.

5

The improvement achieved by the modifications is confirmed by the drop of LD's

6

MAPEs for both the peak and off-peak periods and the decrease of KR's MAPEs in the off-peak

7

period.

8

The number of malfunctioning detectors influences TC's and LR's performance relatively

9

more deterministically than it does the other two approaches during the off-peak period.

10

Generally, the MOEs increase when the number of malfunctioning detectors increases from one

11

to four. However, this trend is not present for KR and LD, thus these two approaches are less

12

sensitive to the number of malfunctioning detectors during the off-peak period.

13

Insert Table 2 Here

14

Table 2 highlights the lowest MAPE for each error configuration and shows the MAPE

15

differences for the other three procedures. For example, TC has the lowest MAPE (5.42%) for

16

error combination 1 during the peak period. The MAPE difference is “+1.79%” for LR, or a total

17

MAPE of 7.21% (5.42%+1.79%). For the off-peak period, TC presents dominant performance

18

with a few exceptions. One is error combination 21, in which LD provides the lowest MAPE

19

(8.85%). LR performs nearly as well when TC is the best, with an average MAPE difference of

20

0.76% for the off-peak period. For the peak period, a similar pattern exists for scenarios in which

21

two or fewer detectors malfunction (before error combination 15). TC provides the most

22

accuracy with only one exception (combination 8). When the number of malfunctioning

23

detectors exceeds 2 (beyond error combination 15), KR is competitive for several error

21 1

configurations for the peak period. In contrast to the off-peak period, KR is the second best with

2

an average MAPE difference of 1.26% while the LR approach is third with an average MAPE

3

surplus of 1.59%. Hence, LR exhibits robustness across different error configurations due to its

4

competitive performance for both periods (less than 2% margin).

5

Error combinations exist for which all methods exhibit higher error rates. For the off-

6

peak period, combinations 17 and 18 result in MAPEs exceeding 13% for all procedures because

7

detectors 601 and 603, considered the most important, malfunction. For the peak period, the

8

similar combination is 22 (only detectors 601 and 609 function).

9

6.1.4. SysEval-II Results for Station 2

10

Distinct from the findings for Station 1, TC is not obviously dominant for Station 2. In the off-

11

peak period, when more than two detectors malfunction (after error configuration 4), the spatial

12

correction methods show obvious advantages. LR enjoys the lowest MAPE values for six

13

combinations. For the peak period, KR shows the best performance for six error combinations.

14

Insert Table 3 Here

15

The performance difference for Stations 1 and 2 can be attributed to the lane

16

configurations. Since Station 2 has no exit lane, the traffic is more consistent than Station 1,

17

which has one exit lane. This lane configuration distinction impacts the methods' performances,

18

which will be confirmed by the random-error evaluation.

19

6.2. Random-Error Evaluation

20

KR and LD refer to the modified versions. Figure 8 presents the MAE and MAPE averaged over

21

10 realizations for the four methods. The TC replacements when spatial correction fails due to no

22

functional detectors are not included in the MAPE calculations to guarantee a fair comparison

23

among the spatial correction methods.

22 1

Insert Figure 4 Here

2

6.2.1. Analysis of the Results of Random-Error Evaluation for Station 1

3

TC performs the best at all error levels for Station 1. At error level 10%, TC's MAPE is only

4

6.76% compared to LD's 15.17%, LR's 13.06%, and KR's 11.58%. Even at 50% error for Station

5

1, TC achieves a MAPE of 11.08%. Considering that the error percentage within a dataset

6

usually does not surpass 50%, TC provides reliable correction performance for non-incident

7

conditions. (In 2008, only 33 out of 314 days had error levels exceeding 50%).

8

LR maintains a relatively steady performance at all error levels. The MAPEs range from

9

13.06% to 15.09% while the ranges for the other three approaches each exceed 5%. This steady

10

trend is determined by LR's inherent characteristics. The linear relationship predicts the expected

11

station flow conditioned on an individual lane flow. The station flow can be estimated even if

12

only one lane flow is valid, so, even if the error level is high, LR's performance is stable.

13

However, the standard deviation of LR's MAPE is larger than those of KR and TC because LR

14

performs relatively poorly when traffic is light in early morning and late evening as

15

demonstrated in Section 6.3. The correction accuracy, in descending order, ranks TC, LR, KR,

16

and LD.

17

LD's performance is not as good as anticipated for Station 1. The lowest MAPE for error

18

level 10% is higher than that reported (less than 8%) in Smith and Conklin (2002). The

19

difference between the results can be ascribed to several reasons. The original study used 10-

20

minute aggregate data, which had less noise while our dataset has a higher resolution of one

21

minute. Station 1 has five lanes, whereas the original study had three (without an exit lane).

22

Though only information from the three left-most lanes is used, the traffic pattern can lead to

23

different results. Additionally, incident impacts cannot be ruled out completely due to possible

23 1

documentation inaccuracy. Since being free of incidents is required for the LD approach,

2

potential incident impacts may undermine its performance.

3

6.2.2. Random-Error Evaluation for Station 2

4

Similar to the findings for Station 1, TC outperforms the other three. However, the most obvious

5

distinction lies in LD's competitive performance, which is better than LR and KR. LD's MAPE at

6

the 10% error level is 9.43% and 11.60% for the 50% error level. LR gives MAPEs ranging from

7

10.38% for error level 10% to 13.34% for error level 50%. The rankings for correction accuracy

8

based on MAPE are TC, LD, KR, and LR. This ranking contradicts the one for Station 1 in

9

which LD has the largest MAPEs for all error levels. Traffic patterns due to different numbers of

10

lanes for the two stations plays an important role in the accuracy for the lane distribution

11

methods. LD performs better when the lanes are all general-purpose lanes, which exhibit more

12

homogenous traffic patterns than those with exit lanes. The three spatial correction methods

13

uniformly present better performance for Station 2 than they do for Station 1, which again

14

underscores the significance of lane configuration. This also accounts for the lower standard

15

deviation of LD's MAPE for Station 2 than for Station 1.

16

6.2.3. Correction Failure Rate for Random-Error Evaluation

17

As mentioned in Section 4, LR, KR, and LD require at least one functional detector. Therefore,

18

they are compared in terms of their correction failure rate. Correction failure indicates the

19

procedure fails to produce an estimate and defaults to the TC estimate. Figure 9 shows the 10-

20

replication average for the percentage of correction failure. For error levels 10% to 30%, all three

21

correction procedures enjoy failure rates less than 10% for both stations and approximately 6%

22

for LR and KR for both stations at the 40% error level, suggesting that they are robust under

23

most circumstances, considering that most datasets have less than 50% error.

24 1

LD's relatively high failure rate for Station 2 at the error levels 40% and 50% is due to

2

the station having only four detectors. Therefore, Station 2 is more likely to have a station

3

malfunction than Station 1, which has five detectors.

4

Insert Figure 9 Here

5

6.3. Method Performance by Time-of-Day

6

To examine time-of-day effects, the MAPEs are calculated for each hour of the target day. The

7

hourly MAPEs are averaged across the 10 random realizations. The results are visualized

8

through Figures 10 and 11. One or more detector malfunctions leads to unreliable station flow

9

for a given time, which calls for correction. The maximum possible station corrections made for

10

an hour is 60. (Random errors are distributed among the possible individual detector readings). A

11

general pattern shared by all random error levels for both stations is the relatively high early

12

morning and late evening MAPEs. Station 2's MAPEs fluctuate at a lower level due to overall

13

better performance (see Figure 8 and Section 6.2). Figures for Station 1 at two error levels (10%

14

and 40%) are shown and analyzed. The relationships for the 20 and 30% error levels closely

15

resemble the one for 10% and the 50% error level is similar to the 40% level.

16

Insert Figure 10 Here

17

Insert Figure 11 Here

18

The gap for 7:01 to 8:00 exists because the original dataset does not have correctly

19

reported flows and the performance measurements could not be calculated. The number of

20

corrections made for each hour is relatively steady. Hence, the temporal variation of performance

21

is due to detector malfunction combinations and possibly traffic flow variations instead of error

22

quantity.

23

At all error levels, TC generally outperforms the other procedures for both stations,

24

except for 15:00 to 16:00 for Station 1 at 40% error levels. In both figures for Station 1, LR,

25 1

followed by KR, achieves the lowest MAPE, indicating these approaches outperform TC under

2

certain circumstances. One example is the time 15:35:00 at which the real station flow is 2280

3

veh/hr and the working detectors for this time under one realization of the 40% error level are

4

detectors 605, 607, and 609. The TC estimate is 2979 veh/hr, derived from the individual lane

5

flows for detectors 601 and 603, which are 675 veh/hr and 1224 veh/hr while the observed

6

values are 420 veh/hr and 780 veh/hr, respectively. The predicted values overestimate the

7

readings, possibly because the flow for this particular time deviates from the past average at the

8

same time. The LR estimate is 2411veh/hr and KR's estimate is 2255 veh/hr in this realization,

9

which are good estimates. As the general trend suggests, TC performs the best in terms of

10

estimation accuracy though cases of lower accuracy occur when the flow deviates from the

11

historical value.

12

LR, KR, and LD's performances exhibit relatively high MAPE for 4:00 to 5:00 and

13

deteriorate for the duration after 19:00 for both stations. Selected individual data points between

14

these time durations at the 10% error level are more closely examined. Traffic conditions are

15

light; for example, the real flows at 4:10 AM are 0(d601), 0(d603), 300(d605), 0(d607), and

16

0(d609). In one realization, detector 603 malfunctions and the final station flow is based on at

17

least one zero flow reading (detector 601). LR's estimated station flow is 1221 veh/hr, which is

18

four times the real station flow. Similar cases are found for KR and LD after 20:00, suggesting

19

LR, KR, and LD may be inaccurate under light conditions especially when the estimates are

20

based on zero flow readings.

21

7. CONCLUSIONS

22

In this study, four correction procedures were examined for non-incident conditions. Temporal

23

correction exploited the inherent temporal trend of historical data. The spatial correction based

26 1

on linear regression, a modification of a previous approach, used the relationship between

2

individual detector flows and station flow. The proposed kernel regression spatial method had

3

the unique feature of incorporating lane use percentage into the correction process. Smith and

4

Conklin's (2002) method based on lane distribution was included as a benchmark.

5

To comprehensively compare the procedures, systematic and random-error evaluations

6

were conducted for two stations. After the results of systematic evaluation were analyzed, the

7

KR and LD approaches were modified. Individual lane flows provided by the detectors on

8

particular general purpose lanes produced more accurate estimates. The two procedures were

9

revised and their station flow estimates were compared to those of the TC and LR approaches at

10

five random error levels.

11

The two detector stations had distinct lane configurations. One station had five lanes,

12

including an exit lane and the other station had four general-purpose lanes. The lane

13

configuration had a significant impact on the spatial correction procedures' performances.

14

7.1. Summary of Findings and Practical Recommendations

15

Lane configurations must be considered when choosing a correction procedure. Traffic patterns

16

associated with the number and types of lanes influence the spatial methods' accuracies.

17

Homogenous lane configurations are more suitable for spatial methods, evidenced by improved

18

accuracy under random errors, compared to heterogeneous lane configurations. The spatial

19

methods are also more accurate when more than one detector malfunctions, based on the

20

systematic evaluation. However, when an exit lane is present, TC is generally more accurate.

21

Correction methods should be calibrated according to location-specific characteristics

22

since each detector has a different impact on accuracy. This is especially true for approaches

23

involving lane distribution. Careful selection of aggregation rule is warranted as shown by the

27 1

systematic evaluations. In this study, including the estimates from the exit and adjacent lanes

2

undermine KR's and LD's accuracies.

3

The distribution of error configurations can guide correction procedure selection.

4

Systematic evaluation indicates that TC is the most accurate for most configurations. However,

5

as the number of malfunctioning detectors increases, KR provides better results under certain

6

error combinations for the station with an exit lane. LR is robust under both lane configurations

7

and achieves accuracy levels consistent with the best approaches under various error

8

configurations; thus, it could be a viable choice for a dataset with diverse error configurations. If

9

one error configuration is dominant, the best approach corresponding to this configuration should

10

be chosen.

11

Time-of-day performance assessment confirms TC's superiority. Overall, this approach

12

outperforms the others at all random error levels, although sporadic cases exist in which other

13

methods are better. Associated with time-of-day are traffic conditions, which are important to the

14

spatial correction procedures, especially when valid observations of zero flow are incorporated

15

into an average. Thus, time-of-day and traffic conditions should be considered when selecting a

16

procedure.

17

In addition to performance-based suggestions, consideration should be given to practical

18

implementation. Though TC is generally the most accurate, it requires data archiving. One query

19

is necessary for each lane correction and may be time-consuming. The storage space and

20

processing time might become prohibitively expensive. When the size of the target dataset is

21

small or speed is not the top priority, TC would be ideal out of the procedures examined,

22

provided the dataset is incident free. When accuracy is not strictly demanded, correction speed is

28 1

an issue, or diverse error configurations are present, the linear regression-based method can be

2

used.

3

7.2. Future Directions

4

This study focuses on incident-free data. However, incidents occur frequently and evaluation and

5

adaptation of the methods for incident conditions remains an important research avenue. The

6

temporal correction procedure’s ability to handle flow deviations from the past average is

7

relatively insufficient and should be improved. This might be achieved by accommodating

8

information from temporally adjacent readings. Additionally, the correction of occupancy and

9

speed readings needs to be evaluated. The use of occupancy might improve spatial methods'

10

performances. Finally, the methods' performances will be assessed for datasets with longer

11

aggregation intervals.

12

ACKNOWLEDGEMENT

13

The authors thank MAUTC for funding the project of which this study is a part. The material in

14

this paper is not necessarily MAUTC's view; the authors remain solely responsible for the

15

content.

16

29 1

REFERENCES

2

Amin, M. R., & Banks, J. H. (2005). Variation in freeway lane use patterns with volume, time of

3 4 5 6 7 8 9

day, and location. Transportation Research Record, 1934, 132-139. Bikowitz, E. W., & Ross, S. P. (1985). Evaluation and Improvement of Inductive Loop Traffic Detectors. Transportation Research Record, 1010, 76-80. Bowman, A., & Azzalini, A. (1997). Applied Smoothing Techniques for Data Analysis. Oxford, New York: Claredon Press. Carter, M., Rakha, H., & Van Aerde, M. (1999). Variability of traffic-flow measures across freeway lanes. Canadian Journal of Civil Engineering, 26, 270-281.

10

Chen, C., Kwon, J., Rice, J., & Skabardonis, A. (2003). Detecting Errors and Imputing Missing

11

Data for Single Loop Surveillance Systems. Paper presented at the the 82nd Annual

12

Meeting of Transportation Research Board, Washington D.C.

13 14 15 16 17 18

Chen, L., & May, A. D. (1987). Traffic Detector Errors and Diagnostics. Transportation Research Record, 1132, 82-93. Cleghorn, D., Hall, F. L., & Garbuio, D. (1991). Improved Data Screening Techniques for Freeway Traffic Management System. Transportation Research Record, 1320, 17-23. Daganzo, C. (1997). Fundamentals of Transportation and Traffic Operations. Oxford, New York: Pergamon-Elsevier.

19

Fernandez-Moctezuma, R. J., Tufte, K. A., Maier, D., & Bertini, R. L. (2007). Toward

20

Management and Imputation of Unavailable Data in Online Advanced Traveler

21

Information Systems. Paper presented at the ITSC 2007, Seattle, Washington, USA.

22

FHWA. (1998). Developing Traveler Information Systems Using the National ITS Architecture.

23

Washington D.C: FHWA.

30 1

Han, L., Shuai, M., Xie, K., Song, G., & Ma, X. (2010). Locally kernel regression adapting with

2

data distribution in prediction of traffic flow. Paper presented at the 2010 18th

3

International Conference on Geoinformatics, Piscataway, NJ, USA.

4 5 6

Ishak, S. (2003). Fuzzy-Clustering Approach to Quantify Uncertainties of Freeway Detector Observations. Transportation Research Record, 1856, 6-15. Ishak, S., Kondagari, S., & Alecsandru, C. (2007). Probabilistic Data-Driven Approach for Real-

7

Time Screening of Freeway Traffic Data. Transportation Research Record, 2012, 94-104.

8

Jacobson, L. N., Nihan, N. L., & Bender, J. D. (1990). Detecting Erroneous Loop Detector Data

9

in a Freeway Traffic Management System. Transportation Research Record, 1287, 151-

10

166.

11

Kikuchi, S., & Milkovic, D. (1999). Method to preprocess observed traffic data for consistency.

12

Application of fuzzy optimization concept. Transportation Research Record,1679, 73-80.

13

Kwon, J., Chen, C., & Varaiya, P. (2004). Statistical methods for detecting spatial configuration

14

errors in traffic surveillance sensors. Transportation Research Record, 1870, 124-132.

15

Lee, M.-S., Cheong, S.-J., Choi, O.-J., & Meang, B.-Y. (2010). Implementation of a Data

16

Processing Method to Enhance the Quality and Support the What-If Analysis for Traffic

17

History Data The KIPS Transactions, Part D, D17(2), 87-102.

18 19

Maier, D., Tufte, K., & Fernandez-Moctezuma, R. J. (2009). Improving Travel Information Products via Robust Estimation Techniques. Portland, OR: OTREC.

20

May, A. (1990). Traffic Flow Fundamentals. Englewood Cliffs, New Jersey.

21

Nadaraya, E. A. (1964). On Estimating Regression. Theory of Probability and its Applications,

22

9(1), 141-142.

31 1

Nguyen, L., & Scherer, W. (2003). Imputation Techniques to Account for Missing Data in

2

Support of Intelligent Transportation Systems Applications Charlottesville, VA: Center

3

for Transportation Studies, University of Virginia.

4 5

Nihan, N. L. (1997). Aid to Determining Freeway Metering Rates and Detecting Loop Errors. Journal of Transportation Engineering, 123(6), 454-458.

6

Park, E. S., Turner, S., & Spiegelman, C. H. (2003). Empirical Approaches to Outlier Detection

7

in Intelligent Transportation Systems Data. Transportation Research Record, 1840, 21-30.

8 9

Payne, J. H., & Thompson, S. (2007). Malfunction Detection and Data Repair for InductionLoop Sensors Using I-880 Data Base. Transportation Research Record, 1570, 191-201.

10

Peeta, S., & Anastassopoulos, I. (2002). Automatic real-time detection and correction of

11

erroneous detector data with fourier transforms for online traffic control architectures.

12

Transportation Research Record, 1811, 1-11.

13 14 15 16 17 18

Pignataro, L. J. (1973). Traffic Engineering: Theory and Practice. Englewood Cliffs, N.J.: Prentice Hall, Inc. Schrank, D., Lomax, T., & Eisele, B. (2011). 2011 Urban Mobility Report. College Station, Texas: Texas Transportation Institute. Smith, B., & Conklin, J. (2002). Use of local lane distribution patterns to estimate missing data values from traffic monitoring systems. Transportation Research Record, 1811, 50-56.

19

Smith, B., & Venkatanarayana, R. (2005). Realizing the Promise of Intelligent Transportation

20

Systems (ITS) Data Archives. Journal of Intelligent Transportation Systems, 9(4), 175-

21

185.

32 1

Sun, S., & Chen, Q. (2008). Kernel regression with a Mahalanobis metric for short-term traffic

2

flow forecasting. Paper presented at the Intelligent Data Engineering and Automated

3

Learning. 9th International Conference, IDEAL 2008, Berlin, Germany.

4 5 6 7

Takezawa, K. (2006). Introduction to nonparametric regression Hoboken, N.J.: WileyInterscience. Toppen, A., & Wunderlich, K. (2003). Travel Time Data Collection for Measurement of Advanced Traveler Information Systems Accuracy. Falls Church, VA: FHWA.

8

TRB. (2000). Highway Capacity Manual. Washington D.C: Transportation Research Board.

9

Turner, S. (2004). Defining and measuring traffic data quality: White paper on recommended

10 11 12 13 14 15 16

approaches. Transportation Research Record, 1870, 62-69. Vanajakshi, L., & Rilett, L. R. (2004). Loop detector data diagnostics based on conservation-ofvehicles principle. Transportation Research Record, 1870, 162-169. Weijermars, W. A. M., & Van Berkum, E. C. (2006). Detection of invalid loop detector data in urban areas. Transportation Research Record, 1945, 82-88.

33 1

Figure Caption List

2

Figure 5 Station Location and Detector Configuration

3

Figure 6 Station Flow Profiles for 4:00 AM to 22:00 PM, 04/22/2008

4

Figure 7 Error Configurations for Systematic Evaluation

5

Figure 8 Results of SysEval-I for the Off-peak Period for Station 1

6

Figure 9 Results of SysEval-I for the Peak Period for Station 1

7

Figure 10 Results of SysEval-I for the Off-peak Period for Station 2

8

Figure 11 Results of SysEval-I for the Peak Period for Station 2

9

Figure 12 Comparison between Correction Methods

10

Figure 13 Percentage of Correction Failures (Averaged over 10 Realizations)

11

Figure 14 MAPE by Time-of-Day for Station 1 at Error Level 10%

12

Figure 15 MAPE by Time-of-Day for Station 1 at Error Level 40%

13

34 1

Tables

2

Table 4 Linear Regression Models for Station 1 and Station 2 HOV Period (5:30 AM to 9:30 AM) Station 1

Station 2 2

Detector

Model

R

Detector

Model

R2

601 (HOV)

HOV HOV qˆ601 =1.96 × f 601 + 3670.97

0.45

541 (HOV)

HOV HOV qˆ541 =1.89 × f541 + 3486.23

0.40

603 (GP)

HOV HOV qˆ603 =1.97 × f 601 + 2212.81

0.63

543 (GP)

HOV HOV qˆ543 =1.95 × f543 + 2358.62

0.63

605 (GP)

HOV HOV qˆ605 =2.73 × f 605 + 1878.34

0.69

545 (GP)

HOV HOV qˆ545 =1.95 × f545 + 2358.62

0.71

607 (GP)

HOV HOV qˆ607 =2.67 × f 607 + 3248.15

0.52

547 (GP)

HOV HOV qˆ547 =2.60 × f547 + 2151.15

0.54

609 (GP)

HOV HOV qˆ609 =0.01 × f 609 + 5840.27

0.01

Regular Period (4:00 AM to 5:30 & 9:30 AM to 10:00 PM) Station 1

3 4

Station 2 2

Detector

Model

R

Detector

Model

R2

601 (GP)

REG REG qˆ601 = 2.38 × f 601 + 1543.21

0.75

541 (GP)

REG REG qˆ541 = 2.39 × f541 + 1966.15

0.70

603 (GP)

REG REG qˆ603 = 2.84 × f 603 − 65.14

0.85

543 (GP)

REG REG qˆ543 = 2.67 × f543 + 214.45

0.85

605 (GP)

REG REG =3.15 × f 605 − 47.15 qˆ605

0.75

545 (GP)

REG REG qˆ545 =3.12 × f545 + 190.70

0.73

607 (GP)

REG REG =3.46 × f 607 + 2353.18 qˆ607

0.23

547 (GP)

REG REG qˆ547 =3.40 × f547 + 1320.98

0.53

609 (GP)

REG REG qˆ601 = 4.19 × f 601 + 2006.42

0.38

35 1

Table 5 The Results for SysEval-II for Station 1 Error Config.

2 3

MAPE (%) Peak (7:00 AM - 9:00 AM)

MAPE (%) Off-Peak (1:00 PM - 3:00 PM)

TC LR KR LD TC LR 1 5.42 +1.79 +1.61 +9.52 7.79 +1.19 2 5.63 +2.82 +2.85 +7.75 6.38 +2.17 3 3.12 +5.45 +4.66 +6.87 5.24 +3.16 4 5.62 +2.38 +1.31 +2.19 4.1 +2.88 5 3.9 +3.86 +3.03 +3.91 3.87 +3.11 6 7.2 +0.21 +0.95 +10.86 12.54 +0.68 7 7.01 +1.09 +1.02 +6.29 10.48 +1.43 8 +1.21 +0.34 7.03 +7.91 +0.75 +0.13 9 6.53 +0.68 +0.50 +8.41 +0.37 +0.13 10 7.47 +2.80 +5.59 +20.43 9.76 +2.75 11 5.79 +3.86 +2.69 +7.59 7.62 +0.93 12 6.62 +1.83 +1.86 +6.76 7.86 +0.69 13 5.58 +3.37 +2.20 +4.41 6.41 +1.99 14 4.89 +3.68 +2.89 +5.10 7.01 +1.39 15 +0.65 +1.07 6.93 +0.88 5.64 +1.34 16 9.20 +1.02 9.20(T) 9.20(T) 15.13 15.13(T) 17 8.15 +0.21 8.15(T) +9.91 0.73 13.22 18 +0.9 7.41 +0.74 +10.65 0.37 13.22 19 +0.78 +0.02 8.03 +5.27 11.71 +0.2 20 +0.01 +0.07 8.03 +5.27 11.72 +0.19 21 +2.45 +0.34 7.03 +7.91 +1.77 +0.13 22 7.3 +6.78 +5.76 +20.60 10.24 +2.27 23 8.13 +2.14 +4.93 +19.77 10.9 +1.61 24 7.95 +1.70 +0.53 +5.43 +0.28 8.55 25 7.42 +1.53 +0.36 +2.57 8.00 +0.4 26 9.45(T) 9.45(T) 9.45(T) 9.45(T) 16.19 16.19(T) 27 10.15 +0.07 10.15(T) 10.15(T) 16.31 16.31(T) 28 +2.01 +0.02 8.03 +5.27 +1.03 11.91 29 +1.9 +0.21 8.15 +9.91 +1.6 13.22 30 8.86 +5.22 +4.20 +19.04 11.6 +0.91 (T) indicates the temporal correction was used due to the correction failure

KR +4.42 +9.29 +5.01 +6.15 +6.38 12.54 +1.73 +3.36 +3.36 +5.91 +8.05 +7.81 +3.84 +3.24 +4.61 15.13(T) +0.73(T) +0.37(T) +0.5 +0.49 +3.36 +5.43 +4.77 +7.12 +2.25 16.19(T) 16.31(T) +0.3 +1.6 +4.07

LD +1.06 +4.55 +7.53 +2.55 +2.78 +1.06 +1.71 8.85 8.85 +16.6 +3.31 +3.07 +6.36 +5.76 +1.01 15.13(T) +0.38 +0.38 +0.48 +0.47 8.85 +16.12 +15.46 +2.38 +4.77 16.19(T) 16.31(T) +0.28 +0.38 +14.76

36 1

Table 6 The Results of SysEval-II for Station 2 Error Config.

2 3

MAPE (%) Peak (7:00 AM - 9:00 AM)

MAPE (%) Off-Peak (1:00 PM - 3:00 PM)

TC LR KR LD TC LR 1 +0.22 +2.68 +2.63 4.38 6.55 +0.34 2 +0.11 +1.17 +1.12 5.12 6.75 +0.28 3 4.06 +2.96 +2.76 +0.50 5.28 +0.73 4 4.26 +3.69 +3.48 +0.41 4.77 +0.77 5 +0.56 +0.03 6.55 +0.27 +2.19 9.37 6 7.08 +0.93 +0.88 +0.13 +1.8 8.45 7 +0.06 +2.62 +2.57 7.10 +1.98 +0.23 8 +0.24 +0.14 7.62 +1.28 +2.22 8.28 9 +0.29 +0.08 7.69 +0.79 +1.78 7.81 10 6.71 +2.07 +1.97 +1.02 +1.44 6.55 11 +0.6 +0.06 9.18 +3.45 +1.09 14.38 12 +0.31 +0.05 9.85 +1.75 +2.05 12.1 13 9.31 +3.58 +3.89 +3.13 +3.21 0.06 14 +0.22 +0.44 9.96 +6.98 +2.03 10.92 (T) indicates the temporal correction was used due to the correction failure

KR +0.95 +2.05 +1.67 +1.38 +3.00 +0.58 +0.64 +4.64 +0.99 +0.4 +1.09(T) +0.27 9.03 +2.00

LD +0.31 +5.58 +4.49 +2.09 +2.96 +1.32 6.86 +2.22(T) +4.52 +3.22 +1.09(T) +0.23 +0.74 +2.03(T)

37 1 2 3

4 5 6

Figures Figure 1

Map Data Source: ESRI.

38 1

2 3

Figure 2

39 1

2 3

Figure 3

40 1

2 3

Figure 4

41 1

2 3

Figure 5

42 1

2 3

Figure 6

43 1

2 3

Figure 7

44 1

2 3

Figure 8

45 1

2 3

Figure 9

46 1

2 3

Figure 10

47 1

2 3

Figure 11

Suggest Documents