IMPROVEMENT OF OD ESTIMATION BASED ON

Advanced OR and AI Methods in Transportation

IMPROVEMENT OF OD ESTIMATION BASED ON DISAGGREGATED FLOW INFORMATION Bernhard FRIEDRICH1, Yun-Pang WANG 1

Abstract. In this paper the redundant information issue of OD estimation models, based on information minimization (IM) and entropy maximization (EM) theories, is briefly reviewed and examined. A systematic approach is proposed to eliminate the redundant information from the flow data at intersection level. The result shows that with the proposed elimination method, the IM model provides more accurate OD estimates than the EM model, not only when the available information for the route choice proportions is by 100% correct, but also when these calculated proportions are not precise.

1. Introduction Due to high manpower cost and extensive time consumption for surveys many models with the easily available link flows information for updating travel demand in congested and uncongested networks are proposed in the last thirty years. Most of these models are applied with the flow information at the network level and use the available link flows, not the turning flows, as constraints to estimate the most likely OD matrix. With respect to new developments in the area of data fusion turning flow information becomes available by fusing detector data with data for the signal timing. Matschke et al [2] has, based on the IM model, examined the influence of turning flow information on the quality of OD estimation and concluded that the more information for link flows and turning flows is available, the better the OD estimates will be. The OD estimates based on turning flows are more accurate than those only using link flows, since the flow constraints used in estimation models can be more specifically defined with the disaggregated flow information. The results also show that the estimated error exists even when estimating the OD matrix with the completely correct information of route choice proportions for all links, including turning movements. This indicates that the redundant information among links at intersections and sections reduces the accuracy of OD estimates. Redundant information (R.I.) exists in a set of links, in which some link flows are linearly dependent on the flows 1

Institute for Transport, Road Engineering and Planning, University of Hannover, Appelstr. 9A, 30167 Hannover, Germany, {friedrich, wang}@ivh.uni-hannover.de

Improvement of OD estimation based on disaggregated flow information…

227

of the other links within the set. Therefore some rout choice proportions for link flows are also linearly dependent. Many numerical methods can be applied for searching the linear dependency, such as Gram-Schmidt orthogonalization, which is suggested by Bell [1], Van Zuylen and Branston [3] and Gaussian elimination. However, with the application of more disaggregated traffic data, such as turning flows, the combined information from turning flows is also linearly dependent on the upstream link flows. Thus the R.I. cannot be fully eliminated by only applying the aforementioned numerical methods. The contribution of each flow information to OD estimation is also different, since the information from turning flows can provide more specific conditions than that from link flows. Therefore some priority rules need to be developed to eliminate the R.I. and retaining the important information from the traffic data at intersection level. This paper focuses on the influence of R.I. on OD estimation when applying IM and EM models and the elimination of R.I. by using a systematic approach.

2. Issue of redundant information in the IM model 2.1.

Improvement by Van Zuylen

The IM model, based on the information measure of Brillouin, is originally proposed by Van Zuylen [4]. Its mathematical form is defined as Tij = tij ⋅ ∏ X a a

pija

gij

a , gij = ∑ pij , subject to a

∑T ij

ij

⋅ pija = Vaobs

(1)

Where Tij = the estimated OD flow from origin i to destination j tij = the historic OD flow or a prior guess of the OD flow from origin i to destination j paij= the route choice proportion of OD flow from origin i to destination j passing link a X an+1 = X an ×

Vaobs Vas

, the adjustment factor for link a by using the ratio of the observed

traffic flow Vaobs and estimated traffic flows Vas for link a in iteration n. In equation (1) the paij information is obtained with a given cost function and the incremental assignment assumption, in which the congestion effect in networks is not prominent. Parameter Xa for link a, for which traffic flow data is available, is iteratively estimated with the given historical OD matrix and paij information. The iteration continues until the difference between the estimated and the observed traffic flows converges. By applying this model all paij information should be independent, based on the definition of information measure of Brillouin, and the most likely matrix will be estimated by adding up all information for all links. It can be achieved with the aforementioned numerical methods, if the OD estimation is based on link flows, but not on turning flows. Besides Van Zuylen [5] found out that when the paij information on all links are not completely known, the known information will be overweighted in the estimation process. It is due to the factor gij in the equation (1) and the estimation result is therefore not stable. He proposed an alternative to avoid the overweight influence on the accuracy of OD estimation by revising the original EM model [4]. He has first examined the EM model with a small network and showed that the accuracy of OD estimates is not affected by the redundant information.

228

B. Friedrich, Y.-P. Wang

However, this model is sensitive to the given total number of trips in the a priori matrix and the correct quantity of all trips thus needs to be known as a requirement. To continue applying it without the abovementioned requirement Van Zuylen replaced the assumption of the equal probability of realisation of a trip from i to j with the proportion x0 of the a priori tij to the sum of the historic matrix. The modified model becomes: Tij = tij ⋅ x0 ∏ xa a

where x 0 =

1 ⋅ ∑ T , and x0 = ∑ tij ij ij ij

pija

, subject to

1

∑∑ pija tij a

ij

∑T

⋅ pija = Vaobs

ij

ij

(2)

will be set as the initial value based ⋅ ∑ vobs a a

on the available link flows. In this modified model two parameters x0 and paij will be solved with the satisfaction of the link flow constraints in an iterative process. Due to that, the total number of trips in the prior matrix becomes irrelevant with OD estimates, since it can be adjusted by solving the adjustment parameter x0 from the observed link flows.

2.2.

Drawback of the modified EM model 0

destination

6

In the research from Van Zuylen, the traffic count data is considered at network level and all paij information is assumed to be known with 100% accuracy. However, that assumption is not easy to be achieved in the reality. Besides, it is difficult to obtain the real paij information and thus often estimated by assigning a historical matrix onto the network in practice. The estimated paij information on all links can thus be obtained and the overweight problem in the IM model is solved, but not the redundant information problem. The test network at intersectional level from Matschke et al [2], shown in figure 1, is applied in this study to examine the results from equation (2) when the 3 : node number 300 : Length of the link (m) estimated error of paij exists. To achieve a better result of the assignment the available detected Figure 1. Layout of test network 1 link counts are treated as link capacities in the test. Root mean square error (RMSE) is used as an index for the accuracy of OD estimates. The results in table 1 show that the error of OD estimates, based on equation (1), exists even with correct paij information due to the R.I. The redundant links cannot also be identified. In contrast there is no noise from redundant information by applying equation (2) and it is easy to distinguish the links, having no contribution on OD estimation, such as link 32 in scenarios 4 and 5. However, when paij information is estimated, equation (2) can not estimate as accurate as equation (1) in all scenarios. This is since each paij does not only contribute the proportion paij/gij, used in equation (1), but the amount of paij. The estimated error from each paij will thus be cumulated and worsen the result. In addition the results show that equation (2) is not able to eliminate the R.I. and the redundant links then can no 7

5

4

10

origin

8

11

31

12

16

9

13

17

14

18

15

19

37

28

30

32

29

35

3

25

21

26

22

27

23

24

2

20

36

1

1

33

: linknum be r

34

1 2

3 4 sum

1

-

2

3

4

sum

100 200 150

450

200 150 - 150 50 100 50 300 350 350 450

500 200 1450

50

-

100

150

300


229

longer be distinguished. Furthermore new constraints for the in- and outbound flows for small OD flows are required when applying equation (2), since the factor x0, derived from the detected traffic counts, will be applied as an initial estimated OD flow for each OD pair. If a great difference among the OD flows exists, the small OD flows will be significantly underestimated when the information about their in- and outbound flows is not available. Thus it is apparent that equation (1) can easily provide better OD estimates when redundant information among links does not exist, as compared to equation (2), where it requires relative accurate paij information and the in- and outbound flows for small OD flows.

*

Based on equation (1) exact paij estimated paij

Based on equation (2) exact paij estimated paij

Based on equation (1)** exact paij estimated paij

RMSE OD*** RMSE OD*** RMSE OD*** RMSE OD*** RMSE OD*** RMSE OD*** S1 0.22 0.00% 6.55 -3.40% 0.00 0.00% 19.06 -8.82% 0.00 0.00% 4.90 -4.18% S2 22.14 0.59% 22.72 -0.47% 20.90 0.00% 23.64 -2.16% 20.39 0.00% 18.44 -1.29% S3 0.38 -0.08% 12.93 -4.87% 0.00 0.00% 19.89 -5.85% 0.00 0.00% 11.35 -3.52% S4 15.66 0.49% 17.27 -0.47% 17.72 0.00% 20.66 -2.33% 15.71 0.00% 17.39 -1.34% S5 15.65 0.46% 17.14 -0.62% 17.72 0.00% 20.69 -2.50% 15.71 0.00% 17.21 -1.56% S6 27.62 -0.08% 28.72 -0.40% 20.90 0,00% 23.24 -0,00% 20.39 0.00% 18.25 -0.47% S7 0.23 0.00% 10.83 -3.68% 0.00 0.00% 19.79 -5.81% 0.00 0.00% 11.35 -3.52%

*: S1: all flows detected; S2: only link flows detected; S3: only turning flows detected; S4: S2 with detected turning flow for links 17, 21, 25; S5: S2 with detected turning flow for links 17, 21, 25, 32; S6: S2 without link flow for link 37; S7: S3 with inbound link flows from all origins. **: with the elimination of redundant information; ***: the difference in percent.

Table 1. Comparison of results estimated by equations (1) and (2) in test network 1

3. Method of eliminating redundant information The proposed approach is based on the structure of the network, shown in figure 2, and the availability of detected traffic flows. In networks linear dependent links essentially consist of dummy links and the upstream links of turning links, when turning links are an explicit part of the network structure. Two main groups are therefore first classified, i.e. “dummy links” and “upstream links of turning movements”, when developing the rules. Subsequently the order to eliminate the dependent paij information depends on the availability of detected traffic counts, since they are the constraints in IM model for searching the most likely solution. The paij information of the links, for which traffic counts are available, will be maintained with first priority, since only the links with traffic counts will be considered as constraints and affect the solution of OD estimation. If the traffic counts for turning movements and for their upstream links are all available, only the information of paij and traffic counts for the turning links will be retained, since they can provide more specific boundary constraints. In addition since in equation (1) gij is the sum of all paij for all links, the redundant paij information needs be eliminated, not only for those linear dependent links with traffic counts but also for those without traffic counts. If there is no information of traffic count for both turning links and their upstream links, these links will not be treated as constraints in IM model and therefore either the paij information of turning links or that of their upstream links should be removed.

230

B. Friedrich, Y.-P. Wang

n1

3

n2

a1

n3

a2

n2

a2

n3

n1

a1

Zone

n2

1

2

a 3

a

a 4

n2 5

a3

n6

2

a

n1

a1 n1

a3 n3 n4

a1

2

n3

n5

a

1

Zone

a

a2 n4

n3

a4

n5

4

n3

a1

n1 n2

a2 n4 3

n2

a

a3

n4

a

n2

1

n3

n2

a2

a

n1

a

n1

a1

(a) Structure of dummy links

a

n5

4

n1

1

a

n4

n4

a

a4 1

n5

a5

n6

a2

n2 n3

n4

a

3

a4

n5

a5

n6

n1

(b) Link structure containing duplicate information Figure 2. General link structure in networks

4. Test results The proposed rules are programmed using C language and applied with two test networks, shown in figures 1 and 3. The test network 2 is a part of the real network in the List area in Hannover, Germany. The result in table 1 indicates that equation (1) with the elimination of R.I. can not only distinguish the redundant links, such as link 37 in scenarios 2 and 8, but also provide the correct total OD flows and more accurate OD estimates than equation (2) in all scenarios, when the available paij information is 100% accurate. If the estimated errors of paij for all links exist, the estimation results from equation (1) without R.I. in all scenarios are more accurate than those from equation (2). It is due to the less influence of the estimated paij error and no influence from R.I. When analysing with test network 2, the traffic flows are based on the real detected data and reproduced by the assignment in VISUM, a macroscopic simulation software. The assigned traffic flows from VISUM are used as the detected data for the links without detectors. Since traffic flows on some links are zero, they are treated as no detected flow exists for those links and the capacities, speeds and distances of the respective links are limited in order to obtaining more accurate paij values. Table 2 shows the consistent results to support the findings with test network 1. scenarios Based on equation (2) Based on equation (1)** all flows detected 313,95 306,68 only link flows detected 315,50 306,82 only turning flows detected 116,29 107,14 link and turning flows Partially detected 133,86 128,14

*: the total OD flow in the network is 58,144 vehicles; ** : with the elimination of redundant information

Table 2. Comparison of results estimated by equations (1) and (2) in test network 2


231

Figure 3. Layout of test network 2

5. Conclusions The computational results on both test networks show that without redundant information the IM model can provide more accurate OD estimates than the modified EM model, not only when paij information is 100% correct, but also when the estimated errors of paij exist. For the future work the performance of the modified EM model with the proposed elimination rules should be investigated for the situation when not all Paijs are available in order to take the advantage of utilizing the floating car data for better estimation.

Reference [1] M. G. H. Bell. The estimation of an origin-destination matrix from traffic counts. Transportation Science, 17(2):198-217, May 1983. [2] I. Matschke, K. Heinig, B. Friedrich. Data fusion technique in the context of traffic state estimation. Proceedings of the Triennial Symposium on Transportation Analysis TRISTAN V, Le Gosier, Guadeloupe, June 13-18, 2004 [3] H. J. Van Zuylen, D. M. Branston, Consistent link flow estimation from counts. Transportation Research E, 16B(6):473-176, 1982. [4] H. J. Van Zuylen, L. G. Willumsen. The most likely trip matrix estimated from traffic counts. Transportation Research B, (14): 291-293, 1980. [5] H. J. Van Zuylen. Some improvements in the estimation of an OD matrix from traffic counts. Proceedings of the 8th International Symposium on Transportation and Traffic Theory, Toronto University, June 1981.

IMPROVEMENT OF OD ESTIMATION BASED ON

IMPROVEMENT OF OD ESTIMATION BASED ON

Suggest Documents

ESTIMATION OF OD MATRICES - CiteSeerX

OD Matrix Acquisition Based on Mobile Phone

Sensitivity enhancement of OD- and OD-CNT-based ...

Improvement of Precipitation Estimation Techniques

Improvement of Source Number Estimation

OD' 1D OD

EFFECT OF PROBLEM-BASED LEARNING ON IMPROVEMENT

Improvement of Tsunami Countermeasures Based on ...

Od

Real-Time Estimation of Transit OD Patterns and ...

Wavelet Estimation of THD Based on Hilbert

On Text-Based Estimation of Document Relevance

Parameter Estimation of Kumaraswamy Distribution Based on

Improvement of Young modulus estimation by ... - ScienceDirect.com

Quantitative estimation of bioturbation based on ...

online direction of arrival estimation based on

Estimation of Velocity Profile Based on Chiu's

AN IMPROVEMENT OVER REGRESSION METHOD OF ESTIMATION ...

On Software Maintenance Process Improvement Based ... - CiteSeerX

Improvement course on evidence-based healthcare ...

Improvement On Triangle Features Based Grouping

Dynamic OD-Matrix Estimation from Traffic Counts and Automated ...

Advanced traffic data for dynamic OD demand estimation - UPCommons

Confidence Intervals for OD Demand Estimation - Semantic Scholar