Understanding and Improving a GPS-based Taxi System

Understanding and Improving a GPS-based Taxi System Darshan Santani

Rajesh Krishna Balan

C. Jason Woodard

Singapore Management University {dsantani,rajesh,jwoodard}@smu.edu.sg

1. ABSTRACT In this work, we present an empirical analysis of a realworld taxi fleet system, based on GPS location data. Our current dataset records the movement of 6,230 taxis (3.6 million observations), as well as the time and location of 38,048 booking requests. Both datasets were collected over a 24-hour period in the city-state of Singapore. Each taxi reports its GPS coordinates, vehicle speed, and operating state at regular intervals. The main goals of our initial analysis were (a) to characterize the efficiency of the taxi system, and (b) to explore the sources of inefficiency, with a view to mitigating them.

1.1 Locations and Bookings Figure 1 shows a subset of the observed taxi movements (0.3% of the data set sampled uniformly at random) plotted by their geographic coordinates. We divided the city into four zones based on exogenously specified local parameters, e.g., the nature of households and relative density of business establishments. The points within each zone are shaded according to the density of bookings in that zone, with the highest shaded black.

Figure 1: Taxi Observations by Location and Booking Frequency of Zone

The figure helps to validate the accuracy of our GPS data, as the overall picture closely resembles a map of Singapore. It shows the major roads and highways, which, unlike the road networks typically considered in simulation studies, are not arranged in a grid. It also suggests patterns of traffic congestion that appear as differences in density, providing motivation for using GPS data to estimate realtime traffic flow in the future.

correlated. That is, when passengers are happy because of short waiting times, drivers are unhappy because of low occupancy, and vice versa. Our index of driver satisfaction is the ratio O/(O+A), where O = the number of minutes the taxi was occupied in the given zone and hour, and A = the number of minutes it was available. We call this ratio the occupancy rate. We measure passenger satisfaction by computing the median waiting time for a booking request using the following method: First, we measure the time lag between a booking request and the pickup that satisfies it. Second, we observe that, especially during peak periods, some booking requests may never be satisfied. From the customer’s perspective, this is equivalent to an infinite waiting time. As long as fewer than half of the booking requests in a zone-hour are unsatisfied, the median waiting time will be finite. Otherwise we arbitrarily define it as 20 minutes, which is five minutes longer than the longest waiting time for a successful pickup. Figure 2 shows the graph of occupancy rate and median waiting time in the given zone for each hour in the day. It reveals systematic differences by zone in the efficiency of taxi allocation. It also highlights situations in which either waiting time spikes without a similar rise in occupancy, or occupancy drops without a similar reduction in waiting time. In the former case, customers are worse off but drivers are no happier. In the latter case, drivers are worse off but customers are no happier. Overall, across all zones, waiting time and occupancy are positively correlated; i.e., longer waiting times (unhappy customers) usually indicates higher occupancy (happy drivers) and vice versa. Our initial analysis shows the power of our approach in identifying potential trouble spots in a simple and repeatable way. In the future, we plan to do the following: (a) collect a larger dataset, (b) develop mechanisms to improve system performance from multiple perspectives (operators, drivers, and passengers), and (c) test and refine our mechanisms in real situations.

1.2 Occupancy and Waiting time To reason about the efficiency of the taxi-dispatch system, we say that an efficiently balanced taxi system is one in which the satisfaction of passengers and drivers is inversely Figure 2: Occupancy Rate and Median Waiting Time by Zone and Hour

Understanding and Improving a GPS-based Taxi System Darshan Santani, Rajesh Krishna Balan, and C. Jason Woodard

Location Locationand andBookings Bookings

Objective Objective • To describe the spatio-temporal efficiency of a large GPS-equipped taxicab fleet.

Can GPS data be used to estimate real time traffic conditions?

• To identify potential hotspots characterized by unusually long wait times and/or coldspots with unusually low taxi occupancy. • To explore high level system characteristics from the different, possibly conflicting, perspectives of passengers, taxi drivers and system operators.

Summary SummaryStatistics Statistics Taxi Observations* by Location and Booking Frequency of Zone

Total Taxis

6230

Inactive Taxis

268

Total Taxi Observations

3,658,507

Erroneous Observations

20,039

Booking Calls

38,048

Parameters* GPS Updates per Hour (no.)

Mean 33.35

* Sampled Dataset (~10,000 observations)

Taxi TaxiBookings Bookings

Median 33.70

SD 9.48

Average time spent on the road (hrs)

17.99

18.38

4.18

Average Speed (km/h)

38.94

33.00

39.47

Street Pickups (no.)

21.1

21.0

8.50

Booking Pickups (no.)

4.6

4.0

3.64

Total Pickups (no.)

25.7

26.0

9.68

Average Occupancy Rate (%)

42.46

46.17

17.24

How can we develop mechanisms to disseminate local information to taxi drivers in real time?

* Per Taxi for Full Day

Occupancy Occupancy&&Waiting WaitingTime Time Zone 1: Central

Booking Frequency by second

Cruising CruisingBehavior Behavior Zone 2: Condo

How can the cruising behavior of taxi drivers be optimized to gain higher operating efficiency?

Zone 3: West

Zone 4: East

Occupancy Rate and Median Waiting Time by Zone and Hour

• Reveals systematic differences by zone in the efficiency of taxi allocation, as well as periods in which efficiency deviates positively or negatively from average. • Clearly highlights situations in which either waiting time spikes without a similar rise in occupancy, or occupancy drops without a clear reduction in waiting time. • Overall, waiting time and occupancy are positively correlated; i.e., longer wait times (unhappy customers) usually indicate higher occupancy (happy drivers) and vice-versa.

Image © 2008 DigitalGlobe © 2008 Tele Atlas © MapIt © 2007 Google™

Understanding and Improving a GPS-based Taxi System

Understanding and Improving a GPS-based Taxi System

Suggest Documents

Understanding and Improving Operating System ... - Semantic Scholar

Understanding and Improving Performance

A Understanding and Improving Computational Science Storage ...

A Understanding and Improving Computational ... - Argonne MCS

Understanding and improving communication and

Understanding and Improving a Modern SAT Solver

Understanding and improving textile recycling: a

Understanding and Improving Operating System Effects in Control ...

Investigating and improving student understanding

Understanding and Improving Groundwater ... - Hydrology.nl

reu UNDERSTANDING AND IMPROVING ...

Understanding and Improving Performance ...

centralised taxi service system (ctss) - SPAD [PDF]

centralised taxi service system (ctss) - SPAD

Improving understanding of microclimate heterogeneity within a ...

Spatio-temporal Efficiency in a Taxi Dispatch System - Rajesh Krishna ...

Understanding and improving transitions of older people: a user and ...

A Collaborative Multiagent Taxi-Dispatch System - Semantic Scholar

TACITUS: A Message Understanding System

Talent in the taxi: a model system for exploring expertise

A Geo-Aware Taxi Carrying Management System by Using Location ...

Understanding and Improving Faculty Professional Development in ...

Understanding and Improving Collaborative Management of ... - DOI.gov

reu UNDERSTANDING AND IMPROVING ... - D-Scholarship@Pitt