Extracting activity patterns from taxi trajectory data: a two-layer framework using spatio-temporal clustering, Bayesian probability and Monte Carlo simulation

Shuhui Gong, John Cartlidge, Ruibin Bai, Yang Yue, Qingquan Li, Guoping Qiu

Research output: Journal PublicationArticlepeer-review

33 Citations (Scopus)

Abstract

Global positioning system (GPS) data generated from taxi trips is a valuable source of information that offers an insight into travel behaviours of urban populations with high spatio-temporal resolution. However, in its raw form, GPS taxi data does not offer information on the purpose (or intended activity) of travel. In this context, to enhance the utility of taxi GPS data sets, we propose a two-layer framework to identify the related activities of each taxi trip automatically and estimate the return trips and successive activities after the trip, by using geographic point-of-interest (POI) data and a combination of spatio-temporal clustering, Bayesian inference and Monte Carlo simulation. Two million taxi trips in New York, the United States of America, and ten million taxi trips in Shenzhen, China, are used as inputs for the two-layer framework. To validate each layer of the framework, we collect 6,003 trip diaries in New York and 712 questionnaire surveys in Shenzhen. The results show that the first layer of the framework performs better than comparable methods published in the literature, while the second layer has high accuracy when inferring return trips.

Original languageEnglish
Pages (from-to)1210-1234
Number of pages25
JournalInternational Journal of Geographical Information Science
Volume34
Issue number6
DOIs
Publication statusPublished - 2 Jun 2020

Keywords

  • Bayesian probabilities
  • Monte Carlo simulation
  • Spatio-temporal clustering
  • travel behaviours

ASJC Scopus subject areas

  • Information Systems
  • Geography, Planning and Development
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'Extracting activity patterns from taxi trajectory data: a two-layer framework using spatio-temporal clustering, Bayesian probability and Monte Carlo simulation'. Together they form a unique fingerprint.

Cite this