TY - JOUR
T1 - Extracting activity patterns from taxi trajectory data
T2 - a two-layer framework using spatio-temporal clustering, Bayesian probability and Monte Carlo simulation
AU - Gong, Shuhui
AU - Cartlidge, John
AU - Bai, Ruibin
AU - Yue, Yang
AU - Li, Qingquan
AU - Qiu, Guoping
N1 - Publisher Copyright:
© 2019, © 2019 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2020/6/2
Y1 - 2020/6/2
N2 - Global positioning system (GPS) data generated from taxi trips is a valuable source of information that offers an insight into travel behaviours of urban populations with high spatio-temporal resolution. However, in its raw form, GPS taxi data does not offer information on the purpose (or intended activity) of travel. In this context, to enhance the utility of taxi GPS data sets, we propose a two-layer framework to identify the related activities of each taxi trip automatically and estimate the return trips and successive activities after the trip, by using geographic point-of-interest (POI) data and a combination of spatio-temporal clustering, Bayesian inference and Monte Carlo simulation. Two million taxi trips in New York, the United States of America, and ten million taxi trips in Shenzhen, China, are used as inputs for the two-layer framework. To validate each layer of the framework, we collect 6,003 trip diaries in New York and 712 questionnaire surveys in Shenzhen. The results show that the first layer of the framework performs better than comparable methods published in the literature, while the second layer has high accuracy when inferring return trips.
AB - Global positioning system (GPS) data generated from taxi trips is a valuable source of information that offers an insight into travel behaviours of urban populations with high spatio-temporal resolution. However, in its raw form, GPS taxi data does not offer information on the purpose (or intended activity) of travel. In this context, to enhance the utility of taxi GPS data sets, we propose a two-layer framework to identify the related activities of each taxi trip automatically and estimate the return trips and successive activities after the trip, by using geographic point-of-interest (POI) data and a combination of spatio-temporal clustering, Bayesian inference and Monte Carlo simulation. Two million taxi trips in New York, the United States of America, and ten million taxi trips in Shenzhen, China, are used as inputs for the two-layer framework. To validate each layer of the framework, we collect 6,003 trip diaries in New York and 712 questionnaire surveys in Shenzhen. The results show that the first layer of the framework performs better than comparable methods published in the literature, while the second layer has high accuracy when inferring return trips.
KW - Bayesian probabilities
KW - Monte Carlo simulation
KW - Spatio-temporal clustering
KW - travel behaviours
UR - http://www.scopus.com/inward/record.url?scp=85076096116&partnerID=8YFLogxK
U2 - 10.1080/13658816.2019.1641715
DO - 10.1080/13658816.2019.1641715
M3 - Article
AN - SCOPUS:85076096116
SN - 1365-8816
VL - 34
SP - 1210
EP - 1234
JO - International Journal of Geographical Information Science
JF - International Journal of Geographical Information Science
IS - 6
ER -