Abstract
With the widespread use of smart phones and mobile Internet, social network users have generated massive geo-tagged tweets, photos and videos to form lots of informative trajectories which reveal not only their spatio-temporal dynamics, but also their activities in the physical world. Existing spatial trajectory query studies mainly focus on analyzing the spatio-temporal properties of the users’ trajectories, while leaving the understanding of their activities largely untouched. In this paper, we incorporate the semantics of the activity information embedded in trajectories into query modelling and processing, with the aim of providing end users more informative and meaningful results. To this end, we propose a novel trajectory query that not only considers the spatio-temporal closeness but also, more importantly, leverages a proven technique in text mining field, probabilistic topic modelling, to capture the semantic relatedness of the activities between the data and query. To support efficient query processing, we design a hierarchical grid-based index by integrating the probabilistic topic distribution on the substructures of trajectories and their spatio-temporal extent at the corresponding level of the index hierarchy. This specialized structure enables a top-down search algorithm to traverse the index while pruning unqualified trajectories in spatial and topical dimensions simultaneously. The experimental results on real-world datasets demonstrate the good efficiency and scalability performance of the proposed indices and trajectory search methods.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Xiao X, Zheng Y, Luo Q, Xie X. Finding similar users using category-based location history. In Proc. the 18th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, November 2010, pp.442-445.
Zheng Y, Xie X. Learning location correlation from GPS trajectories. In Proc. the 11th Int. Conference on Mobile Data Management, May 2010, pp.27-32.
Cao X, Cong G, Jensen C S. Mining significant semantic locations from GPS data. Proceedings of the VLDB Endowment, 2010, 3(1): 1009-1020.
Zheng Y, Zhang L, Xie X, Ma W Y. Mining interesting locations and travel sequences from GPS trajectories. In Proc. the 18th Int. Conference on World Wide Web, April 2009, pp.791-800.
Chen Z, Shen H T, Zhou X, Zheng Y, Xie X. Searching trajectories by locations: An efficiency study. In Proc. the 2010 ACM SIGMOD Int. Conference on Management of Data, June 2010, pp.255-266.
Xu J, Gao Y, Liu C, Zhao L, Ding Z. Efficient route search on hierarchical dynamic road networks. Distributed and Parallel Databases, 2015, 33(2): 227-252.
Dai J, Liu C, Xu J, Ding Z. On personalized and sequenced route planning. World Wide Web: Internet and Web Information Systems, 2016, 19(4): 679-705.
Xue A Y, Zhang R, Zheng Y, Xie X, Huang J, Xu Z. Destination prediction by sub-trajectory synthesis and privacy protection against such prediction. In Proc. the 29th Int. Conference on Data Engineering, April 2013, pp.254-265.
Zheng K, Shang S, Yuan N J, Yang Y. Towards efficient search for activity trajectories. In Proc. the 29th Int. Conference on Data Engineering, April 2013, pp.230-241.
Liu H, Xu J, Zheng K, Liu C, Du L, Wu X. Semantic-aware query processing for activity trajectories. In Proc. the 10th ACM Int. Conference on Web Search and Data Mining, February 2017, pp.283-292.
Blei D M, Ng A Y, Jordan M I. Latent Dirichlet allocation. Journal of Machine Learning Research, 2003, 3: 993-1022.
Jagadish H V, Ooi B C, Tan K L, Yu C, Zhang R. iDistance: An adaptive B+-tree based indexing method for nearest neighbor search. ACM Transactions on Database Systems, 2005, 30(2): 364-397.
Blei D M. Probabilistic topic models. Communications of the ACM, 2012, 55(4): 77-84.
Li J, Liu C, Yu J X, Chen Y, Sellis T, Culpepper J S. Personalized influential topic search via social network summarization. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(7): 1820-1834.
Blei D M, Lafferty J D. Dynamic topic models. In Proc. the 23rd Int. Conference on Machine Learning, June 2006, pp.113-120.
Kim S, Smyth P. Hierarchical Dirichlet processes with random effects. In Proc. the 20th Annual Conference on Neural Information Processing Systems, December 2007, pp.697-704.
Du L, Buntine W L, Jin H. Sequential latent Dirichlet allocation: Discover underlying topic structures within a document. In Proc. the 10th IEEE International Conference on Data Mining, December 2010, pp.148-157.
Jiang H, Zhou R, Zhang L, Wang H, Zhang Y. A topic model based on Poisson decomposition. In Proc. the 2017 ACM Conference on Information and Knowledge Management, November 2017, pp.1489-1498.
Li B, Yang X, Zhou R, Wang B, Liu C, Zhang Y. An efficient method for high quality and cohesive topical phrase mining. IEEE Transactions on Knowledge and Data Engineering, 2019, 31(1): 120-137.
Li B, Yang X, Zhou R, Wang B, Liu C, Zhang Y. Sentence level topic models for associated topics extraction. World Wide Web: Internet and Web Information Systems: Special Issue on Web and Big Data, 2018, Article No. 7.
Liu Q, Ge Y, Li Z, Chen E, Xiong H. Personalized travel package recommendation. In Proc. the 11th IEEE Int. Conference on Data Mining, December 2011, pp.407-416.
Hu B, Jamali M, Ester M. Spatio-temporal topic modeling in mobile social media for location recommendation. In Proc. the 13th IEEE Int. Conference on Data Mining, December 2013, pp.1073-1078.
Yuan N J, Zheng Y, Xie X, Wang Y, Zheng K, Xiong H. Discovering urban functional zones using latent activity trajectories. IEEE Trans. Knowledge and Data Engineering, 2015, 27(3): 712-725.
Cong G, Jensen C S, Wu D. Efficient retrieval of the top-k most relevant spatial web objects. Proceedings of the VLDB Endowment, 2009, 2(1): 337-348.
Rocha-Junior J B, Gkorgkas O, Jonassen S, Nørvåg K. Efficient processing of top-k spatial keyword queries. In Proc. the 12th International Symposium on Spatial and Temporal Databases, August 2011, pp.205-222.
Zhang D, Chan C Y, Tan K L. Processing spatial keyword query as a top-k aggregation query. In Proc. the 37th Int. ACM SIGIR Conference on Research and Development in Information Retrieval, July 2014, pp.355-364.
de Felipe I, Hristidis V, Rishe N. Keyword search on spatial databases. In Proc. the 24th Int. Conference on Data Engineering, April 2008, pp.656-665.
Tao Y, Sheng C. Fast nearest neighbor search with keywords. IEEE Trans. Knowledge and Data Engineering, 2014, 26(4): 878-888.
Chen Y Y, Suel T, Markowetz A. Efficient query processing in geographic Web search engines. In Proc. the 2006 ACM SIGMOD International Conference on Management of Data, June 2006, pp.277-288.
Zhang C, Zhang Y, Zhang W, Lin X, Cheema M A, Wang X. Diversified spatial keyword search on road networks. In Proc. the 17th International Conference on Extending Database Technology, March 2014, pp.367-378.
Gao Y, Qin X, Zheng B, Chen G. Efficient reverse top-k Boolean spatial keyword queries on road networks. IEEE Trans. Knowledge and Data Engineering, 2015, 27(5): 1205-1218.
Luo S, Luo Y, Zhou S, Cong G, Guan J, Yong Z. Distributed spatial keyword querying on road networks. In Proc. the 17th International Conference on Extending Database Technology, March 2014, pp.235-246.
Zheng K, Zheng B, Xu J, Liu G, Liu A, Li Z. Popularity-aware spatial keyword search on activity trajectories. World Wide Web: Internet and Web Information Systems, 2017, 20(4): 749-773.
Cao X, Cong G, Jensen C S, Ooi B C. Collective spatial keyword querying. In Proc. the 2011 ACM SIGMOD International Conference on Management of Data, June 2011, pp.373-384.
Chen Q, Hu H, Xu J. Authenticating top-k queries in location-based services with confidentiality. Proceedings of the VLDB Endowment, 2013, 7(1): 49-60.
Li J, Liu C, Islam M S. Keyword-based correlated network computation over large social media. In Proc. the 30th IEEE International Conference on Data Engineering, March 2014, pp.268-279.
Wu D, Choi B, Xu J, Jensen C S. Authentication of moving top-k spatial keyword queries. IEEE Transactions on Knowledge and Data Engineering, 2015, 27(4): 922-935.
Guo L, Shao J, Aung H H, Tan K L. Efficient continuous top-k spatial keyword queries on road networks. GeoInformatica, 2015, 19(1): 29-60.
Qian Z, Xu J, Zheng K, Sun W, Li Z, Guo H. On efficient spatial keyword querying with semantics. In Proc. the 21st International Conference on Database Systems for Advanced Applications, April 2016, pp.149-164.
Qian Z, Xu J, Zheng K, Zhao P, Zhou X. Semantic-aware top-k spatial keyword queries. World Wide Web: Internet and Web Information Systems, 2018, 21(3): 573-594.
Zheng Y, Liu Y, Yuan J, Xie X. Urban computing with taxicabs. In Proc. the 13th Int. Conference on Ubiquitous Computing, September 2011, pp.89-98.
Xie M. EDS: A segment-based distance measure for subtrajectory similarity search. In Proc. the 2014 ACM SIGMOD International Conference on Management of Data, June 2014, pp.1609-1610.
Su H, Zheng K, Wang H, Huang J, Zhou X. Calibrating trajectory data for similarity-based analysis. In Proc. the 2013 ACM SIGMOD International Conference on Management of Data, June 2013, pp.833-844.
Xie X, Yiu M L, Cheng R, Lu H. Scalable evaluation of trajectory queries over imprecise location data. IEEE Transactions on Knowledge and Data Engineering, 2014, 26(8): 2029-2044.
Jiang W, Zhu J, Xu J, Li Z, Zhao P, Zhao L. A feature based method for trajectory dataset segmentation and profiling. World Wide Web: Internet and Web Information Systems, 2017, 20(1): 5-22.
Bogorny V, Kuijpers B, Alvares L O. ST-DMQL: A semantic trajectory data mining query language. International Journal of Geographical Information Science, 2009, 23(10): 1245-1276.
Alvares L O, Bogorny V, Kuijpers B, de Macêdo J A F, Moelans B, Vaisman A. A model for enriching trajectories with semantic geographical information. In Proc. the 15th ACM International Symposium on Geographic Information Systems, November 2007, Article No. 22.
Ying J J C, Lee W C, Weng T C, Tseng V S. Semantic trajectory mining for location prediction. In Proc. the 19th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, November 2011, pp.34-43.
Leung K W T, Lee D L, Lee W C. CLR: A collaborative location recommendation framework based on co-clustering. In Proc. the 34th ACM SIGIR Conference on Research and Development in Information Retrieval, July 2011, pp.305-314.
Zheng V W, Zheng Y, Xie X, Yang Q. Collaborative location and activity recommendations with GPS history data. In Proc. the 19th Int. Conference on World Wide Web, April 2010, pp.1029-1038.
Shang S, Ding R, Yuan B, Xie K, Zheng K, Kalnis P. User oriented trajectory search for trip recommendation. In Proc. the 15th Int. Conference on Extending Database Technology, March 2012, pp.156-167.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
ESM 1
(PDF 552 kb)
Rights and permissions
About this article
Cite this article
Yin, LH., Liu, H. Searching Activity Trajectories with Semantics. J. Comput. Sci. Technol. 34, 775–794 (2019). https://doi.org/10.1007/s11390-019-1942-8
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-019-1942-8