Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3379174.3392315acmconferencesArticle/Chapter ViewAbstractPublication PagesicdarConference Proceedingsconference-collections
research-article

Residence and Workplace Recovery: User Privacy Risk in Mobility Data

Published: 08 June 2020 Publication History

Abstract

Mobility data has been collected through mobile devices and cellular networks used in academic research and commercial purposes for the last decade. Since releasing individual's mobility records or trajectories gives rise to privacy issues, datasets owners tend to only publish encrypted mobility data, which doesn't contains users' identification symbol like telephone number. However, we argue and prove that even publishing encrypted mobility data could lead to privacy problem, of which the critical problem is users' residence and workplace identification. We develop an attack system that is able to identify users' important locations by a semi-supervised learning model. In addition to traditional time features, our system takes the users' mobility and living patterns into consideration, which are important and affect each other. Our model demands for less ground truth labels and produces considerable improvement in learning accuracy. With large-scale factual mobile data and long-time tracking ground truth data captured from a big city, we reveal that our attack system is able to identify users' residence and workplace with accuracy about 98%, which indicates severe privacy leakage in such datasets. And we provide advice for this kind of privacy-preserving problem.

References

[1]
Daniel Ashbrook and Thad Starner. 2003. Using GPS to learn significant locations and predict movement across multiple users. Personal and Ubiquitous Computing, Vol. 7, 5 (2003), 275--286. https://doi.org/10.1007/s00779-003-0240-0
[2]
H. Cao, F. Xu, J. Sankaranarayanan, Y. Li, and H. Samet. 2019. Habit2vec: Trajectory Semantic Embedding for Living Pattern Recognition in Population. IEEE Transactions on Mobile Computing (2019), 1--1. https://doi.org/10.1109/TMC.2019.2902403
[3]
Xin Cao, Gao Cong, and Christian S. Jensen. 2010. Mining Significant Semantic Locations From GPS Data. PVLDB, Vol. 3, 1 (2010), 1009--1020. https://doi.org/10.14778/1920841.1920968
[4]
Deborah Falcone, Cecilia Mascolo, Carmela Comito, Domenico Talia, and Jon Crowcroft. 2014. What is this place? Inferring place categories through user patterns identification in geo-tagged tweets. In 6th International Conference on Mobile Computing, Applications and Services, MobiCASE 2014, Austin, TX, USA, November 6--7, 2014. 10--19. https://doi.org/10.4108/icst.mobicase.2014.257683
[5]
F.A. Gers, J. Schmidhuber, and F. Cummins. 1999. Learning to forget: continual prediction with LSTM. IET Conference Proceedings (January 1999), 850--855(5). https://digital-library.theiet.org/content/conferences/10.1049/cp_19991218
[6]
Geoffrey Hinton, Li Deng, Dong Yu, George Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Brian Kingsbury, and Tara Sainath. 2012. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups. IEEE Signal Process. Mag., Vol. 29, 6 (2012), 82--97. https://doi.org/10.1109/MSP.2012.2205597
[7]
Anil K. Jain. 2008. Data Clustering: 50 Years Beyond K-means. In Machine Learning and Knowledge Discovery in Databases, European Conference, ECML/PKDD 2008, Antwerp, Belgium, September 15--19, 2008, Proceedings, Part I. 3--4. https://doi.org/10.1007/978--3--540--87479--9_3
[8]
Jong Hee Kang, William Welbourne, Benjamin Stewart, and Gaetano Borriello. 2004. Extracting places from traces of locations. In Proceedings of the 2nd ACM International Workshop on Wireless Mobile Applications and Services on WLAN Hotspots, WMASH 2004, Philadelphia, PA, USA, October 1, 2004. 110--118. https://doi.org/10.1145/1024733.1024748
[9]
John Krumm. 2007. Inference Attacks on Location Tracks. In Pervasive Computing, 5th International Conference, PERVASIVE 2007, Toronto, Canada, May 13--16, 2007, Proceedings . 127--143. https://doi.org/10.1007/978--3--540--72037--9_8
[10]
John Krumm and Dany Rouhana. 2013. Placer: semantic place labels from diary data. In The 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing, UbiComp '13, Zurich, Switzerland, September 8--12, 2013. 163--172. https://doi.org/10.1145/2493432.2493504
[11]
Kevin S. Kung, Stanislav Sobolevsky, and Carlo Ratti. 2013. Exploring universal patterns in human home/work commuting from mobile phone data. CoRR, Vol. abs/1311.2911 (2013). arxiv: 1311.2911 http://arxiv.org/abs/1311.2911
[12]
Nicholas D. Lane, Emiliano Miluzzo, Hong Lu, Daniel Peebles, Tanzeem Choudhury, and Andrew T. Campbell. 2010. A survey of mobile phone sensing. IEEE Communications Magazine, Vol. 48, 9 (2010), 140--150. https://doi.org/10.1109/MCOM.2010.5560598
[13]
Lin Liao, Dieter Fox, and Henry A. Kautz. 2005. Location-Based Activity Recognition using Relational Markov Networks. In IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30 - August 5, 2005. 773--778. http://ijcai.org/Proceedings/05/Papers/1572.pdf
[14]
Ling Qi, Yuanyuan Qiao, Fehmi Ben Abdesslem, Zhanyu Ma, and Jie Yang. 2016. Oscillation Resolution for Massive Cell Phone Traffic Data. In Proceedings of the First Workshop on Mobile Data, MobiData@MobiSys 2016, Singapore, June 30, 2016. 25--30. https://doi.org/10.1145/2935755.2935759
[15]
Yuan Tian, Stephan Winter, and Jian Wang. 2019. Identifying residential and workplace locations from transit smart card data. Journal of Transport and Land Use, Vol. 12, 1 (2019). https://www.jtlu.org/index.php/jtlu/article/view/1247
[16]
Yequan Wang, Minlie Huang, Xiaoyan Zhu, and Li Zhao. 2016. Attention-based LS™ for Aspect-level Sentiment Classification. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016, Austin, Texas, USA, November 1--4, 2016. 606--615. http://aclweb.org/anthology/D/D16/D16--1058.pdf
[17]
Qiujun Wei, Jiangfeng She, Shuhua Zhang, and Jinsong Ma. 2018. Using Individual GPS Trajectories to Explore Foodscape Exposure: A Case Study in Beijing Metropolitan Area. International Journal of Environmental Research and Public Health, Vol. 15, 3 (2018). https://doi.org/10.3390/ijerph15030405
[18]
Di Yao, Chao Zhang, Zhihua Zhu, Jian-Hui Huang, and Jingping Bi. 2017. Trajectory clustering via deep representation learning. In 2017 International Joint Conference on Neural Networks, IJCNN 2017, Anchorage, AK, USA, May 14--19, 2017. 3880--3887. https://doi.org/10.1109/IJCNN.2017.7966345
[19]
Guan Yuan, Penghui Sun, Jie Zhao, Daxing Li, and Canwei Wang. 2017. A review of moving object trajectory clustering algorithms. Artif. Intell. Rev., Vol. 47, 1 (2017), 123--144. https://doi.org/10.1007/s10462-016--9477--7
[20]
Jing Yuan, Yu Zheng, and Xing Xie. 2012. Discovering regions of different functions in a city using human mobility and POIs. In The 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '12, Beijing, China, August 12--16, 2012. 186--194. https://doi.org/10.1145/2339530.2339561
[21]
Yu Zheng, Lizhu Zhang, Xing Xie, and Wei-Ying Ma. 2009. Mining interesting locations and travel sequences from GPS trajectories. In Proceedings of the 18th International Conference on World Wide Web, WWW 2009, Madrid, Spain, April 20--24, 2009. 791--800. https://doi.org/10.1145/1526709.1526816

Cited By

View all
  • (2021)A Multi-Task Sequential State Model for the Human Trajectory Data UnderstandingBig Data Research10.1016/j.bdr.2021.100220(100220)Online publication date: Mar-2021

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICDAR '20: Proceedings of the 2020 ACM Workshop on Intelligent Cross-Data Analysis and Retrieval
June 2020
44 pages
ISBN:9781450375092
DOI:10.1145/3379174
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 June 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. location extract
  2. mobility mining
  3. privacy preserving
  4. residence identification
  5. semi-supervised learning

Qualifiers

  • Research-article

Conference

ICMR '20
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2021)A Multi-Task Sequential State Model for the Human Trajectory Data UnderstandingBig Data Research10.1016/j.bdr.2021.100220(100220)Online publication date: Mar-2021

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media