Abstract
Mobile sensing is becoming a popular approach for inferring patterns of activity and behavior to determine how they affect health and wellbeing. This data-driven approach has the potential to become a major tool in the field of epidemiology, aimed at determining the causes of disease in populations, as well as motivating behavior change. These sensing technologies are generating large datasets that demand significant processing and data management resources. Studies in mobile sensing for healthcare have motivated the creation of large, complex datasets with information opportunistically gathered from distributed sensors in mobile devices. In this chapter, we discuss some of the architectural challenges regarding data gathering in this distributed data-intensive environment such as the healthcare industry, as well as issues regarding the organization and sharing of the large amounts of data collected. Some of these issues include the heterogeneity of the devices, diversity of sensors used, and the need for data provenance when integrating datasets from diverse studies. We highlight that assessing data quality is of paramount importance for conducting longitudinal studies and building on historical knowledge as new data become available. Finally, we identify future research topics in the growing field of mobile sensing and its application to healthcare and wellbeing. We discuss aspects of data curation, data quality, and data provenance, and we provide suggestions on how these challenges could be addressed in the near future.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
A sensing campaign is a data collection campaign following a research protocol. A sensing campaign can involve tens or thousands of participants carrying one or more mobile devices, which opportunistically collect data from the user or her surroundings, or ask the user to carry out certain task (e.g., take a picture when feeling sad or answer a question).
- 2.
- 3.
References
Calatroni, A., Roggen, D., Tröster, G. (2011). Collection and curation of a large reference dataset for activity recognition. In Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on (pp. 30–35). IEEE.
Lichen, M. (2013). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.
Alcalá, J., Fernández, A., Luengo, J., Derrac, J., García, S., Sánchez, L., and Herrera, F. (2010). Keel data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework. Journal of Multiple-Valued Logic and Soft Computing, 17(255–287).
Rücknagel, J., et al. Metadata Schema for the Description of Research Data Repositories: version 3.0, 29 p, DOI:http://doi.org/10.2312/re3.008.
White, Hollie C., et al. “The Dryad data repository: A Singapore framework metadata architecture in a DSpace environment.” Universitätsverlag Göttingen (2008): 157.
Data Observation Network for Earth (DataONE). Data Citation and Attribution; https://www.dataone.org/citing-dataone (accessed Sep. 2016).
Boose, E, A Ellison, L Osterweil, L Clarke, R Podorozhny, J Hadley, A Wise, and D Foster (2007) Ensuring reliable datasets for environmental models and forecasts. Ecological Informatics, 2(3):237–247.
Voss, EA., Makadia, R., Matcho, A., Ma, Q., Knoll, C., Scheme, M., DeFalco, FJ., Lonche, A., Zhu, V., Ryan, PB. (2015). Feasibility and utility of applications of the common data model to multiple, disparate observational health databases. Journal of the American Medical Informatics Association. 22(3):553–64.
Estrin D, Sim I. Health care delivery. Open mHealth architecture: an engine for health care innovation. Science 2010 Nov 5;330(6005):759–760.
Kumar, S., Abowd, G. D., Abraham, W. T., al’Absi, M., Beck, J. G., Chau, D. H., Genevan, D. (2015). Center of excellence for mobile sensor Data-to-Knowledge (MD2K). Journal of the American Medical Informatics Association, 22(6), 1137–1142.
Netzahualcóyotl Hernández, and Jesús Favela. Estimating the Perception of Physical Fatigue Among Older Adults Using Mobile Phones. Human Behavior Understanding. Volume 9277 of the series Lecture Notes in Computer Science. 978-3-319-24194-4.
Darien Miranda, Jesus Favela, Catalina Ibarra, Netzahualcóyotl Cruz. Naturalistic Enactment to Elicit and Recognize Caregiver State Anxiety. Journal of Medical Systems, 2016.
Tactiohealth. Retrieved at November 14th 2016, from http://www.tactiohealth.com/
Garmin. Retrieved at November 14th 2016, from http://www.garmin.com/en-US
Fitbit. Retrieved at November 14th 2016, from https://www.fitbit.com/mx
Owletcare. Retrieved at November 14th 2016, from http://www.owletcare.com/
Zephyrhealth. Retrieved at November 14th 2016, from https://zephyrhealth.com/
Withings. Retrieved at November 14th 2016, from http://www.withings.com/us/en/
Neumitra. Retrieved at November 14th 2016, from https://www.neumitra.com/
Jins-meme. Retrieved at November 14th 2016, from https://jins-meme.com/en/
Proteus. Retrieved at November 14th 2016, from http://www.proteus.com/
Ihealthlabs. Retrieved at November 14th 2016, from https://ihealthlabs.com/
ResearchKit framework. Retrieved 22/09/2016, from http://researchkit.org/
Aharon, N., Gardner, A., Sumter, C., Peatland, A.: Funf: Open Sensing Framework. http://funf.media.mit.edu (2011).
Hernández, N., Arnrich, B., Favela, J., Yavuz, GR., Demiray, B., Fontecha, J., Ersoy, C. mk-sense: An extensible platform to conduct multi-institutional mobile sensing campaigns. Ubiquitous Computing & Ambient Intelligence. Health, AAL, HCI, IoT, Smart Cities, Sensors & Security. UCAmI 2016 (Accepted for publication).
Gersch, M., Lindert, R., Hewing, M.: AAL-business models: Different Prospects for the Successful Implementation of Innovative Services in the Primary and Secondary Healthcare Market. In: AALIANCE Conference, Malaga, Spain (2010).
Walkonomics. Retrieved at November 14th 2016, from http://walkonomics.com
Walkscore. Retrieved at November 14th 2016, from http://walkscore.com
Daniele Quercia, Luca Maria Aiello, Rossano Schifanella, and Adam Davies. 2015. The Digital Life of Walkable Streets. In Proceedings of the 24th International Conference on World Wide Web (WWW ‘15). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, 875–884.
The Library. Data Management. Retrieved 22/09/2016, from http://libraries.ucsd.edu/services/data-curation/data-management
Van den Lynden, V., Corti, L., Willard, M. & Bishop, L. (2009). Managing and Sharing Data: A Best Practice Guide for Researchers. Retrieved 22/09/2016, from http://www.data-archive.ac.uk/media/2894/managingsharing.pdf
DCC Curation Lifecycle Model. Retrieved 22/08/2016, from http://www.dcc.ac.uk/resources/curation-lifecycle-model
Radebaugh, J. Understanding Metadata. National Information Standards Organization, 2004. 20 pp. ISBN: 978-1-880124-62-8.
Sweeney L, Crosas M, Bar-Sinai M. Sharing Sensitive Data with Confidence: The Datatags System. Technology Science. 2015101601. October 16, 2015. http://techscience.org/a/2015101601
Reiner Schlitzer. Oceanographic quality flag schemes and mappings between them. Alfred Wegener Institute for Polar and Marine Research. Retrieved 22/09/2016, from http://odv.awi.de/fileadmin/user_upload/odv/misc/ODV4_QualityFlagSets.pdf
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Hernández, N., Castro, L.A., Favela, J., Michán, L., Arnrich, B. (2017). Data Quality in Mobile Sensing Datasets for Pervasive Healthcare. In: Khan, S., Zomaya, A., Abbas, A. (eds) Handbook of Large-Scale Distributed Computing in Smart Healthcare. Scalable Computing and Communications. Springer, Cham. https://doi.org/10.1007/978-3-319-58280-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-58280-1_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58279-5
Online ISBN: 978-3-319-58280-1
eBook Packages: Computer ScienceComputer Science (R0)