Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2531602.2531689acmconferencesArticle/Chapter ViewAbstractPublication PagescscwConference Proceedingsconference-collections
research-article

Capturing quality: retaining provenance for curated volunteer monitoring data

Published: 15 February 2014 Publication History

Abstract

The "real world" nature of field-based citizen science involves unique data management challenges that distinguish it from projects that involve only Internet-mediated activities. In particular, many data contribution and review practices are often accomplished "offline' via paper or general-purpose software like Excel. This can lead to integration challenges when attempting to implement project-specific ICT with full revision and provenance tracking. In this work, we explore some of the current challenges and opportunities in implementing ICT for managing volunteer monitoring data. Our two main contributions are: a general outline of the workflow tasks common to field-based data collection, and a novel data model for preserving provenance metadata that allows for ongoing data exchange between disparate technical systems and participant skill levels. We conclude with applications for other domains, such as hydrologic forecasting and crisis informatics, as well as directions for future research.

References

[1]
Cifelli, R., Doesken, N., Kennedy, P., Carey, L. D., Rutledge, S. A., Gimmestad, C., and Depue, T. The community collaborative rain, hail, and snow network: Informal education for scientists and citizens. Bulletin of the American Meteorological Society 86, 8 (2005), 1069--1077.
[2]
Federal Geographic Data Committee and others. FGDC-STD-001--1998. Content standard for digital geospatial metadata (1998).
[3]
Fegraus, E. H., Andelman, S., Jones, M. B., and Schildhauer, M. Maximizing the value of ecological data with structured metadata: An introduction to ecological metadata language (EML) and principles for metadata creation. Bulletin of the Ecological Society of America 86, 3 (2005), 158--168.
[4]
Firehock, K., and West, J. A brief history of volunteer biological water monitoring using macroinvertebrates. Journal of the North American Benthological Society 14, 1 (1995), 197--202.
[5]
Gil, Y., Miles, S., Belhajjame, K., Deus, H., Garijo, D., Klyne, G., Missier, P., Soiland-Reyes, S., and Zednik, S. A Primer for the PROV Provenance Model. W3C, 2012. http://www.w3.org/TR/prov-primer/.
[6]
Halfaker, A., Geiger, R. S., Morgan, J. T., and Riedl, J. The rise and decline of an open collaboration system: How Wikipedia's reaction to popularity is causing its decline. American Behavioral Scientist 57, 5 (2013), 664--688.
[7]
Hartung, C., Lerer, A., Anokwa, Y., Tseng, C., Brunette, W., and Borriello, G. Open Data Kit: Tools to build information services for developing regions. In Proceedings of the 4th ACM/IEEE International Conference on Information and Communication Technologies and Development, ACM (2010), 18.
[8]
Howe, J. The Rise of Crowdsourcing. Wired Magazine 14, 6 (2006), 1--4.
[9]
Juran, J. M. Quality control handbook. McGraw-Hill, 1962.
[10]
Kelling, S., Yu, J., Gerbracht, J., and Wong, W.-K. Emergent filters: Automated data verification in a large-scale citizen science project. In Proceedings of Workshops at the Seventh International Conference on eScience, IEEE (2011), 20--27.
[11]
Kim, S., Mankoff, J., and Paulos, E. Sensr: Evaluating a flexible framework for authoring mobile data-collection tools for citizen science. In Proceedings of the 2013 conference on Computer supported cooperative work, ACM (2013), 1453--1462.
[12]
Liu, J., and Ram, S. Who does what: Collaboration patterns in the Wikipedia and their impact on article quality. ACM Transactions on Management Information Systems (TMIS) 2, 2 (2011), 11.
[13]
Lukyanenko, R., Parsons, J., and Wiersma, Y. Citizen science 2.0: Data management principles to harness the power of the crowd. In Service-Oriented Perspectives in Design Science Research. Springer, 2011, 465--473.
[14]
Newman, G., Graham, J., Crall, A., and Laituri, M. The art and science of multi-scale citizen science support. Ecological Informatics 6, 3 (2011), 217--227.
[15]
Okolloh, O. Ushahidi, or 'testimony': Web 2.0 tools for crowdsourcing crisis information. Participatory Learning and Action 59, 1 (2009), 65--70.
[16]
Orlandi, F., and Passant, A. Modelling provenance of DBpedia resources using Wikipedia contributions. Web Semantics: Science, Services and Agents on the World Wide Web 9, 2 (2011), 149--164.
[17]
Priedhorsky, R., and Terveen, L. Wiki grows up: Arbitrary data models, access control, and beyond. In Proceedings of the Seventh International Symposium on Wikis and Open Collaboration, ACM (2011), 63--71.
[18]
Raddick, J., Lintott, C., Schawinski, K., Thomas, D., Nichol, R., Andreescu, D., Bamford, S., Land, K., Murray, P., Slosar, A., et al. Galaxy Zoo: An experiment in public science participation. In Bulletin of the American Astronomical Society, vol. 39 (2007), 892.
[19]
Ram, S., and Liu, J. Understanding the semantics of data provenance to support active conceptual modeling. In Active conceptual modeling of learning. Springer, 2007, 17--29.
[20]
Ribes, D., and Finholt, T. A. Representing community: Knowing users in the face of changing constituencies. In Proceedings of the 2008 ACM conference on Computer supported cooperative work, ACM (2008), 107--116.
[21]
Roth, M., and Tan, W.-C. Data integration and data exchange: It's really about time. In Proceedings of the 6th Biennial Conference on Innovative Data Systems Research, CIDR (2013).
[22]
Sheppard, S. A. wq: A modular framework for collecting, storing, and utilizing experiential VGI. In Proceedings of the 1st ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information, ACM (2012), 62--69.
[23]
Sheppard, S. A., and Terveen, L. Quality is a verb: The operationalization of data quality in a citizen science community. In Proceedings of the Seventh International Symposium on Wikis and Open Collaboration, ACM (2011), 29--38.
[24]
Smith, A. Smartphone adoption and usage. Tech. rep., Pew Internet & American Life Project, Washington, DC, 2011.
[25]
Stvilia, B., Twidale, M. B., Smith, L. C., and Gasser, L. Information quality work organization in Wikipedia. Journal of the American society for information science and technology 59, 6 (2008), 983--1001.
[26]
Sullivan, B. L., Wood, C. L., Iliff, M. J., Bonney, R. E., Fink, D., and Kelling, S. eBird: a citizen-based bird observation network in the biological sciences. Biological Conservation 142, 10 (Oct. 2009), 2282--2292.
[27]
Vrandečić, D., Ratnakar, V., Krötzsch, M., and Gil, Y. Shortipedia: Aggregating and curating Semantic Web data. Web Semantics: Science, Services and Agents on the World Wide Web 9, 3 (2011), 334--338.
[28]
Wang, Z., Dong, H., Kelly, M., Macklin, J. A., Morris, P. J., and Morris, R. A. Filtered-Push: A Map-Reduce platform for collaborative taxonomic data management. In Computer Science and Information Engineering, 2009 WRI World Congress on, vol. 3, IEEE (2009), 731--735.
[29]
Wieczorek, J., Bloom, D., Guralnick, R., Blum, S., Döring, M., Giovanni, R., Robertson, T., and Vieglais, D. Darwin Core: An evolving community-developed biodiversity data standard. PLoS One 7, 1 (2012), e29715.
[30]
Wiggins, A. Free as in puppies: Compensating for ICT constraints in citizen science. In Proceedings of the 2013 conference on Computer supported cooperative work, ACM (2013), 1469--1480.
[31]
Wiggins, A., Bonney, R., Graham, E., Henderson, S., Kelling, S., Littauer, R., LeBuhn, G., Lotts, K., Michener, W., Newman, G., Russell, E., Stevenson, R., and Weltzin, J. Data management guide for public participation in scientific research. DataONE, 2013.
[32]
Wiggins, A., and Crowston, K. From conservation to crowdsourcing: A typology of citizen science. In HICSS'11, IEEE Computer Society (2011), 1--10.
[33]
Wiggins, A., Newman, G., Stevenson, R. D., and Crowston, K. Mechanisms for data quality and validation in citizen science. In Proceedings of Workshops at the Seventh International Conference on eScience, IEEE (2011), 14--19.
[34]
Wilderman, C. C. Models of community science: Design lessons from the field. In Citizen Science Toolkit Conference (Cornell Laboratory of Ornithology, Ithaca, NY, 2007).

Cited By

View all
  • (2024)“I never realized sidewalks were a big deal”: A Case Study of a Community-Driven Sidewalk Accessibility Assessment using Project SidewalkProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642003(1-18)Online publication date: 11-May-2024
  • (2021)An Approach to Improve the Quality of User-Generated Content of Citizen Science PlatformsISPRS International Journal of Geo-Information10.3390/ijgi1007043410:7(434)Online publication date: 25-Jun-2021
  • (2021)CrowdsourcingSpringer Handbook of Atmospheric Measurements10.1007/978-3-030-52171-4_44(1207-1239)Online publication date: 2021
  • Show More Cited By

Index Terms

  1. Capturing quality: retaining provenance for curated volunteer monitoring data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CSCW '14: Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
    February 2014
    1600 pages
    ISBN:9781450325400
    DOI:10.1145/2531602
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 February 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. citizen science
    2. data exchange
    3. data models
    4. eav
    5. ict
    6. mobile applications
    7. provenance
    8. spreadsheets
    9. vgi
    10. volunteer monitoring

    Qualifiers

    • Research-article

    Conference

    CSCW'14
    Sponsor:
    CSCW'14: Computer Supported Cooperative Work
    February 15 - 19, 2014
    Maryland, Baltimore, USA

    Acceptance Rates

    CSCW '14 Paper Acceptance Rate 134 of 497 submissions, 27%;
    Overall Acceptance Rate 170 of 696 submissions, 24%

    Upcoming Conference

    CSCW '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 16 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)“I never realized sidewalks were a big deal”: A Case Study of a Community-Driven Sidewalk Accessibility Assessment using Project SidewalkProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642003(1-18)Online publication date: 11-May-2024
    • (2021)An Approach to Improve the Quality of User-Generated Content of Citizen Science PlatformsISPRS International Journal of Geo-Information10.3390/ijgi1007043410:7(434)Online publication date: 25-Jun-2021
    • (2021)CrowdsourcingSpringer Handbook of Atmospheric Measurements10.1007/978-3-030-52171-4_44(1207-1239)Online publication date: 2021
    • (2019)Beyond Micro-TasksCrowdsourcing10.4018/978-1-5225-8362-2.ch076(1510-1535)Online publication date: 2019
    • (2019)Beyond Micro-TasksSocial Entrepreneurship10.4018/978-1-5225-8182-6.ch072(1403-1428)Online publication date: 2019
    • (2019)Citizen Science: An Information Quality Research FrontierInformation Systems Frontiers10.1007/s10796-019-09915-zOnline publication date: 10-Apr-2019
    • (2018)Beyond Micro-TasksJournal of Database Management10.4018/JDM.201801010129:1(1-22)Online publication date: 1-Jan-2018
    • (2018)Using Citizen Science Projects to Develop Cases for Teaching Digital CurationTransforming Digital Worlds10.1007/978-3-319-78105-1_69(615-619)Online publication date: 15-Mar-2018
    • (2018)Data Provenance in Citizen Science DatabasesNew Trends in Databases and Information Systems10.1007/978-3-030-00063-9_23(242-253)Online publication date: 31-Aug-2018
    • (2018)Overview of Data Storing Techniques in Citizen Science ApplicationsNew Trends in Databases and Information Systems10.1007/978-3-030-00063-9_22(231-241)Online publication date: 31-Aug-2018
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media