Abstract
In this paper we argue that understanding the provenance of social media datasets and their analysis is critical to addressing challenges faced by the social science research community in terms of the reliability and reproducibility of research utilising such data. Based on analysis of existing projects that use social media data, we present a number of research questions for the provenance community, which if addressed would help increase the transparency of the research process, aid reproducibility, and facilitate data reuse in the social sciences.
The work described here was funded by a grant from the United Kingdom’s Economic and Social Research Council Social Media - Developing Understanding, Infrastructure & Engagement (ES/M001628/1).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
References
Batrinca, B., Treleaven, P.C.: Social media analytics: a survey of techniques, tools and platforms. AI & Soc. 30(1), 89–116 (2014)
Cheney, J., Chiticariu, L., Tan, W.-C.: Provenance in databases: why, how, and where. Found. Trends Databases 1(4), 379–474 (2009)
Cheney, J., Finkelstein, A., Ludascher, B., Vansummeren, S.: Principles of provenance. Dagstuhl Rep. 2(2), 84–113 (2012)
Cottrill, C., Yeboah, G., Gault, P., Nelson, J.D., Anable, J., Budd, T.: Tweeting transport: examining the use of twitter in transport events. In: Proceedings of the 47th Annual UTSG Conference (2015)
Edwards, P., Pignotti, E., Eckhardt, A., Ponnamperuma, K., Mellish, C., Bouttaz, T.: ourSpaces – design and deployment of a semantic virtual research environment. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part II. LNCS, vol. 7650, pp. 50–65. Springer, Heidelberg (2012)
Organisation for Economic Co-operation and Development: New data for understanding the human condition. Technical report, February 2013
Moreau, L., Groth, P., Cheney, J., Lebo, T., Miles, S.: The rationale of PROV. Web Semant. Sci. Serv. Agents World Wide Web 35(Part 4), 235–257 (2015)
Moreau, L.: Provenance-based reproducibility in the semantic web. Web Semant. Sci. Serv. Agents World Wide Web 9(2), 202–221 (2011)
Tufekci, Z.: Big data: Pitfalls, methods and concepts for an emergent field. Technical report, March 2013
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Corsar, D., Markovic, M., Edwards, P. (2016). Social Media Data in Research: Provenance Challenges. In: Mattoso, M., Glavic, B. (eds) Provenance and Annotation of Data and Processes. IPAW 2016. Lecture Notes in Computer Science(), vol 9672. Springer, Cham. https://doi.org/10.1007/978-3-319-40593-3_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-40593-3_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40592-6
Online ISBN: 978-3-319-40593-3
eBook Packages: Computer ScienceComputer Science (R0)