Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article
Public Access

Transparency, Fairness, Data Protection, Neutrality: Data Management Challenges in the Face of New Regulation

Published: 26 June 2019 Publication History

Abstract

The data revolution continues to transform every sector of science, industry, and government. Due to the incredible impact of data-driven technology on society, we are becoming increasingly aware of the imperative to use data and algorithms responsibly—in accordance with laws and ethical norms. In this article, we discuss three recent regulatory frameworks: the European Union’s General Data Protection Regulation (GDPR), the New York City Automated Decisions Systems (ADS) Law, and the Net Neutrality principle, which aim to protect the rights of individuals who are impacted by data collection and analysis. These frameworks are prominent examples of a global trend: Governments are starting to recognize the need to regulate data-driven algorithmic technology.
Our goal in this article is to bring these regulatory frameworks to the attention of the data management community and to underscore the technical challenges they raise and that we, as a community, are well-equipped to address. The main takeaway of this article is that legal and ethical norms cannot be incorporated into data-driven systems as an afterthought. Rather, we must think in terms of responsibility by design, viewing it as a systems requirement.

References

[1]
Ziawasch Abedjan, Lukasz Golab, and Felix Naumann. 2017. Data profiling: A tutorial. In Proceedings of the ACM International Conference on Management of Data (SIGMOD’17). 1747--1751.
[2]
Serge Abiteboul, Benjamin André, and Daniel Kaplan. 2015. Managing your digital life. Commun. ACM 58, 5 (Apr. 2015), 32--35.
[3]
Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. 2016. Machine bias: Risk assessments in criminal sentencing. ProPublica (May 2016). Retrieved from https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing.
[4]
Alexandra Chouldechova. 2017. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Retrieved from http://arxiv.org/abs/1703.00056.
[5]
Danielle K. Citron and Frank A. Pasquale. 2014. The scored society: Due process for automated predictions. Washington Law Rev. 89 (2014). Retrieved from http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2376209.
[6]
Anupam Datta, Shayak Sen, and Yair Zick. 2016. Algorithmic transparency via quantitative input influence: Theory and experiments with learning systems. In IEEE Symposium on Security and Privacy (SP’16). 598--617.
[7]
Cynthia Dwork, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Richard S. Zemel. 2012. Fairness through awareness. In Proceedings of the Conference on Innovations in Theoretical Computer Science. 214--226.
[8]
Cynthia Dwork and Aaron Roth. 2014. The algorithmic foundations of differential privacy. Found. Trends Theoret. Comput. Sci. 9, 3--4 (2014), 211--407.
[9]
Michael Feldman, Sorelle A. Friedler, John Moeller, Carlos Scheidegger, and Suresh Venkatasubramanian. 2015. Certifying and removing disparate impact. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 259--268.
[10]
Sorelle A. Friedler, Carlos Scheidegger, and Suresh Venkatasubramanian. 2016. On the (im)possibility of fairness. Retrieved from http://arxiv.org/abs/1609.07236.
[11]
Government of India, Ministry of Communications. 2018. DoT Letter on Net Neutrality Regulatory Framework dated 31-07-2018. Retrieved from http://www.dot.gov.in/net-neutrality.
[12]
Sara Hajian and Josep Domingo-Ferrer. 2013. A methodology for direct and indirect discrimination prevention in data mining. IEEE Trans. Knowl. Data Eng. 25, 7 (2013), 1445--1459.
[13]
Melanie Herschel, Ralf Diestelkämper, and Houssem Ben Lahmar. 2017. A survey on provenance: What for? What form? What from? VLDB J. 26, 6 (2017), 881--906.
[14]
H. V. Jagadish, Johannes Gehrke, Alexandros Labrinidis, Yannis Papakonstantinou, Jignesh M. Patel, Raghu Ramakrishnan, and Cyrus Shahabi. 2014. Big data and its technical challenges. Commun. ACM 57, 7 (2014), 86--94.
[15]
Faisal Kamiran, Indre Zliobaite, and Toon Calders. 2013. Quantifying explainable discrimination and removing illegal discrimination in automated decision making. Knowl. Info. Syst. 35, 3 (2013), 613--644.
[16]
Olga Kharif. September 2018. YouTube, Netflix videos found to be slowed by wireless carriers. Bloomberg (Sept. 2018). Retrieved from https://www.bloomberg.com/news/articles/2018-09-04/youtube-and-netflix-throttled-by-carriers-research-finds.
[17]
Keith Kirkpatrick. 2017. It’s not the algorithm, it’s the data. Commun. ACM 60, 2 (Jan. 2017), 21--23.
[18]
Jon M. Kleinberg, Sendhil Mullainathan, and Manish Raghavan. 2017. Inherent trade-offs in the fair determination of risk scores. In Proceedings of the 8th Innovations in Theoretical Computer Science Conference (ITCS’17). 43:1--43:23.
[19]
Joshua A. Kroll, Joanna Huey, Solon Barocas, Edward W. Felten, Joel R. Reidenberg, David G. Robinson, and Harlan Yu. 2017. Accountable algorithms. Univ. Penn. Law Rev. 165 (2017). Retrieved from http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2765268.
[20]
Maurizio Lenzerini. 2002. Data integration: A theoretical perspective. In Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS’02). ACM, New York, NY, 233--246.
[21]
Shira Mitchell, Eric Potash, and Solon Barocas. 2018. Prediction-based decisions and fairness: A catalogue of choices, assumptions, and definitions. Retrieved from https://arxiv.org/abs/1811.07867.
[22]
Arash Molavi Kakhki, Abbas Razaghpanah, Anke Li, Hyungjoon Koo, Rajesh Golani, David Choffnes, Phillipa Gill, and Alan Mislove. 2015. Identifying traffic differentiation in mobile networks. In Proceedings of the Internet Measurement Conference (IMC’15). ACM, New York, NY, 239--251.
[23]
Haoyue Ping, Julia Stoyanovich, and Bill Howe. 2017. DataSynthesizer: Privacy-preserving synthetic datasets. In Proceedings of the 29th International Conference on Scientific and Statistical Database Management. 42:1--42:5.
[24]
Marco Túlio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. “Why should I trust you?”: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1135--1144.
[25]
Andrea Romei and Salvatore Ruggieri. 2014. A multidisciplinary survey on discrimination analysis. Knowl. Eng. Rev. 29, 5 (2014), 582--638.
[26]
Julia Stoyanovich and Bill Howe. 2018. Follow the data! Algorithmic transparency starts with data transparency. Retrieved from https://ai.shorensteincenter.org/ideas/2018/11/26/follow-the-data-algorithmic-transparency-starts-with-data-transparency.
[27]
Julia Stoyanovich, Bill Howe, Serge Abiteboul, Gerome Miklau, Arnaud Sahuguet, and Gerhard Weikum. 2017. Fides: Towards a platform for responsible data science. In Proceedings of the 29th International Conference on Scientific and Statistical Database Management. 26:1--26:6.
[28]
The European Parliament and Council. 2015. Regulation (EU) 2015/2120. Retrieved from https://eur-lex.europa.eu/legal-content/en/TXT/?uri=CELEX:32015R2120.
[29]
The European Union. 2016. Regulation (EU) 2016/679: General Data Protection Regulation (GDPR). Retrieved from https://gdpr-info.eu/.
[30]
The New York City Council. 2017. Int. No. 1696-A: A Local Law in relation to automated decision systems used by agencies. Retrieved from https://laws.council.nyc.gov/legislation/int-1696-2017/.
[31]
Tim Wu. 2003. Network neutrality, broadband discrimination. J. Telecommun. High Technol. Law 2 (2003).
[32]
Ke Yang, Julia Stoyanovich, Abolfazl Asudeh, Bill Howe, H. V. Jagadish, and Gerome Miklau. 2018. A nutritional label for rankings. In Proceedings of the International Conference on Management of Data (SIGMOD’18). 1773--1776.

Cited By

View all
  • (2024)Future Trends in Data AnalyticsRecent Trends and Future Direction for Data Analytics10.4018/979-8-3693-3609-0.ch013(289-306)Online publication date: 12-Jul-2024
  • (2024)A CONCEPTUAL FRAMEWORK FOR THE GOVERNMENT BIG DATA ECOSYSTEM (‘datagov.eco’)Data & Knowledge Engineering10.1016/j.datak.2024.102348(102348)Online publication date: Sep-2024
  • (2024)Detection and treatment of string events in the limitJournal of Computer Languages10.1016/j.cola.2024.10129981(101299)Online publication date: Nov-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of Data and Information Quality
Journal of Data and Information Quality  Volume 11, Issue 3
Special Issue on Combating Digital Misinformation and Disinformation and On the Horizon
September 2019
160 pages
ISSN:1936-1955
EISSN:1936-1963
DOI:10.1145/3331015
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 June 2019
Accepted: 01 December 2018
Received: 01 December 2018
Published in JDIQ Volume 11, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Transparency
  2. data protection
  3. fairness
  4. neutrality
  5. responsible data science

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)881
  • Downloads (Last 6 weeks)114
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Future Trends in Data AnalyticsRecent Trends and Future Direction for Data Analytics10.4018/979-8-3693-3609-0.ch013(289-306)Online publication date: 12-Jul-2024
  • (2024)A CONCEPTUAL FRAMEWORK FOR THE GOVERNMENT BIG DATA ECOSYSTEM (‘datagov.eco’)Data & Knowledge Engineering10.1016/j.datak.2024.102348(102348)Online publication date: Sep-2024
  • (2024)Detection and treatment of string events in the limitJournal of Computer Languages10.1016/j.cola.2024.10129981(101299)Online publication date: Nov-2024
  • (2024)Regulation by Design: Features, Practices, Limitations, and Governance ImplicationsMinds and Machines10.1007/s11023-024-09675-z34:2Online publication date: 17-May-2024
  • (2024)Incremental federated learning for traffic flow classification in heterogeneous data scenariosNeural Computing and Applications10.1007/s00521-024-10281-436:32(20401-20424)Online publication date: 1-Nov-2024
  • (2023)Legal and Ethical Implications of Data Processing in Sex-Positive Techno Parties: the case of ZusammenKommenProceedings of the 24th Annual International Conference on Digital Government Research10.1145/3598469.3598497(251-260)Online publication date: 11-Jul-2023
  • (2023)Incentive Mechanism Design for Responsible Data Governance: A Large-scale Field ExperimentJournal of Data and Information Quality10.1145/359261715:2(1-18)Online publication date: 19-Apr-2023
  • (2023)“That’s important, but...”: How Computer Science Researchers Anticipate Unintended Consequences of Their Research InnovationsProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581347(1-16)Online publication date: 19-Apr-2023
  • (2023)DMPFrame: A Conceptual Metadata Framework for Data Management PlansJournal of Library Metadata10.1080/19386389.2023.226847423:3-4(121-160)Online publication date: 11-Oct-2023
  • (2023)Public service operational efficiency and blockchain – A case study of Companies House, UKGovernment Information Quarterly10.1016/j.giq.2022.10175940:1(101759)Online publication date: Jan-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media