Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3657054.3659119acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesdg-oConference Proceedingsconference-collections
poster

Incorporating Citizen-Generated Data into Large Language Models

Published: 11 June 2024 Publication History

Abstract

This study investigates the use of citizen-generated data to optimize a large language model (LLM) chatbot that gives nutrition advice. By actively participating in the data collection and annotation process from FDA-approved websites, citizens provided insightful information that was essential for improving the model and addressing biases. The study highlights the difficulties in gathering and annotating data, especially in situations where nuances matter, such as pregnancy nutrition. The results show that the use of citizen-generated data improves the efficacy and efficiency of data collection procedures, providing a practical viewpoint and encouraging community involvement. In addition to guaranteeing data quality, the iterative process raises stakeholders’ awareness of and proficiency with data. Thus, citizen-generated data becomes an essential tool for creating information systems that are more reliable and inclusive.

References

[1]
Su Lin Blodgett, Solon Barocas, Hal Daumé III, and Hanna Wallach. 2020. Language (Technology) is Power: A Critical Survey of “Bias” in NLP. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 5454–5476.
[2]
John M Carroll, Jordan Beck, Shipi Dhanorkar, Jomara Binda, Srishti Gupta, and Haining Zhu. 2018. Strengthening community data: towards pervasive participation. In Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age. 1–9.
[3]
Stefan Jungcurt. 2022. Citizen-generated data: Data by people, for people. https://www.iisd.org/articles/insight/citizen-generated-data-people
[4]
Mitchell Linegar, Rafal Kocielnik, and R Michael Alvarez. 2023. Large language models and political science. Frontiers in Political Science 5 (2023), 1257092.
[5]
Jonathan Silvertown. 2009. A new dawn for citizen science. Trends in ecology & evolution 24, 9 (2009), 467–471.

Cited By

View all

Index Terms

  1. Incorporating Citizen-Generated Data into Large Language Models

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    dg.o '24: Proceedings of the 25th Annual International Conference on Digital Government Research
    June 2024
    1089 pages
    ISBN:9798400709883
    DOI:10.1145/3657054
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 June 2024

    Check for updates

    Author Tags

    1. Citizen Science
    2. Fine-tuning
    3. Retrieval-Augmented Generation

    Qualifiers

    • Poster
    • Research
    • Refereed limited

    Funding Sources

    Conference

    dg.o 2024

    Acceptance Rates

    Overall Acceptance Rate 150 of 271 submissions, 55%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 22
      Total Downloads
    • Downloads (Last 12 months)22
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media