Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3277104.3278310acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbdConference Proceedingsconference-collections
research-article

Twitter Query Expansion via Word2Vec-Urban Dictionary Model

Published: 08 September 2018 Publication History

Abstract

Query expansion has been a field of interest within the field of information retrieval for quite some time. We propose a novel approach for expanding queries on microblogs, using a word2vec-Urban Dictionary hybrid model built on a prior collection of documents, to expand the search set by adding additional slang terms. In our case, we will focus on the social network site Twitter and tweets related to the topic of marijuana. The result of this approach indicated that we were able to collect several tweets that would not otherwise be collected. We also increased the average degree of the social network of contributors as the expanded list of terms also resulted in marijuana-related tweet contributors who would not have been collected as well.

References

[1]
J. Bai, D. Song, P. Bruza, J.-Y. Nie, and G. Cao. Query expansion using term relationships in language models for information retrieval. In Proceedings of the 14th ACM international conference on Information and knowledge management, 2005.
[2]
S. Kuzi, A. Shtok, and O. Kurland. Query expansion using word embeddings. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management - CIKM 2016.
[3]
C. Latiri, H. Haddad, and T. Hamrouni. Towards an effective automatic query expansion process using an association rule mining approach. Journal of Intelligent Information Systems, 39(1), 2012.
[4]
S. Li, H. Ning, Z. Han, and H. Qi. A method for microblog search by adjusting the language model with time. In Eighth International Conference on Internet Computing for Science and Engineering, 2015.
[5]
T. Miyanishi, K. Seki, and K. Uehara. Improving pseudo-relevance feedback via tweet selection. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management, 2013.
[6]
P. Rayson and R. Garside. Comparing corpora using frequency profiling. In Proceedings of the workshop on Comparing corpora-, 9, 2000.
[7]
J. Turner and M. Kantardzic. Geo-social analytics based on spatio-temporal dynamics of marijuana-related tweets. In Proceedings of the 2017 International Conference on Information System and Data Mining, 2017
[8]
M. A. Zingla, L. Chiraz, and Y. Slimani. Short query expansion for microblog retrieval. Procedia Computer Science, 96, 2016.

Cited By

View all
  • (2021)Term and Sentiment Analysis of Cannabidiol: An Infodemiological Examination of Personal and Commercial Tweets (Preprint)Journal of Medical Internet Research10.2196/27307Online publication date: 20-Jan-2021

Index Terms

  1. Twitter Query Expansion via Word2Vec-Urban Dictionary Model

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCBD '18: Proceedings of the 2018 International Conference on Computing and Big Data
    September 2018
    103 pages
    ISBN:9781450365406
    DOI:10.1145/3277104
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 September 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Social network analysis
    2. document classification
    3. natural language processing
    4. policy research
    5. query expansion

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICCBD '18

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Term and Sentiment Analysis of Cannabidiol: An Infodemiological Examination of Personal and Commercial Tweets (Preprint)Journal of Medical Internet Research10.2196/27307Online publication date: 20-Jan-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media