Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2806416.2806519acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Interactive User Group Analysis

Published: 17 October 2015 Publication History

Abstract

User data is becoming increasingly available in multiple domains ranging from phone usage traces to data on the social Web. The analysis of user data is appealing to scientists who work on population studies, recommendations, and large-scale data analytics. We argue for the need for an interactive analysis to understand the multiple facets of user data and address different analytics scenarios. Since user data is often sparse and noisy, we propose to produce labeled groups that describe users with common properties and develop IUGA, an interactive framework based on group discovery primitives to explore the user space. At each step of IUGA, an analyst visualizes group members and may take an action on the group (add/remove members) and choose an operation (exploit/explore) to discover more groups and hence more users. Each discovery operation results in k most relevant and diverse groups. We formulate group exploitation and exploration as optimization problems and devise greedy algorithms to enable efficient group discovery. Finally, we design a principled validation methodology and run extensive experiments that validate the effectiveness of IUGA on large datasets for different user space analysis scenarios.

References

[1]
R. Agrawal, J. Gehrke, D. Gunopulos, and P. Raghavan. Automatic subspace clustering of high dimensional data for data mining applications, volume 27. ACM, 1998.
[2]
R. Agrawal, T. Imielinski, and A. N. Swami. Mining association rules between sets of items in large databases. In SIGMOD, pages 207--216, 1993.
[3]
M. Bhuiyan, S. Mukhopadhyay, and M. A. Hasan. Interactive pattern mining on hidden data: a sampling-based solution. In CIKM, pages 95--104, 2012.
[4]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. the Journal of machine Learning research, 3:993--1022, 2003.
[5]
M. Boley, B. Kang, P. Tokmakov, M. Mampaey, and S. Wrobel. One click mining: Interactive local pattern discovery through implicit preference and performance learning. IDEAS (ACM SIGKDD Workshop), 2013.
[6]
F. Bonchi, F. Giannotti, A. Mazzanti, and D. Pedreschi. Exante: Anticipated data reduction in constrained pattern mining. In PKDD, pages 59--70, 2003.
[7]
C. Bucila, J. Gehrke, D. Kifer, and W. M. White. Dualminer: a dual-pruning algorithm for itemsets with constraints. In Knowledge Discovery and Data Mining, pages 42--51, 2002.
[8]
C. C. Cao, J. She, Y. Tong, and L. Chen. Whom to ask?: jury selection for decision making tasks on micro-blog services. VLDB, 2012.
[9]
J. G. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Research and Development in Information Retrieval, pages 335--336, 1998.
[10]
U. Cetintemel, M. Cherniack, J. DeBrabant, Y. Diao, K. Dimitriadou, A. Kalinin, O. Papaemmanouil, and S. B. Zdonik. Query steering for interactive data exploration. In CIDR, 2013.
[11]
O. Chapelle, S. Ji, C. Liao, E. Velipasaoglu, L. Lai, and S.-L. Wu. Intent-based diversification of web search results: metrics and algorithms. Information Retrieval, 14(6):572--592, 2011.
[12]
U. Feige, G. Kortsarz, and D. Peleg. The dense k-subgraph problem. Algorithmica, 29(3):410--421, 2001.
[13]
N. Friedman, M. Goldszmidt, et al. Discretizing continuous attributes while learning bayesian networks. In ICML, pages 157--165, 1996.
[14]
L. Geng and H. J. Hamilton. Interestingness measures for data mining: A survey. ACM Computing Surveys (CSUR), 38(3):9, 2006.
[15]
B. Goethals, S. Moens, and J. Vreeken. Mime: A framework for interactive visual pattern mining. In PKDD, 2011.
[16]
P. Indyk, S. Mahabadi, M. Mahdian, and V. S. Mirrokni. Composable core-sets for diversity and coverage maximization. In ACM SIGMOD SIGART, pages 100--108. ACM, 2014.
[17]
D. S. Johnson. Approximation algorithms for combinatorial problems. In Proceedings of the fifth annual ACM symposium on Theory of computing, pages 38--49. ACM, 1973.
[18]
A. Leuski and J. Allan. Strategy-based interactive cluster visualization for information retrieval. International Journal on Digital Libraries, 3:170--184, 2000.
[19]
B. Omidvar-Tehrani, S. Amer-Yahia, and A. Termier. Interactive user group analysis. Research Report RR-LIG-048, LIG, Grenoble, France, 2015.
[20]
B. Omidvar-Tehrani, S. Amer-Yahia, A. Termier, A. Bertaux, E. Gaussier, and M.-C. Rousset. Towards a framework for semantic exploration of frequent patterns. IMMoA, 2013.
[21]
L. Parida. Redescription mining: Structure theory and algorithms. In In Proc. AAAI'05, pages 837--844, 2005.
[22]
C. K. sang Leung, P. P. Irani, and C. L. Carmichael. WiFIsViz: Effective Visualization of Frequent Itemsets. In ICDM, 2008.
[23]
A. Siebes, J. Vreeken, and M. van Leeuwen. Item sets that compress. In SDM, volume 6, pages 393--404. SIAM, 2006.
[24]
T. Uno, M. Kiyomi, and H. Arimura. Lcm ver. 2: Efficient mining algorithms for frequent/closed/maximal itemsets. In FIMI, 2004.
[25]
R. West and J. Leskovec. Automatic versus human navigation in information networks. In ICWSM, 2012.

Cited By

View all
  • (2022)Enthusiasts, Pragmatists, and Skeptics: Investigating Users’ Attitudes Towards Emotion- and Personality-Aware Voice Assistants across CulturesProceedings of Mensch und Computer 202210.1145/3543758.3543776(308-322)Online publication date: 4-Sep-2022
  • (2022)Guided Text-based Item ExplorationProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557141(3410-3420)Online publication date: 17-Oct-2022
  • (2021)Balancing Familiarity and Curiosity in Data Exploration with Deep Reinforcement LearningFourth Workshop in Exploiting AI Techniques for Data Management10.1145/3464509.3464884(16-23)Online publication date: 20-Jun-2021
  • Show More Cited By

Index Terms

  1. Interactive User Group Analysis

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
    October 2015
    1998 pages
    ISBN:9781450337946
    DOI:10.1145/2806416
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. interactive analysis
    2. user data
    3. validation

    Qualifiers

    • Research-article

    Funding Sources

    • PEPS FAAAWS
    • ANR

    Conference

    CIKM'15
    Sponsor:

    Acceptance Rates

    CIKM '15 Paper Acceptance Rate 165 of 646 submissions, 26%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Enthusiasts, Pragmatists, and Skeptics: Investigating Users’ Attitudes Towards Emotion- and Personality-Aware Voice Assistants across CulturesProceedings of Mensch und Computer 202210.1145/3543758.3543776(308-322)Online publication date: 4-Sep-2022
    • (2022)Guided Text-based Item ExplorationProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557141(3410-3420)Online publication date: 17-Oct-2022
    • (2021)Balancing Familiarity and Curiosity in Data Exploration with Deep Reinforcement LearningFourth Workshop in Exploiting AI Techniques for Data Management10.1145/3464509.3464884(16-23)Online publication date: 20-Jun-2021
    • (2021)DORA THE EXPLORERProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3481967(4769-4773)Online publication date: 26-Oct-2021
    • (2020)Guided exploration of user groupsProceedings of the VLDB Endowment10.14778/3397230.339724213:9(1469-1482)Online publication date: 1-May-2020
    • (2020)Interactive and Explainable Point-of-Interest Recommendation using Look-alike GroupsProceedings of the 28th International Conference on Advances in Geographic Information Systems10.1145/3397536.3422238(389-392)Online publication date: 3-Nov-2020
    • (2020)User Group Analytics Survey and Research OpportunitiesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.291365132:10(2040-2059)Online publication date: 1-Oct-2020
    • (2020)Cohort analytics: efficiency and applicabilityThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-020-00625-629:6(1527-1550)Online publication date: 27-Aug-2020
    • (2020)Data Pipelines for Personalized Exploration of Rated DatasetsBias and Social Aspects in Search and Recommendation10.1007/978-3-030-52485-2_8(72-78)Online publication date: 12-Jul-2020
    • (2019)UserDEVProceedings of the Workshop on Human-In-the-Loop Data Analytics10.1145/3328519.3329128(1-8)Online publication date: 5-Jul-2019
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media