Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2339530.2339584acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Active learning for online bayesian matrix factorization

Published: 12 August 2012 Publication History

Abstract

The problem of large-scale online matrix completion is addressed via a Bayesian approach. The proposed method learns a factor analysis (FA) model for large matrices, based on a small number of observed matrix elements, and leverages the statistical model to actively select which new matrix entries/observations would be most informative if they could be acquired, to improve the model; the model inference and active learning are performed in an online setting. In the context of online learning, a greedy, fast and provably near-optimal algorithm is employed to sequentially maximize the mutual information between past and future observations, taking advantage of submodularity properties. Additionally, a simpler procedure, which directly uses the posterior parameters learned by the Bayesian approach, is shown to achieve slightly lower estimation quality, with far less computational effort. Inference is performed using a computationally efficient online variational Bayes (VB) procedure. Competitive results are obtained in a very large collaborative filtering problem, namely the Yahoo! Music ratings dataset.

Supplementary Material

JPG File (311a_m_talk_9.jpg)
MP4 File (311a_m_talk_9.mp4)

References

[1]
H. Attias. A variational bayesian framework for graphical models. Advances in Neural Information Processing Systems (NIPS), 12(1--2):209--215, 2000.
[2]
R. Bell and Y. Koren. Lessons from the Netflix prize challenge. ACM SIGKDD Explorations Newsletter, 9(2):75--79, 2007.
[3]
E. Candès and T. Tao. The power of convex relaxation: Near-optimal matrix completion. Information Theory, IEEE Transactions on, 56(5):2053--2080, 2010.
[4]
G. Dror, N. Koenigstein, Y. Koren, and M. Weimer. The Yahoo! Music Dataset and KDD-Cup'11. In ACM International Conference on Knowledge Discovery and Data Mining (KDD), KDD Cup Workshop, 2011.
[5]
M. Hoffman, D. Blei, and F. Bach. Online learning for latent Dirichlet allocation. Advances in Neural Information Processing Systems (NIPS), 23:856--864, 2010.
[6]
M. Jahrer and A. Töscher. Collaborative filtering ensemble. In ACM International Conference on Knowledge Discovery and Data Mining (KDD), KDD Cup Workshop, 2011.
[7]
Y. Koren. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In ACM International Conference on Knowledge Discovery and Data Mining (KDD), pages 426--434, 2008.
[8]
A. Krause, A. Singh, and C. Guestrin. Near-optimal sensor placements in Gaussian processes: Theory, efficient algorithms and empirical studies. Journal of Machine Learning Research, 9:235--284, 2008.
[9]
J. Lee, B. Recht, R. Salakhutdinov, N. Srebro, and J. Tropp. Practical large-scale optimization for max-norm regularization. Advances in Neural Information Processing Systems (NIPS), 23:1297--1305, 2010.
[10]
Y. Lim and Y. Teh. Variational bayesian approach to movie rating prediction. In ACM International Conference on Knowledge Discovery and Data Mining (KDD), KDD Cup Workshop, pages 15--21, 2007.
[11]
J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online learning for matrix factorization and sparse coding. Journal of Machine Learning Research, 11:19--60, 2010.
[12]
S. Nakajima and M. Sugiyama. Implicit regularization in variational Bayesian matrix factorization. In 27th International Conference on Machine Learning (ICML), 2010.
[13]
G. Nemhauser, L. Wolsey, and M. Fisher. An analysis of approximations for maximizing submodular set functions. Mathematical Programming, 14(1):265--294, 1978.
[14]
T. Raiko, A. Ilin, and J. Karhunen. Principal component analysis for large scale problems with lots of missing values. Machine Learning: ECML 2007, pages 691--698, 2007.
[15]
T. Raiko, H. Valpola, M. Harva, and J. Karhunen. Building blocks for variational Bayesian learning of latent variable models. Journal of Machine Learning Research, 8:155--201, 2007.
[16]
R. Salakhutdinov and A. Mnih. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. In Proceedings of the 25th International Conference on Machine Learning (ICML), 2008.
[17]
M. Sato. Online model selection based on the variational Bayes. Neural Computation, 13(7):1649--1681, 2001.

Cited By

View all
  • (2024)ALQ: An Efficient Active Learning Recommendation Algorithm2024 IEEE 6th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)10.1109/IMCEC59810.2024.10575704(1626-1630)Online publication date: 24-May-2024
  • (2023)Tripartite Graph Aided Tensor Completion For Sparse Network MeasurementIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2022.321325934:1(48-62)Online publication date: 1-Jan-2023
  • (2023)MbSRS: A multi-behavior streaming recommender systemInformation Sciences10.1016/j.ins.2023.01.101631(145-163)Online publication date: Jun-2023
  • Show More Cited By

Index Terms

  1. Active learning for online bayesian matrix factorization

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2012
    1616 pages
    ISBN:9781450314626
    DOI:10.1145/2339530
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. matrix factorization
    2. online learning

    Qualifiers

    • Research-article

    Conference

    KDD '12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)22
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)ALQ: An Efficient Active Learning Recommendation Algorithm2024 IEEE 6th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)10.1109/IMCEC59810.2024.10575704(1626-1630)Online publication date: 24-May-2024
    • (2023)Tripartite Graph Aided Tensor Completion For Sparse Network MeasurementIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2022.321325934:1(48-62)Online publication date: 1-Jan-2023
    • (2023)MbSRS: A multi-behavior streaming recommender systemInformation Sciences10.1016/j.ins.2023.01.101631(145-163)Online publication date: Jun-2023
    • (2022)CODE: Compact IoT Data Collection with Precise Matrix Sampling and Efficient Inference2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS54860.2022.00077(743-753)Online publication date: Jul-2022
    • (2022)Efficient Data Collection Scheme based on Information Entropy for Vehicular Crowdsensing2022 IEEE International Conference on Communications Workshops (ICC Workshops)10.1109/ICCWorkshops53468.2022.9882168(1-6)Online publication date: 16-May-2022
    • (2021)Cost-Effective Active Sparse Urban Sensing: Adversarial Autoencoder ApproachIEEE Internet of Things Journal10.1109/JIOT.2021.30608158:15(12064-12078)Online publication date: 1-Aug-2021
    • (2020)Active matrix factorization for surveysThe Annals of Applied Statistics10.1214/20-AOAS132214:3Online publication date: 1-Sep-2020
    • (2020)Stratified and time-aware sampling based adaptive ensemble learning for streaming recommendationsApplied Intelligence10.1007/s10489-020-01851-9Online publication date: 9-Nov-2020
    • (2020)Double-Wing Mixture of Experts for Streaming RecommendationsWeb Information Systems Engineering – WISE 202010.1007/978-3-030-62008-0_19(269-284)Online publication date: 21-Oct-2020
    • (2019)Active Collaborative Sensing for Energy BreakdownProceedings of the 28th ACM International Conference on Information and Knowledge Management10.1145/3357384.3357929(1943-1952)Online publication date: 3-Nov-2019
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media