Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Learning from multi-label data with interactivity constraints

Published: 01 August 2015 Publication History

Abstract

Extensive study of 12 multi-label learning methods with interactivity constraints.Focus on the beginning of the classification task where few examples are available.Experimental evaluation with a protocol independent of any implementation environment.Classifier performances are evaluated for 7 quality and time criteria on 12 datasets.RF-PCT obtains the best predictive performance while being computationally efficient. Interactive classification aims at introducing user preferences in the learning process to produce individualized outcomes more adapted to each user's behavior than the fully automatic approaches. The current interactive classification systems generally adopt a single-label classification paradigm that constrains items to span one label at a time and consequently limit the user's expressiveness while he/she interacts with data that are inherently multi-label. Moreover, the experimental evaluations are mainly subjective and closely depend on the targeted use cases and the interface characteristics. This paper presents the first extensive study of the impact of the interactivity constraints on the performances of a large set of twelve well-established multi-label learning methods. We restrict ourselves to the evaluation of the classifier predictive and time-computation performances while the number of training examples regularly increases and we focus on the beginning of the classification task where few examples are available. The classifier performances are evaluated with an experimental protocol independent of any implementation environment on a set of twelve multi-label benchmarks of various sizes from different domains. Our comparison shows that four classifiers can be distinguished for the prediction quality: RF-PCT (Random Forest of Predictive Clustering Trees, (Kocev, 2011)), EBR (Ensemble of Binary Relevance, (Read et al., 2011)), CLR (Calibrated Label Ranking, (Fürnkranz et al., 2008)) and MLkNN (Multi-label kNN, (Zhang and Zhou, 2007)) with an advantage for the first two ensemble classifiers. Moreover, only RF-PCT competes with the fastest classifiers and is therefore considered as the most promising classifier for an interactive multi-label learning system.

References

[1]
S. Amershi, Designing for effective end-user interaction with machine learning, in: Proceedings of the 24th annual ACM symposium adjunct on User interface software and technology, ACM, 2011, pp. 47-50.
[2]
S. Amershi, M. Cakmak, W.B. Knox, T. Kulesza, Power to the people: The role of humans in interactive machine learning, AI Magazine, 35 (2014) 105-120.
[3]
S. Amershi, J. Fogarty, D. Weld, Regroup: Interactive machine learning for on-demand group creation in social networks, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, ACM, 2012, pp. 21-30.
[4]
S. Amershi, B. Lee, A. Kapoor, R. Mahajan, B. Christian, Cuet: human-guided fast and accurate network alarm triage, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, ACM, 2011, pp. 157-166.
[5]
Basu, S., Fisher, D., Drucker, S. M., & Lu, H. (2010). Assisting users with clustering tasks by combining metric learning and classification. In AAAI.
[6]
M. Bilenko, S. Basu, R.J. Mooney, Integrating constraints and metric learning in semi-supervised clustering, in: Proceedings of the twenty-first international conference on Machine learning, ACM, 2004, pp. 11.
[7]
M.R. Boutell, J. Luo, X. Shen, C.M. Brown, Learning multi-label scene classification, Pattern recognition, 37 (2004) 1757-1771.
[8]
F. Briggs, Y. Huang, R. Raich, K. Eftaxias, Z. Lei, W. Cukierski, The 9th annual mlsp competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment, in: IEEE International Workshop on Machine Learning for Signal Processing (MLSP), 2013, IEEE, 2013, pp. 1-8.
[9]
O. Chapelle, B. Schölkopf, A. Zien, MIT press, Cambridge, 2006.
[10]
Cheng, W., Hüllermeier, E., & Dembczynski, K. J. (2010). Graded multilabel classification: The ordinal case. In Proceedings of the 27th international conference on machine learning (ICML-10) (pp. 223-230).
[11]
W. Cheng, E. Hüllermeier, Combining instance-based learning and logistic regression for multilabel classification, Machine Learning, 76 (2009) 211-225.
[12]
Y.-W. Chen, C.-J. Lin, Combining svms with various feature selection strategies, in: Feature extraction, Springer, 2006, pp. 315-324.
[13]
A. Clare, R.D. King, Knowledge discovery in multi-label phenotype data, in: Principles of data mining and knowledge discovery, Springer, 2001, pp. 42-53.
[14]
J.R. Dabrowski, E.V. Munson, Is 100 milliseconds too fast?, in: CHI'01 Extended Abstracts on Human Factors in Computing Systems, ACM, 2001, pp. 317-318.
[15]
S. Diplaris, G. Tsoumakas, P.A. Mitkas, I. Vlahavas, Protein classification with multiple algorithms, in: Advances in Informatics, Springer, 2005, pp. 448-456.
[16]
S.M. Drucker, D. Fisher, S. Basu, Helping users sort faster with adaptive machine learning recommendations, in: Human-Computer Interaction-INTERACT 2011, Springer, 2011, pp. 187-203.
[17]
Elisseeff, A. & Weston, J. (2001). A kernel method for multi-labelled classification. In Advances in neural information processing systems (pp. 681-687).
[18]
J.A. Fails, D.R. Olsen, Interactive machine learning, in: Proceedings of the 8th international conference on Intelligent user interfaces, ACM, 2003, pp. 39-45.
[19]
Fiebrink, R., Trueman, D., & Cook, P. R. (2009). A metainstrument for interactive, on-the-fly machine learning. In Proc. NIME (Vol. 2, 3p).
[20]
J. Fogarty, D. Tan, A. Kapoor, S. Winder, Cueflik: interactive concept learning in image search, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, ACM, 2008, pp. 29-38.
[21]
J. Fürnkranz, E. Hüllermeier, E.L. Mencía, K. Brinker, Multilabel classification via calibrated label ranking, Machine Learning, 73 (2008) 133-153.
[22]
S. Godbole, S. Sarawagi, Discriminative methods for multi-labeled classification, in: Advances in knowledge discovery and data mining, Springer, 2004, pp. 22-30.
[23]
E. Hüllermeier, J. Fürnkranz, W. Cheng, K. Brinker, Label ranking by learning pairwise preferences, Artificial Intelligence, 172 (2008) 1897-1916.
[24]
Katakis, I., Tsoumakas, G., & Vlahavas, I. (2008). Multilabel text classification for automated tag suggestion. In Proceedings of the ECML/PKDD.
[25]
D. Kocev, Ensembles for predicting structured outputs, IPS Jo¿ef Stefan, Ljubljana, Slovenia, 2011.
[26]
D. Kocev, C. Vens, J. Struyf, S. D¿eroski, Ensembles of multi-objective decision trees, Springer, 2007.
[27]
X. Kong, M.K. Ng, Z.-H. Zhou, Transductive multilabel learning via label set propagation, IEEE Transactions on Knowledge and Data Engineering, 25 (2013) 704-719.
[28]
Li, T., Zhang, C., & Zhu, S. (2006). Empirical studies on multi-label classification. In IcTAI (Vol. 6, pp. 86-92).
[29]
C.J. Lintott, K. Schawinski, A. Slosar, K. Land, S. Bamford, D. Thomas, Galaxy zoo: Morphologies derived from visual inspection of galaxies from the sloan digital sky survey, Monthly Notices of the Royal Astronomical Society, 389 (2008) 1179-1189.
[30]
Y. Liu, R. Jin, L. Yang, Semi-supervised multi-label learning by constrained non-negative matrix factorization, in: Proceedings of the National Conference on Artificial Intelligence, Vol. 21, AAAI Press; MIT Press, Menlo Park, CA; Cambridge, MA; London, 2006, pp. 421.
[31]
H.-Y. Lo, J.-C. Wang, H.-M. Wang, S.-D. Lin, Cost-sensitive multi-label learning for audio tag annotation and retrieval, IEEE Transactions on Multimedia, 13 (2011) 518-529.
[32]
G. Madjarov, D. Kocev, D. Gjorgjevikj, S. D¿eroski, An extensive experimental comparison of methods for multi-label learning, Pattern Recognition, 45 (2012) 3084-3104.
[33]
G. Nasierding, A.Z. Kouzani, Comparative evaluation of multi-label classification methods, in: 9th International conference on fuzzy systems and knowledge discovery (FSKD), 2012, IEEE, 2012, pp. 679-683.
[34]
K. Ozonat, D. Young, Towards a universal marketplace over the web: Statistical multi-label classification of service provider forms with simulated annealing, in: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, ACM, 2009, pp. 1295-1304.
[35]
J.C. Platt, Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods, in: Advances in large margin classifiers, Citeseer, 1999.
[36]
R. Porter, J. Theiler, D. Hush, Interactive machine learning in data exploitation, Computing in Science & Engineering, 15 (2013) 12-20.
[37]
J.R. Quinlan, Morgan kaufmann, 1993.
[38]
R. Rak, L. Kurgan, M. Reformat, Multi-label associative classification of medical documents from medline, in: Proceedings. Fourth international conference on machine learning and applications, 2005, IEEE, 2005, pp. 8-pp.
[39]
Read, J. (2010). Scalable multi-label classification (PhD thesis). University of Waikato.
[40]
J. Read, B. Pfahringer, G. Holmes, E. Frank, Classifier chains for multi-label classification, Machine Learning, 85 (2011) 333-359.
[41]
A. Ritter, S. Basu, Learning to generalize for complex selection tasks, in: Proceedings of the 14th international conference on intelligent user interfaces, ACM, 2009, pp. 167-176.
[42]
G. Salton, C. Buckley, Improving retrieval performance by relevance feedback, Readings in Information Retrieval, 24 (1997).
[43]
R.E. Schapire, Y. Singer, Boostexter: A boosting-based system for text categorization, Machine Learning, 39 (2000) 135-168.
[44]
H.A. Simon, A behavioral model of rational choice, The Quarterly Journal of Economics (1955) 99-118.
[45]
C.G. Snoek, M. Worring, J.C. Van Gemert, J.-M. Geusebroek, A.W. Smeulders, The challenge problem for automated detection of 101 semantic concepts in multimedia, in: Proceedings of the 14th annual ACM international conference on multimedia, ACM, 2006, pp. 421-430.
[46]
M.S. Sorower, A literature survey on algorithms for multi-label learning, Oregon State University, Corvallis, 2010.
[47]
E. Spyromitros, G. Tsoumakas, I. Vlahavas, An empirical study of lazy multilabel classification algorithms, in: Artificial intelligence: Theories, models and applications, Springer, 2008, pp. 401-406.
[48]
A.N. Srivastava, B. Zane-Ulman, Discovering recurring anomalies in text reports regarding complex space systems, in: Aerospace conference, 2005 IEEE, IEEE, 2005, pp. 3853-3862.
[49]
S. Stumpf, V. Rajaram, L. Li, M. Burnett, T. Dietterich, E. Sullivan, Toward harnessing user feedback for machine learning, in: Proceedings of the 12th international conference on intelligent user interfaces, ACM, 2007, pp. 82-91.
[50]
C.A. Tawiah, V.S. Sheng, A study on multi-label classification, in: Advances in data mining. Applications and theoretical aspects, Springer, 2013, pp. 137-150.
[51]
Trohidis, K., Tsoumakas, G., Kalliris, G., & Vlahavas, I. P. (2008). Multi-label classification of music into emotions. In ISMIR (Vol. 8, pp. 325-330).
[52]
Tsoumakas, G., Katakis, I., & Vlahavas, I. (2008). Effective and efficient multilabel classification in domains with large number of labels. In Proc. ECML/PKDD 2008 workshop on mining multidimensional data (MMD'08) (pp. 30-44).
[53]
G. Tsoumakas, I. Katakis, Multi-label classification: An overview, International Journal of Data Warehousing and Mining (IJDWM), 3 (2007) 1-13.
[54]
G. Tsoumakas, I. Katakis, I. Vlahavas, Mining multi-label data, in: Data mining and knowledge discovery handbook, Springer, 2010, pp. 667-685.
[55]
G. Tsoumakas, I. Vlahavas, Random k-labelsets: An ensemble method for multilabel classification, in: Machine learning: ECML 2007, Springer, 2007, pp. 406-417.
[56]
Wagstaff, K., Cardie, C., Rogers, S., Schrödl, S., et al. (2001). Constrained k-means clustering with background knowledge. In ICML (Vol. 1, pp. 577-584).
[57]
M. Ware, E. Frank, G. Holmes, M. Hall, I.H. Witten, Interactive machine learning: Letting users build classifiers, International Journal of Human-Computer Studies, 55 (2001) 281-292.
[58]
K. Yu, S. Yu, V. Tresp, Multi-label informed latent semantic indexing, in: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, ACM, 2005, pp. 258-265.
[59]
Zhang, M. & Zhou, Z. (2013). A review on multi-label learning algorithms.
[60]
M.-L. Zhang, Z.-H. Zhou, Ml-knn: A lazy learning approach to multi-label learning, Pattern Recognition, 40 (2007) 2038-2048.

Cited By

View all
  • (2018)Smart-U: Smart Utensils Know what You EatIEEE INFOCOM 2018 - IEEE Conference on Computer Communications10.1109/INFOCOM.2018.8486266(1439-1447)Online publication date: 16-Apr-2018
  • (2017)Optimization approach for feature selection in multi-label classificationPattern Recognition Letters10.1016/j.patrec.2017.02.00489:C(25-30)Online publication date: 1-Apr-2017
  • (2017)A Classifier Chain Algorithm with K-means for Multi-label Classification on CloudsJournal of Signal Processing Systems10.1007/s11265-016-1137-286:2-3(337-346)Online publication date: 1-Mar-2017
  1. Learning from multi-label data with interactivity constraints

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Expert Systems with Applications: An International Journal
    Expert Systems with Applications: An International Journal  Volume 42, Issue 13
    August 2015
    387 pages

    Publisher

    Pergamon Press, Inc.

    United States

    Publication History

    Published: 01 August 2015

    Author Tags

    1. Comparative study
    2. Interactive learning
    3. Multi-label learning

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Smart-U: Smart Utensils Know what You EatIEEE INFOCOM 2018 - IEEE Conference on Computer Communications10.1109/INFOCOM.2018.8486266(1439-1447)Online publication date: 16-Apr-2018
    • (2017)Optimization approach for feature selection in multi-label classificationPattern Recognition Letters10.1016/j.patrec.2017.02.00489:C(25-30)Online publication date: 1-Apr-2017
    • (2017)A Classifier Chain Algorithm with K-means for Multi-label Classification on CloudsJournal of Signal Processing Systems10.1007/s11265-016-1137-286:2-3(337-346)Online publication date: 1-Mar-2017

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media