Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2695664.2695904acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

A supervised learning approach to detect subsumption relations between tags in folksonomies

Published: 13 April 2015 Publication History

Abstract

The lack of hierarchical relations in the tag space of social tagging systems may diminish the ability of users to find relevant resources. Many research works propose to overcome this problem by constructing hierarchies of tags automatically by means of heuristic algorithms. These hierarchies encode subsumption relations between pairs of tags and can be used for improving browsing and retrieval of resources. In this paper, we cast the problem of subsumption detection between pairs of tags as a pairwise classification problem. From the literature, we identified several similarity measures that are good indicators of subsumption, which are used as learning features. Under this setting, we observed severe class imbalance and class overlapping which motivated us to investigate and employ class imbalance techniques to overcome these problems. We conducted a comprehensive set of experiments on a large real-world dataset, showing that our approach outperforms the best performing heuristic-based baseline.

References

[1]
F. Almoqhim, D. Millard, and N. Shadbolt. An approach to building high-quality tag hierarchies from crowdsourced taxonomic tag pairs. In Social Informatics, volume 8238 of Lecture Notes in Computer Science, pages 129--138. Springer International Publishing, 2013.
[2]
L. Balby Marinho, K. Buza, and L. Schmidt-Thieme. Folksonomy-based collabulary learning. In Proceedings of the 7th International Conference on The Semantic Web, ISWC '08, pages 261--276, Berlin, Heidelberg, 2008. Springer-Verlag.
[3]
G. E. A. P. A. Batista, R. C. Prati, and M. C. Monard. A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explor. Newsl., 6:20--29, June 2004.
[4]
S. Cai, H. Sun, S. Gu, and Z. Ming. Learning concept hierarchy from folksonomy. In Web Information Systems and Applications Conference (WISA), 2011 Eighth, pages 47--51, Chongqing, China, 2011.
[5]
N. Chawla, K. Bowyer, L. Hall, and W. Kegelmeyer. Smote: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16:321--357, 2002.
[6]
P. De Meo, G. Quattrone, and D. Ursino. Exploitation of semantic relationships and hierarchical data structures to support a user in his annotation and browsing activities in folksonomies. Inf. Syst., 34(6):511--535, Sept. 2009.
[7]
D. Eynard, L. Mazzola, and A. Dattolo. Exploiting tag similarities to discover synonyms and homonyms in folksonomies. Software: Practice and Experience, 43(12):1437--1457, 2013.
[8]
P. Hart. The condensed nearest neighbor rule. Information Theory, IEEE Transactions on, 14(3):515--516, 1968.
[9]
P. Heymann and H. Garcia-Molina. Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical report, Computer Science Department, Standford University, April 2006.
[10]
H. Mousselly-Sergieh, M. Döller, E. Egyed-Zsigmond, G. Gianini, H. Kosch, and J.-M. Pinon. Tag relatedness using laplacian score feature selection and adapted jensen-shannon divergence. In MultiMedia Modeling, volume 8325 of Lecture Notes in Computer Science, pages 159--171. Springer International Publishing, 2014.
[11]
X. Si, Z. Liu, and M. Sun. Explore the structure of social tags by subsumption relations. In COLING, pages 1011--1019. Tsinghua University Press, 2010.
[12]
G. Solskinnsbakk and J. A. Gulla. A hybrid approach to constructing tag hierarchies. In Proceedings of the 2010 international conference on On the move to meaningful internet systems: Part II, OTM'10, pages 975--982, Berlin, Heidelberg, 2010. Springer-Verlag.
[13]
V. Spiliopoulos, A. G. Valarakos, and G. A. Vouros. Csr: Discovering subsumption relations for the alignment of ontologies. In S. Bechhofer, M. Hauswirth, J. Hoffmann, and M. Koubarakis, editors, ESWC, volume 5021 of Lecture Notes in Computer Science, pages 418--431. Springer, 2008.
[14]
G. Tibély, P. Pollner, T. Vicsek, and G. Palla. Extracting tag hierarchies. PLoS ONE, 8(12):e84133, 12 2013.
[15]
I. Tomek. Two Modifications of CNN. IEEE Transactions on Systems, Man, and Cybernetics, 7(2):679--772, 1976.
[16]
M. Zhou, S. Bao, X. Wu, and Y. Yu. An unsupervised model for exploring hierarchical semantics from social annotations. In The Semantic Web, volume 4825 of Lecture Notes in Computer Science, pages 680--693. Springer Berlin Heidelberg, 2007.

Cited By

View all
  • (2023)Linking Rare and Popular Tags in CQA SitesProceedings of the 31st International Conference on Information Systems Development10.62036/ISD.2023.42Online publication date: 2023
  • (2021)Mining Tag Relationships in CQA SitesConceptual Modeling10.1007/978-3-030-89022-3_27(345-355)Online publication date: 18-Oct-2021
  • (2020)Critical Parameter Identification for Safety Events in Commercial Aviation Using Machine LearningAerospace10.3390/aerospace70600737:6(73)Online publication date: 4-Jun-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '15: Proceedings of the 30th Annual ACM Symposium on Applied Computing
April 2015
2418 pages
ISBN:9781450331968
DOI:10.1145/2695664
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 April 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. folksonomy
  2. imbalanced datasets
  3. machine learning
  4. pairwise learning
  5. semantic
  6. subsumption detection
  7. tags

Qualifiers

  • Research-article

Funding Sources

  • CNPq and FACEPE

Conference

SAC 2015
Sponsor:
SAC 2015: Symposium on Applied Computing
April 13 - 17, 2015
Salamanca, Spain

Acceptance Rates

SAC '15 Paper Acceptance Rate 291 of 1,211 submissions, 24%;
Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)2
Reflects downloads up to 30 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Linking Rare and Popular Tags in CQA SitesProceedings of the 31st International Conference on Information Systems Development10.62036/ISD.2023.42Online publication date: 2023
  • (2021)Mining Tag Relationships in CQA SitesConceptual Modeling10.1007/978-3-030-89022-3_27(345-355)Online publication date: 18-Oct-2021
  • (2020)Critical Parameter Identification for Safety Events in Commercial Aviation Using Machine LearningAerospace10.3390/aerospace70600737:6(73)Online publication date: 4-Jun-2020
  • (2019)Mining Hypernym-Hyponym Relations from Social Tags via Tag EmbeddingArtificial Intelligence and Security10.1007/978-3-030-24271-8_29(319-328)Online publication date: 11-Jul-2019
  • (2019)Semantic Richness of Tag Sets: Analysis of Machine Generated and Folk Tag SetHandbuch Methoden der Politikwissenschaft10.1007/978-3-030-19807-7_4(35-47)Online publication date: 8-May-2019
  • (2018)Learning Relations from Social Tagging DataPRICAI 2018: Trends in Artificial Intelligence10.1007/978-3-319-97304-3_3(29-41)Online publication date: 27-Jul-2018
  • (2015)Learning Structured Knowledge from Social Tagging Data: A Critical Review of Methods and Techniques2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity)10.1109/SmartCity.2015.89(307-314)Online publication date: Dec-2015

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media