Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1835804.1835924acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Growing a tree in the forest: constructing folksonomies by integrating structured metadata

Published: 25 July 2010 Publication History

Abstract

Many social Web sites allow users to annotate the content with descriptive metadata, such as tags, and more recently to organize content hierarchically. These types of structured metadata provide valuable evidence for learning how a community organizes knowledge. For instance, we can aggregate many personal hierarchies into a common taxonomy, also known as a folksonomy, that will aid users in visualizing and browsing social content, and also to help them in organizing their own content. However, learning from social metadata presents several challenges, since it is sparse, shallow, ambiguous, noisy, and inconsistent. We describe an approach to folksonomy learning based on relational clustering, which exploits structured metadata contained in personal hierarchies. Our approach clusters similar hierarchies using their structure and tag statistics, then incrementally weaves them into a deeper, bushier tree. We study folksonomy learning using social metadata extracted from the photo-sharing site Flickr, and demonstrate that the proposed approach addresses the challenges. Moreover, comparing to previous work, the approach produces larger, more accurate folksonomies, and in addition, scales better.

Supplementary Material

JPG File (kdd2010_plangprasopchok_gtf_01.jpg)
MOV File (kdd2010_plangprasopchok_gtf_01.mov)

References

[1]
I. Bhattacharya and L. Getoor. Collective entity resolution in relational data. ACM Trans. Knowl. Discov. Data, 1(1):5, 2007.
[2]
M. Broecheler and L. Getoor. Probabilistic similarity logic. In Proceedings of International Workshop on Statistical Relational Learning, 2009.
[3]
C. H. Brooks and N. Montanez. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In Proceedings of the World Wide Web conference, 2006.
[4]
J. Euzenat and P. Shvaiko. Ontology Matching. Springer-Verlag, 2007.
[5]
S. A. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems. J. Inf. Sci., 32(2):198--208, April 2006.
[6]
M. A. Hearst. Automatic acquisition of hyponyms from large text corpora. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, 1992.
[7]
P. Heymann and H. Garcia-Molina. Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical Report 2006--10, Stanford University, Stanford, CA, USA, April 2006.
[8]
A. Maedche and S. Staab. Measuring similarity between ontologies. In Proceedings of the Knowledge Engineering and Knowledge Management, 2002.
[9]
A. Mathes. Folksonomies: cooperative classification and communication through shared metadata. 2004.
[10]
P. Mika. Ontologies are us: A unified model of social networks and semantics. J. Web Sem., 5(1):5--15, 2007.
[11]
A. E. Monge and C. Elkan. An efficient domain-independent algorithm for detecting approximately duplicate database records. In Proceedings of the SIGMOD workshop on Data Mining and Knowledge Discovery, 1997.
[12]
A. Plangprasopchok and K. Lerman. Constructing folksonomies from user-specified relations on flickr. In Proceedings of the World Wide Web conference, 2009.
[13]
T. Rattenbury, N. Good, and M. Naaman. Towards automatic extraction of event and place semantics from flickr tags. In Proceedings of the conference on Research and development in information retrieval, 2007.
[14]
M. Sanderson and W. B. Croft. Deriving concept hierarchies from text. In Proceedings of the conference on Research and development in information retrieval, pages 206--213, 1999.
[15]
P. Schmitz. Inducing ontology from flickr tags. In Proceedings of the WWW workshop on Collaborative Web Tagging Workshop, 2006.
[16]
R. Snow, D. Jurafsky, and A. Y. Ng. Semantic taxonomy induction from heterogenous evidence. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2006.
[17]
O. Udrea, L. Getoor, and R. J. Miller. Leveraging data and structure in ontology integration. In SIGMOD Conference, 2007.
[18]
J. Wang and P. Domingos. Hybrid markov logic networks. In Proceedings of Association for the Advancement of Artificial Intelligence, 2008.
[19]
H. Yang and J. Callan. A metric-based framework for automatic taxonomy induction. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2009.

Cited By

View all
  • (2022)On the Applications and Parallelization of Multichannel Source CodingTENCON 2022 - 2022 IEEE Region 10 Conference (TENCON)10.1109/TENCON55691.2022.9977596(1-6)Online publication date: 1-Nov-2022
  • (2019)Sampling sketches for concave sublinear functions of frequenciesProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3454409(1363-1373)Online publication date: 8-Dec-2019
  • (2016)Research on Model Beginning Standard of Multi-objective Evolutionary Algorithm Based on EntropyJournal of Software10.17706/jsw.11.7.685-69411:7(685-694)Online publication date: Jul-2016
  • Show More Cited By

Index Terms

  1. Growing a tree in the forest: constructing folksonomies by integrating structured metadata

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 July 2010

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. collective knowledge
      2. data mining
      3. folksonomies
      4. relational clustering
      5. social information processing
      6. social metadata
      7. taxonomies

      Qualifiers

      • Research-article

      Conference

      KDD '10
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Upcoming Conference

      KDD '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 16 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)On the Applications and Parallelization of Multichannel Source CodingTENCON 2022 - 2022 IEEE Region 10 Conference (TENCON)10.1109/TENCON55691.2022.9977596(1-6)Online publication date: 1-Nov-2022
      • (2019)Sampling sketches for concave sublinear functions of frequenciesProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3454409(1363-1373)Online publication date: 8-Dec-2019
      • (2016)Research on Model Beginning Standard of Multi-objective Evolutionary Algorithm Based on EntropyJournal of Software10.17706/jsw.11.7.685-69411:7(685-694)Online publication date: Jul-2016
      • (2016)Folksonomy-Based Visual Ontology Construction and Its ApplicationsIEEE Transactions on Multimedia10.1109/TMM.2016.252760218:4(702-713)Online publication date: Apr-2016
      • (2016)Applying Community Detection Methods to Cluster Tags in Multimedia Search Results2016 IEEE International Symposium on Multimedia (ISM)10.1109/ISM.2016.0106(467-474)Online publication date: Dec-2016
      • (2015)Radial CursorProceedings of the Annual Meeting of the Australian Special Interest Group for Computer Human Interaction10.1145/2838739.2838828(73-77)Online publication date: 7-Dec-2015
      • (2015)The Role of Structural Information for Designing Navigational User InterfacesProceedings of the 26th ACM Conference on Hypertext & Social Media10.1145/2700171.2791025(59-68)Online publication date: 24-Aug-2015
      • (2015)Learning Structured Knowledge from Social Tagging Data: A Critical Review of Methods and Techniques2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity)10.1109/SmartCity.2015.89(307-314)Online publication date: Dec-2015
      • (2015)Are folksonomies shared conceptualizations?2015 IEEE 19th International Conference on Computer Supported Cooperative Work in Design (CSCWD)10.1109/CSCWD.2015.7230969(265-270)Online publication date: May-2015
      • (2014)A semantic case based web 2.0 tag hierarchy construction framework2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)10.1109/FSKD.2014.6980905(616-622)Online publication date: Aug-2014
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media