Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2663761.2664212acmconferencesArticle/Chapter ViewAbstractPublication PagesracsConference Proceedingsconference-collections
research-article

A semantic weighting method for document classification based on Markov logic networks

Published: 05 October 2014 Publication History

Abstract

This paper proposes a semantic weighting method to classify textural documents. Human lives in the world where web documents have a great potential and the amount of valuable information has been consistently growing over the year. There is a problem that finding relevant web documents corresponding to what users want is more difficult due to the huge amount of web size. For this reason, there have been many researchers overcome this problem. The most important thing is document classification. All documents are composed of numerous words. Many classification methods have been extracted keywords from documents and then analyzed keywords pattern or frequency. In this paper, we propose Category Term Weight (CTW) using keywords from documents in order to enhance performance in document classification. CTW combines keywords frequency with semantic information. The frequency and semantic information have a great potential to find similarities between documents. That is why we calculates CTW from collection of training documents. After this step, CTW from unknown document and CTW in previous Category Term Database will be applied by designed Markov Logic Networks Model. Our designed MLNs Model and existing Naive-bayse Model will be compared by applied CTW. The experimental results shows the improvement of precision compare with the existing model.

References

[1]
http://en.wikipedia.org/wiki/Information_overload
[2]
Byeongkyu, K., Dongjin, C., Chang, C., Junho, C., and Pankoo, K. 2012. Data Classification through Specified Building n-gram. In proceedings of the International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing (The Palermo, Italy, July 04-06, 2012). IMIS '12. IEEE, 171--176.
[3]
Mitchell, T., 1997. Machine Learning, McGraw-Hill.
[4]
Byeongkyu, K., Kunseok, O., and Pankoo, K. 2012. A Research for Web Documents Genre Classification using STW. Journal of Information Technology and Architecture. 9, 4 (Dec. 2012), 413--422.
[5]
http://en.wikipedia.org/wiki/Tf%E2%80%93idf
[6]
http://en.wikipedia.org/wiki/Mutual_information
[7]
Chungseok, H., SooWon, L., and Sangyong, P. 2012. A Document Classification System Using Modified ECCD and Category Weight for each Document, Korea Information Processing Society. 19-B, 4 (Aug. 2012), 237--242.
[8]
Matthew, R., and Pedro, D. 2006. Markov logic networks, Machine Learning. 62, 1--2 (Feb. 2006), 107--136.
[9]
Sebastian, R., and Ivan, M. R. 2008. Collective semantic role labelling with markov logic. In Proceedings of the international Conference on Computational Natural Language Learning (The Manchester, UK, August 16--17, 2008) CoNLL '08. ACM, The Stroudsburg, PA, USA, 193--197.
[10]
Chang, C., Junho, C., Eunji, L., Ilsun, Y., and Pankoo, K. 2013. Probabilistic spatio-temporal inference for motion event understanding, Neurocomputing. 122, (Dec. 2013), 24--32.
[11]
Pedro, O. 2009, Probabilistic Reasoning in the Semantic Web using Markov Logic. Master Thesis. University of Coimbra.
[12]
Pedro, D., and Daniel, L. 2009. Markov Logic: An Interface Layer for Artificial Intelligence, Morgan and Claypool Publishers.
[13]
YongHwan, L. 2013. Detection and Recognition of Facial Emotion using Bezier Curves, IT Convergence Practice (INPRA). 1,2(Jun. 2013), 11--19.
[14]
Houda, B., and Max, B. 2004. Impact on Performance of Hypertext Classification of Selective Rich HTML Capture, Artificial Intelligence Applications and Innovations. 154, (Aug. 2004), 22--27.
[15]
Lingling, M., Runqing, H., and Junzhong, G. 2013. A Review of Semantic Similarity Measures in WordNet, International Journal of Hybrid Information Technology. 6, 1 (Jan. 2013), 1--12.
[16]
Bing, L., Wynne, H., and Yiming, M. 1998. Integrating Classification and Association Rule Mining, In Proceedings of Knowledge Discovery and Data Mining (New york, USA, August 27-31, 1998). KDD'98. 80--86.

Cited By

View all
  • (2018)Markov logic networks based emotion classification for Chinese microblogsInternational Journal of Intelligent Information and Database Systems10.1504/IJIIDS.2016.0754329:2(197-211)Online publication date: 14-Dec-2018

Index Terms

  1. A semantic weighting method for document classification based on Markov logic networks

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      RACS '14: Proceedings of the 2014 Conference on Research in Adaptive and Convergent Systems
      October 2014
      386 pages
      ISBN:9781450330602
      DOI:10.1145/2663761
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 05 October 2014

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. document classification
      2. markov logic networks
      3. semantic weight

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      RACS '14
      Sponsor:

      Acceptance Rates

      RACS '14 Paper Acceptance Rate 59 of 251 submissions, 24%;
      Overall Acceptance Rate 393 of 1,581 submissions, 25%

      Upcoming Conference

      RACS '24

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 29 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Markov logic networks based emotion classification for Chinese microblogsInternational Journal of Intelligent Information and Database Systems10.1504/IJIIDS.2016.0754329:2(197-211)Online publication date: 14-Dec-2018

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media