Abstract
For domain ontology construction and expansion, data-driven approaches based on web resources have been actively investigated. Despite the importance of document filtering for domain ontology management, however, few studies have sought to develop a method for automatically filtering out domain-relevant documents from the web. To address this situation, here we propose a document filtering scheme that identifies documents relevant to a domain ontology based on concept preferences. Testing of the proposed filtering scheme with a business domain ontology on 1,409 YahooPicks web pages yielded promising filtering results that outperformed the baseline system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Decker, S., Erdmann, M., Fensel, D., Studer, R.: Ontobroker: Ontology based access to distributed and semi-structured information. In: Meersman, R., et al. (eds.) Semantic Issues in Multimedia Systems, pp. 351–369. Kluwer Academic Publisher, Dordrecht (1999)
Haav, H.M.: Learning ontologies for domain-specific information retrieval. In: Abramowicz, W. (ed.) Knowledge-Based Information Retrieval and Filtering from the Web, ch. 14. Kluwer Academic Publishers, Dordrecht (2003)
Abbattista, F., Paradiso, A., Semeraro, G., Zambetta, F.: An agent that learns to support users of aWeb site. Appl. Soft Comput. 4(1), 1–12 (2004)
Middleton, S.E., Shadbolt, N.R., De Roure, D.C.: Ontological user profiling in recommender systems. ACM Trans. Inform.Syst (TOIS) 22(1), 54–88 (2004)
Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Analysis of recommendation algorithms for e-commerce. In: Proceedings of the 2nd ACM Conference on Electronic Commerce (2000)
Sarwar, B., Konstan, J., Borchers, A., Herlocker, J., Miller, B., Riedl, J.: Using filtering agents to improve prediction quality in the GroupLens research collaborative filtering system. In: Proceedings of the 1998 Conference on Computer Supported Cooperative Work (1998)
Balabanoic, M.: An adaptive web page recommendation service. In: Proceedings of the First International Conference on Autonomous Agents, pp. 378–385 (1997)
Singh, S., Dhanalakshmi, P., Dey, L.: Rough-fuzzy document grading system for customized text information retrieval. Information Processing and Management 41, 195–216 (2005)
Berry, M.W.: Survey of text mining: clustering, classification, and retrieval, pp. 25–42. Springer, Heidelberg (2003)
Pustejovsky, J.: The Generative Lexicon. MIT Press, Cambridge (1995)
This is available from YahooPicks Online (2000), http://picks.yahoo.com
Ehrig, M., Maedche, A.: Ontology-focused crawling of web documents. In: Proceedings of ACM Symposium on Applied Computing (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kang, BY., Kim, HG. (2006). Document Filtering for Domain Ontology Based on Concept Preferences. In: Mizoguchi, R., Shi, Z., Giunchiglia, F. (eds) The Semantic Web – ASWC 2006. ASWC 2006. Lecture Notes in Computer Science, vol 4185. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11836025_38
Download citation
DOI: https://doi.org/10.1007/11836025_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-38329-1
Online ISBN: 978-3-540-38331-4
eBook Packages: Computer ScienceComputer Science (R0)