Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1007/978-3-319-09153-2_25guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Heuristics for Semantic Path Search in Wikipedia

Published: 30 June 2014 Publication History

Abstract

In this paper an approach based on Heuristic Semantic Walk (HSW) is presented, where semantic proximity measures among concepts are used as heuristics in order to guide the concept chain search in the collaborative network of Wikipedia, encoding problem-specific knowledge in a problem-independent way. Collaborative information and multimedia repositories over the Web represent a domain of increasing relevance, since users cooperatively add to the objects tags, label, comments and hyperlinks, which reflect their semantic relationships, with or without an underlying structure. As in the case of the so called Big Data, methods for path finding in collaborative web repositories require solving major issues such as large dimensions, high connectivity degree and dynamical evolution of online networks, which make the classical approach ineffective. Experiments held on a range of different semantic measures show that HSW lead to better results than state of the art search methods, and points out the relevant features of suitable proximity measures for the Wikipedia concept network. The extracted semantic paths have many relevant applications such as query expansion, synthesis of explanatory arguments, and simulation of user navigation.

References

[1]
Bollegala, D., Matsuo, Y., Ishizukain, M.: A Web Search Engine-Based Approach to Measure Semantic Similarity between Words. IEEE Transactions on Knowledge and Data Engineering (2011)
[2]
Cilibrasi, R., Vitanyi, P.: The Google Similarity Distance. ArXiv.org (2004)
[3]
Church, K.W., Hanks, P.: Word association norms, mutual information and lexicography. In: ACL, vol.ä27 (1989)
[4]
Franzoni, V., Milani, A.: PMING Distance: A Collaborative Semantic Proximity Measure. In: WI-IAT, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol.ä2, pp. 442—449 (2012)
[5]
Kurant, M., Markopoulou, A., Thiran, P.: On the bias of BSF. ITC (2010)
[6]
Milne, D., Witten, I.H.: An effective, low-cost measure of semantic relatedness obtained from Wikipedia links. WIKIAI (2008)
[7]
Yeh, E., Ramage, D., Manning, C.D., Agirre, E., Soroa, A.: WikiWalk: Random walks on Wikipedia for Semantic Relatedness. In: Proc. Graph-based Methods for Natural Language Processing (2009)
[8]
Newman, M.E.J.: Fast algorithm for detecting community structure in networks. University of Michigan, MI (2003)
[9]
Cao, G., Gao, J., Nie, J.Y., Bai, J.: Extending query translation to cross-language query expansion with markov chain models. CIKM, ATM (2007)
[10]
Turney, P.D.: Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol.ä2167, pp. 491—502. Springer, Heidelberg (2001)
[11]
Xu, Z., Luo, X., Yu, J., Xu, W.: Measuring semantic similarity between words by removing noise and redundancy in web snippets. Concurrency Computat: PEä23 (2011)
[12]
Wu, L., Hua, X.S., Yu, N., Ma, W.Y., Li, S.: Flickr Distance. Microsoft Research Asia (2008)
[13]
Leung, C.H.C., Li, Y., Milani, A., Franzoni, V.: Collective Evolutionary Concept Distance Based Query Expansion for Effective Web Document Retrieval. In: Murgante, B., Misra, S., Carlini, M., Torre, C.M., Nguyen, H.-Q., Taniar, D., Apduhan, B.O., Gervasi, O. (eds.) ICCSA 2013, Part IV. LNCS, vol.ä7974, pp. 657—672. Springer, Heidelberg (2013)
[14]
Gori, M., P.: A random-walk based scoring algorithm with application to recommender systems for large-scale e-commerce. In: 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
[15]
Franzoni, V., Milani, A.: Heuristic Semantic Walk. In: Murgante, B., Misra, S., Carlini, M., Torre, C.M., Nguyen, H.-Q., Taniar, D., Apduhan, B.O., Gervasi, O. (eds.) ICCSA 2013, Part IV. LNCS, vol.ä7974, pp. 643—656. Springer, Heidelberg (2013)
[16]
Lew, M.S., Sebe, N., Djeraba, C., Jain, R.: Content-based multimedia information retrieval: State of the art and challenges. ACM Trans. Multimedia Comp. Com. App (2006)
[17]
Franzoni, V., Milani, A.: Heuristic semantic walk for concept chaining in collaborative networks. International Journal of Web Information Systemsä10(1), 85—103 (2014).
[18]
Franzoni, V., Milani, A., Mengoni, P., Mencacci, M.: Semantic Heuristic Search in Collaborative Networks: Measures and Contexts. In: WI-IAT, 2014 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (2014) (accepted for)
[19]
Cheng, V.C., Leung, C.H.C., Liu, J., Milani, A.: Probabilistic Aspect Mining Model for Drug Reviews. IEEE Transactions on Knowledge and Data Engineeringä99, 1 (preprint, 2014).
[20]
Milani, A., Santucci, V.: Community of scientist optimization: An autonomy oriented approach to distributed optimization. AI Commun.ä25(2), 157—172 (2012).
[21]
Leung, C.H.C., Chan, A.W.S., Milani, A., Liu, J., Li, Y.: Intelligent Social Media Indexing and Sharing Using an Adaptive Indexing Search Engine. ACM TISTä3(3), 47 (2012).
[22]
Baioletti, M., Milani, A., Poggioni, V., Rossi, F.: Experimental evaluation of pheromone models in ACOPlan. Ann. Math. Artif. Intell.ä62(3-4), 187—217 (2011).

Cited By

View all
  • (2019)Neural Network Based Approach for Learning Planning Action ModelsComputational Science and Its Applications – ICCSA 201910.1007/978-3-030-24311-1_38(526-537)Online publication date: 1-Jul-2019
  • (2017)A path-based model for emotion abstraction on facebook using sentiment analysis and taxonomy knowledgeProceedings of the International Conference on Web Intelligence10.1145/3106426.3109420(947-952)Online publication date: 23-Aug-2017

Index Terms

  1. Heuristics for Semantic Path Search in Wikipedia

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    Proceedings of the 14th International Conference on Computational Science and Its Applications — ICCSA 2014 - Volume 8584
    June 2014
    807 pages
    ISBN:9783319091525
    • Editors:
    • Beniamino Murgante,
    • Sanjay Misra,
    • Ana Rocha,
    • Carmelo Torre,
    • Jorge Rocha,
    • Maria Falcão,
    • David Taniar,
    • Bernady Apduhan,
    • Osvaldo Gervasi

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 30 June 2014

    Author Tags

    1. collaborative networks
    2. heuristics search
    3. information retrieval
    4. random walk
    5. semantic networks
    6. semantic similarity measures

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)Neural Network Based Approach for Learning Planning Action ModelsComputational Science and Its Applications – ICCSA 201910.1007/978-3-030-24311-1_38(526-537)Online publication date: 1-Jul-2019
    • (2017)A path-based model for emotion abstraction on facebook using sentiment analysis and taxonomy knowledgeProceedings of the International Conference on Web Intelligence10.1145/3106426.3109420(947-952)Online publication date: 23-Aug-2017

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media