Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1316689.1316723dlproceedingsArticle/Chapter ViewAbstractPublication PagesvldbConference Proceedingsconference-collections
Article

Similarity search for web services

Published: 31 August 2004 Publication History

Abstract

Web services are loosely coupled software components, published, located, and invoked across the web. The growing number of web services available within an organization and on the Web raises a new and challenging search problem: locating desired web services. Traditional keyword search is insufficient in this context: the specific types of queries users require are not captured, the very small text fragments in web services are unsuitable for keyword search, and the underlying structure and semantics of the web services are not exploited.
We describe the algorithms underlying the Woogle search engine for web services. Woogle supports similarity search for web services, such as finding similar web-service operations and finding operations that compose with a given one. We describe novel techniques to support these types of searches, and an experimental study on a collection of over 1500 web-service operations that shows the high recall and precision of our algorithms.

References

[1]
{1} Binding point. http://www.bindingpoint.com/.
[2]
{2} Grand central. http://www.grandcentral.com/directory/.
[3]
{3} Salcentral. http://www.salcentral.com/.
[4]
{4} Web service list. http://www.webservicelist.com/.
[5]
{5} Wordnet. http://www.cogsci.princeton.edu/ wn/.
[6]
{6} rainbow. http://www.cs.cmu.edu/ mccallum/bow, 2003.
[7]
{7} R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, and A. Verkamo. Fast discovery of association rules. Advances in Knowledge Discovery and Data Mining, 1996.
[8]
{8} J. Cardoso. Quality of Service and Semantic Composition of Workflows. PhD thesis, University of Georgia, 2002.
[9]
{9} D.-S. Coalition. Daml-s: Web service description for the semantic web. In ISWC, 2002.
[10]
{10} S. Cost and S. Salzberg. A weighted nearest neighbor algorithm for learning with symbolic features. Machine Learning, 10:57-78, 1993.
[11]
{11} S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. JASIS, 41(6):391-407, 1990.
[12]
{12} H.-H. Do and E. Rahm. COMA - A System for Flexible Combination of Schema Matching Approaches. In Proc. of VLDB, 2002.
[13]
{13} A. Doan, P. Domingos, and A. Halevy. Reconciling schemas of disparate data sources: a machine learning approach. In Proc. of SIGMOD, 2001.
[14]
{14} D. Hand, H. Mannila, and P. Smyth. Principles of Data Mining. The MIT Press, 2001.
[15]
{15} A. Hess and N. Kushmerick. Learning to attach semantic metadata to web services. In ISWC, 2003.
[16]
{16} K. S. Jones. Automatic keyword classification for information retrieval. Archon Books, 1971.
[17]
{17} G. Karypis, E. H. Han, and V. Kumar. Chameleon: A hierarchical clustering algorithm using dynamic modeling. COMPUTER, 32, 1999.
[18]
{18} L. Kaufman and P. J. Rousseeuw. Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, New York, 1990.
[19]
{19} L. S. Larkey. Automatic essay grading using text classification techniques. In Proc. of ACM SIGIR, 1998.
[20]
{20} L. S. Larkey and W. Croft. Combining classifiers in text categorization. In Proc. of ACM SIGIR, 1996.
[21]
{21} V. Levenshtein. Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Daklady, 10:707-710, 1966.
[22]
{22} S. Melnik, H. Garcia-Molina, and E. Rahm. Similarity Flooding: A Versatile Graph Matching Algorithm. In Proc. of ICDE, 2002.
[23]
{23} M. Paolucci, T. Kawmura, T. Payne, and K. Sycara. Semantic matching of web services capabilities. In Proc. of International Semantic Web Conference(ISWC), 2002.
[24]
{24} E. Rahm and P. A. Bernstein. A survey on approaches to automatic schema matching. VLDB Journal, 10(4), 2001.
[25]
{25} G. Salton, editor. The SMART Retrieval System-Experiments in Automatic Document Retrieval. Prentice Hall Inc., Englewood Cliffs, NJ, 1971.
[26]
{26} E. Sirin, J. Hendler, and B. Parsia. Semi-automatic composition ofweb services using semantic descriptions. In WSMAI-2003, 2003.
[27]
{27} Y. Yang and J. Pedersen. A comparative study on feature selection in text categorization. In International Conference on Machine Learning, 1997.
[28]
{28} A. M. Zaremski and J. M. Wing. Specification matching of software components. TOSEM, 6:333-369, 1997.

Cited By

View all
  • (2020)Complex Network-Based Web Service for Web-API DiscoveryProceedings of the Australasian Computer Science Week Multiconference10.1145/3373017.3373035(1-10)Online publication date: 4-Feb-2020
  • (2019)A Fitness-Based Evolving Network for Web-APIs DiscoveryProceedings of the Australasian Computer Science Week Multiconference10.1145/3290688.3290709(1-10)Online publication date: 29-Jan-2019
  • (2017)Human-aware plan recognitionProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298023.3298103(3686-3692)Online publication date: 4-Feb-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
VLDB '04: Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
August 2004
1380 pages

Sponsors

  • VLDB Endowment: Very Large Database Endowment

Publisher

VLDB Endowment

Publication History

Published: 31 August 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Complex Network-Based Web Service for Web-API DiscoveryProceedings of the Australasian Computer Science Week Multiconference10.1145/3373017.3373035(1-10)Online publication date: 4-Feb-2020
  • (2019)A Fitness-Based Evolving Network for Web-APIs DiscoveryProceedings of the Australasian Computer Science Week Multiconference10.1145/3290688.3290709(1-10)Online publication date: 29-Jan-2019
  • (2017)Human-aware plan recognitionProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298023.3298103(3686-3692)Online publication date: 4-Feb-2017
  • (2017)Semantic similarity based web services composition frameworkProceedings of the Symposium on Applied Computing10.1145/3019612.3019805(1319-1325)Online publication date: 3-Apr-2017
  • (2016)Exploring web services from a network perspective using multi-level viewsJournal of Web Engineering10.5555/3177218.317722515:5-6(501-520)Online publication date: 1-Nov-2016
  • (2016)Semantic similarity based context-aware web service discovery using NLP techniquesJournal of Web Engineering10.5555/3177203.317720815:1-2(110-129)Online publication date: 1-Mar-2016
  • (2016)Discovering Underlying Plans Based on Distributed Representations of ActionsProceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems10.5555/2936924.2937091(1135-1143)Online publication date: 9-May-2016
  • (2016)Leveraging fuzzy dominance relationship and machine learning for hybrid web service discoveryInternational Journal of Web Engineering and Technology10.1504/IJWET.2016.07733611:2(107-132)Online publication date: 1-Jan-2016
  • (2016)Supporting the Design of Machine Learning Workflows with a Recommendation SystemACM Transactions on Interactive Intelligent Systems10.1145/28520826:1(1-35)Online publication date: 22-Feb-2016
  • (2016)Service querying to support process variant developmentJournal of Systems and Software10.1016/j.jss.2015.07.050122:C(538-552)Online publication date: 1-Dec-2016
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media