Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1739041.1739107acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Fast ELCA computation for keyword queries on XML data

Published: 22 March 2010 Publication History

Abstract

Keyword search is integrated in many applications on account of the convenience to convey users' query intention. Recently, answering keyword queries on XML data has drawn the attention of web and database communities, because the success of this research will relieve users from learning complex XML query languages, such as XPath/XQuery, and/or knowing the underlying schema of the queried XML data. As a result, information in XML data can be discovered much easier.
To model the result of answering keyword queries on XML data, many LCA (lowest common ancestor) based notions have been proposed. In this paper, we focus on ELCA (Exclusive LCA) semantics, which is first proposed by Guo et al. and afterwards named by Xu and Papakonstantinou. We propose an algorithm named Hash Count to find ELCAs efficiently. Our analysis shows the complexity of Hash Count algorithm is O(kd|S1|), where k is the number of keywords, d is the depth of the queried XML document and |S1| is the frequency of the rarest keyword. This complexity is the best result known so far. We also evaluate the algorithm on a real DBLP dataset, and compare it with the state-of-the-art algorithms. The experimental results demonstrate the advantage of Hash Count algorithm in practice.

References

[1]
DBLP XML Records. In http://dblp.uni-trier.de/xml/.
[2]
Oracle Berkeley DB. In http://www.oracle.com/technology/products/berkeley-db/index.html.
[3]
S. Agrawal, S. Chaudhuri, and G. Das. Dbxplorer: A system for keyword-based search over relational databases. In ICDE, pages 5--16, 2002.
[4]
G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using banks. In ICDE, pages 431--440, 2002.
[5]
S. Cohen, J. Mamou, Y. Kanza, and Y. Sagiv. Xsearch: A semantic search engine for xml. In VLDB, pages 45--56, 2003.
[6]
R. Goldman, N. Shivakumar, S. Venkatasubramanian, and H. Garcia-Molina. Proximity search in databases. In VLDB, pages 26--37. Morgan Kaufmann, 1998.
[7]
K. Golenberg, B. Kimelfeld, and Y. Sagiv. Keyword proximity search in complex data graphs. In SIGMOD Conference, pages 927--940, 2008.
[8]
L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. Xrank: Ranked keyword search over xml documents. In SIGMOD Conference, pages 16--27, 2003.
[9]
H. He, H. Wang, J. Yang, and P. S. Yu. Blinks: ranked keyword searches on graphs. In SIGMOD Conference, pages 305--316, 2007.
[10]
V. Hristidis and Y. Papakonstantinou. Discover: Keyword search in relational databases. In VLDB, pages 670--681, 2002.
[11]
V. Hristidis, Y. Papakonstantinou, and A. Balmin. Keyword proximity search on xml graphs. In ICDE, pages 367--378, 2003.
[12]
V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, and H. Karambelkar. Bidirectional expansion for keyword search on graph databases. In VLDB, pages 505--516, 2005.
[13]
L. Kong, R. Gilleron, and A. Lemay. Retrieving meaningful relaxed tightest fragments for xml keyword search. In EDBT, pages 815--826, 2009.
[14]
G. Li, J. Feng, J. Wang, and L. Zhou. Effective keyword search for valuable lcas over xml documents. In CIKM, pages 31--40, 2007.
[15]
Y. Li, C. Yu, and H. V. Jagadish. Schema-free xquery. In VLDB, pages 72--83, 2004.
[16]
F. Liu, C. T. Yu, W. Meng, and A. Chowdhury. Effective keyword search in relational databases. In SIGMOD Conference, pages 563--574, 2006.
[17]
Z. Liu and Y. Chen. Identifying meaningful return information for xml keyword search. In SIGMOD Conference, pages 329--340, 2007.
[18]
Z. Liu and Y. Chen. Reasoning and identifying relevant matches for xml keyword search. PVLDB, 1(1):921--932, 2008.
[19]
Y. Luo, X. Lin, W. Wang, and X. Zhou. Spark: top-k keyword query in relational databases. In SIGMOD Conference, pages 115--126, 2007.
[20]
L. Qin, J. X. Yu, and L. Chang. Keyword search in databases: the power of rdbms. In SIGMOD Conference, pages 681--694, 2009.
[21]
A. Schmidt, M. L. Kersten, and M. Windhouwer. Querying xml documents made easy: Nearest concept queries. In ICDE, pages 321--329, 2001.
[22]
C. Sun, C. Y. Chan, and A. K. Goenka. Multiway slca-based keyword search in xml data. In WWW, pages 1043--1052, 2007.
[23]
W. Wang, X. Wang, and A. Zhou. Hash-search: An efficient slca-based keyword search algorithm on xml documents. In DASFAA, pages 496--510, 2009.
[24]
Y. Xu and Y. Papakonstantinou. Efficient keyword search for smallest lcas in xml databases. In SIGMOD Conference, pages 537--538, 2005.
[25]
Y. Xu and Y. Papakonstantinou. Efficient lca based keyword search in xml data. In EDBT, pages 535--546, 2008.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '10: Proceedings of the 13th International Conference on Extending Database Technology
March 2010
741 pages
ISBN:9781605589459
DOI:10.1145/1739041
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 March 2010

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

Conference

EDBT/ICDT '10
EDBT/ICDT '10: EDBT/ICDT '10 joint conference
March 22 - 26, 2010
Lausanne, Switzerland

Acceptance Rates

Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)1
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Review on Keyword Search and Ranking Techniques for Semi-Structured DataKnowledge-Intensive Economies and Opportunities for Social, Organizational, and Technological Growth10.4018/978-1-5225-7347-0.ch013(248-270)Online publication date: 2019
  • (2019)A Crowdsourcing-based Approach for Efficient XML Keyword Search2019 5th International Conference on Web Research (ICWR)10.1109/ICWR.2019.8765281(16-21)Online publication date: Apr-2019
  • (2019)XPloreRankWorld Wide Web10.1007/s11280-018-0630-x22:4(1727-1750)Online publication date: 1-Jul-2019
  • (2019)Enabling Real Time Analytics over Raw XML DataReal-Time Business Intelligence and Analytics10.1007/978-3-030-24124-7_8(113-132)Online publication date: 11-Oct-2019
  • (2018)Discussions on Subgraph Ranking for Keyworded Search2018 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData)10.1109/Cybermatics_2018.2018.00179(935-936)Online publication date: Jul-2018
  • (2017)Towards heterogeneous keyword searchProceedings of the ACM Turing 50th Celebration Conference - China10.1145/3063955.3064802(1-6)Online publication date: 12-May-2017
  • (2017)A review on XML keyword query processing2017 International Conference on Innovative Mechanisms for Industry Applications (ICIMIA)10.1109/ICIMIA.2017.7975610(238-241)Online publication date: Feb-2017
  • (2017)Effective XML keyword query processing2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)10.1109/ICECA.2017.8203739(523-528)Online publication date: Apr-2017
  • (2017)A query refinement framework for xml keyword searchWorld Wide Web10.1007/s11280-017-0447-z20:6(1469-1505)Online publication date: 1-Nov-2017
  • (2017)Top-k coupled keyword recommendation for relational keyword queriesKnowledge and Information Systems10.1007/s10115-016-0959-350:3(883-916)Online publication date: 1-Mar-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media