Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1247480.1247637acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Information discovery in loosely integrated data

Published: 11 June 2007 Publication History

Abstract

We model heterogeneous data sources with cross references, such as those crawled on the (enterprise) web, as a labeled graph with data objects as typed nodes and references or links as edges. Given the labeled data graph, we introduce flexible and efficient querying capabilities that go beyond existing capabilities by additionally discovering meaningful relationships between objects that satisfy keyword and/or structured query filters. We introduce the relationship search operator that exploits the link structure between data objects to rank objects related to the result of a filter. We implement the search operator using the ObjectRank [1] algorithm that uses the random surfer model. We study several alternatives for constructing summary graphs for query results that consist of individual and aggregate nodes that are somehow linked to qualifying result nodes. Some of the summary graphs are useful for presenting query results to the user, while others could be used to evaluate subsequent queries efficiently without considering all the nodes and links in the original data graph.

References

[1]
Andrey Balmin, Vagelis Hristidis, Yannis Papakonstantinou: ObjectRank: Authority-Based Keyword Search in Databases. VLDB 2004: 564--575.
[2]
Paul Brown, Peter J. Haas, Jussi Myllymaki, Hamid Pirahesh, Berthold Reinwald, Yannis Sismanis: Toward Automated large-scale Information Integration and Discovery. Data Management in a Connected World 2005: 161--180.
[3]
IBM Entity Analytic Solutions (EAS)-Solution Overview www.ibm.com/software/data/db2/eas/
[4]
Yannis Sismanis, Paul Brown, Peter J. Haas, Berthold Reinwald: GORDIAN: Efficient and Scalable Discovery of Composite Keys. VLDB 2006: 691--702.

Cited By

View all
  • (2018)BinRankIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2010.8522:8(1176-1190)Online publication date: 31-Dec-2018
  • (2009)BinRankProceedings of the 2009 IEEE International Conference on Data Engineering10.1109/ICDE.2009.94(66-77)Online publication date: 29-Mar-2009

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
June 2007
1210 pages
ISBN:9781595936868
DOI:10.1145/1247480
  • General Chairs:
  • Lizhu Zhou,
  • Tok Wang Ling,
  • Program Chair:
  • Beng Chin Ooi
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. XML
  2. information discovery
  3. search

Qualifiers

  • Article

Conference

SIGMOD/PODS07
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2018)BinRankIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2010.8522:8(1176-1190)Online publication date: 31-Dec-2018
  • (2009)BinRankProceedings of the 2009 IEEE International Conference on Data Engineering10.1109/ICDE.2009.94(66-77)Online publication date: 29-Mar-2009

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media