Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1353343.1353351acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free access

Semantic peer, here are the neighbors you want!

Published: 25 March 2008 Publication History

Abstract

Peer Data Management Systems (PDMSs) have been introduced as a solution to the problem of large-scale sharing of semantically rich data. A PDMS consists of semantic peers connected through semantic mappings. Querying a PDMS may lead to very poor results, because of the semantic degradation due to the approximations given by the traversal of the semantic mappings, thus leading to the problem of how to boost a network of mappings in a PDMS.
In this paper we propose a strategy for the incremental maintenance of a flexible network organization that clusters together peers which are semantically related in Semantic Overlay Networks (SONs), while maintaining a high degree of node autonomy. Semantic features, a summarized representation of clusters, are stored in a "light" structure which effectively assists a newly entering peer when choosing its semantically closest overlay networks. Then, each peer is supported in the selection of its own neighbors within each overlay network according to two policies: Range-based selection and k-NN selection. For both policies, we introduce specific algorithms which exploit a distributed indexing mechanism for efficient network navigation. The proposed approach has been implemented in a prototype where its effectiveness and efficiency have been extensively tested.

References

[1]
K. Aberer, P. Cudré-Mauroux, M. Hauswirth, and T. V. Pelt. GridVine: Building Internet-Scale Semantic Overlay Networks. In Proc. of ISWC, pages 107--121, 2004.
[2]
M. Bawa, G. Manku, and P. Raghavan. SETS: Search Enhanced by Topic Segmentation. In Proc. of the 26th ACM SIGIR Conf., pages 306--313, 2003.
[3]
T. Berners-Lee, J. Hendler, and O. Lassila. The Semantic Web. Scientific American, May 2001.
[4]
P. Ciaccia, M. Patella, and P. Zezula. M-tree: An Efficient Access Method for Similarity Search in Metric Spaces. In Proc. of the 23rd VLDB Conf., pages 426--435, 1997.
[5]
C. Comito, S. Patarin, and D. Talia. PARIS: A Peer-to-Peer Architecture for Large-Scale Semantic Data Integration. In Proc. of the DBISP2P Workshop, pages 163--170, 2005.
[6]
A. Crespo and H. Garcia-Molina. Semantic Overlay Networks for P2P Systems. In Proc. of the 3rd AP2PC Workshop, pages 1--13, 2004.
[7]
C. Doulkeridis, K. Nørvåg, and M. Vazirgiannis. DESENT: Decentralized and Distributed Semantic Overlay Generation in P2P Networks. IEEE J. on Selected Areas in Comm., 25(1):25--34, 2007.
[8]
P. Ganesan, H. Garcia-Molina, and J. Widom. Exploiting Hierarchical Domain Structure to Compute Similarity. ACM TOIS, 21(1):64--93, 2003.
[9]
V. Ganti, R. Ramakrishnan, J. Gehrke, A. Powell, and J. French. Clustering Large Datasets in Arbitrary Metric Spaces. In Proc. of the 15th ICDE Conf., pages 502--511, 1999.
[10]
A. Halevy, Z. Ives, J. Madhavan, P. Mork, D. Suciu, and I. Tatarinov. The Piazza Peer Data Management System. IEEE TKDE, 16(7):787--798, 2004.
[11]
A. Jain, M. Murty, and P. Flynn. Data Clustering: A Review. ACM Comp. Surv., 31(3):264--323, 1999.
[12]
G. Koloniari and E. Pitoura. Content-Based Routing of Path Queries in Peer-to-Peer Systems. In Proc. of the 9th EDBT Conf., pages 29--47, 2004.
[13]
C. Leacock and M. Chodorow. Combining Local Context and WordNet Similarity for Word Sense Identification. In C. Fellbaum, editor, WordNet: An Electronic Lexical Database, pages 256--283. MIT Press, 1998.
[14]
M. Li, W.-C. Lee, and A. Sivasubramaniam. Semantic Small World: An Overlay Network for Peer-to-Peer Search. In Proc. of the 12th IEEE ICNP, pages 228--238, 2004.
[15]
A. Linari and G. Weikum. Efficient Peer-to-Peer Semantic Overlay Networks Based on Statistical Language Models. In Proc. of the P2PIR Workshop (in conj. with CIKM), pages 9--16, 2006.
[16]
J. Madhavan, S. Cohen, X. Dong, A. Halevy, S. Jeffery, D. Ko, and C. Yu. Web-Scale Data Integration: You Can Afford to Pay as You Go. In CIDR, pages 342--350, 2007.
[17]
F. Mandreoli, R. Martoglia, W. Penzo, and S. Sassatelli. SRI: Exploiting Semantic Information for Effective Query Routing in a PDMS. In Proc. of the WIDM (in conj. with CIKM), pages 19--26, 2006.
[18]
F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, and G. Villani. SRI@work: Efficient and Effective Routing Strategies in a PDMS. In In Proc. of the 8th WISE Conf., pages 285--297, 2007.
[19]
F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, and G. Villani. SUNRISE: Exploring PDMS Networks with Semantic Routing Indexes. In Proc. of ESWC, 2007.
[20]
W. Nejdl, M. Wolpers, W. Siberski, C. Schmitz, M. Schlosser, I. Brunkhorst, and A. Löser. Super-Peer-Based Routing and Clustering Strategies for RDF-Based Peer-to-Peer Networks. In Proc. of the 12th WWW Conf., pages 536--543, 2003.
[21]
J. Parreira, S. Michel, and G. Weikum. P2PDating: Real Life Inspired Semantic Overlay Networks for Web Search. Inf. Proc. & Manag., 43(3):643--664, 2007.
[22]
E. Parzen. On Estimation of a Probability Density Function and Mode. Ann. Math. Statist., 33:1065--1076, 1962.
[23]
W. M. Rand. Objective Criteria for the Evaluation of Clustering Methods. J. Amer. Stat. Assoc., 66(336):846--850, 1971.
[24]
P. Rousseeuw. Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. J. Comput. Appl. Math., 20(1):53--65, 1987.
[25]
P. Triantafillou, C. Xiruhaki, M. Koubarakis, and N. Ntarmos. Towards High Performance Peer-to-Peer Content and Resource Sharing Systems. In Proc. of the 1st CIDR, 2003.
[26]
C. Yu and H. Jagadish. Schema Summarization. In Proc. of the 32nd VLDB Conf., pages 319--330, 2006.

Cited By

View all
  • (2015)Increasing Coverage in Distributed Search and Recommendation with Profile DiversityTransactions on Large-Scale Data- and Knowledge-Centered Systems XXII - Volume 943010.1007/978-3-662-48567-5_4(115-144)Online publication date: 1-Jul-2015
  • (2012)Ontology-Based Clustering in a Peer Data Management SystemInternational Journal of Distributed Systems and Technologies10.4018/jdst.20120401013:2(1-21)Online publication date: 1-Apr-2012
  • (2011)Gossiping correspondences to reduce semantic heterogeneity of unstructured P2P systemsProceedings of the 4th international conference on Data management in grid and peer-to-peer systems10.5555/2040132.2040138(37-48)Online publication date: 1-Sep-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '08: Proceedings of the 11th international conference on Extending database technology: Advances in database technology
March 2008
762 pages
ISBN:9781595939265
DOI:10.1145/1353343
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 March 2008

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

EDBT '08

Acceptance Rates

Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)48
  • Downloads (Last 6 weeks)8
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Increasing Coverage in Distributed Search and Recommendation with Profile DiversityTransactions on Large-Scale Data- and Knowledge-Centered Systems XXII - Volume 943010.1007/978-3-662-48567-5_4(115-144)Online publication date: 1-Jul-2015
  • (2012)Ontology-Based Clustering in a Peer Data Management SystemInternational Journal of Distributed Systems and Technologies10.4018/jdst.20120401013:2(1-21)Online publication date: 1-Apr-2012
  • (2011)Gossiping correspondences to reduce semantic heterogeneity of unstructured P2P systemsProceedings of the 4th international conference on Data management in grid and peer-to-peer systems10.5555/2040132.2040138(37-48)Online publication date: 1-Sep-2011
  • (2011)GROUPProceedings of the 11th international conference and 4th international conference on Smart spaces and next generation wired/wireless networking10.5555/2033707.2033766(496-507)Online publication date: 22-Aug-2011
  • (2011)Peer-to-Peer Data ManagementSynthesis Lectures on Data Management10.2200/S00338ED1V01Y201104DTM0153:2(1-150)Online publication date: 31-May-2011
  • (2011)SoFAExpert Systems with Applications: An International Journal10.1016/j.eswa.2010.06.02038:1(94-105)Online publication date: 1-Jan-2011
  • (2011)Gossiping Correspondences to Reduce Semantic Heterogeneity of Unstructured P2P SystemsData Management in Grid and Peer-to-Peer Systems10.1007/978-3-642-22947-3_4(37-48)Online publication date: 2011
  • (2011)GROUP: A Gossip Based Building Community ProtocolSmart Spaces and Next Generation Wired/Wireless Networking10.1007/978-3-642-22875-9_45(496-507)Online publication date: 2011
  • (2011)Efficient Data Sharing over Large-Scale Distributed CommunitiesIntelligent Decision Systems in Large-Scale Distributed Environments10.1007/978-3-642-21271-0_7(149-164)Online publication date: 2011
  • (2009)Rewiring strategies for semantic overlay networksDistributed and Parallel Databases10.1007/s10619-009-7046-726:2-3(181-205)Online publication date: 1-Dec-2009
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media