Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1242572.1242818acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

On building graphs of documents with artificial ants

Published: 08 May 2007 Publication History

Abstract

We present an incremental algorithm for building a neighborhood graph from a set of documents. This algorithm is based on a population of artificial agents that imitate the way real ants build structures with self-assembly behaviors. We show that our method outperforms standard algorithms for building such neighborhood graphs (up to 2230 times faster on the tested databases with equal quality) and how the user may interactively explore the graph.

References

[1]
H. Azzag, C. Guinot, and G. Venturini. Anttree: web document clustering using artificial ants. In R. L. de M'antaras and L. Saitta, editors, Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 04), pages 480--484. IOS Press, 8 2004.
[2]
T. Fruchterman and E. Reingold. Graph drawing by force-directed placement. In Software -- Practice and Experience, volume 21, pages 1129--1164, 1991.
[3]
C. Guinot, D. J.-M. Malvy, F. Morizot, M. Tenenhaus, J. Latreille, S. Lopez, E. Tschachler, and L. Dubertret. Classification of healthy human facial skin. Textbook of Cosmetic Dermatology Third edition, 2003.
[4]
H. Hacid and D. A. Zighed. An effective method for locally neighborhood graphs updating. In DEXA 2005, pages 930--939, 2005.
[5]
E.-H. Han, D. Boley, M. Gini, R. Gross, K. Hastings, G. Karypis, V. Kumar, B. Mobasher, and J. Moore. Webace: a web agent for document categorization and exploration. In AGENTS '98: Proceedings of the second international conference on Autonomous agents, pages 408--415, New York, NY, USA, 1998. ACM Press.
[6]
G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Inf. Process. Manage., 24(5):513--523, 1988.
[7]
G. T. Toussaint. The relative neighborhood graphs in a finite planar set. In Pattern recognition, chapter 12, pages 261--268. 1980.

Recommendations

Reviews

Suma Adabala

The task of clustering similar or related documents is important to information retrieval systems, like search engines. This is done by building graphs, where the given set of documents form the nodes and the edges represent the similarity between the documents and nodes. The authors present a graph-building algorithm that closely follows the self-assembly behavior observed when ants build living structures by connecting their bodies together. The tabulated results show that the proposed algorithm outperforms standard methods, such as relative neighborhood graphs (RNG) methods, for building graphs, while finding more similarity, that is, creating more links between documents. The key principle of the proposed algorithm is that the graph is built incrementally. When a new document is added, it follows the path of maximum similarity: it is connected to all neighboring nodes and documents whose similarity to the new document is higher than a given similarity threshold. As this is a short poster session paper, details of the algorithm and evaluation are omitted. It may be worthwhile to investigate other related publications by the authors, as the high performance of this algorithm makes it a promising substitute for current clustering algorithms. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '07: Proceedings of the 16th international conference on World Wide Web
May 2007
1382 pages
ISBN:9781595936547
DOI:10.1145/1242572
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 May 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. artificial ants
  2. clustering
  3. documents
  4. graph
  5. interactive visualization
  6. web

Qualifiers

  • Article

Conference

WWW'07
Sponsor:
WWW'07: 16th International World Wide Web Conference
May 8 - 12, 2007
Alberta, Banff, Canada

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 215
    Total Downloads
  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media