Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- short-paperJuly 2024
A Learning-to-Rank Formulation of Clustering-Based Approximate Nearest Neighbor Search
SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 2261–2265https://doi.org/10.1145/3626772.3657931A critical piece of the modern information retrieval puzzle is approximate nearest neighbor search. Its objective is to return a set of k data points that are closest to a query point, with its accuracy measured by the proportion of exact nearest ...
- research-articleDecember 2019
Influence constraint based Top-k spatial keyword preference query
AIIPCC '19: Proceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud ComputingArticle No.: 60, Pages 1–6https://doi.org/10.1145/3371425.3371492The traditional Top-k spatial keyword preference query processing mode usually selects the range and nearest neighbor as the spatial constraints. It focuses on the influence of the distance between a spatial object and a feature object on the query ...
- research-articleMarch 2013
Efficient processing of containment queries on nested sets
EDBT '13: Proceedings of the 16th International Conference on Extending Database TechnologyPages 227–238https://doi.org/10.1145/2452376.2452404We study the problem of computing containment queries on sets which can have both atomic and set-valued objects as elements, i.e., nested sets. Containment is a fundamental query pattern with many basic applications. Our study of nested set containment ...
- ArticleOctober 2012
An efficient video copy detection method combining vocabulary tree and inverted file
IScIDE'12: Proceedings of the third Sino-foreign-interchange conference on Intelligent Science and Intelligent Data EngineeringPages 613–621https://doi.org/10.1007/978-3-642-36669-7_75In this paper, we present an efficient content-based video copy detection method based on vocabulary tree and inverted files. The copy detection system exploits complementary local features and video sequence matching. Using two different local features,...
- posterOctober 2011
Index tuning for query-log based on-line index maintenance
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementPages 1997–2000https://doi.org/10.1145/2063576.2063874The existing query-log based on-line index maintenance approaches rely on frequency distribution of terms in the static query-log. Though these approaches are proved to be efficient, but in real world, the frequency distribution of the terms changes ...
-
- ArticleJune 2011
An Effective and Efficient Indexing Scheme for Audio Fingerprinting
MUE '11: Proceedings of the 2011 Fifth FTRA International Conference on Multimedia and Ubiquitous EngineeringPages 48–52https://doi.org/10.1109/MUE.2011.20An audio fingerprint is a content-based compact signature that summarizes an audio recording. A song can be recognized by matching an extracted fingerprint to a database of known fingerprints. Audio fingerprinting must solve the two key problems of ...
- demonstrationOctober 2010
A technical demonstration of large-scale image object retrieval by efficient query evaluation and effective auxiliary visual feature discovery
MM '10: Proceedings of the 18th ACM international conference on MultimediaPages 1559–1562https://doi.org/10.1145/1873951.1874286In this demonstration, we present a real-time system that addresses three essential issues of large-scale image object retrieval: 1) image object retrieval-facilitating pseudo-objects in inverted indexing and novel object-level pseudo-relevance feedback ...
- ArticleAugust 2010
Research in Automatic Search Engine Replacement Algorithm for Web Caching Based on User Behavior
WISA '10: Proceedings of the 2010 Seventh Web Information Systems and Applications ConferencePages 142–145https://doi.org/10.1109/WISA.2010.25To improve the retrieval efficiency and performance of the large scale information retrieval systems, analyzed existing replacement algorithm for WEB caching, due to the diversity of the WEB traffic pattern, the traditional algorithms for cache updating ...
- ArticleDecember 2009
Semantic Image Retrieval Using Region Based Inverted File
DICTA '09: Proceedings of the 2009 Digital Image Computing: Techniques and ApplicationsPages 242–249https://doi.org/10.1109/DICTA.2009.48Image data is as common as textual data in this digital world. There is an urgent demand of image management tools as efficient as those text search engines. Decades of research on image retrieval has found there is a significant gap between the ...
- research-articleNovember 2009
On-line index maintenance using horizontal partitioning
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementPages 435–444https://doi.org/10.1145/1645953.1646010In this paper, we propose a new merge-based index maintenance strategy for Information Retrieval systems. The new model is based on partitioning of the inverted index across the terms in it. We exploit the query log to partition the on-disk inverted ...
- ArticleJuly 2009
Search Mathematical Formulas by Mathematical Formulas
Proceedings of the Symposium on Human Interface 2009 on ConferenceUniversal Access in Human-Computer Interaction. Part I: Held as Part of HCI International 2009Pages 404–411https://doi.org/10.1007/978-3-642-02556-3_46Users cannot search information by mathematical formulas as queries in existing search engines. This is because mathematical formulas are not expressed as a sequence of characters. Some formulas are expressed in a complex structure like fractional ...
- ArticleAugust 2008
Mitos: Design and Evaluation of a DBMS-Based Web Search Engine
PCI '08: Proceedings of the 2008 Panhellenic Conference on InformaticsPages 49–53https://doi.org/10.1109/PCI.2008.46Engineering a Web search engine offering effective and efficient information retrieval is a challenging task. Mitos is a recently developed search engine that offers a wide spectrum of functionalities. A rather unusual design choice is that its index is ...
- ArticleMay 2007
Using d-gap patterns for index compression
WWW '07: Proceedings of the 16th international conference on World Wide WebPages 1209–1210https://doi.org/10.1145/1242572.1242769Sequential patterns of d-gaps exist pervasively in inverted lists of Web document collection indices due to the cluster property. In this paper the information of d-gap sequential patterns is used as a new dimension for improving inverted index ...
- ArticleOctober 2005
Fast on-line index construction by geometric partitioning
CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge managementPages 776–783https://doi.org/10.1145/1099554.1099739Inverted index structures are the mainstay of modern text retrieval systems. They can be constructed quickly using off-line merge-based methods, and provide efficient support for a variety of querying modes. In this paper we examine the task of on-line ...
- articleMarch 2005
A statistics-based approach to incrementally update inverted files
Information Processing and Management: an International Journal (IPRM), Volume 41, Issue 2Pages 275–288https://doi.org/10.1016/j.ipm.2003.10.004Many information retrieval systems use the inverted file as indexing structure. The inverted file, however, requires inefficient reorganization when new documents are to be added to an existing collection. Most studies suggest dealing with this problem ...
- articleJanuary 2003
Inverted file compression through document identifier reassignment
Information Processing and Management: an International Journal (IPRM), Volume 39, Issue 1Pages 117–131https://doi.org/10.1016/S0306-4573(02)00020-1The inverted file is the most popular indexing mechanism for document search in an information retrieval system. Compressing an inverted file can greatly improve document search rate. Traditionally, the d-gap technique is used in the inverted file ...
- ArticleDecember 2002
An effective region-based image retrieval framework
MULTIMEDIA '02: Proceedings of the tenth ACM international conference on MultimediaPages 456–465https://doi.org/10.1145/641007.641106We present a region-based image retrieval framework that integrates efficient region-based representation in terms of storage and retrieval and effective on-line learning capability. The framework consists of methods for image segmentation and grouping, ...
- ArticleNovember 2001
Compressing inverted files in scalable information systems by binary decision diagram encoding
SC '01: Proceedings of the 2001 ACM/IEEE conference on SupercomputingPage 60https://doi.org/10.1145/582034.582094One of the key challenges of managing very huge volumes of data in scalable Information retrieval systems is providing fast access through keyword searches. The major data structure in the information retrieval system is an inverted file, which records ...
- articleJuly 1993
Query processing and inverted indices in shared: nothing text document information retrieval systems
The VLDB Journal — The International Journal on Very Large Data Bases (VLDB), Volume 2, Issue 3Pages 243–276The performance of distributed text document retrieval systems is strongly influenced by the organization of the inverted text. This article compares the performance impact on query processing of various physical organizations for inverted lists. We ...
- articleDecember 1979
Design of a balanced multiple-valued file-organization scheme with the least redundancy
ACM Transactions on Database Systems (TODS), Volume 4, Issue 4Pages 518–530https://doi.org/10.1145/320107.320123A new balanced file-organization scheme of order two for multiple-valued records is presented. This scheme is called HUBMFS2 (Hiroshima University Balanced Multiple-valued File-organization Scheme of order two). It is assumed that records are ...