Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1646396.1646421acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
research-article

Evaluation of GIST descriptors for web-scale image search

Published: 08 July 2009 Publication History

Abstract

The GIST descriptor has recently received increasing attention in the context of scene recognition. In this paper we evaluate the search accuracy and complexity of the global GIST descriptor for two applications, for which a local description is usually preferred: same location/object recognition and copy detection. We identify the cases in which a global description can reasonably be used.
The comparison is performed against a state-of-the-art bag-of-features representation. To evaluate the impact of GIST's spatial grid, we compare GIST with a bag-of-features restricted to the same spatial grid as in GIST.
Finally, we propose an indexing strategy for global descriptors that optimizes the trade-off between memory usage and precision. Our scheme provides a reasonable accuracy in some widespread application cases together with very high efficiency: In our experiments, querying an image database of 110 million images takes 0.18 second per image on a single machine. For common copyright attacks, this efficiency is obtained without noticeably sacrificing the search accuracy compared with state-of-the-art approaches.

References

[1]
O. Chum, J. Philbin, J. Sivic, M. Isard, and A. Zisserman. Total recall: Automatic query expansion with a generative feature model for object retrieval. In ICCV, 2007.
[2]
O. Chum, J. Philbin, and A. Zisserman. Near duplicate image detection: min-hash and tf-idf weighting. In BMVC, 2008.
[3]
M. Douze, A. Gaidon, H. Jégou, M. Marszalek, and C. Schmid. INRIA-LEAR's video copy detection system. In TRECVID Workshop, November 2008.
[4]
J. Hayes and A. Efros. Scene completion using millions of photographs. In SIGGRAPH, 2007.
[5]
H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In ECCV, October 2008.
[6]
H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometry consistency for large scale image search - extended version. Technical report, INRIA, RR 6709, October 2008.
[7]
H. Jégou, H. Harzallah, and C. Schmid. A contextual dissimilarity measure for accurate and efficient image search. In CVPR, 2007.
[8]
S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In CVPR, 2006.
[9]
H. Lejsek, F. Ásmundsson, B. Jónsson, and L. Amsaleg. Scalability of local image descriptors: a comparative study. In ACM Multimedia, pages 589--598, 2006.
[10]
X. Li, C. Wu, C. Zach, S. Lazebnik, and J.-M. Frahm. Modeling and recognition of landmark image collections using iconic scene graphs. In ECCV, October 2008.
[11]
D. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2): 91--110, 2004.
[12]
J. Matas, O. Chum, U. Martin, and T. Pajdla. Robust wide baseline stereo from maximally stable extremal regions. In BMVC, pages 384--393, 2002.
[13]
K. Mikolajczyk. Binaries for affine covariant region descriptors, 2007.
[14]
K. Mikolajczyk and C. Schmid. Scale and affine invariant interest point detectors. IJCV, 60(1): 63--86, 2004.
[15]
D. Nistér and H. Stewénius. Scalable recognition with a vocabulary tree. In CVPR, pages 2161--2168, 2006.
[16]
A. Oliva and A. Torralba. Modeling the shape of the scene: a holistic representation of the spatial envelope. IJCV, 42(3): 145--175, 2001.
[17]
J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In CVPR, 2007.
[18]
J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Lost in quantization: Improving particular object retrieval in large scale image databases. In CVPR, 2008.
[19]
G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing&Management, 24(5): 513--523, 1988.
[20]
J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In ICCV, pages 1470--1477, 2003.
[21]
A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In MIR '06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, pages 321--330, 2006.
[22]
A. Torralba, R. Fergus, and W. T. Freeman. 80 million tiny images: a large database for non-parametric object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(11): 1958--1970, November 2008.
[23]
A. Torralba, R. Fergus, and Y. Weiss. Small codes and large databases for recognition. In CVPR, 2008.
[24]
Y. Weiss, A. Torralba, and R. Fergus. Spectral hashing. In Advances in Neural Information Processing Systems, 2009.
[25]
J. Zobel, A. Moffat, and K. Ramamohanarao. Inverted files versus signature files for text indexing. ACM Transactions on Database Systems, 23(4): 453--490, 1998.

Cited By

View all
  • (2024)Revolutionizing Malware DetectionInnovations, Securities, and Case Studies Across Healthcare, Business, and Technology10.4018/979-8-3693-1906-2.ch011(196-220)Online publication date: 12-Apr-2024
  • (2024)On the Dynamism of Paintings Through the Distribution of Edge DirectionsJournal of Imaging10.3390/jimaging1011027610:11(276)Online publication date: 1-Nov-2024
  • (2024)Vexless: A Serverless Vector Data Management System Using Cloud FunctionsProceedings of the ACM on Management of Data10.1145/36549902:3(1-26)Online publication date: 30-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIVR '09: Proceedings of the ACM International Conference on Image and Video Retrieval
July 2009
383 pages
ISBN:9781605584805
DOI:10.1145/1646396
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2009

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

CIVR '09
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)34
  • Downloads (Last 6 weeks)5
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Revolutionizing Malware DetectionInnovations, Securities, and Case Studies Across Healthcare, Business, and Technology10.4018/979-8-3693-1906-2.ch011(196-220)Online publication date: 12-Apr-2024
  • (2024)On the Dynamism of Paintings Through the Distribution of Edge DirectionsJournal of Imaging10.3390/jimaging1011027610:11(276)Online publication date: 1-Nov-2024
  • (2024)Vexless: A Serverless Vector Data Management System Using Cloud FunctionsProceedings of the ACM on Management of Data10.1145/36549902:3(1-26)Online publication date: 30-May-2024
  • (2024)A Unified Framework Based on Graph Consensus Term for Multiview LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.320149835:3(3964-3977)Online publication date: Mar-2024
  • (2024)Cluster-Based Optimization of Training Data Selection for Image Copy Detection Models2024 14th International Conference on Electrical Engineering (ICEENG)10.1109/ICEENG58856.2024.10566392(225-230)Online publication date: 21-May-2024
  • (2024)The 2023 video similarity dataset and challengeComputer Vision and Image Understanding10.1016/j.cviu.2024.103997243(103997)Online publication date: Jun-2024
  • (2024)Pattern-Expandable Image Copy DetectionInternational Journal of Computer Vision10.1007/s11263-024-02140-5132:12(5618-5634)Online publication date: 22-Jun-2024
  • (2024)Large-scale response-aware online ANN search in dynamic datasetsCluster Computing10.1007/s10586-023-04159-827:3(3499-3519)Online publication date: 1-Jun-2024
  • (2023)Representation learning via consistent assignment of views over random partitionsProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667842(39582-39601)Online publication date: 10-Dec-2023
  • (2023)Battle of the backbonesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667399(29343-29371)Online publication date: 10-Dec-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media