Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleDecember 2024
Learning-Based Sub-image Retrieval in Historical Document Images
AbstractThe goal of this paper is to propose an unsupervised learning-based framework in order to deal with any kind of one-shot object detection scenario, focusing on the tasks of sub-image retrieval and pattern spotting in historical document images. ...
- ArticleDecember 2024
Fashion Image Retrieval with Occlusion
AbstractWith the growth of online fashion platforms and independent content creators, there is a growing interest in visually searching for similar clothing items as shown online. In real-world settings, clothes are often covered by other objects, making ...
- ArticleNovember 2024
Recovering Latent Hierarchical Relationships in Image Datasets Through Hyperbolic Embeddings
Progress in Pattern Recognition, Image Analysis, Computer Vision, and ApplicationsPages 92–103https://doi.org/10.1007/978-3-031-76607-7_7AbstractHyperbolic space has emerged as a promising alternative to Euclidean space for embedding high-dimensional data, including images. In particular, Hyperbolic embeddings have shown to be more effective in discovering hierarchical relationships ...
- research-articleNovember 2024
TPTE: Text-Guided Patch Token Exploitation for Unsupervised Fine-Grained Representation Learning
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 20, Issue 11Article No.: 352, Pages 1–18https://doi.org/10.1145/3673657Recent advances in pre-trained vision-language models have successfully boosted the performance of unsupervised image representation in many vision tasks. Most of existing works focus on learning global visual features with Transformers and neglect ...
- research-articleOctober 2024
Making Archives Searchable: Vision-Language Models for Classification of Historical Aerial Imagery
GeoSearch '24: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Searching and Mining Large Collections of Geospatial DataPages 1–8https://doi.org/10.1145/3681769.3698578Historical aerial imagery archives contain valuable geospatial data for studying urban development, environmental changes, and historical events. However, the volume of data and inconsistencies in metadata and georeferencing complicate content ...
-
- ArticleNovember 2024
IRGen: Generative Modeling for Image Retrieval
- Yidan Zhang,
- Ting Zhang,
- Dong Chen,
- Yujing Wang,
- Qi Chen,
- Xing Xie,
- Hao Sun,
- Weiwei Deng,
- Qi Zhang,
- Fan Yang,
- Mao Yang,
- Qingmin Liao,
- Jingdong Wang,
- Baining Guo
AbstractWhile generative modeling has become prevalent across numerous research fields, its integration into the realm of image retrieval remains largely unexplored and underjustified. In this paper, we present a novel methodology, reframing image ...
- ArticleNovember 2024
MeshVPR: Citywide Visual Place Recognition Using 3D Meshes
AbstractMesh-based scene representation offers a promising direction for simplifying large-scale hierarchical visual localization pipelines, combining a visual place recognition step based on global features (retrieval) and a visual localization step ...
- ArticleOctober 2024
Statewide Visual Geolocalization in the Wild
AbstractThis work presents a method that is able to predict the geolocation of a street-view photo taken in the wild within a state-sized search region by matching against a database of aerial reference imagery. We partition the search region into ...
- ArticleSeptember 2024
BVRCC: Bootstrapping Video Retrieval via Cross-Matching Correction
Artificial Neural Networks and Machine Learning – ICANN 2024Pages 19–33https://doi.org/10.1007/978-3-031-72347-6_2AbstractExisting video retrieval datasets suffer heavily from the ignorance of cross-matching between captions and videos. Typically, captions actually match with multiple videos but are incorrectly labeled as exclusive to ones, leading to numerous ...
- ArticleAugust 2024
R-DiP: Re-ranking Based Diffusion Pre-computation for Image Retrieval
AbstractIn image retrieval tasks, although efficient methods based on pre-computing information related to retrieval and effective methods utilizing re-ranking have been proposed, developing a method that achieves both efficiency and effectiveness at the ...
- ArticleAugust 2024
Exemplar-Free Deep Incremental Hashing for Efficient Image Retrieval
Advanced Intelligent Computing Technology and ApplicationsPages 386–400https://doi.org/10.1007/978-981-97-5675-9_33AbstractDeep hashing techniques have been advanced by CNNs’ semantic representations. However, existing incremental hashing methods rely on original data to maintain similarities, which is often inaccessible due to privacy, legal, and transmission ...
- ArticleJuly 2024
Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation
- Omar Moured,
- Shahid Ali Farooqui,
- Karin Müller,
- Sharifeh Fadaeijouybari,
- Thorsten Schwarz,
- Mohammed Javed,
- Rainer Stiefelhagen
Computers Helping People with Special NeedsPages 291–298https://doi.org/10.1007/978-3-031-62846-7_35AbstractAlternative Texts (Alt-Text) for chart images are essential for making graphics accessible to people with blindness and visual impairments. Traditionally, Alt-Text is manually written by authors but often encounters issues such as ...
- research-articleAugust 2024
Multi-Proxy Deep Hashing for Image Retrieval
MVRMLM '24: Proceedings of 2024 ACM ICMR Workshop on Multimodal Video RetrievalPages 33–38https://doi.org/10.1145/3664524.3675368Deep hashing is an effective method for content-based image retrieval due to its low storage requirements and fast retrieval speed. However, deep hashing is known to suffer from semantic information loss and quantization errors due to the need to ...
- research-articleAugust 2024
Hashing Orthogonal Constraint Loss for Multi-Label Image Retrieval
MVRMLM '24: Proceedings of 2024 ACM ICMR Workshop on Multimodal Video RetrievalPages 27–32https://doi.org/10.1145/3664524.3675367With the exponential growth of image data on the Internet, large-scale image retrieval has become increasingly important. Hash coding serves as a fundamental technique to achieve efficient retrieval. Traditional deep hashing methods typically optimize ...
- research-articleAugust 2024
Deep Fisher-Vector Descriptors for Image Retrieval and Scene Recognition
MVRMLM '24: Proceedings of 2024 ACM ICMR Workshop on Multimodal Video RetrievalPages 20–26https://doi.org/10.1145/3664524.3675365This study presents a novel architecture that significantly enhances the capabilities of large-scale image retrieval and recognition systems. We introduce a novel multi-stream Fisher vector network that integrates a convolutional neural network (CNN) ...
- ArticleMarch 2024
PDTW150K: A Dataset for Patent Drawing Retrieval
AbstractWe introduce a new large-scale patent dataset termed PDTW150K for patent drawing retrieval. The dataset contains more than 150,000 patents associated with text metadata and over 850,000 patent drawings. We also provide a set of bounding box ...
- ArticleJanuary 2024
Pseudo-label Based Unsupervised Momentum Representation Learning for Multi-domain Image Retrieval
AbstractAlthough many current cross-domain image retrieval researches have made good progress, most of the works is targeted at specific domains. At the same time, we also noticed that many works are based on manually annotated images. In this paper, in ...
- research-articleMarch 2024
DeepHashDetection: Adversarial Example Detection Basedon Similarity Image Retrieval
CCEAI '24: Proceedings of the 2024 8th International Conference on Control Engineering and Artificial IntelligencePages 220–224https://doi.org/10.1145/3640824.3640859Deep learning systems are extensively utilized in applications such as machine vision, autonomous driving, and audio recognition. However, there are concerns about their reliability and trustworthiness within both academic and industrial circles, ...
- research-articleJanuary 2024
Targeted Transferable Attack against Deep Hashing Retrieval
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 48, Pages 1–7https://doi.org/10.1145/3595916.3626420With the extensive utilization of deep hashing, there exists a surging interest in studying adversarial attacks against it. Previous methods have demonstrated the superior white-box attack performance against deep hashing. However, the more challenging ...