Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleOctober 2024
Data Augmentation via Latent Diffusion for Saliency Prediction
AbstractSaliency prediction models are constrained by the limited diversity and quantity of labeled data. Standard data augmentation techniques such as rotating and cropping alter scene composition, affecting saliency. We propose a novel data augmentation ...
- research-articleOctober 2024
MAGNet: Multi-scale Awareness and Global fusion Network for RGB-D salient object detection
AbstractIn recent years, excellent RGB-D salient object detection performance has been achieved. However, existing detection methods generally require a large number of model parameters in pursuit of high accuracy. To alleviate this problem, we propose a ...
- short-paperApril 2024
Perceptual Impact of Facial Quality in MPEG V-PCC-encoded Volumetric Videos
MMVE '24: Proceedings of the 16th International Workshop on Immersive Mixed and Virtual Environment SystemsPages 71–74https://doi.org/10.1145/3652212.3652221Volumetric video, a technique used in augmented reality (AR) and virtual reality (VR) applications, presents unique challenges in rendering and compression. To enable efficient compression, video-based point cloud compression (V-PCC) techniques have been ...
- research-articleApril 2024
Exploring the benefits of images with frequency visual content in predicting human ocular scanpaths using Artificial Neural Networks
Expert Systems with Applications: An International Journal (EXWA), Volume 239, Issue Chttps://doi.org/10.1016/j.eswa.2023.121839AbstractWe present a study of an artificial neural architecture that predict human ocular scanpaths while they are free-viewing different images types. This analysis is made by comparing different metrics that encompass scanpath patterns, these metrics ...
- research-articleMarch 2024
Blind quality-based pairwise ranking of contrast changed color images using deep networks
AbstractNext-generation multimedia networks are expected to provide systems and applications with top Quality of Experience (QoE) to users. To this end, robust quality evaluation metrics are critical. Unfortunately, most current research focuses only on ...
Highlights- A new architecture based on a suite of deep-learning models is introduced. The proposed architecture exploits relevant visual information, such as spatial color attributes and visual attention mechanisms, to accurately rank contrast-...
-
- research-articleOctober 2023
Multi-sentence video captioning using spatial saliency of video frames and content-oriented beam search algorithm
Expert Systems with Applications: An International Journal (EXWA), Volume 228, Issue Chttps://doi.org/10.1016/j.eswa.2023.120454AbstractVideo captioning algorithms aim at expressing the information and activities contained in a video clip in the form of lingual sentences. Most existing video captioning approaches have used only one sentence to describe the semantic content of a ...
- research-articleOctober 2023
VSGAN: Visual Saliency guided Generative Adversarial Network for data augmentation
IMXw '23: Proceedings of the 2023 ACM International Conference on Interactive Media Experiences WorkshopsPages 69–75https://doi.org/10.1145/3604321.3604382Deep learning approaches have allowed for a great leap in the performances of visual saliency models. However, the lack of annotated data remains the main challenge for visual saliency prediction. In this paper, we leverage image inpainting methods to ...
- research-articleJune 2023
Importance First: Generating Scene Graph of Human Interest
International Journal of Computer Vision (IJCV), Volume 131, Issue 10Pages 2489–2515https://doi.org/10.1007/s11263-023-01817-7AbstractScene graph aims to faithfully reveal humans’ perception of image content. When humans look at a scene, they usually focus on their interested parts in a special priority. This innate habit indicates a hierarchical preference about human ...
- research-articleApril 2023
The Dahu graph-cut for interactive segmentation on 2D/3D images
Highlights- An efficient method to compute the Dahu pseudo-distance on multivariate images.
Interactive image segmentation is an important application in computer vision for selecting objects of interest in images. Several interactive segmentation methods are based on distance transform algorithms. However, the most known ...
- research-articleApril 2023
Loop closure detection with patch-level local features and visual saliency prediction
Engineering Applications of Artificial Intelligence (EAAI), Volume 120, Issue Chttps://doi.org/10.1016/j.engappai.2023.105902AbstractLoop closure detection (LCD) is essential in the field of visual Simultaneous Localization and Mapping (vSLAM). In the LCD system, geometrical verification based on image matching plays a crucial role in avoiding erroneous detections. ...
Highlights- Create a novel LCD-oriented saliency prediction dataset (Saliency-LCD).
- Design ...
- research-articleOctober 2022
An enhanced image quality assessment by synergizing superpixels and visual saliency
Journal of Visual Communication and Image Representation (JVCIR), Volume 88, Issue Chttps://doi.org/10.1016/j.jvcir.2022.103610Highlights- Three limitations between the superpixel-based and VS-based FR-IQA models are found.
- We found that the two approaches have a complementary principle.
- Base on this principle, a FR-IQA model for synergizing superpixel and VS is ...
Superpixel and saliency-based evaluation methods play important roles in full reference image quality assessment (FR IQA). However, we find that these methods have one complementary principle and three limitations: (1) the weighted maps of ...
- ArticleJanuary 2023
Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency
AbstractThe intelligent video surveillance system (IVSS) can automatically analyze the content of the surveillance image (SI) and reduce the burden of the manual labour. However, the SIs may suffer quality degradations in the procedure of acquisition, ...
- research-articleAugust 2022
Objective quality assessment of retargeted images based on RBF neural network with structural distortion and content change
Multimedia Tools and Applications (MTAA), Volume 82, Issue 5Pages 7463–7477https://doi.org/10.1007/s11042-022-13662-wAbstractObjective quality assessment of retargeted images aims to find the best retargeting method for showing an image on different display terminals. This paper uses a Radial Basis Function (RBF) neural network to assess the quality of retargeted ...
- research-articleJuly 2022
Impact of visual saliency on multi-distorted blind image quality assessment using deep neural architecture
Multimedia Tools and Applications (MTAA), Volume 81, Issue 18Pages 25283–25300https://doi.org/10.1007/s11042-022-12060-6AbstractNo-referenceimage quality assessment (NR-IQA) techniques try to assess the quality of images without anyinformation regarding the pristine version of the image. NR-IQA becomes more challenging for images affected by multiple distortions and images ...
- research-articleJune 2022
Saliency-aware color harmony models for outdoor signboard
Computers and Graphics (CGRS), Volume 105, Issue CPages 25–35https://doi.org/10.1016/j.cag.2022.04.012AbstractThis paper introduces a geometric approach for assessing color harmony of a signboard, and color coherence of a signboard with the environment. We propose to incorporate visual saliency as an inherent color characteristic residing in ...
Graphical abstractDisplay Omitted
Highlights- A public dataset with 5.2 K valid subjective ratings on 375 real-world signboards.
- research-articleMay 2022
Personalized saliency prediction using color spaces
Multimedia Tools and Applications (MTAA), Volume 81, Issue 13Pages 18181–18202https://doi.org/10.1007/s11042-022-12341-0AbstractSaliency is the ability of being important, noticeable or attention worthy. Finding salient regions in images has important applications in automatic image cropping, image compression and advertisements. The salient regions for an individual in an ...
- research-articleJanuary 2022
SCVS: blind image quality assessment based on spatial correlation and visual saliency
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 39, Issue 1Pages 443–458https://doi.org/10.1007/s00371-021-02340-xAbstractWe propose a no-reference image quality assessment (NR-IQA) approach to predict the perceptual quality score of a given image without using any reference image. Our model consists of two steps and trains two similar convolutional neural networks (...
- research-articleJanuary 2022
BiconNet: An edge-preserved connectivity-based approach for salient object detection
Highlights- A new connectivity-based CNN (BiconNet) for salient object detection is proposed.
Salient object detection (SOD) is viewed as a pixel-wise saliency modeling task by traditional deep learning-based methods. A limitation of current SOD models is insufficient utilization of inter-pixel information, which usually ...
- research-articleJanuary 2022
An adaptive enhancement algorithm based on visual saliency for low illumination images
Applied Intelligence (KLU-APIN), Volume 52, Issue 2Pages 1770–1792https://doi.org/10.1007/s10489-021-02466-4AbstractIn order to improve the brightness and contrast of low illumination color images and avoid over enhancement, an adaptive image enhancement algorithm based on visual saliency is proposed. Firstly, the original low illumination image is transformed ...
- research-articleOctober 2021
Silicone mask face anti-spoofing detection based on visual saliency and facial motion
Neurocomputing (NEUROC), Volume 458, Issue CPages 416–427https://doi.org/10.1016/j.neucom.2021.06.033AbstractFace recognition systems are widely used for target recognition and identity authentication, such as automated teller machines, mobile phones, and entrance guard systems. However, face recognition systems are vulnerable to presentation ...