TIP: Vol 31, No

Volume 312022

Volume 31

2022

Publisher:

IEEE Press

ISSN:1057-7149

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Semi-Supervised Structured Subspace Learning for Multi-View Clustering

Pages 1–14https://doi.org/10.1109/TIP.2021.3128325

Multi-view clustering aims at simultaneously obtaining a consensus underlying subspace across multiple views and conducting clustering on the learned consensus subspace, which has gained a variety of interest in image processing. In this paper, we propose ...

research-article

A Novel Hybrid Level Set Model for Non-Rigid Object Contour Tracking

Pages 15–29https://doi.org/10.1109/TIP.2021.3112051

Most existing trackers use bounding boxes for object tracking. However, the background contained in the bounding box inevitably decreases the accuracy of the target model, which affects the performance of the tracker and is particularly pronounced for non-...

research-article

Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding

Pages 30–42https://doi.org/10.1109/TIP.2021.3126420

Geometric partitioning has attracted increasing attention by its remarkable motion field description capability in the hybrid video coding framework. However, the existing geometric partitioning (GEO) scheme in Versatile Video Coding (VVC) causes a non-...

research-article

AVLSM: Adaptive Variational Level Set Model for Image Segmentation in the Presence of Severe Intensity Inhomogeneity and High Noise

Pages 43–57https://doi.org/10.1109/TIP.2021.3127848

Intensity inhomogeneity and noise are two common issues in images but inevitably lead to significant challenges for image segmentation and is particularly pronounced when the two issues simultaneously appear in one image. As a result, most existing level ...

research-article

View-Wise Versus Cluster-Wise Weight: Which Is Better for Multi-View Clustering?

Pages 58–71https://doi.org/10.1109/TIP.2021.3128323

Weighted multi-view clustering (MVC) aims to combine the complementary information of multi-view data (such as image data with different types of features) in a weighted manner to obtain a consistent clustering result. However, when the cluster-wise ...

research-article

A Domain Gap Aware Generative Adversarial Network for Multi-Domain Image Translation

Pages 72–84https://doi.org/10.1109/TIP.2021.3125266

Recent image-to-image translation models have shown great success in mapping local textures between two domains. Existing approaches rely on a cycle-consistency constraint that supervises the generators to learn an inverse mapping. However, learning the ...

research-article

M<sup>5</sup>L: Multi-Modal Multi-Margin Metric Learning for RGBT Tracking

Pages 85–98https://doi.org/10.1109/TIP.2021.3125504

Classifying hard samples in the course of RGBT tracking is a quite challenging problem. Existing methods only focus on enlarging the boundary between positive and negative samples, but ignore the relations of multilevel hard samples, which are crucial for ...

research-article

Remote Sensing Scene Classification via Multi-Branch Local Attention Network

Pages 99–109https://doi.org/10.1109/TIP.2021.3127851

Remote sensing scene classification (RSSC) is a hotspot and play very important role in the field of remote sensing image interpretation in recent years. With the recent development of the convolutional neural networks, a significant breakthrough has been ...

research-article

Passive Non-Line-of-Sight Imaging Using Optimal Transport

Pages 110–124https://doi.org/10.1109/TIP.2021.3128312

Passive non-line-of-sight (NLOS) imaging has drawn great attention in recent years. However, all existing methods are in common limited to simple hidden scenes, low-quality reconstruction, and small-scale datasets. In this paper, we propose NLOS-OT, a ...

research-article

Euclidean Distance Approximations From Replacement Product Graphs

Pages 125–137https://doi.org/10.1109/TIP.2021.3128319

We introduce a new chamfering paradigm, locally connecting pixels to produce path distances that approximate Euclidean space by building a small network (a replacement product) inside each pixel. These “<inline-formula> <tex-math notation="LaTeX">$...

research-article

Edge Tracing Using Gaussian Process Regression

Pages 138–148https://doi.org/10.1109/TIP.2021.3128329

We introduce a novel edge tracing algorithm using Gaussian process regression. Our edge-based segmentation algorithm models an edge of interest using Gaussian process regression and iteratively searches the image for edge pixels in a recursive Bayesian ...

research-article

A Prototypical Knowledge Oriented Adaptation Framework for Semantic Segmentation

Pages 149–163https://doi.org/10.1109/TIP.2021.3128311

A prevalent family of fully convolutional networks are capable of learning discriminative representations and producing structural prediction in semantic segmentation tasks. However, such supervised learning methods require a large amount of labeled data ...

research-article

Defocus Image Deblurring Network With Defocus Map Estimation as Auxiliary Task

Pages 216–226https://doi.org/10.1109/TIP.2021.3127850

Different from the object motion blur, the defocus blur is caused by the limitation of the cameras’ depth of field. The defocus amount can be characterized by the parameter of point spread function and thus forms a defocus map. In this paper, we ...

research-article

Loss Re-Scaling VQA: Revisiting the Language Prior Problem From a Class-Imbalance View

Pages 227–238https://doi.org/10.1109/TIP.2021.3128322

Recent studies have pointed out that many well-developed Visual Question Answering (VQA) models are heavily affected by the language prior problem. It refers to making predictions based on the co-occurrence pattern between textual questions and answers ...

research-article

Triple-Level Model Inferred Collaborative Network Architecture for Video Deraining

Pages 239–250https://doi.org/10.1109/TIP.2021.3128327

Video deraining is an important issue for outdoor vision systems and has been investigated extensively. However, designing optimal architectures by the aggregating model formation and data distribution is a challenging task for video deraining. In this ...

research-article

Efficient and Accurate Stitching for 360° Dual-Fisheye Images and Videos

Pages 251–262https://doi.org/10.1109/TIP.2021.3130531

Back-to-back dual-fisheye cameras are the most cost-effective devices to capture 360° visual content. However, image and video stitching for such cameras often suffer from the effect of fisheye distortion, photometric inconsistency between the two ...

research-article

Completely Blind Quality Assessment of User Generated Video Content

Pages 263–274https://doi.org/10.1109/TIP.2021.3130541

In this work, we address the challenging problem of completely blind video quality assessment (BVQA) of user generated content (UGC). The challenge is twofold since the quality prediction model is oblivious of human opinion scores, and there are no well-...

research-article

Variational Abnormal Behavior Detection With Motion Consistency

Pages 275–286https://doi.org/10.1109/TIP.2021.3130545

Abnormal crowd behavior detection has recently attracted increasing attention due to its wide applications in computer vision research areas. However, it is still an extremely challenging task due to the great variability of abnormal behavior coupled with ...

research-article

Dynamic Facial Expression Recognition Under Partial Occlusion With Optical Flow Reconstruction

Pages 446–457https://doi.org/10.1109/TIP.2021.3129120

Video facial expression recognition is useful for many applications and received much interest lately. Although some methods give good results in controlled environments (no occlusion), recognition in the presence of partial facial occlusion remains a ...

research-article

Contrastive Self-Supervised Pre-Training for Video Quality Assessment

Pages 458–471https://doi.org/10.1109/TIP.2021.3130536

Video quality assessment (VQA) task is an ongoing small sample learning problem due to the costly effort required for manual annotation. Since existing VQA datasets are of limited scale, prior research tries to leverage models pre-trained on ImageNet to ...

research-article

Spectral-Spatial Boundary Detection in Hyperspectral Images

Pages 499–512https://doi.org/10.1109/TIP.2021.3131942

In this paper, we propose a novel method for boundary detection in close-range hyperspectral images. This method can effectively predict the boundaries of objects of similar colour but different materials. To effectively extract the material information ...

research-article

JigsawGAN: Auxiliary Learning for Solving Jigsaw Puzzles With Generative Adversarial Networks

Pages 513–524https://doi.org/10.1109/TIP.2021.3120052

The paper proposes a solution based on Generative Adversarial Network (GAN) for solving jigsaw puzzles. The problem assumes that an image is divided into equal square pieces, and asks to recover the image according to information provided by the pieces. ...

research-article

Toward Scalable and Unified Example-Based Explanation and Outlier Detection

Pages 525–540https://doi.org/10.1109/TIP.2021.3127847

When neural networks are employed for high-stakes decision-making, it is desirable that they provide explanations for their prediction in order for us to understand the features that have contributed to the decision. At the same time, it is important to ...

research-article

Two-Stage Copy-Move Forgery Detection With Self Deep Matching and Proposal SuperGlue

Pages 541–555https://doi.org/10.1109/TIP.2021.3132828

Copy-move forgery detection identifies a tampered image by detecting pasted and source regions in the same image. In this paper, we propose a novel two-stage framework specially for copy-move forgery detection. The first stage is a backbone self deep ...

research-article

Fast Parameter-Free Multi-View Subspace Clustering With Consensus Anchor Guidance

Pages 556–568https://doi.org/10.1109/TIP.2021.3131941

Multi-view subspace clustering has attracted intensive attention to effectively fuse multi-view information by exploring appropriate graph structures. Although existing works have made impressive progress in clustering performance, most of them suffer ...

research-article

Dynamic Neural Network for Lossy-to-Lossless Image Coding

Pages 569–584https://doi.org/10.1109/TIP.2021.3132825

Lifting-based wavelet transform has been extensively used for efficient compression of various types of visual data. Generally, the performance of such coding schemes strongly depends on the lifting operators used, namely the prediction and update ...

research-article

Person Foreground Segmentation by Learning Multi-Domain Networks

Pages 585–597https://doi.org/10.1109/TIP.2021.3097169

Separating the dominant person from the complex background is significant to the human-related research and photo-editing based applications. Existing segmentation algorithms are either too general to separate the person region accurately, or not capable ...

research-article

Universal Adversarial Patch Attack for Automatic Checkout Using Perceptual and Attentional Bias

Pages 598–611https://doi.org/10.1109/TIP.2021.3127849

Adversarial examples are inputs with imperceptible perturbations that easily mislead deep neural networks (DNNs). Recently, adversarial patch, with noise confined to a small and localized patch, has emerged for its easy feasibility in real-world ...

research-article

Adaptive Affinity for Associations in Multi-Target Multi-Camera Tracking

Pages 612–622https://doi.org/10.1109/TIP.2021.3131936

Data associations in multi-target multi-camera tracking (MTMCT) usually estimate affinity directly from re-identification (re-ID) feature distances. However, we argue that it might not be the best choice given the difference in matching scopes between re-...

research-article

Learning From Pixel-Level Label Noise: A New Perspective for Semi-Supervised Semantic Segmentation

Pages 623–635https://doi.org/10.1109/TIP.2021.3134142

This paper addresses semi-supervised semantic segmentation by exploiting a small set of images with pixel-level annotations (strong supervisions) and a large set of images with only image-level annotations (weak supervisions). Most existing approaches aim ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

IEEE Transactions on Image Processing

Sections

Semi-Supervised Structured Subspace Learning for Multi-View Clustering

A Novel Hybrid Level Set Model for Non-Rigid Object Contour Tracking

Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding

AVLSM: Adaptive Variational Level Set Model for Image Segmentation in the Presence of Severe Intensity Inhomogeneity and High Noise

View-Wise Versus Cluster-Wise Weight: Which Is Better for Multi-View Clustering?

A Domain Gap Aware Generative Adversarial Network for Multi-Domain Image Translation

M<sup>5</sup>L: Multi-Modal Multi-Margin Metric Learning for RGBT Tracking

Remote Sensing Scene Classification via Multi-Branch Local Attention Network

Passive Non-Line-of-Sight Imaging Using Optimal Transport

Euclidean Distance Approximations From Replacement Product Graphs

Edge Tracing Using Gaussian Process Regression

A Prototypical Knowledge Oriented Adaptation Framework for Semantic Segmentation

Defocus Image Deblurring Network With Defocus Map Estimation as Auxiliary Task

Loss Re-Scaling VQA: Revisiting the Language Prior Problem From a Class-Imbalance View

Triple-Level Model Inferred Collaborative Network Architecture for Video Deraining

Efficient and Accurate Stitching for 360° Dual-Fisheye Images and Videos

Completely Blind Quality Assessment of User Generated Video Content

Variational Abnormal Behavior Detection With Motion Consistency

Dynamic Facial Expression Recognition Under Partial Occlusion With Optical Flow Reconstruction

Contrastive Self-Supervised Pre-Training for Video Quality Assessment

Spectral-Spatial Boundary Detection in Hyperspectral Images

JigsawGAN: Auxiliary Learning for Solving Jigsaw Puzzles With Generative Adversarial Networks

Toward Scalable and Unified Example-Based Explanation and Outlier Detection

Two-Stage Copy-Move Forgery Detection With Self Deep Matching and Proposal SuperGlue

Fast Parameter-Free Multi-View Subspace Clustering With Consensus Anchor Guidance

Dynamic Neural Network for Lossy-to-Lossless Image Coding

Person Foreground Segmentation by Learning Multi-Domain Networks

Universal Adversarial Patch Attack for Automatic Checkout Using Perceptual and Attentional Bias

Adaptive Affinity for Associations in Multi-Target Multi-Camera Tracking

Learning From Pixel-Level Label Noise: A New Perspective for Semi-Supervised Semantic Segmentation