Nothing Special   »   [go: up one dir, main page]

Next Article in Journal
The Thermomajorization Polytope and Its Degeneracies
Previous Article in Journal
Non-Equilibrium Wigner Function and Application to Model of Catalyzed Polymerization
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Tensor Ring Sparsity Measurement for Image Completion

1
School of Automation, Guangdong University of Technology, Guangzhou 510006, China
2
RIKEN Center for Advanced Intelligence Project (AIP), Tokyo 103-0027, Japan
*
Author to whom correspondence should be addressed.
Entropy 2024, 26(2), 105; https://doi.org/10.3390/e26020105
Submission received: 12 December 2023 / Revised: 15 January 2024 / Accepted: 22 January 2024 / Published: 24 January 2024
(This article belongs to the Section Multidisciplinary Applications)
Figure 1
<p>Illustrations of the TR decomposition.</p> ">
Figure 2
<p>Illustrations of TR decomposition and its Kronecker bases representation.</p> ">
Figure 3
<p>Illustration of convergence property for TRSM-TC under different choices of TR-ranks. A synthetic tensor with TR structure (size <math display="inline"><semantics> <mrow> <mo>(</mo> <mn>7</mn> <mo>×</mo> <mn>8</mn> <mo>×</mo> <mn>7</mn> <mo>×</mo> <mn>8</mn> <mo>×</mo> <mn>7</mn> <mo>)</mo> </mrow> </semantics></math> with TR-rank <math display="inline"><semantics> <mrow> <mo>[</mo> <mn>5</mn> <mo>,</mo> <mn>5</mn> <mo>,</mo> <mn>5</mn> <mo>,</mo> <mn>5</mn> <mo>,</mo> <mn>5</mn> <mo>]</mo> </mrow> </semantics></math>, missing rate 0.8) is tested. The experiment records the change in the objective function values along the number of iterations. Each independent experiment is conducted 100 times, the average results are shown in the graphs, and the convergence curve is presented.</p> ">
Figure 4
<p>The eight benchmark color images.</p> ">
Figure 5
<p>Visual completion result of the house image for a missing rate of 0.9 with 5D tensorization and TR-ranks 12. Top row from left to right: the original image, its 0.9 missing case, and the ground truth error image. The second and third rows show the images recovered by TRSM-TC and their corresponding error images, respectively; the fourth and fifth rows show the images recovered by TRLRF and their corresponding error images, respectively; the sixth and seventh rows show the images recovered by TR-WOPT and their corresponding error images, respectively. (<b>a</b>) The fully recovered images. (<b>b</b>) The images recovered by the first 230,000 Kronecker bases. (<b>c</b>) The images recovered by the first 220,000 Kronecker bases. (<b>d</b>) The images recovered by the first 210,000 Kronecker bases. (<b>e</b>) The images recovered by the first 200,000 Kronecker bases. In each figure, we zoom in on the small red box located on the left and display the result in the upper right area.</p> ">
Figure 6
<p>Visual completion result of the 31st band of fake and real apples and the 29th band of sponges of the 0.98 missing rate. The first and second rows represent the recovery results of fake and real apples and sponges, respectively, and from left to right are the original images, the 0.98 missing rate case of the images, and the images recovered by algorithms TRSM-TC, TR-WOPT, TRLRF, and FBCP, respectively. In each figure, we zoom in on the small red box and display the result in the larger red box.</p> ">
Figure 7
<p>Visual completion results of 6 videos under a 0.98 missing rate. The columns from left to right: the original videos, the 0.98 missing cases and the videos recovered by algorithms TRSM-TC, TRLRF, TR-WOPT, and FBCP, respectively.</p> ">
Figure 8
<p>Visual completion result of the 3rd frame of the container and the 24th frame of akiyo using TRSM-TC. The top row shows the original images and its 0.98 missing cases for container and akiyo. The second and third rows show the recovered results of container and akiyo under different TR-ranks.</p> ">
Figure 9
<p>Ablation studies examining the effects of <math display="inline"><semantics> <mi>λ</mi> </semantics></math> and the first TR-rank. In figures focusing on the first TR-rank, the horizontal dashed line represents results achieved when the first TR-rank is set equal to the other ranks.</p> ">
Versions Notes

Abstract

:
As a promising data analysis technique, sparse modeling has gained widespread traction in the field of image processing, particularly for image recovery. The matrix rank, served as a measure of data sparsity, quantifies the sparsity within the Kronecker basis representation of a given piece of data in the matrix format. Nevertheless, in practical scenarios, much of the data are intrinsically multi-dimensional, and thus, using a matrix format for data representation will inevitably yield sub-optimal outcomes. Tensor decomposition (TD), as a high-order generalization of matrix decomposition, has been widely used to analyze multi-dimensional data. In a direct generalization to the matrix rank, low-rank tensor modeling has been developed for multi-dimensional data analysis and achieved great success. Despite its efficacy, the connection between TD rank and the sparsity of the tensor data is not direct. In this work, we introduce a novel tensor ring sparsity measurement (TRSM) for measuring the sparsity of the tensor. This metric relies on the tensor ring (TR) Kronecker basis representation of the tensor, providing a unified interpretation akin to matrix sparsity measurements, wherein the Kronecker basis serves as the foundational representation component. Moreover, TRSM can be efficiently computed by the product of the ranks of the mode-2 unfolded TR-cores. To enhance the practical performance of TRSM, the folded-concave penalty of the minimax concave penalty is introduced as a nonconvex relaxation. Lastly, we extend the TRSM to the tensor completion problem and use the alternating direction method of the multipliers scheme to solve it. Experiments on image and video data completion demonstrate the effectiveness of the proposed method.

1. Introduction

Image completion is a well-known problem in the field of image processing, which aims to recover the missing entries of partially observed image data [1,2]. Image completion methods have extensive applications in practical scenarios, including hyperspectral image recovery [3] and video inpainting [4]. These techniques have also exhibited successful implementations across various domains within the natural sciences. For example, in [5,6], they used image completion methods to achieve reliable, high-quality seismic data, which is crucial before the subsequent processing steps. Moreover, in [7], they used the image completion method to acquire dense earthquake data, which largely promotes the understanding of the structure and dynamics of Earth.
Sparsity is an important property in data representation and is extremely important for data analysis. Examples include Anisotropic interaction rules recognition in biological groups [8], face modeling [9], image compressive sensing [10,11], MRI compressive sensing [12,13,14], signal restoration [15,16], etc. For image completion, it is important to capture the inherent sparsity of the image data for the successful recovery of the missing entries. A common method of measuring the sparsity of the data uses the rank of the matrix, and it essentially measures the numbers of second-order Kronecker bases: u i v i T ( u i and v i are vectors), i = 1 , , R , used for matrix representation, where i is an index referring to the i-th Kronecker basis, and R is the matrix rank. The image completion task, thus, can be formulated as an optimization problem with the objective of minimizing the rank of the candidate matrices. However, the optimization problem involving rank minimization is NP-hard due to its discrete nature, and the trace norm is widely used as the convex surrogate [17,18]. Successful application of trace norm minimization for image completion can be found in [19]. Furthermore, it has been proved theoretically that the rank minimization problem can be solved by the trace norm minimization problem under some reasonable conditions [20].
However, in practice, images are usually generated from the interaction of multiple factors. While a matrix is well-suited for representing data arising from the interaction of two factors, employing a matrix rank for image completion in such cases needs to transform the data into a matrix format. This transformation may break the multi-mode correlation of the data and result in sub-optimal performance [21]. As a high-order generalization of a matrix, a tensor provides a natural way to represent these multi-way data and has been widely used for data representation in many areas such as electrodynamics [22,23], signal processing [24,25,26], and computer vision [27,28,29]. One of the most important tensor analysis techniques is tensor decomposition, which provides a compact representation of the tensor. The most classical tensor decomposition method is called CANDECOMP/PARAFAC or canonical polyadic (CP) decomposition, which was proposed by Carroll et al. [30]. CP decomposition decomposes a tensor into a sum of the rank-one tensor r = 1 R a r ( 1 ) a r ( d ) , where a r ( k ) , k = 1 , , d are vectors, d is the order of the tensor, R is the CP rank, and ∘ is the outer product operation. It can be observed that the CP rank serves as a natural generalization for the sparsity measurement of the tensor since matrix decomposition and the matrix rank are the special cases of CP decomposition when d = 2 . However, unlike its lower-order counterpart, the computation of the CP rank is an NP-hard problem [31], and it is also difficult to establish a solvable relaxation form [32].
Besides CP decomposition, many studies focus on the Tucker decomposition proposed in [33], which decomposes a tensor into a core tensor multiplied by factor matrices along each mode. Image completion algorithms based on low-rank Tucker decomposition can be found in [34,35]. Recently, tensor network decomposition, initially derived from the physics community for quantum many-body simulations, has been brought to the signal processing community and is emerging as a powerful tool for the analysis of tensor data [36,37]. Among them, the tensor train (TT) decomposition [38], the quantized tensor train (QTT) decomposition [39], and the tensor ring (TR) decomposition [40] have achieved the most research interests. Many low-rank TT and TR completion algorithms have been developed [41,42,43], and the experiment results demonstrate the superiority of the tensor network methods compared with Tucker decomposition methods in image completion, which is mainly due to the powerful representation ability of the tensor network decomposition.
While Tucker, TT, and TR decomposition ranks are commonly employed in tensor data analysis, they lack a similar interpretation of tensor sparsity as seen in CP rank. To bridge this gap, several studies measure the sparsity of the tensor using the Kronecker bases constructed by the factor matrices of the Tucker decomposition [44,45,46,47]. Different from the previous studies, and inspired by the strong expressiveness of the tensor network, in this work, we focus on TR decomposition, as shown in Figure 1, and the main contributions of the paper are the following:
  • We define a novel tensor sparsity measurement, termed tensor ring sparsity measurement (TRSM), which can be efficiently computed by the continuous product of ranks of the TR cores. Specifically, it measures the sparsity of tensors as the numbers of the Kronecker bases constructed by the TR cores for tensor representation. The graphical demonstration of the Kronecker bases representation of a tensor based on TR decomposition is shown in Figure 2.
  • To improve the practicality of TRSM, the minimax concave penalty (MCP) folded-concave penalty is introduced as a nonconvex relaxation and then applied to the tensor completion problem, which has previously been applied in computer vision and pattern recognition. As a result, we formulate a new tensor completion model called tensor ring sparsity measurement tensor completion (TRSM-TC). An efficient algorithm based on the alternating direction method of multipliers (ADMM) is developed to optimize the proposed model. Experiments show that TRSM-TC achieves better performance than other algorithms in recovering a high missing rate hyperspectral images and video.
The remainder of this paper is organized as follows. Section 2 introduces the preliminaries of tensor algebra and TR decomposition. Section 3 presents the proposed TRSM. The TRSM-TC problem and its optimization are formulated in Section 4. The complexity analysis is presented in Section 5. Section 6 demonstrates the experimental results and is followed by the conclusions in Section 7.

2. Preliminaries

2.1. Tensor Algebra

In this paper, some notations and preliminaries of tensors [48] are adopted. Scalars, vectors, and matrices are denoted by lowercase letters (e.g., x R ), boldface lowercase letters (e.g., x R n 1 ), and capital letters (e.g., X R n 1 × n 2 ), respectively. A tensor is a multidimensional array and is denoted by calligraphic letters, (e.g., X R n 1 × × n d ), where n i is the size of the corresponding mode. X ( i 1 , i 2 , , i n ) or x i 1 i 2 i n denote an element of tensor X R n 1 × × n d in position ( i 1 , i 2 , , i n ) . A mode-k fiber of tensor X R n 1 × × n d is a vector denoted as x i 1 i k 1 : i k + 1 i d , where a colon is used to indicate all elements of a mode. A tensor sequence of { X ( 1 ) , , X ( d ) } can be denoted as { X ( k ) } k = 1 d . Where appropriate, a tensor sequence can also be written as [ X ] . The matrix sequences and vector sequences can be defined similarly. The inner product of two tensors X , Y with the same size is defined as X , Y = i 1 i 2 i d x i 1 i 2 i d y i 1 i 2 i d . Furthermore, the Frobenius norm of X is defined by X F = X , X .
Two types of tensor unfolding expressions are defined in this article. The mode-k unfolding of tensor X R n 1 × × n d converts a tensor to a matrix, which is denoted as X ( k ) R n k × n 1 n k 1 n k + 1 n d , and using the multi-indices defined in [36], its elements are defined by X ( k ) ( i k , i 1 i k 1 i k + 1 i d ¯ ) = X ( i 1 , i 2 , , i d ) Another mode-k unfolding of tensor X R n 1 × × n d is denoted as X [ k ] R n k × n k + 1 n d n 1 n k 1 , and its elements are defined by X [ k ] ( i k , i k + 1 i d i 1 i k 1 ¯ ) = X ( i 1 , i 2 , , i d ) . Furthermore, the inverse operation of tensor unfolding is matrix folding, which transforms matrices to higher-order tensors as an inverse operation of the corresponding tensor unfolding. In this paper, we only define the folding operation fold k ( · ) for the first type of mode-k unfolding as fold k ( X ( k ) ) = X .

2.2. Tensor Ring Decomposition

TR decomposition is proposed to represent a high-order tensor by a sequence of three-order tensors that are multiplied together circularly. Specifically, given a d-order tensor X R n 1 × × n d , TR decomposition decomposes it into a sequence of latent tensors Z k R R k × n k × R k + 1 , k = 1 , , d , which can be expressed in an element-wise form given by
X ( i 1 , i 2 , , i d ) = Tr { Z 1 ( i 1 ) Z 2 ( i 2 ) Z d ( i d ) } ,
Z k ( i k ) is the i k th lateral slice matrix of the latent tensor Z k , which is of size R k × R k + 1 . The last latent tensor Z d is of size R d × n d × R 1 , where R d + 1 = R 1 . Z k are called TR cores. The collection of R k ,   k = 1 , 2 , , d , is defined as TR ranks. The illustration of TR decomposition is shown in Figure 1.

3. Tensor Ring Sparsity Measurement

By expressing the TR decomposition of Equation (1) in the tensor form, we obtain the following expression
X = r 1 , , r d R 1 , , R d z 1 ( r 1 , r 2 ) z 2 ( r 2 , r 3 ) z d ( r d , r 1 ) ,
where z k ( r k , r k + 1 ) is the mode-2 fiber of the TR core Z k . As we can see, based on TR decomposition, a tensor X can be represented by i = 1 d R i Kronecker bases constructed by the TR-cores. In Figure 2, we provide a graphical demonstration of this representation over a three-order tensor. In consequence, when considering tensors represented through TR decomposition, the total numbers of the Kronecker bases required in Equation (2) naturally provide a way to measure the sparsity of the tensor. In this work, we utilize the following formulation to measure the Kronecker bases sparsity of a tensor X defined upon the TR decomposition
K ( X ) = i = 1 d rank ( Z i ( 2 ) ) .
The following theorem explains the relation between K ( X ) and the TR Kronecker bases sparsity of the tensor
Theorem 1.
Given a d-th order tensor X R n 1 × × n d , which can be represented by Equation (1), the following inequality holds
i = 1 d rank ( Z i ( 2 ) ) i = 1 d R i .
Proof. 
For the mode-2 unfolding TR cores Z i ( 2 ) R n i × R i R i + 1 , according to the property of matrix rank, the following inequality holds
rank ( Z i ( 2 ) ) min { n i , R i R i + 1 } R i R i + 1 .
The proof is completed by
i = 1 d rank ( Z i ( 2 ) ) i = 1 d R i 2 = i = 1 d R i .
 □
We can conclude that i = 1 d rank ( Z i ( 2 ) )   serves as the lower bound of the TR Kronecker bases sparsity of the tensor. However, due to the lack of a straightforward optimization method for handling the matrix optimization problem involving a square root. We drop the square root and relax it to the form of K ( X ) . Moreover, the optimization of rank ( Z i ( 2 ) ) will lead to combinatorial optimization when applied. Instead of using the trace norm as the convex surrogate to ensure the practical performance of the proposed method, in this work, we use the nonconvex relaxations over the rank of the matrix. Specifically, the MCP folded-concave penalty [49] is chosen here due to its nearly unbiased statistical properties in variable selection. As a result, we can obtain the following relaxation over K ( X )
K MCP ( X ) = i = 1 d rank MCP ( Z i ( 2 ) ) .
Here, rank MCP ( · ) denotes the MCP folded-concave penalties of the matrix, which is defined as
rank MCP ( X ) = i ψ mcp σ i ( X ) ,
where σ i ( X ) denotes the i-th singular value of matrix X and
ψ mcp ( t ) = λ | t | t 2 2 a ,   if   | t | < a λ a λ 2 / 2 ,   if   | t | a λ .
The function ψ mcp ( · ) is the MCP folded-concave penalty, which can be seen as a continuous interpolation between the 1 penalty when a = , and the 0 penalty when a 0 + . Therefore, applying the MCP penalty to the singular values will result in a tighter surrogate of the matrix rank compared to the nuclear norm. Here, λ and a are two hyperparameters that play different roles in determining the shape of the ψ mcp ( · ) . a mainly controls the concavity of the function, while λ controls the penalty level [49].

4. Tensor Ring Sparsity Measurement-Based Tensor Completion

The core problem of the missing value estimation lies in how to build up the relationship between the known elements and the unknown ones. In this paper, we incorporate the proposed TRSM in the tensor completion problem as the optimization objective and formulate the TRSM-TC model as
min [ Z ] , X i = 1 d rank MCP ( Z i ( 2 ) ) + λ 2 X Ψ ( [ Z ] ) F 2   s . t . P Ω ( X ) = P Ω ( T ) ,
where P Ω ( T ) denotes all the observed entries with respect to the set of indices of observed entries represented by Ω , T denotes the observed tensor, and Ψ ( [ Z ] ) denotes the TR format tensor generated from [ Z ] . Every element of Ψ ( [ Z ] ) is calculated by Equation (1).
Due to the complex folded-concave penalties, in order to solve the model, a slightly modified ADMM inspired by [46] is needed. In the modified ADMM, the solution process over Equation (10) is divided into two stages. In the first stage, we use ADMM to solve the trace norm version of Equation (10), demonstrated as follows
min [ Z ] , [ M ] , X i = 1 d M i ( 2 ) * + λ 2 X Ψ ( [ Z ] ) F 2     s . t . P Ω ( X ) = P Ω ( T ) , M i ( 2 ) = Z i ( 2 ) , i = 1 , 2 , , d .
Here, we introduce [ M ] as auxiliary variables over the TR-cores, and the updated values of M i ( 2 ) , for i = 1 , , d will be used in the subsequent optimization. It is worth mentioning that the first stage of ADMM is only running for one iteration. In the second stage, we instead solve the following problem
min [ Z ] , [ M ] , X i = 1 d rank MCP ^ ( M i ( 2 ) | M i ( 2 ) 0 ) + λ 2 X Ψ ( [ Z ] ) F 2     s . t . P Ω ( X ) = P Ω ( T ) , M i ( 2 ) = Z i ( 2 ) , i = 1 , 2 , , d .
Here, rank MCP ^ ( X | Y ) = i ψ ^ MCP ( σ i ( X ) | σ i ( Y ) ) , and ψ ^ MCP ( σ i ( X ) | σ i ( Y ) ) = ψ MCP ( σ i ( Y ) ) + ψ MCP ( σ i ( Y ) ) ( σ i ( X ) σ i ( Y ) ) , ψ MCP ( · ) is the derivative of ψ MCP ( · ) . For M i 0 , i = 1 , , d , they are the fixed parameters of the optimization problem. Moreover, the values of M i 0 are assigned from the M i obtained in the first stage element-wisely, for i = 1 , , d . For Equation (12), the augmented Lagrangian function is
L ( [ Z ] , X , [ M ] , [ Y ] ) = i = 1 d rank MCP ^ ( M i ( 2 ) | M i ( 2 ) 0 ) + ( k = 1 d μ 2 M k Z k + 1 μ Y k F 2 ) + λ 2 X Ψ ( [ Z ] ) F 2 s . t . P Ω ( X ) = P Ω ( T ) ,
where [ Y ] are the Lagrangian multipliers, and μ > 0 is a penalty parameter. For k = 1 , , d , Z k , M k , Y k are all independent, and we update them using the following updating schemes.
Update of  Z k . For k = 1 , , d , the augmented Lagrangian function with respect to Z k can be simplified as
L ( Z k ) = μ 2 M k Z k + 1 μ Y k F 2 + λ 2 X Ψ ( [ Z ] ) F 2 + C Z k ,
where the constant C Z k consists of other parts of the Lagrangian function, which are irrelevant to updating Z k . This is a least squares problem, and the best result is obtained when the derivative of Equation (14) equals zero, so for k = 1 , , d , Z k can be updated by
Z k = fold 2 ( ( μ M k ( 2 ) + Y k ( 2 ) + λ X [ k ] ( Z [ 2 ] k ) ) ( μ I + λ ( Z [ 2 ] k ) T ( Z [ 2 ] k ) ) 1 ) ,
where I R R k 2 × R k 2 denotes the identity matrix, and Z [ 2 ] k is the subchain of TR decomposition with respect to mode-k, where the definition can be found in [40].
Update of  M i . For i = 1 , , d , the augmented Lagrangian functions with respect to M i is expressed
L ( M i ) = a i rank MCP ^ ( M i ( 2 ) | M i ( 2 ) 0 ) + < Y i , M i Z i > + μ 2 M i Z i F 2 + C M i ,
where a i = k = 1 , k i d rank MCP ^ ( M k ( 2 ) | M k ( 2 ) 0 ) . Here, we introduce a i as an auxiliary variable for saving space to represent the constant factor that is not related to the optimization variable M i . The above formulation has a closed-form solution, which is given by
M i = fold 2 ( D ^ a i μ , v i ( Z i ( 2 ) 1 μ Y i ( 2 ) ) ) ,
where v i = ( ψ MCP ( σ 1 ( M i ( 2 ) 0 ) ) , , ψ MCP ( σ r ( M i ( 2 ) 0 ) ) T is a collection of the singular values of M i ( 2 ) 0 and D ^ r , w ( X ) = U Σ r , w V T is the generalized shrinkage operator over X where Σ r , w = diag ( max ( σ i r w i , 0 ) ) ( σ i is the i-th singular value of X, and w i is the i-th element of w ).
Update of  X . The augmented Lagrangian functions with respect to X are given by
L ( X ) = λ 2 X Ψ ( [ Z ] ) F 2 + C X s . t . P Ω ( X ) = P Ω ( T ) .
The recovery tensor X is updated by inputting the observed values in the corresponding entries and by approximating the missing entries by the updated TR factors [ Z ] for every iteration.
X = P Ω ( T ) + P Ω ¯ ( Ψ ( [ Z ] ) ) ,
where Ω ¯ is the set of indices of missing entries, which is a complement to Ω .
Update of  Y k . For k = 1 , , d and the Lagrangian multiplier, Y k is updated as
Y k = Y k + μ ( M k Z k ) .
The penalty term of the Lagrangian functions L is restricted by μ , and it is updated for every iteration by μ = min ( ρ μ , μ max ) , where 1 < ρ < 1.5 is a tuning hyperparameter. Moreover, at the end of each iteration, the values of M i ( 2 ) will be assigned to M i ( 2 ) 0 element-wisely as an update of M i ( 2 ) 0 , for i = 1 , , d .
The ADMM-based solving scheme is updated iteratively based on the above updating scheme. Two optimization stopping conditions are set: (i) maximum number of iterations m a x i t e r and (ii) the difference between two iterations (i.e., X X last F / X F ), which is thresholded by the tolerance t o l . The implementation process of TRSM-TC is summarized in Algorithm 1. The convergent property of the proposed algorithm is demonstrated in Figure 3.
Algorithm 1 Tensor Ring Sparsity Measurement Tensor Completion
Input:  P Ω T , Z k k = 1 d : initial TR-cores, M i i = 1 d : initial auxiliary variables, Y i i = 1 d : initial Lagrangian multiplier, initial X , λ , μ , μ m a x , ρ , t o l , m a x i t e r
Output: completed tensor X and TR-cores Z k k = 1 d
 1:
M i 0 = M i for i = 1 , , d , where M i ,   i = 1 , , d are obtained from one iteration ADMM update of Equation (11);
 2:
for  i = 1 to m a x i t e r do
 3:
   if  i > 1 then
 4:
        M i 0 = M i for i = 1 , , d ;
 5:
   end if
 6:
   Update Z k k = 1 d by Equation (15);
 7:
   Update M i i = 1 d by Equation (17);
 8:
    X l a s t = X ;
 9:
   Update X by Equation (19);
10:
   Update Y i i = 1 d by Equation (20);
11:
    μ = m a x ρ μ , μ m a x ;
12:
   if  X X l a s t F / X F < t o l then
13:
        break ;
14:
   end if
15:
end for

5. Computational Complexity

We analyze the computational complexity of the proposed TRSM-TC as follows. For a tensor X R n 1 × × n d , the TR-ranks are set as R 1 = R 2 = = R d = j , and the main computational complexity in updating auxiliary variables [ M ] comes from the SVD operation, which is 𝒪 ( k = 1 d 2 ( d 1 ) n k 2 j 2 ) . The computational complexities of calculating Z [ 2 ] k and updating [ Z ] are 𝒪 ( d j 3 i = 1 , i k d n i ) and 𝒪 ( d j 2 i = 1 d n i + d j 6 ) , respectively. If we assume n 1 = n 2 = = n d = n , then the overall complexity of the proposed algorithm can be written as 𝒪 ( d j 2 n d + d j 6 ) .

6. Experimental Results

In this section, tensor completion algorithms, including Tmac [50], FBCP [51], CP-WOPT [52], HaLRTC [53], TRLRF [42], TR-WOPT [54], are chosen as the baseline methods, and we evaluate the effectiveness of the proposed TRSM-TC method on color images, multispectral images, and video data completion.

6.1. Color Images Completion

In this section, we test the proposed TRSM-TC against other completion algorithms on eight benchmark color images shown in Figure 4. In recent years, reshaping low-order tensors into high-order tensors and then selecting an appropriate high-order form is a commonly used strategy to improve the performance of the TT/TR-based methods on visual-data completion [41,43]. To evaluate the performances of different methods in high-order forms and investigate in which forms they perform the best, we further reshaped the color images of size ( 256 × 256 × 3 ) (3D) to ( 16 × 16 × 16 × 16 × 3 ) (5D), ( 4 × 4 × 4 × 4 × 4 × 4 × 4 × 4 × 3 ) (9D) and visual data tensorization (VDT) (9D). The VDT tensorization has been introduced in [55]. The idea is simple, if the size of a RGB image is u × v × 3 and u = u 1 × u 2 × × u l and v = v 1 × v 2 × × v l are satisfied, then the image can be tensorized to a ( l + 1 ) dimension tensor of size u 1 v 1 × u 2 v 2 × × u l v l × 3 . In this experiment, we firstly reshape the three-way color image to a seventeen-way tensor of size 2 × 2 × × 2 × 3 and permute the tensor according to order { 1 , 9 , 2 , 10 , 3 , 11 , 4 , 12 , 5 , 13 , 6 , 14 , 7 , 15 , 8 , 16 , 17 } . Then, we reshape the tensor to a nine-way tensor of size ( 4 × 4 × 4 × 4 × 4 × 4 × 4 × 4 × 3 ) .
In this experiment, we set the random missing rate as 0.8 and 0.9 and evaluated the recovery performance using RSE, which is defined as
RSE = X T F / T F ,
where X is the ground-truth tensor, and T is the tensor recovered by the tensor completion algorithm. Smaller RSE means better recovery results. In addition to RSE, we also adopt a structural similarity index measure (SSIM) [56] to assess the performance of image recovery. A higher SSIM value indicates better recovery performance. For TR-related algorithms, we set the TR-ranks as the same value (i.e., R 1 = R 2 = = R d ), and we vary the TR-ranks to obtain the best results. The parameters of other algorithms are tuned to achieve the best performance.
Table 1 lists the averaged performances of the eight color images over different algorithms under different sampling rates and tensor shapes. It can be seen from Table 1 that the proposed method and other TR-based algorithms (i.e., TRLRF, TR-WOPT) obtain significantly lower RSE in 5D. These experiment results suggest that an appropriate high-order tensorization scheme does help improve the performances of TR-based methods. For other compared methods, we can observe that they achieve worse performances after reshaping into high-order forms. Compared with other methods, the proposed TRSM-TC obtains the best recovery performance in the 5D form. In the following experiments, we keep the 5D tensorization scheme for the TR-based methods and adopt the original data structure for the other methods. In Table 2, we compare the recovery SSIM between different methods. It is evident that the proposed method consistently achieves the highest SSIM across various missing rates, thereby demonstrating its effectiveness in recovering color images.
In Figure 5, we visualize the house image recovered under 90 missing rates by a portion of the Kronecker bases constructed by the learned TR-cores from different TR-based completion methods. Besides showing the fully recovered tensor constructed by the TR-related algorithms, the recovered images constructed from the first 200,000, 210,000, 220,000, 230,000 largest Kronecker bases (ranked based on the Frobenius norm of the Kronecker basis) are also demonstrated. As we can see, when adding more Kronecker bases, the recovery results of all the methods improve. However, the improvements in TR-WOPT and TRLRF are relatively small. Moreover, TRSM-TC consistently achieves better recovery performance compared with TR-WOPT and TRLRF under the same number of Kronecker bases. These results indicate that the proposed TRSM-TC is encouraged to find more meaningful and expressive TR Kronecker bases.

6.2. Noisy Color Images Completion

In practice, there is additive noise in the image data. Therefore, in this section, we tend to test the robustness of the proposed method to the noise. By selecting eight benchmark color images described in the last section as the ground truth images, we add Gaussian noise with different intensities to them according to signal–noise ratios (SNR) of 21, 26, and 31. Then, we set the random missing rate as 0.8 and 0.9 and evaluate the recovery performance of different methods using RSE and SSIM.
In Table 3, we demonstrate the averaging recovery performances of different methods over eight color images under different missing rates and noise. From the result, we can see that the recovery performances of all the methods become worse than their noise-free cases. However, we can see that the proposed TRSM-TC still outperforms the other testing methods in the noisy cases in terms of both RSE and SSIM.

6.3. Multispectral Images Completion

In this section, we use the Columbia Multispectral Image Database (CAVE) [57], which contains multispectral images of 32 real-world scenes, each with a spatial resolution of 512 × 512 and 31 bands (varying from 400 nm to 700 nm in 10 nm steps). Each image is resized to 256 × 256 for all spectral bands and rescaled to [0, 1]. For the TR-related methods, the multispectral images of size ( 256 × 256 × 31 ) (3D) are directly reshaped to ( 16 × 16 × 16 × 16 × 31 ) (5D).
In this experiment, we considered the high missing rates of 0.95 and 0.98. RSE, SSIM, and erreur relative globale adiensionnelle de synthese (ERGAS [58]) were employed for performance evaluation, and ERGAS is defined as
ERGAS = 100 1 s i = 1 s MSE ( X : : i , T : : i ) MEAN ( T : : i ) 2 ,
where s is equal to the spectral dimension of the tensor. X : : i and T : : i are the i-th frontal slice of ground-truth tensor X and recovery tensor T . Moreover, MSE ( · ) is the mean square error operation of two matrices and MEAN ( · ) is the mean value operation of the matrix. For ERGAS, good completion results correspond to smaller values.
The averaged performances of 32 multispectral images under different sampling rates are summarized in Table 4. As shown in Table 4, TRSM-TC performs the best with respect to all the evaluation metrics. Moreover, it should be noted that it is challenging for image completion algorithms to recover the images with a 0.98 missing rate. While most of the algorithms fail to achieve satisfactory completion performances, the proposed method still achieves a high performance.
In order to visually compare the performances of the competing methods with a 0.98 missing rate, we show in Figure 6 the 31st band of fake and real apples and the 29th band of sponges. We only demonstrate the results of TRSM-TC, TR-WOPT, TRLRF, and FBCP because other methods fail in this case. From the figures, the superiority of the proposed method can be observed.

6.4. Videos Completion

In the video completion experiment, six videos of size ( 112 × 160 × 3 × 32 ) (4D) were chosen from the video trace library [59]. For the TR-related methods, we reshaped them to ( 8 × 14 × 16 × 10 × 3 × 32 ) (6D).
To further demonstrate the superiority of the proposed method in recovering data with a high missing rate, we consider that only 2 % of the video data are observed, and the average completion results are shown in Table 5. From the table, we can see that TRSM-TC achieves the lowest RSE and ERGAS and the highest SSIM compared with other methods. Moreover, the visual completion results of TRSM-TC, TRLRF, TR-WOPT, and FBCP are demonstrated in Figure 7. It can be seen from the results that the proposed method outperforms all the other algorithms. In particular, the proposed method can recover the finer-grained textures and coarser-grained structures of the videos while the other methods can not.
Furthermore, since the proposed method includes an explicit low-rank constraint on the TR-cores, the resulting TR decomposition tends to be low-rank, even when larger initial TR-ranks are set. In the following experiment, we aim to test the rank robustness of the proposed algorithm. As we can see in Figure 8, the recovery results of container and akiyo under different choices of TR-ranks are shown. The best completion results for container and akiyo are obtained when TR-ranks are set as 15 and 21, respectively. After that, with the increase in TR-ranks, the recovery performances are bound to decline due to the overfitting. However, the results indicate that the performance remains relatively stable. Specifically, in the case of the container’s recovery, even when the TR-ranks increase to 21—which is 6 higher than the best ranks—the overall structure of the image is still preserved. These findings confirm the rank robustness of the proposed method.

6.5. Parameters Analysis

In this section, our objective is to investigate the effects of varying λ , which is the hyperparameter that controls the MCP penalty level, and R 1 , the first TR-rank, on the recovery performance of our proposed method. We conduct our study using one image each from color, multispectral, and video datasets with missing rates of 0.9 for the color image and 0.98 for both the multispectral image and video. Furthermore, we set the TR-ranks to 12 for the color image and video, and 8 for the multispectral image. As we can see from the upper left figure in Figure 9, the RSE for both datasets initially decreases and then increases as λ surpasses a certain threshold. This suggests that both too small and too large values of λ can negatively impact the method’s performance. The underlying reason is that λ governs the regularization intensity of the penalty term in the objective function. From the results, it is recommended to choose a value for λ within [0.6,1.4]. To evaluate the effect of varying R 1 , we keep all the other TR-ranks fixed and only adjust the value of the first TR-rank. The results presented in Figure 9 indicate that tuning the first TR-rank has a noticeable impact on performance across various datasets, with the most pronounced improvement observed in the multispectral image case, and this is because most of the redundancy of the multispectral image exists in the spectral mode [60], which is controlled by the first TR-rank. These findings imply that tuning the first TR-rank independently could be beneficial in practical applications, particularly when significant low-rankness is present in either the first or the last mode of the tensor.

7. Conclusions

In this paper, we propose a novel tensor sparsity measurement based on TR decomposition, termed TRSM. The proposed TRSM has a unified interpretation with a matrix sparsity measurement by taking the Kronecker basis as the fundamental representation component. By incorporating TRSM in the tensor completion problem as an optimization objective and solving it with the modified ADMM, we conducted extensive image and video data completion experiments to demonstrate the effectiveness of the proposed TRSM.
In our future research, we will employ TRSM to TRPCA and other tensor analysis applications to improve their performance.

Author Contributions

Conceptualization, J.Z., Y.Q., Y.M., A.W. and Q.Z.; investigation, J.Z., Y.Q., Y.M. and A.W.; methodology, J.Z., Y.Q., Y.M., A.W. and Q.Z.; project administration, Q.Z.; software, J.Z. and Y.M.; validation, J.Z., Y.Q. and Y.M.; visualization, J.Z. and Y.Q.; writing—original draft, J.Z., Y.Q. and Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (No. 62103110, No. 62071132 and No. 62203124) and Guangdong Natural Science Foundation (No. 2023A1515012916).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bertalmio, M.; Sapiro, G.; Caselles, V.; Ballester, C. Image inpainting. In Proceedings of the 27th Annual Conference on Computer Graphics and Interactive Techniques, New Orleans, LA, USA, 23–28 July 2000; pp. 417–424. [Google Scholar]
  2. Komodakis, N. Image completion using global optimization. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), New York, NY, USA, 17–22 June 2006; Volume 1, pp. 442–452. [Google Scholar]
  3. He, W.; Yokoya, N.; Yuan, L.; Zhao, Q. Remote sensing image reconstruction using tensor ring completion and total variation. IEEE Trans. Geosci. Remote Sens. 2019, 57, 8998–9009. [Google Scholar] [CrossRef]
  4. Ding, T.; Sznaier, M.; Camps, O.I. A rank minimization approach to video inpainting. In Proceedings of the 2007 IEEE 11th International Conference on Computer Vision, Rio De Janeiro, Brazil, 14–21 October 2007; pp. 1–8. [Google Scholar]
  5. Siahsar, M.A.N.; Gholtashi, S.; Abolghasemi, V.; Chen, Y. Simultaneous denoising and interpolation of 2D seismic data using data-driven non-negative dictionary learning. Signal Process. 2017, 141, 309–321. [Google Scholar] [CrossRef]
  6. Nazari Siahsar, M.A.; Gholtashi, S.; Kahoo, A.R.; Chen, W.; Chen, Y. Data-driven multitask sparse dictionary learning for noise attenuation of 3D seismic data. Geophysics 2017, 82, V385–V396. [Google Scholar] [CrossRef]
  7. Chen, Y.; Bai, M.; Chen, Y. Obtaining free USArray data by multi-dimensional seismic reconstruction. Nat. Commun. 2019, 10, 4434. [Google Scholar] [CrossRef] [PubMed]
  8. Chen, D.; Xu, B.; Zhu, T.; Zhou, T.; Zhang, H.T. Anisotropic interaction rules in circular motions of pigeon flocks: An empirical study based on sparse Bayesian learning. Phys. Rev. E 2017, 96, 022411. [Google Scholar] [CrossRef] [PubMed]
  9. Shashua, A. On photometric issues in 3D visual recognition from a single 2D image. Int. J. Comput. Vis. 1997, 21, 99–122. [Google Scholar] [CrossRef]
  10. Benezeth, Y.; Jodoin, P.M.; Emile, B.; Laurent, H.; Rosenberger, C. Comparative study of background subtraction algorithms. J. Electron. Imaging 2010, 19, 033003. [Google Scholar]
  11. Cao, W.; Wang, Y.; Sun, J.; Meng, D.; Yang, C.; Cichocki, A.; Xu, Z. Total variation regularized tensor RPCA for background subtraction from compressive measurements. IEEE Trans. Image Process. 2016, 25, 4075–4090. [Google Scholar] [CrossRef]
  12. Lustig, M.; Donoho, D.L.; Santos, J.M.; Pauly, J.M. Compressed sensing MRI. IEEE Signal Process. Mag. 2008, 25, 72–82. [Google Scholar] [CrossRef]
  13. Uecker, M.; Hohage, T.; Block, K.T.; Frahm, J. Image reconstruction by regularized nonlinear inversion—joint estimation of coil sensitivities and image content. Magn. Reson. Med. 2008, 60, 674–682. [Google Scholar] [CrossRef]
  14. Wang, A.; Zhou, G.; Jin, Z.; Zhao, Q. Noisy Tensor Completion via Orientation Invariant Tubal Nuclear Norm. Pac. J. Optim. 2023, 19, 273–313. [Google Scholar]
  15. Wright, J.; Ganesh, A.; Rao, S.; Ma, Y. Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Matrices via Convex Optimization; Report No. UILU-ENG-09-2210, DC-243; Coordinated Science Laboratory: Urbana, IL, USA, 2009. [Google Scholar]
  16. Zhao, Q.; Meng, D.; Xu, Z.; Zuo, W.; Zhang, L. Robust principal component analysis with complex noise. In Proceedings of the International Conference on Machine Learning, PMLR, Beijing, China, 22–24 June 2014; pp. 55–63. [Google Scholar]
  17. Cai, J.F.; Candès, E.J.; Shen, Z. A singular value thresholding algorithm for matrix completion. SIAM J. Optim. 2010, 20, 1956–1982. [Google Scholar] [CrossRef]
  18. Ma, S.; Goldfarb, D.; Chen, L. Fixed point and Bregman iterative methods for matrix rank minimization. Math. Program. 2011, 128, 321–353. [Google Scholar] [CrossRef]
  19. Recht, B. A simpler approach to matrix completion. J. Mach. Learn. Res. 2011, 12, 3413–3430. [Google Scholar]
  20. Candès, E.J.; Tao, T. The power of convex relaxation: Near-optimal matrix completion. IEEE Trans. Inf. Theory 2010, 56, 2053–2080. [Google Scholar] [CrossRef]
  21. Signoretto, M.; Van de Plas, R.; De Moor, B.; Suykens, J.A. Tensor versus matrix completion: A comparison with application to spectral data. IEEE Signal Process. Lett. 2011, 18, 403–406. [Google Scholar] [CrossRef]
  22. Giannakopoulos, I.I.; Guryev, G.D.; Serrallés, J.E.; Georgakis, I.P.; Daniel, L.; White, J.K.; Lattanzi, R. Compression of volume-surface integral equation matrices via Tucker decomposition for magnetic resonance applications. IEEE Trans. Antennas Propag. 2021, 70, 459–471. [Google Scholar] [CrossRef]
  23. Giannakopoulos, I.I.; Guryev, G.D.; Serrallés, J.E.; Georgakis, I.P.; Daniel, L.; White, J.K.; Lattanzi, R. A tensor train compression scheme for remote volume-surface integral equation interactions. In Proceedings of the 2021 International Applied Computational Electromagnetics Society Symposium (ACES), Chengdu, China, 28–31 July 2021; pp. 1–4. [Google Scholar]
  24. De Lathauwer, L.; De Moor, B. From matrix to tensor: Multilinear algebra and signal processing. In Proceedings of the Institute of Mathematics and Its Applications Conference Series; Citeseer: State College, PA, USA, 1998; Volume 67, pp. 1–16. [Google Scholar]
  25. Sidiropoulos, N.D.; Bro, R.; Giannakis, G.B. Parallel factor analysis in sensor array processing. IEEE Trans. Signal Process. 2000, 48, 2377–2388. [Google Scholar] [CrossRef]
  26. Qiu, Y.; Zhou, G.; Wang, Y.; Zhang, Y.; Xie, S. A generalized graph regularized non-negative tucker decomposition framework for tensor data representation. IEEE Trans. Cybern. 2020, 52, 594–607. [Google Scholar] [CrossRef]
  27. Vasilescu, M.A.O.; Terzopoulos, D. Multilinear analysis of image ensembles: Tensorfaces. In Proceedings of the European Conference on Computer Vision; Springer: Berlin/Heidelberg, Germany, 2002; pp. 447–460. [Google Scholar]
  28. Qiu, Y.; Zhou, G.; Zhao, Q.; Xie, S. Noisy tensor completion via low-rank tensor ring. IEEE Trans. Neural Netw. Learn. Syst. 2022, 35, 1127–1141. [Google Scholar] [CrossRef]
  29. Zheng, Y.B.; Huang, T.Z.; Zhao, X.L.; Zhao, Q.; Jiang, T.X. Fully-connected tensor network decomposition and its application to higher-order tensor completion. In Proceedings of the AAAI Conference on Artificial Intelligence, Online, 2–9 February 2021; Volume 35, pp. 11071–11078. [Google Scholar]
  30. Carroll, J.D.; Chang, J.J. Analysis of individual differences in multidimensional scaling via an N-way generalization of “Eckart-Young” decomposition. Psychometrika 1970, 35, 283–319. [Google Scholar] [CrossRef]
  31. Håstad, J. Tensor rank is NP-Complete. In Proceedings of the International Colloquium on Automata, Languages, and Programming; Springer: Berlin/Heidelberg, Germany, 1989; pp. 451–460. [Google Scholar]
  32. Zheng, Y.B.; Huang, T.Z.; Zhao, X.L.; Jiang, T.X.; Ji, T.Y.; Ma, T.H. Tensor N-tubal rank and its convex relaxation for low-rank tensor recovery. Inf. Sci. 2020, 532, 170–189. [Google Scholar] [CrossRef]
  33. Tucker, L.R. Some mathematical notes on three-mode factor analysis. Psychometrika 1966, 31, 279–311. [Google Scholar] [CrossRef] [PubMed]
  34. Liu, Y.; Long, Z.; Huang, H.; Zhu, C. Low CP rank and tucker rank tensor completion for estimating missing components in image data. IEEE Trans. Circuits Syst. Video Technol. 2019, 30, 944–954. [Google Scholar] [CrossRef]
  35. Tong, X.; Cheng, L.; Wu, Y.C. Bayesian Tensor Tucker Completion with A Flexible Core. IEEE Trans. Signal Process. 2023, 71, 4077–4091. [Google Scholar] [CrossRef]
  36. Cichocki, A.; Lee, N.; Oseledets, I.; Phan, A.H.; Zhao, Q.; Mandic, D.P. Tensor networks for dimensionality reduction and large-scale optimization: Part 1 low-rank tensor decompositions. Found. Trends Mach. Learn. 2016, 9, 249–429. [Google Scholar] [CrossRef]
  37. Cichocki, A.; Phan, A.H.; Zhao, Q.; Lee, N.; Oseledets, I.V.; Sugiyama, M.; Mandic, D. Tensor networks for dimensionality reduction and large-scale optimizations. part 2 applications and future perspectives. arXiv 2017, arXiv:1708.09165. [Google Scholar]
  38. Oseledets, I.V. Tensor-train decomposition. SIAM J. Sci. Comput. 2011, 33, 2295–2317. [Google Scholar] [CrossRef]
  39. Oseledets, I.V. Approximation of 2d × 2d matrices using tensor decomposition. SIAM J. Matrix Anal. Appl. 2010, 31, 2130–2145. [Google Scholar] [CrossRef]
  40. Zhao, Q.; Zhou, G.; Xie, S.; Zhang, L.; Cichocki, A. Tensor ring decomposition. arXiv 2016, arXiv:1606.05535. [Google Scholar]
  41. Bengua, J.A.; Phien, H.N.; Tuan, H.D.; Do, M.N. Efficient tensor completion for color image and video recovery: Low-rank tensor train. IEEE Trans. Image Process. 2017, 26, 2466–2479. [Google Scholar] [CrossRef]
  42. Yuan, L.; Li, C.; Mandic, D.; Cao, J.; Zhao, Q. Tensor ring decomposition with rank minimization on latent space: An efficient approach for tensor completion. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 9151–9158. [Google Scholar]
  43. Yu, J.; Zhou, G.; Li, C.; Zhao, Q.; Xie, S. Low tensor-ring rank completion by parallel matrix factorization. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 3020–3033. [Google Scholar] [CrossRef] [PubMed]
  44. Xue, J.; Zhao, Y.; Liao, W.; Chan, J.C.W.; Kong, S.G. Enhanced sparsity prior model for low-rank tensor completion. IEEE Trans. Neural Netw. Learn. Syst. 2019, 31, 4567–4581. [Google Scholar] [CrossRef] [PubMed]
  45. Xue, J.; Zhao, Y.; Huang, S.; Liao, W.; Chan, J.C.W.; Kong, S.G. Multilayer sparsity-based tensor decomposition for low-rank tensor completion. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 6916–6930. [Google Scholar] [CrossRef]
  46. Zhao, Q.; Meng, D.; Kong, X.; Xie, Q.; Cao, W.; Wang, Y.; Xu, Z. A novel sparsity measure for tensor recovery. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 271–279. [Google Scholar]
  47. Xie, Q.; Zhao, Q.; Meng, D.; Xu, Z. Kronecker-basis-representation based tensor sparsity and its applications to tensor recovery. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 40, 1888–1902. [Google Scholar] [CrossRef]
  48. Kolda, T.G.; Bader, B.W. Tensor decompositions and applications. SIAM Rev. 2009, 51, 455–500. [Google Scholar] [CrossRef]
  49. Zhang, C.H. Nearly unbiased variable selection under minimax concave penalty. Ann. Stat. 2010, 38, 894–942. [Google Scholar] [CrossRef]
  50. Xu, Y.; Hao, R.; Yin, W.; Su, Z. Parallel matrix factorization for low-rank tensor completion. arXiv 2013, arXiv:1312.1254. [Google Scholar] [CrossRef]
  51. Zhao, Q.; Zhang, L.; Cichocki, A. Bayesian CP factorization of incomplete tensors with automatic rank determination. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 37, 1751–1763. [Google Scholar] [CrossRef]
  52. Morup, M.; Dunlavy, D.M.; Acar, E.; Kolda, T.G. Scalable tensor factorizations with missing data. In Technical Report; Sandia National Laboratories: Albuquerque, NM, USA; Livermore, CA, USA, 2010. [Google Scholar]
  53. Liu, J.; Musialski, P.; Wonka, P.; Ye, J. Tensor completion for estimating missing values in visual data. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 35, 208–220. [Google Scholar] [CrossRef]
  54. Yuan, L.; Cao, J.; Zhao, X.; Wu, Q.; Zhao, Q. Higher-dimension tensor completion via low-rank tensor ring decomposition. In Proceedings of the 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Honolulu, HI, USA, 12–15 November 2018; pp. 1071–1076. [Google Scholar]
  55. Yuan, L.; Zhao, Q.; Gui, L.; Cao, J. High-order tensor completion via gradient-based optimization under tensor train format. Signal Process. Image Commun. 2019, 73, 53–61. [Google Scholar] [CrossRef]
  56. Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef] [PubMed]
  57. Yasuma, F.; Mitsunaga, T.; Iso, D.; Nayar, S.K. Generalized assorted pixel camera: Postcapture control of resolution, dynamic range, and spectrum. IEEE Trans. Image Process. 2010, 19, 2241–2253. [Google Scholar] [CrossRef] [PubMed]
  58. Wald, L. Data Fusion: Definitions and Architectures: Fusion of Images of Different Spatial Resolutions; Presses des MINES: Paris, France, 2002. [Google Scholar]
  59. Seeling, P.; Reisslein, M. Video traffic characteristics of modern encoding standards: H. 264/AVC with SVC and MVC extensions and H. 265/HEVC. Sci. World J. 2014, 2014, 189481. [Google Scholar] [CrossRef]
  60. Chang, Y.; Yan, L.; Chen, B.; Zhong, S.; Tian, Y. Hyperspectral image restoration: Where does the low-rank property exist. IEEE Trans. Geosci. Remote Sens. 2020, 59, 6869–6884. [Google Scholar] [CrossRef]
Figure 1. Illustrations of the TR decomposition.
Figure 1. Illustrations of the TR decomposition.
Entropy 26 00105 g001
Figure 2. Illustrations of TR decomposition and its Kronecker bases representation.
Figure 2. Illustrations of TR decomposition and its Kronecker bases representation.
Entropy 26 00105 g002
Figure 3. Illustration of convergence property for TRSM-TC under different choices of TR-ranks. A synthetic tensor with TR structure (size ( 7 × 8 × 7 × 8 × 7 ) with TR-rank [ 5 , 5 , 5 , 5 , 5 ] , missing rate 0.8) is tested. The experiment records the change in the objective function values along the number of iterations. Each independent experiment is conducted 100 times, the average results are shown in the graphs, and the convergence curve is presented.
Figure 3. Illustration of convergence property for TRSM-TC under different choices of TR-ranks. A synthetic tensor with TR structure (size ( 7 × 8 × 7 × 8 × 7 ) with TR-rank [ 5 , 5 , 5 , 5 , 5 ] , missing rate 0.8) is tested. The experiment records the change in the objective function values along the number of iterations. Each independent experiment is conducted 100 times, the average results are shown in the graphs, and the convergence curve is presented.
Entropy 26 00105 g003
Figure 4. The eight benchmark color images.
Figure 4. The eight benchmark color images.
Entropy 26 00105 g004
Figure 5. Visual completion result of the house image for a missing rate of 0.9 with 5D tensorization and TR-ranks 12. Top row from left to right: the original image, its 0.9 missing case, and the ground truth error image. The second and third rows show the images recovered by TRSM-TC and their corresponding error images, respectively; the fourth and fifth rows show the images recovered by TRLRF and their corresponding error images, respectively; the sixth and seventh rows show the images recovered by TR-WOPT and their corresponding error images, respectively. (a) The fully recovered images. (b) The images recovered by the first 230,000 Kronecker bases. (c) The images recovered by the first 220,000 Kronecker bases. (d) The images recovered by the first 210,000 Kronecker bases. (e) The images recovered by the first 200,000 Kronecker bases. In each figure, we zoom in on the small red box located on the left and display the result in the upper right area.
Figure 5. Visual completion result of the house image for a missing rate of 0.9 with 5D tensorization and TR-ranks 12. Top row from left to right: the original image, its 0.9 missing case, and the ground truth error image. The second and third rows show the images recovered by TRSM-TC and their corresponding error images, respectively; the fourth and fifth rows show the images recovered by TRLRF and their corresponding error images, respectively; the sixth and seventh rows show the images recovered by TR-WOPT and their corresponding error images, respectively. (a) The fully recovered images. (b) The images recovered by the first 230,000 Kronecker bases. (c) The images recovered by the first 220,000 Kronecker bases. (d) The images recovered by the first 210,000 Kronecker bases. (e) The images recovered by the first 200,000 Kronecker bases. In each figure, we zoom in on the small red box located on the left and display the result in the upper right area.
Entropy 26 00105 g005
Figure 6. Visual completion result of the 31st band of fake and real apples and the 29th band of sponges of the 0.98 missing rate. The first and second rows represent the recovery results of fake and real apples and sponges, respectively, and from left to right are the original images, the 0.98 missing rate case of the images, and the images recovered by algorithms TRSM-TC, TR-WOPT, TRLRF, and FBCP, respectively. In each figure, we zoom in on the small red box and display the result in the larger red box.
Figure 6. Visual completion result of the 31st band of fake and real apples and the 29th band of sponges of the 0.98 missing rate. The first and second rows represent the recovery results of fake and real apples and sponges, respectively, and from left to right are the original images, the 0.98 missing rate case of the images, and the images recovered by algorithms TRSM-TC, TR-WOPT, TRLRF, and FBCP, respectively. In each figure, we zoom in on the small red box and display the result in the larger red box.
Entropy 26 00105 g006
Figure 7. Visual completion results of 6 videos under a 0.98 missing rate. The columns from left to right: the original videos, the 0.98 missing cases and the videos recovered by algorithms TRSM-TC, TRLRF, TR-WOPT, and FBCP, respectively.
Figure 7. Visual completion results of 6 videos under a 0.98 missing rate. The columns from left to right: the original videos, the 0.98 missing cases and the videos recovered by algorithms TRSM-TC, TRLRF, TR-WOPT, and FBCP, respectively.
Entropy 26 00105 g007
Figure 8. Visual completion result of the 3rd frame of the container and the 24th frame of akiyo using TRSM-TC. The top row shows the original images and its 0.98 missing cases for container and akiyo. The second and third rows show the recovered results of container and akiyo under different TR-ranks.
Figure 8. Visual completion result of the 3rd frame of the container and the 24th frame of akiyo using TRSM-TC. The top row shows the original images and its 0.98 missing cases for container and akiyo. The second and third rows show the recovered results of container and akiyo under different TR-ranks.
Entropy 26 00105 g008
Figure 9. Ablation studies examining the effects of λ and the first TR-rank. In figures focusing on the first TR-rank, the horizontal dashed line represents results achieved when the first TR-rank is set equal to the other ranks.
Figure 9. Ablation studies examining the effects of λ and the first TR-rank. In figures focusing on the first TR-rank, the horizontal dashed line represents results achieved when the first TR-rank is set equal to the other ranks.
Entropy 26 00105 g009
Table 1. The average performance comparison of 7 competing TC methods with different missing rates on 8 color images. All the methods’ best performances over different tensorization schemes are highlighted in bold.
Table 1. The average performance comparison of 7 competing TC methods with different missing rates on 8 color images. All the methods’ best performances over different tensorization schemes are highlighted in bold.
Methodsmr = 0.8mr = 0.9
3D5D9DVDT3D5D9DVDT
CP-WOPT0.14020.16070.23350.24030.20380.21290.26630.2849
FBCP0.1343 0.15510.23430.25320.1799 0.19840.27160.2812
HaLRTC0.1327 0.28010.32760.32910.18890.33090.35650.3581
Tmac0.1242 0.13730.13260.14630.1785 0.18530.19640.2002
TRLRF0.12790.1148 0.12990.13490.19720.1610 0.16460.1982
TR-WOPT0.13960.1044 0.11430.12780.20920.14340.1433 0.1695
TRSM-TC0.13430.0925 0.13200.13400.21790.1283 0.17340.1716
Table 2. The averaged recovery SSIM between 7 competing TC methods with different missing rates on 8 color images. The best results are highlighted in bold.
Table 2. The averaged recovery SSIM between 7 competing TC methods with different missing rates on 8 color images. The best results are highlighted in bold.
MRCP-WOPTFBCPHaLRTCTmacTRLRFTR-WOPTTRSM-TC
0.80.55890.63410.67260.63480.68140.69810.7627
0.90.40030.44690.49460.44920.4930.5510.6174
Table 3. The average performance comparison of 7 competing TC methods with different missing rates and SNR on 8 color images. The upper and the lower tables show the recovery performances under missing rates 0.8 and 0.9, respectively. The best RSE and SSIM are highlighted in bold.
Table 3. The average performance comparison of 7 competing TC methods with different missing rates and SNR on 8 color images. The upper and the lower tables show the recovery performances under missing rates 0.8 and 0.9, respectively. The best RSE and SSIM are highlighted in bold.
SNRCP-WOPTFBCPHaLRTCTmacTRLRFTR-WOPTTRSM-TC
RSESSIMRSESSIMRSESSIMRSESSIMRSESSIMRSESSIMRSESSIM
310.1430.5460.1370.6220.1360.6580.1280.6210.1170.6740.1080.6910.0980.740
260.1470.5430.1380.5970.1410.6320.1360.5900.1210.6500.1120.6670.1030.719
210.1560.4950.1470.5400.1530.5800.1580.5080.1330.6020.1230.6180.1140.657
SNRCP-WOPTFBCPHaLRTCTmacTRLRFTR-WOPTTRSM-TC
RSESSIMRSESSIMRSESSIMRSESSIMRSESSIMRSESSIMRSESSIM
310.2080.4020.1840.4400.1920.4870.1820.4450.1630.5290.1450.5530.1290.604
260.2100.3930.1860.4280.1950.4750.1870.4260.1640.5250.1480.5350.1330.584
210.2210.3670.1910.3970.2050.4430.2050.3790.1720.4820.1560.4980.1430.540
Table 4. The average performance comparison of 7 competing TC methods with different missing rates on 32 MSI images. The best results achieved over different recovery metrics are highlighted in bold.
Table 4. The average performance comparison of 7 competing TC methods with different missing rates on 32 MSI images. The best results achieved over different recovery metrics are highlighted in bold.
MR CP-WOPTFBCPHaLRTCTmacTRLRFTR-WOPTTRSM-TC
0.95RSE0.18350.13520.33430.17640.13810.13440.1162
ERGAS164.1529123.0992309.2216163.4394128.0540123.4724107.6782
SSIM0.85060.87160.76720.82510.8530.88050.896
0.98RSE0.64660.23740.63140.39010.26960.23080.1980
ERGAS522.4653216.4430591.0829354.5717249.1188210.0180182.5890
SSIM0.43720.74610.44690.61860.66530.77000.7754
Table 5. The average video completion performances of 7 competing TC methods with a missing rate of 0.98. The best results achieved over different recovery metrics are highlighted in bold.
Table 5. The average video completion performances of 7 competing TC methods with a missing rate of 0.98. The best results achieved over different recovery metrics are highlighted in bold.
CP-WOPTFBCPHaLRTCTmacTRLRFTR-WOPTTRSM-TC
RSE0.17750.11650.42040.23160.10670.10820.0858
ERGAS218.6831143.0337515.8984286.2220131.8164133.7670107.7512
SSIM0.59410.74130.40730.53920.74930.74320.8079
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zeng, J.; Qiu, Y.; Ma, Y.; Wang, A.; Zhao, Q. A Novel Tensor Ring Sparsity Measurement for Image Completion. Entropy 2024, 26, 105. https://doi.org/10.3390/e26020105

AMA Style

Zeng J, Qiu Y, Ma Y, Wang A, Zhao Q. A Novel Tensor Ring Sparsity Measurement for Image Completion. Entropy. 2024; 26(2):105. https://doi.org/10.3390/e26020105

Chicago/Turabian Style

Zeng, Junhua, Yuning Qiu, Yumeng Ma, Andong Wang, and Qibin Zhao. 2024. "A Novel Tensor Ring Sparsity Measurement for Image Completion" Entropy 26, no. 2: 105. https://doi.org/10.3390/e26020105

APA Style

Zeng, J., Qiu, Y., Ma, Y., Wang, A., & Zhao, Q. (2024). A Novel Tensor Ring Sparsity Measurement for Image Completion. Entropy, 26(2), 105. https://doi.org/10.3390/e26020105

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop