Nothing Special   »   [go: up one dir, main page]

Next Article in Journal
Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
Previous Article in Journal
A Machine Learning Approach for Gearbox System Fault Diagnosis
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Field Theoretical Approach for Signal Detection in Nearly Continuous Positive Spectra I: Matricial Data

by
Vincent Lahoche
1,
Dine Ousmane Samary
1,2,* and
Mohamed Tamaazousti
1
1
Université Paris-Saclay, CEA, List, F-91120 Palaiseau, France
2
International Chair in Mathematical Physics and Applications (ICMPA-UNESCO Chair), University of Abomey-Calavi, Cotonou 072 P.O. Box 50, Benin
*
Author to whom correspondence should be addressed.
Entropy 2021, 23(9), 1132; https://doi.org/10.3390/e23091132
Submission received: 25 July 2021 / Revised: 14 August 2021 / Accepted: 25 August 2021 / Published: 31 August 2021
(This article belongs to the Section Signal and Data Analysis)
Figure 1
<p>Qualitative picture of the signal detection issue in a nearly continuous spectrum.</p> ">
Figure 2
<p>Qualitative illustration of the RG flow. The UV scale is described by the classical action <math display="inline"><semantics> <mi mathvariant="script">S</mi> </semantics></math>, while the IR scale is described by an effective object <math display="inline"><semantics> <mi mathvariant="sans-serif">Γ</mi> </semantics></math>, where microscopic effects are hidden in the different parameters involved in its definition.</p> ">
Figure 3
<p>Qualitative illustration of the meaning of the effective averaged action <math display="inline"><semantics> <msub> <mi mathvariant="sans-serif">Γ</mi> <mi>k</mi> </msub> </semantics></math>, as the effective action of the UV degrees of freedom which have been integrated-out.</p> ">
Figure 4
<p>Qualitative illustration of the RG flow behavior. Some different UV initial conditions lead to the same (universal) IR physics, up to negligible differences with regard to the experimental precision.</p> ">
Figure 5
<p>A step of the RG flow. On the left, integration of momenta between <math display="inline"><semantics> <mrow> <mi>s</mi> <mi mathvariant="sans-serif">Λ</mi> </mrow> </semantics></math> and <math display="inline"><semantics> <mi mathvariant="sans-serif">Λ</mi> </semantics></math>. On the right, dilatation of the remaining momenta with a factor <math display="inline"><semantics> <mrow> <mn>1</mn> <mo>/</mo> <mi>s</mi> </mrow> </semantics></math>.</p> ">
Figure 6
<p>Numerical flow associated to the MP law (data without signal) and for the quartic truncation. The main directions of the flow are highlighted by the black arrows (which are oriented from UV to IR). We observe the existence of a region reminiscent of the standard Wilson–Fisher fixed point.</p> ">
Figure 7
<p>The canonical dimension for MP distribution with <math display="inline"><semantics> <mrow> <mi>K</mi> <mo>=</mo> <mn>1</mn> </mrow> </semantics></math> and <math display="inline"><semantics> <mrow> <mi>σ</mi> <mo>=</mo> <mn>0.5</mn> </mrow> </semantics></math> (on the right), <math display="inline"><semantics> <mrow> <mi>σ</mi> <mo>=</mo> <mn>1</mn> </mrow> </semantics></math> (in the middle) and <math display="inline"><semantics> <mrow> <mi>σ</mi> <mo>=</mo> <mn>2</mn> </mrow> </semantics></math> (on the left). The purple curve corresponds to the MP distribution.</p> ">
Figure 8
<p>RG trajectories starting from <math display="inline"><semantics> <mrow> <mi>k</mi> <mo>=</mo> <msub> <mi mathvariant="sans-serif">Λ</mi> <mn>0</mn> </msub> </mrow> </semantics></math> for <math display="inline"><semantics> <msub> <mi>u</mi> <mn>2</mn> </msub> </semantics></math> (blue curves), <math display="inline"><semantics> <msub> <mi>u</mi> <mn>4</mn> </msub> </semantics></math> (red curves), and <math display="inline"><semantics> <msub> <mi>u</mi> <mn>6</mn> </msub> </semantics></math> (green curves).</p> ">
Figure 9
<p>A typical DNMP distribution for <math display="inline"><semantics> <mrow> <mi>P</mi> <mo>=</mo> <mn>1500</mn> </mrow> </semantics></math> and <math display="inline"><semantics> <mrow> <mi>N</mi> <mo>=</mo> <mn>2000</mn> </mrow> </semantics></math>.</p> ">
Figure 10
<p>Numerical flow associated to a DNMP distribution in the learnable region.</p> ">
Figure 11
<p>Three points of view of the compact region <math display="inline"><semantics> <msub> <mi mathvariant="script">R</mi> <mn>0</mn> </msub> </semantics></math> (illustrated with purple dots) in the vicinity of the Gaussian fixed point (illustrated with a black dot). In this region RG trajectories end in the symmetric phase, and thus are compatible with a symmetry restoration scenario for initial conditions corresponding to an explicit symmetry breaking. The top plots are associated to the case of pure noise and the bottom plots are, respectively, associated to the case with signal.</p> ">
Figure 12
<p>Illustration of the evolution of the potential associated to an example of initial conditions of the coupling <math display="inline"><semantics> <msub> <mi>u</mi> <mn>2</mn> </msub> </semantics></math>, <math display="inline"><semantics> <msub> <mi>u</mi> <mn>4</mn> </msub> </semantics></math>, and <math display="inline"><semantics> <msub> <mi>u</mi> <mn>6</mn> </msub> </semantics></math> where the RG trajectories end in the symmetric phase in the case of pure noise (on the left) and stay in the non-symmetric phase when we add a signal (on the right).</p> ">
Figure 13
<p>Illustration of the evolution of the <math display="inline"><semantics> <msub> <mi>u</mi> <mn>2</mn> </msub> </semantics></math> for eigenvalues between <math display="inline"><semantics> <mrow> <mn>2.5</mn> </mrow> </semantics></math> and <math display="inline"><semantics> <mrow> <mn>3.4</mn> </mrow> </semantics></math> in the case of pure noise (NMP distribution). We can see that the values of <math display="inline"><semantics> <msub> <mover accent="true"> <mi>u</mi> <mo stretchy="false">¯</mo> </mover> <mn>2</mn> </msub> </semantics></math> for these examples are of the same magnitude as <math display="inline"><semantics> <mrow> <mi>N</mi> <mo>=</mo> <mn>2000</mn> </mrow> </semantics></math>. This highlights the existence of some RG trajectories associated to physically relevant states in the deep infrared.</p> ">
Figure 14
<p>Illustration of the evolution of <math display="inline"><semantics> <mi>κ</mi> </semantics></math>, obtained with the LPA representation, for eigenvalues between <math display="inline"><semantics> <mrow> <mn>2.5</mn> </mrow> </semantics></math> and <math display="inline"><semantics> <mrow> <mn>3.4</mn> </mrow> </semantics></math> in the case of data without signal. For some RG trajectories (on the left), <math display="inline"><semantics> <mi>κ</mi> </semantics></math> decreases to zero, which correspond to a restoration of the symmetry. For other RG trajectories (on the right), <math display="inline"><semantics> <mi>κ</mi> </semantics></math> stays almost constant in the range of eigenvalues that we consider, and does not lead to a restoration of the symmetry.</p> ">
Figure 15
<p>Illustration of the evolution of the potential associated to an example of initial conditions of the coupling <math display="inline"><semantics> <msub> <mi>u</mi> <mn>2</mn> </msub> </semantics></math>, <math display="inline"><semantics> <msub> <mi>u</mi> <mn>4</mn> </msub> </semantics></math>, and <math display="inline"><semantics> <msub> <mi>u</mi> <mn>6</mn> </msub> </semantics></math>. We see that the RG trajectories, obtained with the LPA representation, end in the symmetric phase in the case of pure noise (on the left) and stay in the non symmetric phase when we add a signal (on the right).</p> ">
Figure 16
<p>Illustration of the evolution of <math display="inline"><semantics> <mi>η</mi> </semantics></math>, obtained by the LPA’ representation, for eigenvalues between <math display="inline"><semantics> <mrow> <mn>2.5</mn> </mrow> </semantics></math> and <math display="inline"><semantics> <mrow> <mn>3.4</mn> </mrow> </semantics></math> in the case of data without signal. We see that for these RG trajectories, the anomalous dimension <math display="inline"><semantics> <mi>η</mi> </semantics></math> remains small. This highlights that there is no significant change when we use the LPA’ representation instead of the LPA one.</p> ">
Review Reports Versions Notes

Abstract

:
Renormalization group techniques are widely used in modern physics to describe the relevant low energy aspects of systems involving a large number of degrees of freedom. Those techniques are thus expected to be a powerful tool to address open issues in data analysis when datasets are highly correlated. Signal detection and recognition for a covariance matrix having a nearly continuous spectra is currently one of these opened issues. First, investigations in this direction have been proposed in recent investigations from an analogy between coarse-graining and principal component analysis (PCA), regarding separation of sampling noise modes as a UV cut-off for small eigenvalues of the covariance matrix. The field theoretical framework proposed in this paper is a synthesis of these complementary point of views, aiming to be a general and operational framework, both for theoretical investigations and for experimental detection. Our investigations focus on signal detection. They exhibit numerical investigations in favor of a connection between symmetry breaking and the existence of an intrinsic detection threshold.

1. Introduction

Statistical physics was born to deal with systems involving a very large number of interacting degrees of freedom, to extract relevant features at large scales, when classical mechanics are no longer a feasible option [1]. These relevant features generally take the form of an effective description involving a small number of parameters related to very large number of parameters used to describe microscopic states. From the point of view of information theory, statistical physics looks like a consequence of a statistical inference based on the maximum entropy estimate, disregarding the specific aspects of the microscopic problem, and it is for this reason a general paradigm [2,3,4]. In the last century, Wilson and Kadanoff [5,6,7,8,9] discovered that an effective description valid for large scales can be deduced from the knowledge of a microscopic description following a coarse-graining procedure called renormalization group (RG), which averages recursively over the fluctuations that have short wavelengths, with respect to a reference scale. In modern physics [10,11], RG becomes a powerful tool to address the questions of universality and simplicity of the large-scale physical laws [12,13,14,15,16,17,18,19].
Modern data analysis aims to deal with very large datasets, which are strongly correlated, and principal component analysis (PCA) is one of the most popular methods to extract information, i.e., to find relevant features [20,21,22,23,24,25,26,27,28,29,30,31]. Even though different realisations are present in the literature, the principle is always the same. PCA is a linear projection onto the lower dimensional subspace spanned by the eigendirections corresponding to the larger eigenvalues (the relevant features). For the datasets taking the form of a suitably mean-shifted and normalized N × P matrix X a i , with a { 1 , , P } and i { 1 , , N } , the covariance matrix C (if one wishes to work with different random variables, for instance associated to different systems, one should also reduce the matrix X, and work with the correlation matrix) is defined as the average of X T X , describing correlations between type-i variables. Standard PCA works well when the largest eigenvalues can be easily distinguished from the other ones. In this case, a small number of modes captures the most relevant features of the covariance. Such an effective description is reminiscent of the famous large river effect of the RG flow in statistical physics, referring to the general property of this flow to be dragged toward a finite-dimensional subspace corresponding to relevant and marginal operators for sufficiently large scales [32].
The connection between PCA and RG can be traced from information theory [33,34,35,36,37,38,39,40] as a consequence of the ability of the RG to extract large scale relevant features of a microscopic system. From an information theory point of view, the RG describes how a theory valid for small distance physics flows toward a simpler theory (i.e., with a reduced set of parameters) at larger distance, as information is progressively lost (due to coarse-graining). In turn, the inference problem of recovering the elementary theory from the knowledge of the large scale simpler model is the same as finding the equivalent class of elementary models having the same large scale limits. The distinction between elements of the different equivalent family is based on the existence of an intrinsic criterion which quantifies the intrinsic ability for a perturbation to a given microscopic state to survive at large scale. The relevant operators of quantum fields, that survives at a large distance, are the only ones whose provide a clean distinction between asymptotic states for some large scale observer. Relevance can be defined intrinsically from information theory, according to a specific notions of distance between states. In information geometry theory, where the state space looks as a differentiable manifold, a computable notion of distance is given by the Fisher information metric, and the notion of equivalence class can be defined with respect to this distance, since all the asymptotic states having distance smaller than some working precision are undistinguishable [36,37,38,39].
In the case of a continuous spectrum [21,22,41,42], the standard PCA fails to provide a clean separation between noise and information. We propose to exploit the link between PCA and RG to address this separation with an objective physical criterion. A first step in this direction was done in [20,21,22]. The authors considered an effective field theory framework, where the separation between information and noise looks like an arbitrary cut-off scale Λ in the eigenvalue spectrum of the covariance matrix (see Figure 1). In [23], the authors introduced the non-perturbative Wetterich–Morris framework. In that formalism the bare (i.e., the microscopic) action is left unchanged, but infrared contributions are suppressed from the effective action. Hence, the formalism focus on the effective action Γ k for the integrated-out degrees of freedom up to the scale k rather than the microscopic action for the remaining (infrared) degrees of freedom. In some sense, we rather focus on determining what the “noise” is than what the “information” is, and k looks as an IR rather UV cut-off (the notion of infrared (IR) and ultraviolet (UV) are related, respectively, to the smaller and higher values of the scale parameter k.). In this previous study however, the investigations of the author were essentially restricted to the power-counting aspects for distributions around the universal Marchenko–Pastur (MP) noise [43]. The conclusions were: (1) for a purely noisy data the first quartic perturbation to the Gaussian distribution is relevant from coarse-graining; (2) a strong enough signal must change the power counting to make the perturbation irrelevant, and the effective description goes perturbatively toward the Gaussian point.
The aim of the following work is to consider the same question and to go beyond these dimensional aspects. More precisely, we address the following question: within our field theoretical formalism, is it possible to find objective criteria to decide if a continuous spectrum associated to a dataset contains information or not? Focusing on deformations around MP law, our result shows that in this formalism, the presence of a signal in the spectrum can be identified from a symmetry breaking corresponding to a non-zero value of the field theory vacuum. The symmetry is progressively restored as the signal strength is turned to zero. Note that the conclusions of this paper have been extended to tensorial PCA in [25].
Let us note that phase transitions are usually associated to signal detection in PCA [44,45,46]. A classic example is provided by the one-spiked matrix model [47], which exhibits a sharp phase transition in the larger eigenvalue distribution. In that elementary case the signal is materialized by a single vector u R N with length 1 and the noise by a N × N matrix M along the Gaussian orthogonal ensemble. Let us denote by λ the size (intensity) of the signal. For λ < 1 , the largest eigenvalue λ c in the sum M + λ u u T follows semi-circle law with Tracy–Widom distribution for N . In contrast, for λ > 1 , the largest eigenvalue converges toward λ c = λ + 1 / λ , and its statistic follows a Gaussian error function. From the point of view of signal detection, this result as a consequence that as soon as λ > 1 , the signal can be easily detected using standard iterative methods; whereas detection is impossible in practice for λ < 1 . The main difference with our point of view is that we do not focus on the largest eigenvalue distribution, but on the ability of the spectrum to support large-scale non-trivial structures embedded in a field theory.
The manuscript is organized as follows. Section 2 provides a summary of the field theoretical framework introduced in [23], specifying some subtle points not discussed in this previous work, especially with regard to the kinetic classical action. Moreover, anticipating the results of the next section, we discuss the relevance of interactions and argue the existence of a “wall” in generalized momenta, defining a physical cut-off, below which the relevant sector diverges and the field theoretical embedding breaks down. Working above this wall, we show that only a few number of local couplings are relevant, essentially quartic and sixtic couplings for small perturbations around MP law. We close the section with the Wetterich–Morris non-perturbative formalism [21,22,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52], that we discuss in this context. Section 3 is devoted to an analysis of the exact RG equation using standard local potential approximation (LPA) and its improved version (LPA ) which takes into account field renormalization effects and anomalous dimension. Within this approximation scheme, we are able to identify the presence of a signal (materialized by a few number of discrete spikes disturbing the purely noisy data) as a lack of symmetry restoration in the deep IR, for k 1 / N . Finally, in Section 4, we summarize our investigations, and discuss some open issues, together with a plan to address them in the future.

2. The RG in Field Theory

Despite its origins in particle physics, the RG is probably one of the most important and universal concepts discovered during the last century. RG enables to understand physics at different scales. Technically, in the field theoretical framework, this is a consequence of the ability of the microscopic degrees of freedom to be reabsorbed by a set of parameters, that predictivity is required to be finite to design an effective field theory. Such a theory has, thus, the property to be valid only up to a certain scale, where the more fundamental degrees of freedom cannot be distinguished from their effective description. The same procedure can then be repeated, resulting in an effective chain of theories. This coarse-graining is at the core of the RG idea, as originally formulated by Kadanoff and Wilson [5,6,7,8,9] (see Figure 2). In particular, the RG aims to address the following question: to what extent can two different microscopic states be distinguishable under coarse-graining? To be more formal, let us consider a system built with a large number N of interacting degrees of freedom ζ i for i [ 1 , N ] . A microscopic state is a set ζ { ζ 1 , ζ 2 , , ζ N } . The nature of the elementary states ζ i describing a single degree of freedom depends on the problem that one considers (for the standard Ising model, for instance [53], ζ i ’s are discrete variables ζ i = ± 1 ). Assuming the maximum entropy prescription, these states are associated to a probability distribution p [ ζ ] = e S [ ζ ] , where S [ ζ ] is called classical action or fundamental Hamiltonian in physics. This microscopic level is conventionally called ultraviolet scale (UV scale), and the dominant configurations, say classics, are given by the saddle point equation S / ζ i = 0 . The momenta of the distribution are generated by:
Z [ j ] = ζ p [ ζ ] e j ζ ,
where j { j 1 , j 2 , , j N } and j ζ : = j 1 ζ 1 + + j N ζ N . Moreover, the formal sum ∑ have to be replaced by a functional path integral for continuous degrees of freedom. The classical field M : = { M i } is defined as the means value of ζ i : M i : = ζ ζ i p [ ζ ] . The cumulants of the distribution are generated by the free energy W [ j ] = ln Z [ j ] , taking successive derivatives with respect to the source j and setting j = 0 . In the field theoretical vocabulary, W [ j ] is the generating functional of connected correlations functions (i.e., the correlations functions which cannot be factorized as a product of two-point correlation functions). The physical configurations for M are fixed, for j = 0 by an equation taking the same form as the saddle point equation for ζ , but involving the effective action Γ [ M ] , Γ / M i = 0 . This effective action is formally defined as the Legendre transform of the free energy:
Γ [ M ] + W [ j ] = j M .
We call infrared scale (IR scale) the effective description where fluctuations are integrated out. From a statistical point of view, Γ [ M ] is the generating function of one particle irreducible (1PI) diagrams or effective vertices, in the sense that they represent effective couplings between components of the field M entering in the construction of the functional Γ [ M ] .
Then, the Wilson RG procedure [17,18] assumes a progressive dilution of the information, integrating-out progressively the fluctuating degrees of freedom following a given slicing. In that way, RG constructs a path from UV to IR scales (see Figure 2), in which each step provides an effective description, associated to an effective classical action describing fluctuations of non-integrated degrees of freedom. Thus, the effect of the microscopic degrees of freedom that we ignore is hidden in the parameters defining this action. Thus, RG transformations define a mapping from an action to another at different scales, and the successive positions of the classical action through the functional space of allowed actions construct a trajectory, starting in the UV and ending in the IR. This functional space is usually called theory space. Along this trajectory, the couplings (i.e., the parameters defining the action) change; and the RG equations take the form of a dynamical system describing this running behavior of the couplings along RG trajectories.
However, the existence of such a path is guaranteed only if it is possible to define a criterion for the choice of the renormalization scale of such fluctuations. In standard field theory, this criterion is given by the energy of the modes, the high energy modes being associated to small scales whereas low energy modes are associated to large scales. These energy levels correspond to the eigenvalues of some physically relevant operator. In standard field theory, for instance, for a classical action describing a scalar field ϕ ( x ) on R d ,
S [ ϕ ] : = 1 2 R d d d x ϕ ( x ) ( Δ + m 2 ) ϕ ( x ) + g 4 ! R d d d x ϕ 4 ( x ) ,
the operator allowing to classify the modes is the kinetic operator K = Δ + m 2 ; or simply the Laplacian Δ , whose eigenvectors are the Fourier modes.

2.1. A Field Theoretical Embedding for Data Analysis

As in statistical physics, in the big data area, a state is a point in a space with a very large number of dimensions. PCA, in turn, aims to find the most relevant features out of a very large number of variables. In the case of a continuous spectrum, the relevance is fixed by some sensitivity threshold, discriminating between large eigenvalues that we keep and small eigenvalues that we discard. In other words, PCA constructs effective descriptions, which are valid as long as we can ignore the small eigenvalue effects. This picture is reminiscent of what RG do. From this, it could exist a criterion to distinguish between a noisy spectrum and another containing information, based on the differences between their respective effective asymptotic states. Let us consider the example of the field theory described by the classical action (3). The dimension of coupling constants like g depends on the dimension of the background space R d . In this example, [ g ] = d 4 . The relevance in the vicinity of the Gaussian fixed point ( g 0 ) depends on the value of this dimension. For d > 4 , the operator ϕ 4 ( x ) is irrelevant, meaning that for sufficiently large scale, the theory is essentially Gaussian. In the opposite situation, for d < 4 , the RG flow moves away from the Gaussian fixed point (The point m 2 = g = 0 is a fixed point of the RG flow, thus any partial integration leads to another Gaussian model in virtue of the Gaussian integration properties. Thus, the relevance of the operators in the deep IR is usually determined by the dimension of the space.) However, the latter determines also the shape of the distribution for the Laplacian eigenvalues p 2 , which is ρ ( p 2 ) = ( p 2 ) d / 2 1 , and relevance can be alternatively viewed as a property of the momentum distribution, without reference to the background space dimension. This is exactly what the authors of references [20,21,22,23] did: the scaling dimension is defined through the coarse graining, from the requirement that no explicit scale dependence occurs in the flow equations, excepts eventually at the linear level (see Section 3.1).
We propose a framework allowing to construct a field theoretical approximation of the fundamental RG flow. This point of view is familiar in condensed matter physics, and especially in the physics of critical phenomena. The classical example is the Ising model, whose critical behavior may be well approximated by an effective field theory in the critical domain [7]. Let us consider a set of N random variables, ϕ = { ϕ 1 , ϕ 2 , , ϕ N } R N ; providing an archetypal example of field. Moreover, we assume that there exists a distribution p [ ϕ ] able to reproduce the covariance matrix C , at least for sufficiently large scale (in eigenvalue space). An elementary formal realization of this is given by the Gaussian states:
p [ ϕ ] exp 1 2 i , j ϕ i C i j 1 ϕ j ,
ensuring that ϕ i ϕ j = C i j . The bracket notation X [ ϕ ] defines the mean value of X with respect to the distribution p [ ϕ ] . For such a Gaussian description, all the non-vanishing momenta of the distributions, ϕ i ϕ j ϕ k reduce to a sum of the product of 2-points function following Wick theorem, and only the second cumulant does not vanish. In the field theory language, a theory with this property is said to be free. From an RG point of view, this description makes sense only if the Gaussian point is stable, i.e., if any perturbation around the Gaussian point ends up disappearing after some steps of the RG. In the next section we show that for MP law the Gaussian point is unstable. Moreover for a realistic dataset, correlations for n-point functions fail to be a product of 2-point functions, as a purely Gaussian law would have required. In the field theory language, this failing of the Gaussian description indicates the presence of interactions, materialized as non-Gaussian terms in the action. The coupling g ϕ 4 ( x ) in (3) provides an elementary example.
We, therefore, consider an interacting field theory. In standard field theory, there exist powerful principles, inherited from physics or mathematical consistency, to guide the choice of interactions, and the relevant domains of the theory space. In the absence of such a guide, we use the same simplicity argument already considered in [20,23]. We focus on purely local interactions of the form g i ϕ i 2 n , with fields interacting on the same point, with the same coupling constant. In that way, near to the Gaussian point our distribution p [ ϕ ] is suitably expanded as
p [ ϕ ] exp 1 2 i , j ϕ i C ˜ i j 1 ϕ j g 4 ! i ϕ i 4 + .
Note that, in our assumptions we kept only even interaction terms, ignoring for instance couplings like ϕ i 3 . This hypothesis is equivalent to assume the reflection symmetry (note that, truncating around quartic interactions, adding a term like i , j ϕ i 2 ϕ j 2 , which is invariant under to the rotational group O ( N ) , enlarges the discrete group Z 2 to the hypercubic symmetry. Thus, the purely local model is, with this respect, the less structured one.) ϕ i ϕ i . Finally, note that, in principle, C ˜ 1 C 1 . Indeed, what is known “empirically” is the full 2-point function C i j ; and the probability distribution must be:
d ϕ ϕ i ϕ j p [ ϕ ] = C i j .
and from perturbation theory
d ϕ ϕ i ϕ j p [ ϕ ] = C ˜ i j + O ( g ) .
Note that, from an information theory point of view, probability density (5) corresponds to the maximum entropy solution, compatible with constraint (5) and the existence of undetermined non-Gaussian correlations. From that point of view, the model is minimal in the sense that it carries the least possible structure, as stated in [20]. In that setting C i j = C ˜ i j only at first order, and when non-Gaussian contributions cannot be neglected, C ˜ i j receives quantum corrections, depending on the couplings in a non-trivial way. Inferring the Gaussian kernel C ˜ i j from the knowledge of C i j is a very challenging problem in field theory. In some approximation schemes however, relevant to extracting non-perturbative information about the behavior of the RG flow, this difficulty is not a limitation of our investigation. In the local potential approximation that we will consider in this paper for instance, we assume that C i j 1 and C ˜ i j 1 differ by a constant, C i j 1 = C ˜ i j 1 + k ; the constant k capturing all the quantum corrections. One would expect that an approximation works, essentially, in the region of small eigenvalues for C i j 1 , the IR regime in the field theoretical language. We hope that our methods can track the presence of a signal. We will return on this discussion in Section 3.1. A finer analysis would require more elaborate methods, beyond the scope of this paper. We are then able to construct an approximation of the RG flow, which is not autonomous due to the lack of dilatation invariance of the eigenvalue distribution for C i j 1 (see Section 3). Finally, let us mention a remark about the field theoretical embedding. In Section 3, we show that even for the MP law, the number of relevant interaction becomes arbitrarily large in the first 66 % of the smallest eigenvalues. This introduces an unconventional difficulty in field theory, which can be alternatively viewed as a limitation of the field theory approximation. The breaking down of the field theory up to a certain scale is not a novelty. It is well known for instance that the Ising model behaves like an effective ϕ 4 field theory like (3) in the vicinity of the ferromagnetic transition. Thus, a failure of the field theoretical approximation may be alternatively viewed as a signal that a more elementary description is required. Then, it is interesting to remark that the field theory considered in (5) may be essentially deduced from the Ising-like model:
p Ising ( { S } ) exp 1 2 S i C i j S j ,
where the S i = ± 1 are discrete Ising spins. Indeed, introducing N reals variables ϕ i , and using the standard Gaussian trick to rewrite the quadratic term in S i ;
p Ising ( { S } ) i d ϕ i exp 1 2 ϕ i C i j 1 ϕ j + S i ϕ i .
Thus, summing over { S i } configurations generates an effective i cosh ( ϕ i ) ; and expanding it in power of ϕ i reproduces the terms appearing in the local expansion in (5). The model described by (8) is reminiscent of the standard spin-glass models, as the Sherrington–Kirkpatrick model [54,55,56,57]. Its derivation, moreover, follows directly from the maximum entropy prescription with constraint (6) if we assume to work with discrete spins. We can alternatively see our model of field theory as coming from this binary model, derived from a principle of maximum entropy with fixed correlations [2,3,4].

2.2. The Model

In this section, we provide a mathematical definition of the field theoretical model that we consider, an extended discussion can be found in [20,23]. Following these references, we work in the eigenbasis of the matrix C i j 1 , more suitable for a coarse-graining approach. Note that, with our assumption, this eigenbasis is the same as the one for C ˜ i j 1 . In that way, the Gaussian (or kinetic) part of the classical action of p [ ϕ ] takes the form;
S kinetic [ ψ ] = 1 2 μ ψ μ λ μ ψ μ ,
where λ μ denote the eigenvalues of C ˜ i j 1 , labeled with the discrete index μ ; and the fields { ψ μ } are the eigen-components of the expansion of ϕ i along the normalized eigenbasis u i ( μ ) :
ψ μ = i ϕ i u i ( μ ) , j C ˜ i j 1 u j ( μ ) = λ μ u i ( μ ) .
Let then m 2 be the smallest eigenvalue. We introduce the positive definite quantities p μ 2 : = λ μ m 2 . This way, the kinetic action takes the form of a standard kinetic action in field theory:
S kinetic [ ψ ] = 1 2 μ ψ μ ( p μ 2 + m 2 ) ψ μ .
In the continuum limit, for N sufficiently large, it is suitable to use the empirical eigenvalue distribution χ ( λ ) and to replace sums by integrals. This distribution is empirically inferred from μ δ ( λ λ μ ) / N , and the distribution ρ ( p μ 2 ) for the momenta p μ 2 follows. Moreover, with our assumptions on C ˜ i j 1 , this distribution can be directly deduced from the spectrum of C i j 1 . Finally, for purely random matrices with i.i.d entries, the MP theorem states that the empirical distribution has to converge toward an analytic form that can be used to do exact computations.
It is, however, difficult to deal with interactions in this formalism. A simplification, already considered by the authors of [23] is to work with momentum-dependent field ψ ( p ) rather than with the field ψ μ (labeled with the positive quantity p μ 2 ). This can be motivated by the observation that the model (5) is no rather fundamental than any other model able to reproduce effective correlations in some approximation scheme. In local approximation, that we consider in this paper, the two formulations are expected to be in the same universality class when we define locality in momentum space as:
Definition 1.
An interaction is said to be local of order P if it involves P fields and if it is conservative, i.e., if it is of the form:
U [ ψ ] { p α } δ 0 , α = 1 P p α α = 1 P ψ ( p α ) .
where δ p , p denotes the standard Kronecker delta, equal to 1 for p = p and zero otherwise. By extension, we say that a functional U [ ψ ] is local if its expansion in power of ψ involves only local terms.
This definition follows the standard one in classical and quantum field theory, and it is moreover consistent with the idea that locality is defined ‘‘at contact point” in the original representation (5).

2.3. Wetterich–Morris Framework

Among the different incarnations of the Kadanoff–Wilson’s coarse-graining idea, the Wetterich–Morris (WM) framework has the advantage to be well suited to non-perturbative approximation methods [58,59]. Rather than Kadanoff–Wilson approach, which focuses on the effective classical action S k for IR modes below the scale k, the WM formalism focuses on the effective averaged action Γ k ; i.e., the effective action for integrated-out modes above the scale k. As recalled in the previous section, the fundamental ingredient to describe IR scales, when all degrees of freedom have been integrated out is the effective action Γ [ M ] , which is defined as the Legendre transform of the free energy W [ j ] (Equation (2)); the classical field M = { M μ } being defined as:
M μ = W [ j ] j μ .
The starting point of the WM formalism is to modify the classical action S [ ψ ] adding a scale dependents term Δ S k [ ψ ] , depending on a continuous index k running from k = Λ , for some fundamental UV scale Λ , to k = 0 . In such a way, we define a continuous family of models, described by a free energy W k [ j ] defined as:
W k [ j ] : = ln [ d ψ ] p [ ψ ] e Δ S k [ ψ ] + μ j ( p μ ) ψ ( p μ ) .
The regulator function Δ S k [ ψ ] behaves like a mass, whose value depends on the momentum scale:
Δ S k [ ψ ] = 1 2 μ ψ ( p μ ) r k ( p μ 2 ) ψ ( p μ ) .
The momenta scale r k ( p μ 2 ) provides an operational description of the Kadanoff–Wilson’s coarse-graining procedure, and it is chosen, such that:
  • r k = 0 ( p 2 ) = 0 p 2 , meaning that for k = 0 , W k W , all the fluctuations are integrated out;
  • r k = Λ ( p 2 ) 1 , meaning that in the deep UV, all fluctuations are frozen with a very large mass;
  • r k ( p 2 ) 0 for p 2 / k 2 < 1 , meaning that high energy modes with respect to the scale k 2 are essentially unaffected by the regulator. In contrast, low energy modes must have a large mass which decouples them from long-distance physics.
The two boundaries conditions ensure that we recover the effective descriptions, respectively, in the UV limit, where no fluctuations are integrated out, and in the deep IR where all the fluctuations are integrated out. In other words, we interpolate between the classical action S and the effective action Γ . This can be achieved by introducing the effective averaged action Γ k defined as:
Γ k [ M ] + W k [ j ] = μ j ( p μ ) M ( p μ ) 1 2 μ M ( p μ ) r k ( p μ 2 ) M ( p μ ) ,
such that Γ k = 0 Γ and, from the conditions on r k , Γ k = Λ S . The meaning of Γ k is illustrated in the Figure 3. Along the path from k = Λ to k = 0 , Γ k goes through the theory space, and the different coupling changes. The dynamics of the couplings can be deduced considering a small variation k k + d k , and we can show that Γ k obeys to the WM equation [58,59]:
Γ ˙ k = 1 2 μ r ˙ k ( p μ 2 ) Γ k ( 2 ) + r k μ , μ 1 .
This equation is the one that we will use in this paper to investigate RG flow for datasets. The dot notation Γ ˙ k represents the partial derivation of Γ k with respect to the scale k.

3. RG, from Theory to Numerical Investigations

In this section, we investigate the behavior of the RG flow, focusing on the evolution of the field expectation value and symmetry restoration aspects. However, since the functional space has infinite dimensions, solving the non-perturbative Equation (18) is a difficult task, requiring approximations that we discuss as well.
As a first approximation, we focus on the symmetric phase [60,61,62,63,64,65,66], which can be defined as the region of the whole phase space where it makes sense to expands the averaged effective action Γ k [ M ] in the power of M, around M = 0 . Regions where M = 0 become unstable vacua and the field expansion can be improved by an expansion around a non-zero vacuum. This works well in the local potential approximation (LPA), neglecting the momentum dependence of the classical field. Corrections to the strict LPA take the form of a perturbative expansion in the power of p 2 , called derivative expansion (this terminology is inherited from the standard field theory, where an expansion in the power of the momentum p 2 is nothing but an expansion in the power of Δ , the standard Laplacian in R d .) (DE). In this paper, we consider only the first terms in the derivative expansion, provided by the kinetic action contribution 1 2 p 2 M ( p ) M ( p ) to Γ k [ M ] . In strict LPA, the coefficient in front of p 2 (the field strength) remains equal to 1. A slight improvement to the LPA, called LPA takes into account the field strength flow Z ( k ) : 1 2 p 2 M ( p ) M ( p ) 1 2 Z ( k ) p 2 M ( p ) M ( p ) , so that the anomalous dimension does not vanish. We will consider both these approximations, showing explicitly that the corrections provided to LPA remain small into the range of scales that we consider, thus ensuring the validity of the LPA, as well the reliability of our conclusions.

3.1. Solving the Exact RG Equation into the Symmetric Phase

3.1.1. Generalities

As explained before, a truncation is generally required to solve the RG Equation (18). In other words, a truncation is nothing but an ansatz for Γ k , and, thus, a specific parametrization of a finite-dimensional region of the full phase space. The reliability of the method is, however, no guarantee in general, and a deep inspection is always required to validate the conclusions of the truncations. Generally, there are two main sources of shortcomings.
The first one comes from the choice of the regulator r k . Indeed, formally, the boundary conditions ensured by r k and Γ k are such that different choices for r k lead to different trajectories into the theory space, with the same boundary conditions Γ k = 0 = Γ . This formal device however does not survive the truncation procedure in general, and it is well known that a spurious dependence on the regulator appears for physically relevant quantities like critical exponents. The knowledge of exact results or exact relations enables in favorable cases, to improve the choice of the regulator. Some general considerations based on optimization criteria can be of some help in other cases [67,68,69]. For our purpose, since we essentially focus on the shape of the effective potential rather than on the specific value of a physical quantity, one expects that such dependence is not too relevant.
The second one is the choice of truncation. A general criterion is based on the relative relevance of the different ingredients entering in the definition of Γ k . In the worst case, the parametrization may conflict with exact relations, coming, for instance, from symmetries like Ward identities [60,61,62,63,64,65,66]. Once again, one expects that such a pathological effect is not likely to appear here.
In this section, we aim to focus on the symmetric phase, where Γ k is assumed to be well expanded in the power of M. With this assumption, it is suitable to write Γ k [ M ] = Γ k , kin [ M ] + U k [ M ] , where Γ k , kin [ M ] , the kinetic part, keeps only the quadratic terms in M and U k [ M ] , the potential, is expanded in power of M higher than 2. In the LPA, the potential U k [ M ] is a purely local function, in the sense of the Definition 1. Moreover, we assume that U k is an even function, i.e., that the symmetry M M holds. In contrast, Γ k , kin [ M ] , whose inverse propagates the local modes, may involve non-local contributions, and its generic parametrization reads as:
Γ k , kin [ M ] = 1 2 p M ( p ) ( Z ( k , p 2 ) p 2 + u 2 ( k ) ) M ( p ) ,
where Z ( k , p 2 ) expands in power of p 2 as Z ( k , p 2 ) = Z ( k ) + O ( p 2 ) . In this paper, we focus on the first order of the DE, keeping only the term of order ( p 2 ) 0 in the expansion of Z ( k , p 2 ) . In the symmetric phase moreover, assuming that U k [ x ] is an even function, the flow equation for Z ( k ) vanishes exactly. Thus, it is suitable to fix the normalization of fields, such that Z ( k ) = 1 k .
As explained in Section 2.1, the field theory framework that we consider is non-conventional in the sense that the full kinetic action is known in the deep IR, but not at the microscopic level. We, thus, have to infer the microscopic kinetic action from the IR regime. Such an inference problem is reputed to be a hard problem (Figure 4), especially because the coarse-graining is not invertible. A consequence of this is the large river effect [70]. Usually, when a sufficiently large number of degrees of freedom have been integrated out, all the RG trajectories converge toward a finite-dimensional region of the full phase space, spanned by relevant and marginal (by power counting) interactions. In other words, different microscopic physics may have the same effective behavior at sufficiently large scale, the difference, spanned by irrelevant (i.e., non-renormalizable) interactions falling below the experimental precision threshold on a large enough scale. Thus, the best compromise is an equivalence class of microscopic models, that are not distinguishable (up to some experimental precision) in the deep IR. This hard inference problem is simplified within the LPA, because the expression for the effective kinetic action differs only by the mass parameter u 2 ( k ) .
The derivation of the flow equations follows the general strategy [65]. Taking the second derivative of the Equation (18) with respect to M μ , we get:
Γ ˙ k , μ 1 μ 2 ( 2 ) = 1 2 μ r ˙ k ( p μ 2 ) G k , μ μ Γ k , μ μ μ 1 μ 2 ( 4 ) G k , μ μ .
The different terms involved in this expression can be explicitly derived from the truncation. Indeed, from:
Γ k [ M ] = 1 2 p M ( p ) ( p 2 + u 2 ( k ) ) M ( p ) + u 4 ( k ) 4 ! N { p i } δ i p i i = 1 4 M ( p i ) + u 6 ( k ) 6 ! N 2 { p i } δ i p i i = 1 6 M ( p i ) + O ( M 6 ) ,
we straightforwardly deduce that:
Γ k , μ 1 μ 2 ( 2 ) = δ p μ 1 , p μ 2 p μ 1 2 + u 2 ( k ) ,
and:
Γ k , μ 1 μ 2 μ 3 μ 4 ( 4 ) = g 4 ! N π δ 0 , p π ( μ 1 ) + p π ( μ 2 ) + p π ( μ 3 ) + p π ( μ 4 ) ,
where π denotes elements of the permutation group of four elements. Note that, the origin of the factors 1 / N and 1 / N 2 can be easily traced. As we will see below, the 1 / N in front of u 4 ensures that (20) can be rewritten as an integral in the large N limit, involving the effective distribution ρ ( p 2 ) . The 1 / N 2 in front of u 6 ensures that all the contributions to the flow of u 4 receive the same power in 1 / N . For the same reason, u 8 have to scale as 1 / N 3 and u 2 p as 1 / N p 1 . Finally, the division by 1 / ( 2 p ) ! ensures that the symmetry factors of the Feynman diagrams match exactly with the dimension of its own discrete symmetry group.
From (22), we easily deduce that
G k , μ μ = 1 p μ 2 + u 2 + r k ( p μ 2 ) δ p μ , p μ .
To compute the flow equation, we have to make a choice for the regulator. From the expected form of the propagator, it is suitable to chose the Litim regulator—which is optimized in the sense of [67,68]:
r k ( p μ 2 ) = ( k 2 p μ 2 ) θ ( k 2 p μ 2 ) ,
where θ ( x ) denotes the standard Heaviside function. The flow equation for u 2 follows:
u ˙ 2 = 1 2 N 2 k 2 ( k 2 + u 2 ) 2 μ θ ( k 2 p μ 2 ) Γ k , μ μ μ 1 μ 1 ( 4 ) | p μ 1 = 0 .
In the large N limit, it is suitable to convert the sum as an integration, following [23]. For power law distributions ρ ( p 2 ) = ( p 2 ) α , the resulting equations are exactly the same as for standard field theory in dimension d, for which ρ ( p 2 ) = ( p 2 ) d / 2 1 . The RG proceed usually in two steps. As a first step we integrate degrees of freedom into some range of momenta p [ s Λ , Λ ] ( s < 1 ), providing a change of cut-off Λ s Λ . The second step is a global dilatation p p / s , ensuring that the original UV cut-off Λ is restored (see Figure 5).
The shape of a power-law distribution is globally invariant to such transformations. The consequence is the existence of a global scale-dependent rescaling u p k d p u ¯ p for all couplings, such that flow equations become autonomous. In that equation, d p is the scaling (or canonical) dimension for u p . The distribution that we consider in this paper, like MP law, do not enjoy this shape invariance property, and the flow equation never look as an autonomous system. The best compromise that we can do is a local definition of the canonical dimension, as in [23] with the example of quartic coupling. Here, we reproduce some parts of this analysis, providing a deeper investigation of the local scaling dimensions for higher couplings.

3.1.2. Flow Equations, Scaling and Dimension

Because of the asymptotic nature for u 2 , it is suitable to assume that it must scale as k 2 , and following [23], we define the dimensionless mass as u ¯ 2 = k 2 u 2 . Thus, without assumptions on the distribution ρ we get:
u ¯ ˙ 2 = 2 u ¯ 2 2 u 4 ( 1 + u ¯ 2 ) 2 1 k 2 0 k ρ ( p 2 ) p d p ,
with the notation X ˙ = k d X / d k . For a power law distribution, L : = 0 k ρ ( p 2 ) p d p equals L = k 2 α + 2 / ( 2 α + 2 ) ; therefore
d ln ( L ) = ( 2 α + 2 ) d ln ( k ) .
The variation of the loop integral is proportional to the variation of the time flow t = ln ( k ) . This is why the parameter t is as well relevant for ordinary QFT. For ρ being not a power law however, it is suitable to use the time τ defined as d τ : = d L . In this parametrization we get straightforwardly:
d u ¯ 2 d τ = 2 d t d τ u ¯ 2 2 u 4 ( 1 + u ¯ 2 ) 2 ρ ( k 2 ) k 2 d t d τ 2 ,
and we define the τ -dimension for u 2
dim τ ( u 2 ) = 2 d t d τ .
The τ -dimension for u 4 can be defined in the same way,
u 4 ρ ( k 2 ) k 2 d t d τ 2 = : u ¯ 4 ,
ensuring that the non autonomous character of the flow is entirely contained in the linear term of the flow equations. We obtain finally:
d u ¯ 2 d τ = 2 d t d τ u ¯ 2 2 u ¯ 4 ( 1 + u ¯ 2 ) 2 .
For the coupling u 4 , taking the fourth derivative of the flow Equation (18) and excluding the odd functions which vanish, we get:
d u 4 d τ = 2 u 6 ( 1 + u ¯ 2 ) 2 ρ ( k 2 ) d t d τ 2 + 12 u 4 2 ( 1 + u ¯ 2 ) 3 ρ ( k 2 ) k 2 d t d τ 2 .
Thus, rescaling u 6 in such a way that only the linear term in u ¯ 4 is scale-dependent enforces the definition:
u 6 k 2 ρ ( k 2 ) k 2 d t d τ 2 2 = : u ¯ 6 .
Therefore:
d u ¯ 4 d τ = dim τ ( u 4 ) u ¯ 4 2 u ¯ 6 ( 1 + u ¯ 2 ) 2 + 12 u ¯ 4 2 ( 1 + u ¯ 2 ) 3 ,
where:
dim τ ( u 4 ) : = 2 t t + t 1 2 d ln ρ d t 1 ,
denoting as X for d X / d τ . Finally, we get for u 6 :
u ¯ 6 = dim τ ( u 6 ) u ¯ 6 + 60 u ¯ 4 u ¯ 6 ( 1 + u ¯ 2 ) 3 108 u ¯ 6 3 ( 1 + u ¯ 2 ) 4 ;
where:
dim τ ( u 6 ) : = 2 d t d τ + 4 t t + t 1 2 d ln ρ d t 1 .
In the same way, we get for u 2 p :
dim τ ( u 2 p ) = 2 ( p 2 ) d t d τ ( p 1 ) dim τ ( u 4 ) .

3.1.3. Analytical Noisy Data, MP Distribution

For our numerical investigations we need to keep control of the size of the signal and numerical approximations. To this end, we consider deformations around a model of noise. We focus on the MP law, which has the double advantage to be a familiar model of noise and to be analytic. For X N × P matrix with i.i.d. entries and variance σ 2 < , the MP distribution μ ( x ) gives the spectrum of the correlation matrix Z : = X T X P for both N , P but P / N = : K remains finite [43]. Explicitly:
μ ( x ) = 1 2 π σ 2 ( a + x ) ( x a ) K x ,
where a ± = σ 2 ( 1 ± K ) 2 . The distribution ρ for eigenvalues of the inverse matrix can be easily deduced from (40). Figure 6 provides a picture of the numerical flow for a quartic truncation. Interestingly, the behavior of the RG flow looks very close to the familiar flow for ϕ 4 theory in dimension d < 4 . In particular, we show the existence of two regions, one in which the flow goes toward positive mass and the second one toward the negative mass. Usually, this splitting is governed by a fixed point, the Wilson–Fisher fixed point. Even though we have no true fixed point, in this case, we show that an analogous effect appears, the role of the fixed point being played by an extended attractive region that we call pseudo-fixed point.
In Figure 7 we plotted the canonical dimensions of the couplings up to p = 5 , for K = 1 and σ = 0.5 , 1 and 2, respectively. This is the property announced in Section 2.1. In the deep UV sector, i.e., in the domain of very small eigenvalues, the canonical dimension is positive for an arbitrarily large number of interactions. In the RG language, this means that an arbitrarily large number of operators are relevant toward the IR scales, and the description of the flow becomes very difficult, requiring to consider very large truncations in a very small range of scales. In contrast, up to a scale, Λ 0 ( σ ) , defined such that:
d t d τ 3 4 dim τ ( u 4 ) t = ln ( Λ 0 ) = 0 ,
only the local couplings u 4 and u 6 are relevant. Numerically, this point is reached in the vicinity of the eigenvalue λ λ 0 / 3 , λ 0 denoting the largest eigenvalue of the analytic spectrum. We, thus, have essentially revealed the existence of two regions: the deep noisy region (DNR), for p 2 > Λ 0 , where the number of relevant operators and their respective canonical dimensions increases arbitrarily, and the learnable region (LR) for p 2 < Λ 0 , where only two couplings are relevant and standard field theoretical methods are expected to work. In this paper we only focus on this region; using RG to track the presence of a signal. Figure 8 shows numerical evolution of couplings u 2 , u 4 , and u 6 , starting the RG flow from k = Λ 0 . Note, finally, that the behavior of the canonical dimension can be expected from the small p behavior of the MP law. Indeed for small p, ρ ( p 2 ) α with α = 1 / 2 . Following the dimensional analysis in [23], the corresponding canonical dimension for the local couplings u 2 p must be dim t ( u 2 p ) = 2 ( 1 ( p 1 ) α ) , and, thus, interactions are irrelevant for p > 3 . The asymptotic behavior of the distribution provides, therefore, a first indication of the relevant interactions in the asymptotic region, and we call critical dimension the corresponding value for α .

3.1.4. A First Look on Numerical Investigations

After the previous analytic observations, we provide in this section a first inspection on numerical aspects for non-analytic signals, as illustrated in Figure 9.
To keep control on the strength of the signal, we focus on two kind of eigenvalue distributions for our numerical investigations. The first one is a model of noise, that we call NMP (numerical Marchenko–Pastur). It is obtained by constructing the covariance matrix for a N × P matrix with i.i.d. random entries and variance equal to 1. The distribution of the eigenvalues of such matrix converges, for large P and N, to the MP’s law, and we set P = 1500 and N = 2000 for all our simulations that we discuss in this section. The second distribution that we call DNMP (disturbed numerical Marchenko–Pastur) is obtained from the first one by adding the spikes associated to a matrix of rank R = 65 (defining the size of the signal). The variance being fixed to 1, the canonical dimensions for the purely noisy data are given by Figure 7. We focus on the learnable region (LR), for eigenvalues between 2.5 and 3.4 , where only the ϕ 4 and ϕ 6 interactions are relevant.
In [23], the authors pointed out that the canonical dimension of relevant operators decreases with the presence of a signal in the LR, and in particular for strong enough signal, [ g ] becomes negative. Therefore, we expect the existence of a sufficiently small neighbourhood near the Gaussian fixed point where the field theory goes toward an asymptotic Gaussian behavior, arbitrarily close to the mass axis. This heuristic behavior based on dimensional considerations illustrates how the presence of the signal can affect the equivalence class of asymptotic states. Here, we are aiming to go beyond these dimensional considerations, and investigate non-perturbative effects with regard to the field expectation value.
Figure 10 shows the RG flow for the NMP law disturbed by a signal materialized with discrete spikes. Comparing with the purely MP law (Figure 6), we show that the pseudo-fixed point moved toward the Gaussian fixed point. This illustrate how RG may be used to track the presence of a signal. Indeed, the (pseudo-)fixed point controls the critical behavior. If its position changes, one expects that IR physics may be affected for some initial conditions. Among these IR properties, we focus in this paper on the field vacuum expectation value. In the truncation that we considered, and neglecting the momentum dependence of the classical field, the effective potential writes as a sixtic polynomial:
U ( m , { u 2 n } ) = 1 2 u 2 m 2 + u 4 4 ! m 4 + u 6 6 ! m 6 ,
up to the rescaling M = : N m . The classical configuration is such that U / m = 0 , and it depends on the values and on the signs of the different couplings. We focus on the sixtic truncations, and in the region u 6 > 0 , ensuring integrability. Under these conditions, we investigate, in the vicinity of the Gaussian fixed point, the set of initial conditions ending in the symmetric phase, i.e., such that the values of the couplings ensure m = 0 . The set of these points takes the form of a compact region R 0 around the Gaussian point.
Figure 11 shows this compact region R 0 for a purely noisy data (on the top) and for a small perturbation of the previous one with a multi-spike signal (on the bottom). Figure 12 illustrates what happens in regard to the shape of the effective potential. On the left, we show the evolution of the effective potential in the purple region for a purely noisy data. On the right, we present the evolution of the potential for the same initial conditions, when the noise is disturbed with some spikes.
This observation highlights a strong equivalence between the presence of a signal and the lack of symmetry restoration for some RG trajectories. However, a moment of reflection shows that region R 0 is too large, and that physically relevant states have to be researched as a compact subset ε 0 of this region constructed as the intersection of R 0 with the constraint imposed to the probability distribution in the IR. The relevant one in the LPA is that the mass u 2 (interpreted as the largest eigenvalue) remains finite, implying that u ¯ 2 scales as k 2 for small k. Since the separation between two eigenvalues is of order 1 / N , one expects that the smallest value for k 2 is ∼1/N. Figure 13 shows that trajectories satisfying this requirement exist in R 0 , and ε 0 .
The existence of a non-vanishing set of physically relevant states grants the possibility to propose the following scenario. We showed that the size of the region R 0 decreases due to the presence of a signal in the spectrum. However, as long as this collapse does not affect the subset ε 0 , the presence of the signal has no relevant consequences with respect to the physical states, at least concerning the expectation value of the field. Our observations suggest that this is happening only for a signal which is large enough, thus providing evidence in favor of the existence of an intrinsic detection threshold working with the expectation value. We do not address the issue of the precise determination of the shape of this subset ε 0 , which is the purpose of the companion paper [24].

3.2. Venturing into the Non-Symmetric Phase

3.2.1. LPA and LPA

In this section, we consider the LPA and its improved version LPA . This way, our assumptions about Γ k , kin (Equation (19)) hold, but we include the mass contribution into the local potential U k [ M ] . Moreover, we neglect the momentum dependence of the classical field M ( p ) , dominated by the zero-momentum (large scale) value:
M ( p ) M δ p 0 .
This approximation usually holds in the IR region, which is exactly what we consider. Moreover, it is not hard to show that such an expansion around M = 0 reproduces exactly the same equations as the truncation (21) for local operators (i.e., neglecting the momentum dependence of the effective vertices Γ k ( 2 p ) ). This approximation works well at large scale, where a symmetry breaking scenario is expected, requiring an expansion around a non-vanishing vacuum M 0 . For this reason, we consider the following parametrization:
U k [ χ ] = u 4 ( k ) 2 ! χ κ ( k ) 2 + u 6 ( k ) 3 ! χ κ ( k ) 3 + ,
where χ : = M 2 / 2 , and κ ( k ) is the running vacuum. The global normalization is such that, for M 0 ( p ) = M δ p 0 , Γ k [ M = M 0 ] = N U k [ χ ] . The 2-point vertex Γ k ( 2 ) moreover is defined as:
Γ k , μ μ ( 2 ) = Z ( k ) p 2 + 2 U k M 2 δ p μ , p μ ,
and, thus, replaces the formula (22), the role of the mass being played by the second derivative of the potential. The flow equation for U k can be deduced from (18), setting M = M 0 on both sides. Assuming, once again, that N is large and using the continuum setting, we get:
U ˙ k [ M ] = 1 2 p d p k k ( r k ( p 2 ) ) ρ ( p 2 ) 1 Γ k ( 2 ) + r k ( p , p ) .
Note that in the definition (45) we introduced the anomalous dimension Z ( k ) , which has a non-vanishing flow equation for κ 0 . To take into account the non vanishing flow for Z, it is suitable to slightly modify the Litim regulator as:
r k ( p 2 ) = Z ( k ) ( k 2 p 2 ) θ ( k 2 p 2 ) .
This modification simplifies the computation of the integrals [67,68]. In the computation of the flow equations, however it is suitable to rescale the dimensionless couplings u ¯ 2 p Z p u ¯ 2 p , such that the coefficient in front of p 2 in the kinetic action remains equal to 1. This additional rescaling adds a term n η ( k ) in the flow equation, where η , the anomalous dimension is defined as:
η ( k ) = Z ˙ ( k ) Z ( k ) .
Despite the fact that it simplifies the computation, the factor Z in front of the regulator (47) must not affect the boundary conditions Γ k = S and Γ k = 0 Γ . In particular, the first one requires that r k 1 k r , for positive r. This is obviously the case for Z = 1 , r k 1 k 2 . However, it is possible for Z to break this condition. This may be the case, for instance, if the flow reaches a fixed point p. At this point, the anomalous dimension takes a value η p , thus Z ( k ) = k η p and r k 1 k 2 + η p . The requirement r > 0 then imposes η p > 2 . Obviously, this is a limitation of the regulator, not of the method. Moreover, the non-autonomous nature of the RG equation prevents the existence of exact fixed points, so that the criteria should be more finely defined. Generally, one expects that the LPA approximation makes sense only in regimes where η is not so large, and becomes spurious in regime where | η | 1 [71].
RG equation for η = 0
As a first approximation, standard LPA sets Z ( k ) = 1 , or equivalently η = 0 . From (46), we arrive to the expression:
U ˙ k [ χ ] = 2 0 k ρ ( p 2 ) p d p k 2 k 2 + χ U k ( χ ) + 2 χ χ 2 U k ( χ ) .
Introducing the flow parameter τ defined in Section 3.1, we get:
U k [ χ ] = k 2 ρ ( k 2 ) d t d τ 2 k 2 k 2 + χ U k ( χ ) + 2 χ χ 2 U k ( χ ) ,
First, we define the scaling of the effective potential as:
χ U k ( χ ) k 2 = χ ¯ U ¯ k ( χ ¯ ) , χ χ 2 U k ( χ ) k 2 = χ ¯ χ ¯ 2 U ¯ k ( χ ¯ ) ,
therefore:
U k [ χ ] = d t d τ 2 k 2 ρ ( k 2 ) 1 + χ ¯ U ¯ k ( χ ¯ ) + 2 χ ¯ χ ¯ 2 U ¯ k ( χ ¯ )
The Equation (51) fixes the relative scaling of U k and χ . The previous relation moreover fixes the absolute scaling (the word “absolute” simply means that all the flow equations remain invariant) under a global reparametrization. This property, moreover, can be read directly in the partition function, and it reflects the invariance of the path integral measure of U k :
U k [ χ ] : = U ¯ k [ χ ¯ ] k 2 ρ ( k 2 ) d t d τ 2 .
In order to find the appropriate rescaling for χ , we introduce a scale dependent factor A, and define χ ¯ as χ = A χ ¯ . From global coherence, χ ¯ has to be such that:
U k [ χ ] : = U ¯ k [ A 1 χ ] k 2 ρ ( k 2 ) d t d τ 2 .
Therefore, expanding in power of χ , we find that the linear term becomes:
χ U k ( χ = 0 ) χ = χ ¯ U ¯ k [ χ ¯ = 0 ] χ ¯ k 2 ρ ( k 2 ) d t d τ 2 ,
or, from (51):
χ U k ( χ = 0 ) χ = χ U k ( χ = 0 ) χ A 1 ρ ( k 2 ) d t d τ 2 .
Then, assuming χ U k ( χ = 0 ) χ 0 , we get:
A = ρ ( k 2 ) d t d τ 2 ,
and:
χ = ρ ( k 2 ) d t d τ 2 χ ¯ .
This equation, obviously fixes the dimension of κ which must be the same as χ . The flow equations for the different couplings must be derived from definition:
U k χ | χ = κ = 0 ,
2 U k χ 2 | χ = κ = u 4 ( k ) ,
3 U k χ 3 | χ = κ = u 6 ( k ) .
The first equation is nothing but the mathematical translation of the requirement that the expansion is made around a local minimum. The two other equations are consequence of the parametrization of U k . In order to derive the flow equations for dimensionless couplings, it is suitable to work with a flow equation at fixed χ ¯ rather than fixed χ :
U k [ χ ] = ρ ( k 2 ) d t d τ 2 U ¯ k [ χ ¯ ] + dim τ ( U k ) U ¯ k [ χ ¯ ] dim τ ( χ ) χ ¯ χ ¯ U ¯ k [ χ ¯ ] ,
where dim τ ( U k ) and dim τ ( χ ) denote, respectively, the canonical dimension of U k and χ , respectively. To compute them, we return on their definitions, explicitly:
dim τ ( U k ) = t d d t ln k 2 ρ ( k 2 ) d t d τ 2 ,
and
dim τ ( χ ) = t d d t ln ρ ( k 2 ) d t d τ 2 .
The final expression for the effective potential RG equation then becomes:
U ¯ k [ χ ¯ ] = dim τ ( U k ) U ¯ k [ χ ¯ ] + dim τ ( χ ) χ ¯ χ ¯ U ¯ k [ χ ¯ ] + 1 1 + χ ¯ U ¯ k ( χ ¯ ) + 2 χ ¯ χ ¯ 2 U ¯ k ( χ ¯ ) .
The next steps are standard. From the definition (59) we must have χ ¯ U ¯ k [ χ ¯ = κ ¯ ] = u ¯ 4 κ ¯ . Thus, taking the second derivative of (65), we get for κ ¯ :
κ ¯ = dim τ ( χ ) κ ¯ + 2 3 + 2 κ ¯ u ¯ 6 u ¯ 4 ( 1 + 2 κ ¯ u ¯ 4 ) 2
In the same way, taking second and third derivatives, and from the conditions (60) and (61), we get:
u ¯ 4 = dim τ ( u 4 ) u ¯ 4 + dim τ ( χ ) κ ¯ u ¯ 6 10 u ¯ 6 ( 1 + 2 κ ¯ u ¯ 4 ) 2 + 4 ( 3 u ¯ 4 + 2 κ ¯ u ¯ 6 ) 2 ( 1 + 2 κ ¯ u ¯ 4 ) 3 ,
and
u ¯ 6 = dim ( u 6 ) u ¯ 6 12 ( 3 u ¯ 4 + 2 κ ¯ u ¯ 6 ) 3 ( 1 + 2 κ ¯ u ¯ 4 ) 4 + 40 u ¯ 6 3 u ¯ 4 + 2 κ ¯ u ¯ 6 ( 1 + 2 κ ¯ u ¯ 4 ) 3 .
The flow equation for η
We now assume that η ( k ) 0 . From definition, assuming that Z depends only on the value of the vacuum, we must have:
Z [ M = κ ] d d p 2 Γ k ( 2 ) ( p , p ) | M = 2 κ .
Therefore:
η ( k ) : = 1 Z k d Z d k = 1 Z d d p 2 Γ ˙ k ( 2 ) ( p , p ) .
The flow equation for Γ k ( 2 ) can be deduced from (18), taking the second derivative with respect to the classical field. Due to the fact that, the effective vertex are momentum independent, in the LPA representation, the contributions involving Γ k ( 4 ) have to be discarded from the flow equation for Z. Finally:
Z ˙ : = ( Γ k , 000 ( 3 ) ) 2 d d p 2 q r ˙ k ( q 2 ) G 2 ( q 2 ) G ( ( q + p ) 2 ) | M = 2 κ , p = 0 ,
where, according to LPA, we evaluate the right hand side over uniform configurations. Therefore, G ( p , p ) = : G ( p ) δ ( p + p ) is the inverse of Γ k ( 2 ) ( p , p ) + r k ( p 2 ) δ ( p + p ) , with Γ k ( 2 ) given by equation (45). The expression of Γ k , 000 ( 3 ) can be easily obtained; taking the third derivative of the effective potential for M:
Γ k , 000 ( 3 ) = 3 u 4 2 κ + u 6 ( 2 κ ) 3 / 2 .
We arrive to the following expression for anomalous dimension (see Appendix A):
η ( k ) = 2 ( t ) 2 ( 3 2 κ ¯ u ¯ 4 + ( 2 κ ¯ ) 3 / 2 u ¯ 6 ) 2 ( 1 + 2 κ ¯ u ¯ 4 ) 4 .
Note that, to derive this expression we have to take into account that the additional rescaling coming from Z accordingly to the requirement that the coefficient in front of p 2 in the kinetic action remains equals to 1. Requiring κ ¯ Z 1 κ ¯ with respect to the strict LPA definition. Due to the factors Z in the definition of barred quantities, η ( k ) appears in the flow equations. The net result is a translation of canonical dimensions
dim τ ( u 2 n ) dim τ ( u 2 n ) n d t d τ η ( k )
in the equations obtained within strict LPA.

3.2.2. Numerical Investigations

The main goal in this section is to show that the general behavior that we observed for the DE in the symmetric phase holds using the LPA formalism, expanding around a non-zero vacuum. Figure 14 shows the existence of some RG trajectories for which the symmetry is restored within the range where the eigenvalues are between 2.5 and 3.4 (corresponding to the range where only the ϕ 4 and ϕ 6 interactions are relevant for the MP distribution with σ = 1 and K = 0.75 ). This is manifested by the fact that κ decreases to zero. We also show in the same figure that there are other RG trajectories which do not allow a restoration of the symmetry. Once again, we can identify a set of initial conditions in the vicinity of the Gaussian fixed point where symmetry is always restored in the deep IR. Furthermore, we show that there are initial coupling conditions that are of great interest for signal detection. In fact, for these initial conditions, we have a restoration of the symmetry when we consider data without signal and, conversely, we do not have such restoration when we add the signal in the data. This is illustrated in Figure 15 in the form of potentials for a specific initial coupling condition. Finally, we emphasize that there is no significant change in this general behavior when we apply the LPA’ representation instead of the LPA one, i.e., when we take into account the non-zero anomalous dimension ( η ) in the formalism. Indeed, we show in Figure 16, that this anomalous dimension remains very small for the range of eigenvalues that we consider. This moreover is expected to be a good indication of the convergence of the derivative expansion [71], which improves the reliability of our conclusions.

4. Concluding Remarks and Open Issues

Let us summarize our investigations in this paper:
  • In order to keep control on the size of the signal and numerical approximations, we constructed datasets as perturbations around the MP law. We showed that the field theory approximation works well up to some scale Λ 0 . From this scale, the relevant sector, spanned by relevant couplings, diverges (its dimension becomes arbitrarily large, and couplings have arbitrary large dimension), and we expect that standard approximation fails up to this scale.
  • Above the scale Λ 0 , and focusing on the local interactions, the relevant sector has dimension 2, spanned by ϕ 4 and ϕ 6 interactions, in agreement with a naive power counting based on the critical dimension α = 1 / 2 of the MP law;
  • For MP distribution, we showed the existence of a compact region R 0 in the vicinity of the Gaussian fixed point, whose RG trajectories end in the symmetric region, and thus are compatible with symmetry restoration scenario;
  • Disturbing the MP spectrum with a strong enough signal reduces the size of this compact region, continuously deforming the effective potential from a symmetric toward a broken shape. In that picture, the role played by the signal strength is reminiscent of the role played by the inverse temperature β : = 1 / T in the physics of phase transition;
  • Finally, considering intersection ε 0 between R 0 and the physical conditions imposed to the IR 2-point function, we provided evidence in favor of the existence of an intrinsic detection threshold in the LPA approximation. This region is fully investigated in the companion paper [24].
These conclusions have to be completed by some important remarks concerning the different approximations that we did, and by perspectives for forthcoming works.
The first one is about the approximation procedure used to solve the RG Equation (18). Indeed, despite the limitations of the field theory approximation, the standard recipes to solve RG equations present limitations. In particular, the LPA neglects the momentum dependence of the coupling (i.e., deviations from the strict local approximation). We have no doubts that such an approximation makes sense in the deep IR regime, at the tail of the spectrum, where momenta are weak, and non-local interactions appear less relevant than local ones. However, as we explore the small eigenvalue scales, the effect of derivative couplings can no longer be neglected. As long as these terms can be treated as corrections, it is expected that our conclusions will not change significantly. However, these corrections could play a role in the estimation of the detection criterion. Note that, in regimes where momenta take large values and DE breaks down, other approximation schemes exist, enabling to keep the full dependence of the effective vertices. The most popular being the so-called Blaizot–Mendez–Wschebor (BMW) method [21,22,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51], which, combined with exact relations such as Ward identities, gives the possibility to provide exact (i.e., scheme independent) results [61]. Another current source of disagreement concerns the choice of the regulator. However, our conclusions are based on the behavior of the effective potential rather than on a specific value of a physically relevant quantity as a critical exponent. We thus expect them to be solid with respect to this issue [62,63,64,72,72].
The other source of approximation is the theoretical embedding. We showed that such an embedding offers a satisfactory description only for a small eigenvalue region. As we pointed out, such a limitation is not a novelty in physics, and it may be the sign that a more fundamental description has to replace the field theory approximation. Equation (8), involving discrete spins, provides an example of such a description. Note that, conversely, our field theory can be viewed as an effective description of such a binary model described with a constrained maximum entropy distribution. Finally, we focused on equilibrium, i.e., maximum entropy states, although we have adopted the standard field theory view of the RG. With this respect, let us mention the interesting possibility to view the exact RG equations as a form of entropy dynamics [40].
Our results focused on a specific model of noise, closer to the analytical MP law. Obviously, this is far from exhausting the large diversity of models. One might expect our conclusions to be much more general, and that they could be a universal property of all statistical noise models. This conjecture however has to be supported by deeper investigations, and we plan to address this issue for other models of noise. A first step in this direction has also been done recently for data materialized by random tensors rather than matrices [25], as considered in the topic of tensorial PCA [73].

Author Contributions

Conceptualization, V.L. and D.O.S.; Data curation, M.T.; Formal analysis, V.L. and D.O.S.; Funding acquisition, M.T.; Investigation, V.L., D.O.S. and M.T.; Methodology, D.O.S. and M.T.; Project administration, V.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not Applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Derivation of the Anomalous Dimension

In this appendix, we provide the main steps of the derivation of the anomalous dimension (73).
Using the modified Litim regulator, we get:
r ˙ k ( p 2 ) = η ( k ) r k ( p 2 ) + 2 Z k 2 θ ( k 2 p 2 ) ,
and
d d p 2 r k ( p 2 ) = Z θ ( k 2 p 2 ) .
In the improved LPA, the diagonal components of the effective propagator take the form:
G ( p 2 ) = 1 Z p 2 + Z ( k 2 p 2 ) θ ( k 2 p 2 ) + M 2 ( g , h , κ ) ,
where M 2 denotes the effective mass, i.e., the second derivative of the effective action. Therefore, we have to compute integrals like
I n ( k , p ) = k k ρ ( q 2 ) q ( q 2 ) n d q G ( ( p + q ) 2 ) .
We focus on small and positive p. The integral decomposes as I n ( k , p ) = I n ( + ) ( k , p ) + I n ( ) ( k , p ) , where:
I n ( ± ) ( k , p ) = ± 0 ± k ρ ( q 2 ) q ( q 2 ) n d q G ( ( p + q ) 2 ) .
Since p > 0 , in the negative branch ( q + p ) 2 < k 2 , and:
I n ( ) ( k , p ) = 1 Z k 2 + M 2 × k 0 ρ ( q 2 ) q ( q 2 ) n d q ,
which is independent of p. In the positive branch, in contrast:
I n ( + ) ( k , p ) = 1 Z k 2 + M 2 0 k p ρ ( q 2 ) q ( q 2 ) n d q + k p k ρ ( q 2 ) q ( q 2 ) n d q 1 Z ( q + p ) 2 + M 2 .
Taking the first derivative with respect to p, we get:
d d p I n ( + ) ( k , p ) = 1 Z k 2 + M 2 ρ ( q 2 ) q ( q 2 ) n | q = k p + ρ ( q 2 ) q ( q 2 ) n d q 1 Z ( q + p ) 2 + M 2 | q = k p 2 Z k p k ρ ( q 2 ) q ( q 2 ) n d q ( q + p ) ( Z ( q + p ) 2 + M 2 ) 2 .
The first two terms cancels exactly, and then:
d d p I n ( + ) ( k , 0 ) = 2 Z k p k ρ ( q 2 ) q ( q 2 ) n d q ( q + p ) ( Z ( q + p ) 2 + M 2 ) 2 .
Finally, taking second derivative and setting p = 0 , we get:
1 2 d 2 d p 2 I n ( k , 0 ) = Z ρ ( k 2 ) ( k 2 ) n + 1 ( Z k 2 + M 2 ) 2 = : I n ( k , 0 ) .
Therefore:
Z η ( k ) = ( 3 u 4 2 κ + u 6 ( 2 κ ) 3 / 2 ) 2 ( Z k 2 + M 2 ) 2 ( 2 Z k 2 I 0 ( k , 0 ) + Z η ( k ) ( k 2 I 0 ( k , 0 ) I 1 ( k , 0 ) ) ) .
In order to introduce τ -dimensionless quantities, we remark that both u 4 κ and u 6 κ 2 have the same scaling dimension. Finally, using renormalized and dimensionless quantities u ¯ 2 p , and replacing the effective mass by its value:
M ¯ 2 = χ ¯ U ¯ k ( κ ¯ ) + 2 κ ¯ χ ¯ 2 U ¯ k ( κ ¯ ) = 2 κ ¯ u ¯ 4 ,
we arrive to the expression (73).

References

  1. Feynman, R.P. Statistical Mechanics; Advanced Book Classics; Westview Press: Nashville, TN, USA, 1998. [Google Scholar]
  2. Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620–630. [Google Scholar] [CrossRef]
  3. Jaynes, E.T. Information theory and statistical mechanics II. Phys. Rev. 1957, 108, 171–190. [Google Scholar] [CrossRef]
  4. Jaynes, E.T. Gibbs vs. Boltzmann Entropies. Am. J. Phys. 1965, 33, 391. [Google Scholar] [CrossRef]
  5. Wilson, K.G. Renormalization group and critical phenomena. 1. Renormalization group and the Kadanoff scaling picture. Phys. Rev. B 1971, 4, 3174. [Google Scholar] [CrossRef] [Green Version]
  6. Wilson, K.G. Renormalization group and critical phenomena. 2. Phase space cell analysis of critical behavior. Phys. Rev. B 1971, 4, 3184. [Google Scholar] [CrossRef] [Green Version]
  7. Hu, B. Introduction to Real Space Renormalization Group Methods in Critical and Chaotic Phenomena. Phys. Rep. 1982, 91, 233. [Google Scholar] [CrossRef]
  8. Kadanoff, L.P. Lectures on the Application of Renormalization Group Techniques to Quarks and Strings. Rev. Mod. Phys. 1977, 49, 267. [Google Scholar] [CrossRef] [Green Version]
  9. Kadanoff, L.P. Variational Principles and Approximate Renormalization Group Calculations. Phys. Rev. Lett. 1975, 34, 1005. [Google Scholar] [CrossRef] [Green Version]
  10. Zinn-Justin, J. From Random Walks to Random Matrices; Oxford University Press: Oxford, UK, 2019. [Google Scholar]
  11. Zinn-Justin, J. Quantum Field Theory and Critical Phenomena; Oxford University Press: Oxford, UK, 2002. [Google Scholar]
  12. Yeo, J.H.; Moore, M.A. Renormalization group analysis of the M-p-spin glass model with p = 3 and M = 3. Phys. Rev. B 2012, 85, 100405. [Google Scholar] [CrossRef] [Green Version]
  13. Castellana, M.; Parisi, G. Renormalization group computation of the critical exponents of hierarchical spin glasses. Phys. Rev. E 2010, 82, 040105. [Google Scholar] [CrossRef] [Green Version]
  14. Pezzella, U.; Coniglio, A. Spin glasses and frustrated percolation: A renormalization group approach. Physics A 1997, 237, 353. [Google Scholar] [CrossRef]
  15. Dotsenko, V.S. Towards A Renormalization Group Theory of Spin Glasses. IC/87/154. Available online: https://inis.iaea.org/search/search.aspx?orig_q=RN:19033970 (accessed on 8 August 2021).
  16. Collet, P.; Eckmann, J.P.; Glaser, V.J.; Martin, A. A spin glass with random couplings. J. Statist. Phys. 1984, 36, 89. [Google Scholar] [CrossRef] [Green Version]
  17. Kadanoff, L.P. Static Phenomena Near Critical Points: Theory and Experiment. Rev. Mod. Phys. 1967, 39, 395. [Google Scholar] [CrossRef]
  18. Kenneth, G.W. The renormalization group and critical phenomena. Rev. Mod. Phys. 1982, 55, 3. [Google Scholar]
  19. Sokal, A.D.; van Enter, A.C.D.; Fernandez, R. Regularity properties and pathologies of position space renormalization group transformations: Scope and limitations of Gibbsian theory. J. Statist. Phys. 1994, 72, 879. [Google Scholar] [CrossRef] [Green Version]
  20. Bradde, S.; Bialek, W. PCA meets RG. J. Stat. Phys. 2017, 167, 462–475. [Google Scholar] [CrossRef]
  21. Meshulam, L.; Gauthier, J.L.; Brody, C.D.; Tank, D.W.; Bialek, W. Coarse–graining and hints of scaling in a population of 1000+ neurons. arXiv 2018, arXiv:1812.11904. [Google Scholar]
  22. Meshulam, L.; Gauthier, J.L.; Brody, C.D.; Tank, D.W.; Bialek, W. Coarse Graining, Fixed Points, and Scaling in a Large Population of Neurons. Phys. Rev. Lett. 2019, 123, 1079–7114. [Google Scholar] [CrossRef] [Green Version]
  23. Lahoche, V.; Samary, D.O.; Tamaazousti, M. Generalized scale behavior and renormalization group for principal component analysis. arXiv 2002, arXiv:2002.10574. [Google Scholar]
  24. Lahoche, V.; Samary, D.O.; Tamaazousti, M. Signal detection in nearly continuous spectra and symmetry breaking. arXiv 2011, arXiv:2011.05447. [Google Scholar]
  25. Lahoche, V.; Samary, D.O.; Tamaazousti, M. Field theoretical approach for signal detection in nearly continuous positive spectra II: Tensorial data. Entropy 2021, 23, 795. [Google Scholar] [CrossRef]
  26. Shlens, J. A tutorial on principal components analysis. arXiv 2014, arXiv:1404.1100. [Google Scholar]
  27. Campeti, P.; Poletti, D.; Baccigalupi, C. Principal component analysis of the primordial tensor power spectrum. J. Cosmol. Astropart. Phys. 1909, 9, 055. [Google Scholar] [CrossRef] [Green Version]
  28. Hotelling, H. Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 1933, 24, 417–441, 498–520. [Google Scholar] [CrossRef]
  29. Abdi, H.; Williams, L.J. Principal component analysis. Wiley Interdisciplinary Reviews. Comput. Stat. 2010, 2, 433–459. [Google Scholar] [CrossRef]
  30. Yue, G.; Jennifer, D. Sparse Probabilistic Principal Component Analysis. In Proceedings of the Twelth International Conference on Artificial Intelligence and Statistics, Clearwater Beach, FL, USA, 16–19 April 2009. [Google Scholar]
  31. Seddik, M.E.A.; Tamaazousti, M.; Couillet, R. A kernel random matrix-based approach for sparse PCA. In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
  32. Delamotte, B. An Introduction to the nonperturbative renormalization group. Lect. Notes Phys. 2012, 852, 49. [Google Scholar] [CrossRef] [Green Version]
  33. Bény, C. Inferring relevant features: From QFT to PCA. Int. J. Quant. Inf. 2018, 16, 1840012. [Google Scholar] [CrossRef] [Green Version]
  34. Foreman, S.; Giedt, J.; Meurice, Y.; Unmuth-Yockey, J. RG inspired Machine Learning for lattice field theory. EPJ Web Conf. 2018, 175, 11025. [Google Scholar] [CrossRef]
  35. Bény, C. Coarse-grained distinguishability of field interactions. Quantum 2018, 2, 67. [Google Scholar] [CrossRef]
  36. Bény, C.; Osborne, T.J. Information-geometric approach to the renormalization group. Phys. Rev. A 2015, 92, 022330. [Google Scholar] [CrossRef] [Green Version]
  37. Ay, N.; Jost, J.; Le, H.V.; Schwachhofer, L. Information Geometry (Ergebnisse der Mathematik und ihrer Grenzgebiete, 3. Folge A Series of Modern Surveys in Mathematics 64); Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
  38. Amari, S.I. Information Geometry and Its Applications; Springer: Berlin/Heidelberg, Germany, 2016; Volume 194. [Google Scholar]
  39. Amari, S.I.; Nagaoka, H. Methods of Information Geometry; American Mathematical Society: Providence, RI, USA, 2000; Volume 191. [Google Scholar]
  40. Pessoa, P.; Caticha, A. Exact Renormalization Groups As a Form of Entropic Dynamics. Entropy 2017, 20, 25. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Tkacik, G.; Marre, O.; Amodei, D.; Schneidman, E.; Bialek, W.; Berry, M.J., II. Searching for collective behavior in a large network of sensory neurons. PLoS Comput. Biol. 2014, 10, e1003408. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Marsili, M. Dissecting financial markets: Sectors and states. Quant. Financ. 2002, 2, 297–302. [Google Scholar] [CrossRef]
  43. Marčenko, V.A.; Pastur, L.A. Distribution of eigenvalues for some sets of random matrices. Math. USSR-Sb. 1967, 1, 457. [Google Scholar] [CrossRef]
  44. Baik, J.; Arous, G.B.; Peche, S. Phase transition of the largest eigenvalue for non-null complex sample covariance matrices. Ann. Probab. 2005, 33, 1643–1697. [Google Scholar] [CrossRef]
  45. Perry, A.; Wein, A.S.; Bandeira, A.S.; Moitra, A. Optimality and sub-optimality of PCA I: Spiked random matrix models. Ann. Stat. 2018, 46, 2416–2451. [Google Scholar] [CrossRef] [Green Version]
  46. Woloshyn, R.M. Learning phase transitions: Comparing PCA and SVM. arXiv 2019, arXiv:1905.08220. [Google Scholar]
  47. Wang, D. Spiked Models in Wishart Ensemble. Ph.D. Thesis, Brandeis University, Waltham, MA, USA, 2008. [Google Scholar]
  48. Dyson, F.J. Statistical theory of the energy levels of complex systems. I. J. Math. Phys. 1962, 3, 140. [Google Scholar] [CrossRef]
  49. Alekseev, O.; Mineev-Weinstein, M. Statistical mechanics of stochastic growth phenomena. Phys. Rev. E 2017, 96, 010103. [Google Scholar] [CrossRef] [Green Version]
  50. Dupuis, N.; Canet, L.; Eichhorn, A.; Metzner, W.; Pawlowski, J.M.; Tissier, M.; Wschebor, N. The nonperturbative functional renormalization group and its applications. arXiv 2006, arXiv:2006.04853. [Google Scholar]
  51. Blaizot, J.P.; Mendez-Galain, R.; Wschebor, N. Non perturbative renormalisation group and momentum dependence of n-point functions (I). Phys. Rev. E 2006, 74, 051116. [Google Scholar] [CrossRef] [Green Version]
  52. Blaizot, J.P.; Mendez-Galain, R.; Wschebor, N. Non perturbative renormalization group and momentum dependence of n-point functions. II. Phys. Rev. E 2006, 74, 051117. [Google Scholar] [CrossRef] [Green Version]
  53. Hu, B. The Classical Ising Model: A Quantum Renormalization Group Approach. Phys. Lett. A 1979, 71, 83–86. [Google Scholar] [CrossRef]
  54. Sherrington, D.; Kirkpatrick, S. Solvable Model of a Spin-Glass. Phys. Rev. Lett. 1975, 35, 1792. [Google Scholar] [CrossRef]
  55. Bouchaud, J.P.; Cugliandolo, L.F.; Kurchan, J.; Mezard, M. Out of equilibrium dynamics in spin-glasses and other glassy systems. arXiv 1997, arXiv:cond-mat/9702070. [Google Scholar]
  56. Bouchaud, J.P.; Cugliandolo, L.; Kurchan, J.; Mezard, M. Mode coupling approximations, glass theory and disordered systems. Phys. A 1996, 226, 243. [Google Scholar] [CrossRef] [Green Version]
  57. Panchenko, D. The Sherrington-Kirkpatrick model: An overview. J. Stat. Phys. 2012, 149, 362–383. [Google Scholar] [CrossRef] [Green Version]
  58. Wetterich, C. The Average action for scalar fields near phase transitions. Z. Phys. C 1993, 57, 451. [Google Scholar] [CrossRef]
  59. Wetterich, C. Exact evolution equation for the effective potential. Phys. Lett. B 1993, 301, 90. [Google Scholar] [CrossRef] [Green Version]
  60. Lahoche, V.; Samary, D.O. Ward identity violation for melonic T4-truncation. Nucl. Phys. B 2019, 940, 190. [Google Scholar] [CrossRef]
  61. Lahoche, V.; Samary, D.O. Nonperturbative renormalization group beyond the melonic sector: The effective vertex expansion method for group fields theories. Phys. Rev. D 2018, 98, 126010. [Google Scholar] [CrossRef] [Green Version]
  62. Lahoche, V.; Samary, D.O. Revisited functional renormalization group approach for random matrices in the large-N limit. Phys. Rev. D 2020, 101, 106015. [Google Scholar] [CrossRef]
  63. Lahoche, V.; Samary, D.O. Reliability of the local truncations for the random tensor models renormalization group flow. Phys. Rev. D 2020, 102, 056002. [Google Scholar] [CrossRef]
  64. Pawlowski, J.M.; Scherer, M.M.; Schmidt, R.; Wetzel, S.J. Physics and the choice of regulators in functional renormalisation group flows. Ann. Phys. 2015, 384, 1–2. [Google Scholar] [CrossRef] [Green Version]
  65. Lahoche, V.; Samary, D.O. Pedagogical comments about nonperturbative Ward-constrained melonic renormalization group flow. Phys. Rev. D 2020, 101, 024001. [Google Scholar] [CrossRef] [Green Version]
  66. Lahoche, V.; Samary, D.O. Ward-constrained melonic renormalization group flow for the rank-four ϕ6 tensorial group field theory. Phys. Rev. D 2019, 100, 086009. [Google Scholar] [CrossRef] [Green Version]
  67. Litim, D.F. Optimization of the exact renormalization group. Phys. Lett. B 2000, 486, 92. [Google Scholar] [CrossRef] [Green Version]
  68. Litim, D.F. Derivative expansion and renormalization group flows. J. High Energy Phys. 2001, 0111, 059. [Google Scholar] [CrossRef] [Green Version]
  69. Canet, L.; Delamotte, B.; Mouhanna, D.; Vidal, J. Optimization of the derivative expansion in the nonperturbative renormalization group. Phys. Rev. D 2003, 67, 065004. [Google Scholar] [CrossRef] [Green Version]
  70. Bagnuls, C.; Bervillier, C. Classical to critical crossovers from field theory. Phys. Rev. E 2002, 65, 066132. [Google Scholar] [CrossRef] [Green Version]
  71. Balog, I.; Chaté, H.; Delamotte, B.; Marohnic, M.; Wschebor, N. Convergence of Nonperturbative Approximations to the Renormalization Group. Phys. Rev. Lett. 2019, 123, 240604. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  72. Defenu, N.; Mati, P.; Marian, I.G.; Nandori, I.; Trombettoni, A. Truncation Effects in the Functional Renormalization Group Study of Spontaneous Symmetry Breaking. J. High Energy Phys. 2015, 1505, 141. [Google Scholar] [CrossRef] [Green Version]
  73. Montanari, A.; Richard, E. A statistical model for tensor PCA. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014. [Google Scholar]
Figure 1. Qualitative picture of the signal detection issue in a nearly continuous spectrum.
Figure 1. Qualitative picture of the signal detection issue in a nearly continuous spectrum.
Entropy 23 01132 g001
Figure 2. Qualitative illustration of the RG flow. The UV scale is described by the classical action S , while the IR scale is described by an effective object Γ , where microscopic effects are hidden in the different parameters involved in its definition.
Figure 2. Qualitative illustration of the RG flow. The UV scale is described by the classical action S , while the IR scale is described by an effective object Γ , where microscopic effects are hidden in the different parameters involved in its definition.
Entropy 23 01132 g002
Figure 3. Qualitative illustration of the meaning of the effective averaged action Γ k , as the effective action of the UV degrees of freedom which have been integrated-out.
Figure 3. Qualitative illustration of the meaning of the effective averaged action Γ k , as the effective action of the UV degrees of freedom which have been integrated-out.
Entropy 23 01132 g003
Figure 4. Qualitative illustration of the RG flow behavior. Some different UV initial conditions lead to the same (universal) IR physics, up to negligible differences with regard to the experimental precision.
Figure 4. Qualitative illustration of the RG flow behavior. Some different UV initial conditions lead to the same (universal) IR physics, up to negligible differences with regard to the experimental precision.
Entropy 23 01132 g004
Figure 5. A step of the RG flow. On the left, integration of momenta between s Λ and Λ . On the right, dilatation of the remaining momenta with a factor 1 / s .
Figure 5. A step of the RG flow. On the left, integration of momenta between s Λ and Λ . On the right, dilatation of the remaining momenta with a factor 1 / s .
Entropy 23 01132 g005
Figure 6. Numerical flow associated to the MP law (data without signal) and for the quartic truncation. The main directions of the flow are highlighted by the black arrows (which are oriented from UV to IR). We observe the existence of a region reminiscent of the standard Wilson–Fisher fixed point.
Figure 6. Numerical flow associated to the MP law (data without signal) and for the quartic truncation. The main directions of the flow are highlighted by the black arrows (which are oriented from UV to IR). We observe the existence of a region reminiscent of the standard Wilson–Fisher fixed point.
Entropy 23 01132 g006
Figure 7. The canonical dimension for MP distribution with K = 1 and σ = 0.5 (on the right), σ = 1 (in the middle) and σ = 2 (on the left). The purple curve corresponds to the MP distribution.
Figure 7. The canonical dimension for MP distribution with K = 1 and σ = 0.5 (on the right), σ = 1 (in the middle) and σ = 2 (on the left). The purple curve corresponds to the MP distribution.
Entropy 23 01132 g007
Figure 8. RG trajectories starting from k = Λ 0 for u 2 (blue curves), u 4 (red curves), and u 6 (green curves).
Figure 8. RG trajectories starting from k = Λ 0 for u 2 (blue curves), u 4 (red curves), and u 6 (green curves).
Entropy 23 01132 g008
Figure 9. A typical DNMP distribution for P = 1500 and N = 2000 .
Figure 9. A typical DNMP distribution for P = 1500 and N = 2000 .
Entropy 23 01132 g009
Figure 10. Numerical flow associated to a DNMP distribution in the learnable region.
Figure 10. Numerical flow associated to a DNMP distribution in the learnable region.
Entropy 23 01132 g010
Figure 11. Three points of view of the compact region R 0 (illustrated with purple dots) in the vicinity of the Gaussian fixed point (illustrated with a black dot). In this region RG trajectories end in the symmetric phase, and thus are compatible with a symmetry restoration scenario for initial conditions corresponding to an explicit symmetry breaking. The top plots are associated to the case of pure noise and the bottom plots are, respectively, associated to the case with signal.
Figure 11. Three points of view of the compact region R 0 (illustrated with purple dots) in the vicinity of the Gaussian fixed point (illustrated with a black dot). In this region RG trajectories end in the symmetric phase, and thus are compatible with a symmetry restoration scenario for initial conditions corresponding to an explicit symmetry breaking. The top plots are associated to the case of pure noise and the bottom plots are, respectively, associated to the case with signal.
Entropy 23 01132 g011
Figure 12. Illustration of the evolution of the potential associated to an example of initial conditions of the coupling u 2 , u 4 , and u 6 where the RG trajectories end in the symmetric phase in the case of pure noise (on the left) and stay in the non-symmetric phase when we add a signal (on the right).
Figure 12. Illustration of the evolution of the potential associated to an example of initial conditions of the coupling u 2 , u 4 , and u 6 where the RG trajectories end in the symmetric phase in the case of pure noise (on the left) and stay in the non-symmetric phase when we add a signal (on the right).
Entropy 23 01132 g012
Figure 13. Illustration of the evolution of the u 2 for eigenvalues between 2.5 and 3.4 in the case of pure noise (NMP distribution). We can see that the values of u ¯ 2 for these examples are of the same magnitude as N = 2000 . This highlights the existence of some RG trajectories associated to physically relevant states in the deep infrared.
Figure 13. Illustration of the evolution of the u 2 for eigenvalues between 2.5 and 3.4 in the case of pure noise (NMP distribution). We can see that the values of u ¯ 2 for these examples are of the same magnitude as N = 2000 . This highlights the existence of some RG trajectories associated to physically relevant states in the deep infrared.
Entropy 23 01132 g013
Figure 14. Illustration of the evolution of κ , obtained with the LPA representation, for eigenvalues between 2.5 and 3.4 in the case of data without signal. For some RG trajectories (on the left), κ decreases to zero, which correspond to a restoration of the symmetry. For other RG trajectories (on the right), κ stays almost constant in the range of eigenvalues that we consider, and does not lead to a restoration of the symmetry.
Figure 14. Illustration of the evolution of κ , obtained with the LPA representation, for eigenvalues between 2.5 and 3.4 in the case of data without signal. For some RG trajectories (on the left), κ decreases to zero, which correspond to a restoration of the symmetry. For other RG trajectories (on the right), κ stays almost constant in the range of eigenvalues that we consider, and does not lead to a restoration of the symmetry.
Entropy 23 01132 g014
Figure 15. Illustration of the evolution of the potential associated to an example of initial conditions of the coupling u 2 , u 4 , and u 6 . We see that the RG trajectories, obtained with the LPA representation, end in the symmetric phase in the case of pure noise (on the left) and stay in the non symmetric phase when we add a signal (on the right).
Figure 15. Illustration of the evolution of the potential associated to an example of initial conditions of the coupling u 2 , u 4 , and u 6 . We see that the RG trajectories, obtained with the LPA representation, end in the symmetric phase in the case of pure noise (on the left) and stay in the non symmetric phase when we add a signal (on the right).
Entropy 23 01132 g015
Figure 16. Illustration of the evolution of η , obtained by the LPA’ representation, for eigenvalues between 2.5 and 3.4 in the case of data without signal. We see that for these RG trajectories, the anomalous dimension η remains small. This highlights that there is no significant change when we use the LPA’ representation instead of the LPA one.
Figure 16. Illustration of the evolution of η , obtained by the LPA’ representation, for eigenvalues between 2.5 and 3.4 in the case of data without signal. We see that for these RG trajectories, the anomalous dimension η remains small. This highlights that there is no significant change when we use the LPA’ representation instead of the LPA one.
Entropy 23 01132 g016
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lahoche, V.; Ousmane Samary, D.; Tamaazousti, M. Field Theoretical Approach for Signal Detection in Nearly Continuous Positive Spectra I: Matricial Data. Entropy 2021, 23, 1132. https://doi.org/10.3390/e23091132

AMA Style

Lahoche V, Ousmane Samary D, Tamaazousti M. Field Theoretical Approach for Signal Detection in Nearly Continuous Positive Spectra I: Matricial Data. Entropy. 2021; 23(9):1132. https://doi.org/10.3390/e23091132

Chicago/Turabian Style

Lahoche, Vincent, Dine Ousmane Samary, and Mohamed Tamaazousti. 2021. "Field Theoretical Approach for Signal Detection in Nearly Continuous Positive Spectra I: Matricial Data" Entropy 23, no. 9: 1132. https://doi.org/10.3390/e23091132

APA Style

Lahoche, V., Ousmane Samary, D., & Tamaazousti, M. (2021). Field Theoretical Approach for Signal Detection in Nearly Continuous Positive Spectra I: Matricial Data. Entropy, 23(9), 1132. https://doi.org/10.3390/e23091132

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop