The helicity of electromagnetic fields has received research attention since at least the 1960s [
1,
2,
3,
4,
5,
6,
7,
8]. Recently, the topic has picked up a considerable pace, partly because of the relevance of helicity in chiral light–matter interactions [
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24]. In this context, one of the most basic quantities is the average value of electromagnetic helicity for a given free electromagnetic field, which can be interpreted as the pseudo-scalar that measures the difference between the number of left- and right-handed polarized photons contained in the field [
1,
2]. The most common expression for average helicity is [
1,
2,
3,
6,
7,
8,
11,
12,
13,
14,
15,
17,
21,
22,
23,
24]:
where
and
are the real-valued electric and magnetic fields[potentials], respectively. The use of two potentials is a common strategy in this context [
2,
8,
11,
12,
13,
17,
25]. In particular, it allows the obtaining of an integrand which is local in
. The above equation has been arrived at in several ways. This article contains a different one.
In this article, several different integral expressions for the average helicity of a given electromagnetic field are obtained starting from the inner product that ensures invariance of the result under the largest symmetry group of Maxwell equations: The conformal group. We show that the result from the derived expressions coincides with that of the above integral. In our approach, all the fields are complex because only positive frequencies are included. The advantages of this choice regarding the treatment of helicity with Riemann-Silberstein-like fields and their corresponding potentials are discussed. Integral expressions in momentum space,
space, and
space are obtained. The numerical evaluation of the time-harmonic
expressions, whose integrands are local in
, can be conveniently performed using common numerical Maxwell solvers. The formalism is used to obtain expressions for the computation of molecular circular dichroism in
Appendix C.
We start by writing down the conformally invariant form of the average helicity and showing that the result coincides with the common definition. To such end, we consider
, the vector space of finite-energy solutions to Maxwell’s equations in free space. We denote vectors in such space by kets such as
, which represent particular electromagnetic field solutions. We are interested in the average helicity of the field. The key ingredient for the definition of average properties is an inner product between two vectors
and
, denoted
, which endows
with the structure of a Hilbert space. Properties such as energy, linear momentum, helicity, etc.…, are represented by Hermitian operators which map elements of
back onto itself. Then, for a given field
, the average value of a given property represented by a Hermitian operator
is the quadratic form
that is, the projection of the vector
onto the vector
. Since
is Hermitian,
must be a real number.
The crucial question of
which inner product to choose was settled by Gross by requiring the inner product to be invariant under the conformal group [
26]. That is: Given any two solutions
and
, and their corresponding transformed versions under any transformation in the conformal group,
and
, the inner product must be so that
. Gross showed in Ref. [
26] that this requirement essentially determines the exact expression of the inner product, which we will use later. The conformal group includes space–time translations, spatial rotations, and Lorentz boosts, which together form the Poincaré group, plus space–time scalings, and special conformal transformations [
26]. The conformal group is the largest symmetry group of Maxwell equations in free space. Conformally invariant results have hence the maximum possible validity in electromagnetism.
An important distinction is in order at this point. The use of the conformally invariant inner product ensures the maximal validity for average quantities as defined by Equation (
1): The projection of the vector
onto the vector
is equal to the projection of
onto
, i.e.,
, for any transformation
T in the conformal group, where
is the Hermitian adjoint of
T. Satisfying this demand amounts to showing that an inner product exists with respect to which the conformal group acts unitarily (
for all
T, where
I is the identity) on the vector space of solutions of Maxwell equations [
26]. Loosely speaking, this means that the value of the averages in Equation (
1) will not change regardless of “the conformal point of view” or “conformal change of coordinate system”. This will hold for average helicity, and also for average momentum, average angular momentum, etc.… A
different matter is whether the average quantity in a conformally transformed field is the same as the average quantity in the initial field, for all conformal transformations. In this case, we are asking whether
and
have the same average value of a given property
, i.e., whether
which is often not the case, such as for example when a Lorentz boost simultaneously changes the energy and momentum of a given field. Incidentally, it will be clear later that Equation (
2) is actually met in the case of average helicity.
Writing down an explicit expression for Equation (
1) requires us to choose an explicit representation for the vectors in
and the operators acting on them. We choose the following representation for the vectors in
:
where the
define the plane-wave components of a version of the Riemann-Silberstein vectors [
5]
where
,
,
, and
are the vacuum’s permittivity, permeability, speed of light, and impedance, respectively,
is the wavevector, and
is the angular frequency. The
can be further decomposed as
, where
are complex-valued scalar functions and
are the
-dependent polarization vectors for each handedness(helicity) (The
can be obtained by the rotation of
, the two vectors corresponding to
:
, where
and
). We note that
, which makes the
transverse functions, namely
. The origin
is removed in the integral in Equation (
4) because we are considering electrodynamics and excluding electro- and magneto-statics, whereby
needs to be excluded. It important to note that
only positive frequencies are included in Equation (
4). This amounts to considering positive energies only, which is possible in electromagnetism since the photon is its own anti-particle. Only one sign of the energy (frequency) is needed because the same information is contained on both sides of the spectrum ([
5] § 3.1 and [
4]). When only positive frequencies are included,
,
,
and
in Equation (
4) are complex-valued fields. With
standing for
,
,
or
:
We define the complex-valued fields so that the typical real-valued versions are obtained as
The restriction to positive frequencies is particularly consequential for the treatment of helicity, the generalized polarization handednesses of the field. One of the advantages of the Riemann-Silberstein vectors is their ability to encode the helicity content of the field. They are the eigenstates of the helicity operator and potentially allow for the splitting of the two polarization handednesses in any field, including near and evanescent fields. However, when they are defined by means of real-valued fields, as in
, their use for splitting the two helicities is not as simple as it becomes when complex-valued fields are used. With real-valued fields we have that the two ± fields determine each other through complex conjugation
, which is at odds with the a priori physical independence of the two helicity components of the electromagnetic field. For example, the complex conjugation connection means that the two ± squared norms
, which could intuitively be thought of as the
-local helicity intensities, become equal at all space–time points
. This contradicts, for example, the fact that there can be electromagnetic fields containing only one of the two helicities, e.g., any linear combination of plane-waves with the same polarization handedness. The restriction to positive frequencies overcomes these limitations: In Equation (
4)
contains no information about
, in particular
.
The choice of the representation in Equation (
3), where the two helicity components are distinguished, as opposed to other more common possibilities where the electric and magnetic fields are distinguished, can also be motivated by the transformation properties of the two different options with respect to the conformal group. Namely helicity is invariant under the conformal group [
27,
28], i.e., all the generators of the conformal group commute with the helicity operator. This was established by Mack and Todorov in Ref. [
27] when they showed that a Casimir operator of the conformal group is linearly related to the helicity operator. The invariance can also be inferred from the facts that (i) helicity is ultimately proportional to the cosine of an angle (The cosine of the angle between the vector of spin-1 matrices
and the linear momentum operator
is defined as
. Then, using the definition of the helicity operator
in Equation (
8) we can write
. However, the action of
on members of
is trivial since
where
I is the identity. This can be seen in ([
29] Equation (5.54).), and is readily verified by direct calculation using the spin-1 matrices.), and that, (ii) the preservation of angles is guaranteed by conformal transformations. In the representation of Equation (
3) this invariance means that no matter which conformal transformation is applied to
, the
upper components of
will never end up on the lower part, and vice-versa. This reduces the algebraic complexity of some expressions and manipulations. In sharp contrast to this, what is meant by electric and magnetic fields is not conformally invariant. Actually, the meanings of “electric” and “magnetic” are not even relativistically invariant since electric and magnetic fields are intermixed by Lorentz boosts ([
30] Equation (11.149)). An important physical fact about helicity can be deduced from its conformal invariance. Since the helicity operator (
) commutes with any
T in the conformal group (
), and
T is unitary with respect to the chosen inner product (
), we can readily see that Equation (
2) is met (
.): The average helicity of a conformally transformed field is the same as the average helicity of the initial field.
Let us go on to computing the average helicity of a given field as a conformally invariant inner product. We will explicitly keep the constants
,
,
, and
in the expressions, and use the four fields
,
,
and
. These choices [
21,
23] facilitate the re-use of the formulas when a description such as the one in Equation (
4) is possible in a non-vacuum background, such as for example in an infinite isotropic and homogeneous linear medium.
Following Gross [
26], and Bialynicki-Birula ([
31] § 9) and ([
5] § 5), the definitions in Equations (
1) and (
3) allow us to write the average value of any property
as (Note: This is seen by comparing Equation (
4) with ([
5] Equations (4.11)–(4.12)), and Equation (
7) particularized to the energy operator
with ([
5] Equation (4.13)), and setting
.)
where
means transpose conjugate. Equation (
7) is an explicit expression of the conformally invariant inner product between
and
(Note: When Equation (
7) is brought to the
domain, the
-local expression results in the double integral
of a manifestly non-
-local integrand including a term such as
(see [
26] Equation (
6) and [
5] Equation (5.7))).
We are now ready to focus our attention on the average value of helicity. The helicity operator
is defined as the projection of the angular momentum operator vector
onto the direction of the linear momentum operator vector
:
where for electromagnetism,
is the vector of spin-1 matrices (The second equality can be seen to follow, for example, from considering the coordinate representation of the angular momentum and linear momentum operator vectors, (ref. [
5] Equations (5.24) and (5.25)):
,
. Their inner product then reads
. The first term vanishes since it is the divergence of a curl).
We start by particularizing Equation (
7) to the helicity operator
.
where the last expression contains the explicit form of the helicity operator in our choice of representation (This follows from the definition of helicity in Equation (
8):
, where the equivalence follows from applying ([
5] Equation (2.2))) in momentum space where
). We now use the fact that the
are eigenstates of helicity, namely
, to write
We will now show that Equation (
10) is equivalent to the most common integral expression of the helicity average. To such end, and taking advantage of the fact that
, we define the helicity potentials
which in the
representation, and recalling that
, are seen to be related to
as
from where we can use ([
21] Equation (
2)), namely
, to recognize that these helicity potentials are linear combinations of complex versions of the transverse real-valued “magnetic”
and “electric”
potentials [
2,
8,
11,
12,
13,
17,
25].
Appendix A contains some background information about the electric potential. Linear combinations very similar to Equation (
13) have been recently introduced by Elbistan et al. in Ref. [
17], albeit using real-valued vector functions instead of our complex
and
. As previously discussed, this difference is relevant for treating helicity. When real-valued fields are used in the right hand side of Equation (
13), it follows that
, which ultimately leads to a zero value of the average helicity as reported in [
17].
It is also worth pointing out that the
functions are transverse, i.e.,
, which follows from Equation (
11) and the previously mentioned property
. The helicity potentials only contain the transverse degrees of freedom, the same as the free electromagnetic field, which ensures that the results obtained using
are gauge independent.
We proceed by using Equation (
11) and the central expression in Equation (
10) to obtain
Equation (
14) can now be brought to the
domain as follows. First, we apply the substitutions
, and
These changes do not affect the result, but allow us to see from Equation (
4) that the
are the three-dimensional Fourier transforms (
) of
. The same relation holds between
and
. We can now apply applying Parseval’s theorem, i.e., the unitarity of the inverse Fourier transform
, to each of the two terms in the subtraction in Equation (
15):
where the integrand is local in
. We show in
Appendix A that when Equation (
10) is brought to the
domain instead, the
term results in the double integral
of a manifestly non-
-local integrand including a term such as
. The inconvenient
term is absorbed in the definition of the potentials in Equation (
11).
To further approach the most common expression of the average helicity, we now substitute
into Equation (
16) and obtain
which is a complex version of the well-known integral for the average helicity featuring real-valued fields, as found e.g., in ([
21] Equation (
6)).
Appendix B shows that the results of the complex and real versions coincide.
The
-domain expressions in Equations (
10) and (
14), and the
-domain expression in Equation (
16) produce the correct result. We now obtain
-domain expressions. The time-harmonic decomposition is often used in both theoretical investigations and numerical computations.
We start by noting that the result of the integral in Equation (
16) is independent of time (Indeed, the simplifying arbitrary choice
is made by Gross in [
26] for evaluating the inner product with integrals featuring
-dependent integrands). The time independence of
, manifest in Equations (
10) and (
14), is ultimately due the fact that
must be invariant under time translations since such transformations are in the conformal group, i.e.,
cannot depend on time. This can be exploited to obtain expressions for
involving the time-harmonic decomposition of the fields. To such end, we go back to Equation (
16), and expand each term in the integrand into their frequency components
Let us examine the last line of Equation (
19). Because
cannot depend on time, and since the two helicities are independent of each other, it follows that only the
components can contribute to the end result. This allows us to obtain the following three equivalent expressions:
where the equalities readily follow from
, which follows from Equation (
12).
Expressions that are local in
, such as Equations (
14) and (
20), justify the consideration of the average helicity in a finite volume
. This then allows use of the corresponding expressions in practical situations where numerical solvers calculate the fields in finite regions of space. The expressions in Equation (
20) are particularly adapted to the output of finite-element-method solvers such as COMSOL and JCM, which use the time-harmonic decomposition of
-dependent fields.
Finally, regarding applications, the electromagnetic helicity is particularly relevant in chiral light–matter interactions. Among these, the interaction of the field with chiral molecules is one of the most researched cases, partly because the optical sensing of chiral molecules is important in chemistry and pharmaceutical applications. In
Appendix C we use the above formalism to derive expressions for computing the circular dichroism signal for two different settings of the light-molecule interaction: The
dipolarizability tensor and the T-matrix.
In conclusion, several equivalent expressions for the average value of the electromagnetic helicity of a given field have been obtained from a starting point featuring maximal electromagnetic invariance, i.e., from an expression whose result is invariant under the conformal group. Some of the obtained expressions can be conveniently evaluated using the outputs of common Maxwell solvers.