Sensor Selection and Precoding Strategies For Wireless Sensor Networks
Sensor Selection and Precoding Strategies For Wireless Sensor Networks
Sensor Selection and Precoding Strategies For Wireless Sensor Networks
4411
AbstractSensor selection has recently received a growing interest in the literature, motivated by the worldwide deployment
of wireless sensor networks and by the increase in the number of
available applications. In our framework, sensors take remote measurements of a quantity of interest and communicate their observations through a noisy, multiantenna wireless communication link.
In this context, we propose a scheme for optimally selecting out
sensor nodes on the basis of the amount of information they
of
convey to a common receiver/actuator. Moreover, a suitable linear
precoder is employed and optimized at each transmitter with the
aim of maximizing the mutual information between the observed
variable and the signal received by the actuator. The sensor selection problem is known to be combinatorial, and several computable relaxations are available in the literature. In this paper,
the optimality conditions are formally expressed in an information-theoretic context, and both semi-denite-programming relaxations and greedy schemes, leading to computable techniques for
large values of and , are presented. Moreover, specic results
for the cases of high and low signal-to-noise ratio on the wireless
channel are derived. Numerical simulations show that knowledge
of the channel state at the transmitter may lead to an increase of the
achievable mutual information and determine a different choice of
sensors, thus pointing out that our approach signicantly improves
upon selection schemes that neglect the characteristics of the communication layer.
Index TermsChannel precoding, convex relaxation, green networks, mutual information, sensor selection, wireless sensor networks.
I. INTRODUCTION
IRELESS SENSOR NETWORKs (WSNs) and wireless sensor-and-actuator networks (WSANs) have
found applications in many elds, including trafc control,
weather forecast, pollution control, environmental monitoring
and surveillance, see for instance the surveys [1], [2]. In
WSNs/WSANs, the sensors measure a given physical variable
and transmit their observations to a processing center, sometimes coinciding with the network gateway, which processes
the received information.
Manuscript received June 27, 2014; revised February 20, 2015; accepted May
04, 2015. Date of publication June 01, 2015; date of current version July 09,
2015. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Y.-W. Peter Hong.
The authors are with IEIIT-CNR (Institute of Electronics, Telecommunications and Information Engineering of the Italian National Research
Council), Torino 10129, Italy (e-mail: alessandro.nordio@polito.it; alberto.tarable@polito.it; fabrizio.dabbene@polito.it; roberto.tempo@polito.it).
Color versions of one or more of the gures in this paper are available online
at http://ieeexplore.ieee.org.
Digital Object Identier 10.1109/TSP.2015.2439239
The technology growth of recent years has allowed for a reduction in the production and deployment cost of sensors. As
a consequence, the number of deployed sensors is often larger
than needed for processing. On the other hand, the operational
cost of sensors is usually still rather large, due to battery consumption, especially in the signal transmission phase. To cope
with this issue, at xed time intervals, the processing center
needs to perform a sensor selection, with the aim of reducing
the number of transmitting sensors, while keeping the others in
sleep mode. This green-network strategy allows power saving
and increases the lifetime of sensor batteries.
Motivated by these issues, the literature considering the
problem of optimally selecting a subset of sensors from a set of
possible choices has received increasing interest. The interested
reader is referred to [3] for a survey on different schemes
currently used to select sensors, based on different kinds of
metrics (coverage, target-tracking and localization schemes,
single/multiple mission assignments). Clearly, whenever the
number of sensors is large, optimal selection becomes a computationally difcult combinatorial problem. Several approaches
have been proposed to obtain suitablerelaxations with the goal
of making the problem computationally feasible, even for a
moderate-to-large number of sensors.
For example, in [4] the sensor scheduling problem is reformulated as a nonlinear deterministic optimal control problem that
could be in principle attacked via a tree-search approach, but this
solution is generally impractical for a relatively large number of
sensors. The work [5] proposes a convex relaxation technique
to reduce the combinatorial problem to a convex programming
problem, which can be easily solved in polynomial time. The
most prominent advantages of this approach are its immediate
applicability, due to the powerful existing convex optimization
algorithms, and the possibility of handling various performance
criteria with energy and topology constraints. In [6], the case of
multistep measurements is considered, and the process state is
estimated by means of a Kalman lter with the goal of minimizing an objective function related to the Kalman lter error
covariance matrix.
An information-theoretic approach to sensor selection and localization in sensor networks for target tracking is studied in [7].
There, the problem of choosing a sensor from a set of candidates
is formulated, with the goal of minimizing the expected conditional entropy of the posterior target location distribution. In this
case, both the prior target location distribution and the locations
and observation models of the candidate sensors are assumed to
be known. In [8], [9], it is shown that the information gain is a
submodular function. This property allows to prove guaranteed
1053-587X 2015 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
4412
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 63, NO. 16, AUGUST 15, 2015
and
is the -th observation
where
is a vector of size
(different for each sensor). The meamatrix of size
surement noise vectors are modeled as independent Gaussian
,
random variables with zero mean and covariance matrices
. Similarly, we assume that the quantity is a
Gaussian random vector with zero mean and covariance matrix
.
As summarized in Fig. 1, in the setup considered in this paper,
the sensors have to transmit their observations through a wireless channel, and are hence equipped with transmit antennas.
1Throughout the paper, uppercase and lowercase boldface letters denote matrices and vectors, respectively. The identity matrix is denoted by , and
denotes the determinant of the matrix .
NORDIO et al.: SENSOR SELECTION AND PRECODING STRATEGIES FOR WIRELESS SENSOR NETWORKS
4413
(5)
denotes the mutual information between and
where
, conditioned on the knowledge of
and . For Gaussian
is dened as
random variables
(6)
(1)
The sensing devices transmit their (precoded) signals to a
common processing center, which consists of a wireless receiver equipped with
antennas. Signals are assumed to be
transmitted over MIMO orthogonal channels, so that they do
not interfere with each other. The signal received by the processing center from the -th sensor is then given by
where
and
are
, respectively. Also, the
the covariance matrices of and
symbol in (6) denotes conditioning. It follows that the mutual
can be written as
information
(2)
where the
matrix
represents the -th MIMO channel
represents additive noise, assumed independent of ,
and
and modeled as a complex Gaussian random vector with zero
. More specically, the
-th
mean and covariance matrix
is the gain of the link connecting the
entry of the matrix
-th transmit antenna to the -th receive antenna. In the case
all sensors transmit, the received signals can be arranged in the
, of size
, given by
vector
..
.
..
.
..
.
(3)
,
We dene the matrices
, and
. Note that and
are block-diagonal of size
and
respectively, and is of size
. Then, we can rewrite
where
(3) more compactly as
and
collect the measurement noise and the thermal noise, respectively. The equivalent
noise at the receiver is represented by the random vector
, which has covariance matrix
and
(7)
(8)
(4)
(9)
4414
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 63, NO. 16, AUGUST 15, 2015
where
,
, and
are obtained from
the matrices
the eigenvalue decomposition of the Hermitian matrices
and
, i.e.,
and
;
the eigenvalues are ordered in decreasing order, i.e,
where
and
where
;
is an
rectangular diagonal matrix whose diagonal elements are given by
(15)
(11)
Then, the maximization problem in (5) can be rewritten as
(12)
where
. We observe that the solution
of the above problem involves a joint optimization of all pre,
coders. In general, the optimal precoder for the -th sensor,
and
, but rather depends on the
is function not only of
and . This follows from the fact that the
whole matrices
does not have a block-diagonal structure and the
matrix
cost function in (12) cannot be written as a sum of terms, each
depending on the parameters of a single sensor. Thus, the computation of the optimal precoder, , requires the knowledge of
the matrices and at every sensor. This assumption is however unrealistic in our case since it would require an intensive
exchange of information among sensors.
We then resort to a locally optimal solution: instead of
, we compute the precoder
nding the optimal precoder
where
is obtained by solving the
local maximization problem
, where
and
. Moreover
, and the parameter
is chosen so that
.
The expression of the locally optimal precoder in (14) has
the following intuitive explanation: the purpose of the unitary
and
is to rotate the channel and observation
matrices
matrices, respectively, in order to allow the precoder to act
directly over the system eigenvalues. The term
normalizes to unity the average power of the signal
, and
shares the available power,
, among
the diagonal matrix
the available channel modes. The proof of (14) is given in
Appendix A. Expressions for the locally optimal precoder in
the high- and low-SNR regimes are derived in Section VI.
IV. SENSOR SELECTION PROBLEM
As discussed in Section I, we are interested in selecting a
maximizing
suitable subset of sensors of cardinality
an appropriate performance metric, i.e, in our case, mutual information. In practice, to estimate the quantity , the processing
available, at
center makes use of sensors only, out of the
the advantage of a lower complexity and energy saving. Clearly,
better estimates can be obtained if the amount of information
contained in the selected signals is larger.
Mathematically, the sensor selection problem can be formalselection matrix , which
ized by introducing a
left-multiplies the vector . This matrix is formed by
sub-blocks of size
each, and the
-th block of ,
, is dened as
if sensor is the -th selected sensor
otherwise
(13)
. In (13) we recalled the denition of
for
given in (10). We moreover observe that the dependence on
can be found only in the term
. Then, solving the
the mutual
above problem is equivalent to maximize over
information
at each sensor, and the solution,
, does
and
for
. Therefore no exchange of
not depend on
information among sensors is required.
Proposition 1: The locally optimal precoder
that maximizes the problem in (13) is given by
(14)
and
. Also has the following
for
, there exists one and only
properties: i) for each
, and ii) for each
,
one index such that
.
there is at most one index such that
In particular, i) guarantees that exactly sensors are selected,
while ii) ensures that sensors are selected no more than once.
With this notation dened, the optimal sensor selection can be
formulated as the following problem.
Problem 1 (Wireless sensor selection): Based on the knowledge of sensor characteristics (i.e., measurement matrices
and measurement noise covariance matrices
,
), and of the wireless communication channels
,
and thermal
(i.e., transmission matrices
NORDIO et al.: SENSOR SELECTION AND PRECODING STRATEGIES FOR WIRELESS SENSOR NETWORKS
noise variance
), the problem of choosing the selection
, can
matrix that maximizes the mutual information
be formulated as:
(16)
where denotes the received signal in the case the optimal
precoders
have been designed according to (14), i.e.,
where
and
. The mutual information obtained by an optimal choice of
. In the
signals received from sensors is thus given by
next section we show how this problem can be reformulated in
terms of the notation introduced so far.
4415
A. Problem Reformulation
Since and are Gaussian random variables we rst rewrite
as
the mutual information
(17)
where
and
are,
and
, and where
respectively, the covariance matrices of
denotes the covariance matrix of . Thus (17) rewrites
(18)
It is important to observe that, by construction, the covariance matrix
has a block-diagonal structure with blocks of
. Such a property is due to the fact that both
size
and
are block-diagonal matrices. Therefore we can write
whence it follows that the mutual
information (18) can be rewritten as
(19)
where in
we have applied Sylvesters theorem, and in
we dened
, we exploited the fact that the matrix
is idempotent (i.e.,
) and that, because of the block-diagonal structure of
,
. Note also that is
block-diagonal matrix with blocks
.
a
Specically the -th block of , , can be written as
where
,
, are Boolean variables. It
turns out that the sensor selection problem can be reformulated
deby introducing the Boolean selection vector
where
means that the -th
ned as
is
sensor has been selected. It follows that the matrix
where
block-diagonal and its -th block is given by
is the -th diagonal block of
and is given by
(22)
where
is dened in (20), and
denotes the -norm,
equal to the number of nonzero elements in .
Remark 1: Note that the problem formulated in Proposition
2 can be solved based solely on the knowledge of the quantities
,
,
,
, , and thus can be solved prior to performing the actual measurements, thus allowing to switch off
sensors that are not selected. This will permit an
the
optimization of the energy resources.
In the next Section, we show relaxations and greedy algorithms aiming at reducing the complexity of the sensor selection
problem in Proposition 2.
V. SENSOR SELECTION ALGORITHMS
The solution of the optimal wireless sensor selection problem
formulated in Proposition 2 amounts at solving a hard nonconvex optimization problem, due to its intrinsic combinatorial
nature. In principle, a solution would require the evaluation of
possible selections
the performance index for each of the
of active sensors. Specic branch-and-bound techniques, in the
spirit of [18], [19], can be devised for the numerical solution
of this problem, but this approach is clearly not practical unare relatively small. In the other cases, different
less and
approximations or relaxations are possible, as discussed in the
next subsections.
A. Concave Relaxation
A computable continuous relaxation of the optimization
problem (22) can be immediately obtained by relaxing the
requirement that the selection vector is to be binary. In fact,
is concave for
the cost function
. This approach, analogous of that introduced in [5],
leads to the following semidenite program (SDP)
(23)
Thus
where
(20)
4416
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 63, NO. 16, AUGUST 15, 2015
problems, and has been applied, e.g., in the context of compressive sensing in [20]. The complexity of this SDP algorithm
, and hence it can be applied also to rather large
scales as
networks. In general, the optimal solution of this SDP relaxation will take fractional values, so that some kind of sorting
and rounding is necessary to obtain the desired solution. The
simplest approach consists in selecting the elements of
with the largest values. A more sophisticated technique, which
leads to better solutions, consists in applying an iterative procedure, where each iteration solves a reweighted -problem. This
procedure has been suggested in [21] to enhance the sparsity of
the solution of a -minimization problem, and is summarized
in Algorithm 1.
1) 1) Initialization:
,
2) 2) Solve the weighted SDP relaxation:
larger than
, GOTO 2.
1) 1) Initialization:
,
,
2) 2) Select greedily the next sensor:
(25)
3) 3) Update the selected measurement set:
(26)
4) 4) Set
, IF
GOTO 2.
(27)
In other words, a submodular function satises the so-called
diminishing increments property. The submodular function is
,
.
monotone if
The importance of the submodularity stems from the results
proven in [26], where the authors study the problem of maximizing a monotone submodular function under a simple cardinality constraint, that is
(28)
They prove that the solution of the greedy procedure applied to
times the optimal
problem (28) is not less than
one. Since the mutual information in (21) is a submodular function [23], an immediate application of Denition 1 provides the
following lemma.
Lemma 1 (Suboptimality of the greedy algorithm): Let
be the mutual information obtained by solving Algorithm 2 and
let be the mutual information achieved by the optimal solution
of (22). Then,
NORDIO et al.: SENSOR SELECTION AND PRECODING STRATEGIES FOR WIRELESS SENSOR NETWORKS
where
4417
comes
VI. HIGH- AND LOW-SNR REGIMES
In this section, we consider two particular scenarios for
which the sensor selection problem considerably simplies, i.e., the cases of high and low SNR on the wireless
channel. More precisely, the high-SNR regime is dened
, while the low-SNR regime corby the condition
. For the whole section, let the
responds to
eigenvalue decompositions of
and
be, respecand
tively,
where
and
.
A. High-SNR Regime
We start from the high-SNR regime and assume that
,
. This condition can be
, and supposing that
obtained by choosing
are all full rank: the latter assumption
the channel matrices
is physically common both in the case when the receiver is in
line of sight, provided that it is not exceedingly far from the
transmitter, and when the receiver is not in line of sight, in
presence of a rich-scattering propagation environment. With
such hypotheses, we can prove that optimal sensor selection be,
,
comes independent of the channel matrices
and boils down to the same problem of, e.g., [5]. As a direct
consequence, in the high-SNR regime it is inessential to precode the transmitted signals.
,
Proposition 3(High-SNR regime): If
, then, for
, the mutual information
does not depend anymore on the channel realizations. More precisely, the optimal sensor selection problem dened in (22) can
be rewritten as
(29)
Proof: Under the hypotheses of the theorem, from (15) we
see that, for
, the optimal precoder for the -th sensor
satises
(30)
. By
(31)
(32)
(33)
which does not depend on the channel matrices .
Next, we derive the expression of the optimal sensor
selection problem in the high-SNR regime. We rst oband any nonzero constant ,
serve that for any matrix
. Thus the term
in (20) can be rewritten as
where
is given in (10) and
. By substituting this result into (20) we obtain,
for
(34)
Substituting the above expression into (22), we obtain (29).
The fact that, in the high-SNR regime, the optimal sensor selection does not depend on channel realizations has a great practical importance. Indeed, the problem (29) can be solved once
at the beginning and can be used whenever the channels are in
good conditions, without the need for changing the selection according to channel time variations. However, notice that the receiver must know the channel realizations for the selected nodes
in order to properly exploit the received signals.
full rank will
Notice also that any precoder that makes
give the same mutual information as in (33) for
. Even
, the rewithout precoding, i.e., with a choice like
sulting mutual information would be the same as in the case of
optimal precoding. Seeing the overall channel from to as a
cascade of the measurement channel from to and the comsaturates
munication channel from to , at high SNR
, and the sensor selection problem becomes equivato
lent to the one treated in [5].
B. Low-SNR Regime
In the next proposition, we consider instead the low-SNR
regime and we show that, in such case, the cost function to be
maximized is a linear combination of the contributions of each
sensor.
, the optimal
Proposition (Low-SNR regime): If
sensor selection problem in (22) reduces to
(35)
4418
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 63, NO. 16, AUGUST 15, 2015
(36)
(37)
where we have used the fact that for
,
, and
. The trace
in (37) can be written as
(38)
where we have used the expression of the optimal precoder in
(14) and the value of
in (36). Substituting (38) into (37) and
operator the constant terms innoticing that under the
dependent of can be removed, we nd that the optimal sensor
selection can be written as in (35).
The locally optimal precoder can be computed by observing
that the problem in (35) can be easily solved by a greedy algorithm that takes the largest values of
NORDIO et al.: SENSOR SELECTION AND PRECODING STRATEGIES FOR WIRELESS SENSOR NETWORKS
4419
Fig. 4. Outage probability plotted vs. SNR for 5.5 bit threshold mutual information (a), and plotted vs. the threshold mutual information for
and
.
In both gures
(b).
is
given by
where
is a constant allowing
is the -th eigenvalue of
. In
for power normalization and
Fig. 4(a), we show the outage probability versus SNR for a mutual information threshold equal to 5.5 bit, while in Fig. 4(b) we
depict the outage probability versus the threshold on the mutual
. In both gures, we observed
information for
10000 channel realizations per point and we selected
sensors out of a total of
.
provides a
As it can be seen from Fig. 4(a), the choice
in terms of both
signicant gain with respect to the case
SNR and decay slope. In general, the decay slope increases with
due to the diversity effect provided by the channel matrices,
which are assumed to be independent. For all considered values
of , the locally optimal precoder can save up to 3 dB of SNR
w.r.t. the case where no precoder is employed, at an outage prob. Instead, the suboptimal precoder loses 12 dB
ability of
there is a further gain
against the no-precoder case. For
, at the price
of about 45 dB with respect to the case
of a small increase in the complexity of the greedy algorithm.
These results clearly show the impact of the precoder on the performance of sensor selection and that a careful precoder design
cannot neglect the information on the sensor matrices.
In Fig. 4(b), the outage probability provided by the greedy seand
is depicted
lection algorithm for
as a function of the threshold mutual information. In this case,
an increase of from 2 to 3 yields a gain of about 1.5 bits at
, while a further increase of to 4
an outage probability of
provides an additional gain of more than 1 bit, for all considered
precoder optimization entails
precoders. Notice that for
a gain of almost 1 bit.
VIII. CONCLUSIONS
In this paper, we studied the sensor selection problem in the
case where the observations are transmitted through noisy wireless MIMO channels. At each transmitter we employed and op-
4420
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 63, NO. 16, AUGUST 15, 2015
APPENDIX B
PRECODER OPTIMIZATION IN THE LOW-SNR REGIME
To prove that for
the result in (36) holds we rst regiven in (15). Also, we remind that
call the solution for
and
appearing in (15) are the
and
, respectively, sorted in deeigenvalues of
creasing order. We start by noticing that for any value of ,
if
(42)
After some elementary manipulations, such condition can be
written as
where
,
.
and
Notice that, since
, then also
. From
what we have said, it follows that there are exactly nonzero
if and only if
precoder gains
. Now suppose that
and that
, i.e., only the
rst precoder gain is nonzero. As a consequence, we have:
(43)
where the second equality comes from the power normalization.
gives
Solving for
(44)
is given by
and the
(40)
, it has been shown in [14]
For the case of a square matrix
is diagonal. Even though in our case
is
that the optimal
rectangular, the same result applies. In particular the diagonal
, are given by
elements of
where
and
.
This result can be obtained by following the same steps as in
[14, Section III] and generalizing for the case of a rectangular
. For a diagonal
(40) takes the form
(41)
The value of the variables
solving (41) can be obtained by
applying the Karush-Kuhn-Tucker conditions, as done in [14],
and is given by (15).
NORDIO et al.: SENSOR SELECTION AND PRECODING STRATEGIES FOR WIRELESS SENSOR NETWORKS
4421