CN111683023B

CN111683023B - Model-driven large-scale equipment detection method based on deep learning

Info

Publication number: CN111683023B
Application number: CN202010305189.0A
Authority: CN
Inventors: 邵晓丹; 陈晓明
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2020-04-17
Filing date: 2020-04-17
Publication date: 2021-08-24
Anticipated expiration: 2040-04-17
Also published as: CN111683023A

Abstract

The invention discloses a model-driven large-scale equipment detection method based on deep learning. In the cellular internet of things, a base station equipped with a large-scale antenna array simultaneously serves a large number of single-antenna devices. Only a portion of the devices are active at any given time slot, while the other devices are in a dormant state. The invention adopts an authorization-free random access protocol, namely, the activation equipment simultaneously sends a known pilot frequency sequence to the base station. The base station constructs a deep learning network based on a vector approximation message transfer model, trains parameters of the deep learning network in a back propagation mode, detects the equipment state with less training data and estimates the channel information of the equipment. And finally, the base station performs data interaction with the activation equipment by using the estimated channel information. The invention has strong robustness on the distribution of the equipment state matrix and the ill-conditioned pilot frequency matrix, and provides an efficient equipment detection and channel estimation method for the cellular Internet of things with large-scale equipment access.

Description

Model-driven large-scale equipment detection method based on deep learning

Technical Field

The invention relates to the field of wireless communication, in particular to large-scale active device detection and channel estimation in a cellular internet of things.

Background

Enhanced mobile broadband, large-scale machine communication and ultra-reliable low-delay communication are three main scenes of a 6G wireless network. Particularly, with the rise and development of the internet of things and artificial intelligence, a future wireless network needs to support simultaneous access of massive terminal devices, so that large-scale machine communication becomes a hotspot for research in academia and industry. An important feature of large-scale machine communication is that the activation mode of the terminal devices in the network is usually intermittent. In each time slot, only a portion of the terminals are active and need to communicate with the base station. Other devices are temporarily in a dormant state to conserve energy, and they are only activated when triggered by an external event. The activated terminal equipment simultaneously transmits pilot frequency sequences to the base station at the beginning stage of each time slot, and the base station learns which terminals are in an activated state and obtains corresponding channel state information thereof through an activation detection and channel estimation algorithm. And then, in the remaining duration of each time slot, performing uplink and downlink data interaction between the base station and the activated terminal equipment.

Large-scale machine communication requires a communication system to have the capability of processing a large amount of wireless data, and to accurately recognize and dynamically detect the device status and realize high-speed information interaction. Therefore, terminal activation detection in future wireless communication systems is necessarily highly intelligent. As a commonly used artificial intelligence method, machine learning, especially deep learning, has enjoyed great success in the field of computer vision and natural language processing. Recently, machine learning has been applied to wireless communications such as physical layer communications, resource allocation, and intelligent traffic control. However, most existing deep learning networks are data-driven, using standard neural networks like a black box structure, for extensive data training. Which is often scarce in wireless communication networks. And due to the huge potential equipment in the large-scale machine communication system and the use of large-scale antenna arrays, the dimension of each training data is very high, so that the training process consumes a lot of time.

Most of the current large-scale terminal detection algorithms are only suitable for the situation that the pilot matrixes are gaussian distributions which are independent and distributed, and the performance of the algorithms is seriously reduced due to ill-conditioned pilot matrixes. In addition, these detection algorithms generally assume that the device state matrix conforms to a bernoulli-gaussian distribution, however this assumption is limiting because the channel in a real environment typically exhibits more complex statistical properties. In general, it is more reasonable to assume that the device state matrix obeys a bernoulli-mixed gaussian distribution. How to use the characteristics of the channels, the pilot matrix, the intermittent activation of the equipment and the like to construct a new detection algorithm. How to construct a deep network based on a detection algorithm so as to train parameters in the deep network in a back propagation mode and how to realize large-scale active device detection with less training data and shorter training time. These problems become particularly critical in future large-scale machine communication.

Disclosure of Invention

The invention aims to solve the problems that when a base station is provided with a large-scale antenna array, in the existing large-scale access system based on machine learning, the parameter training time of a terminal activation detection and channel estimation scheme is long, the calculation complexity is high and the distribution of a required pilot matrix is limited, and provides a model-driven large-scale equipment detection method based on deep learning.

The invention adopts the following specific technical scheme:

a model-driven deep learning-based large-scale equipment detection method comprises the following steps:

1) at the beginning stage of each time slot with the length of T, all activated terminal equipment simultaneously sends pilot sequences with the length of L to a base station;

2) after receiving the pilot frequency sequence, the base station maps the received signal from a high-dimensional space to a low-dimensional space based on a data decomposition method so as to reduce the algorithm complexity;

3) in a low-dimensional space, a base station constructs an activation detector based on deep learning based on a vector approximation message transfer model;

4) after the base station obtains the activated detector in the step 3), training unknown parameters in the model layer by using a back propagation mode;

5) the base station substitutes the trained parameters into the activation detector in the step 3), detects the activated terminal equipment and estimates the channel state information of the activated equipment;

6) and in the time length with the remaining length of T-L of each time slot, the base station performs uplink and downlink data interaction with the activation equipment by using the channel estimation value.

On the basis of the technical scheme, each step can be further realized by adopting the following specific mode.

Preferably, the data decomposition method in step 2) is as follows:

firstly, the base station carries out singular value decomposition on a received signal Y:

wherein S_sdIs a unitary matrix, V_sdIs a matrix of singular values and is,

is a unitary matrix; then obtain

Wherein

Is S_sdFront r of^eThe columns of the image data are,

is composed of V_sdUpper left corner r of^e×r^eA square matrix of elements, where r^eIs the rank of the unknown signal that needs to be detected; followed by taking

Front r of^eObtaining U; data decomposition satisfies

And V has a rank r^e，

And UU^HI, where I is the identity matrix and M is the number of antennas of the base station.

Preferably, the activation detector based on deep learning in step 3) is:

a) setting the training parameters to

And

wherein

The power adjusting parameter of the nth terminal device in the T-layer network, T represents the transposition operation,

representing the training parameters at the t-th iteration, wherein

The probability value of the jth component in the bernoulli-gaussian mixture distribution of the nth terminal device in the t-th layer network,

the Gaussian distribution variance of the jth component in the Bernoulli-Gaussian mixture distribution of the nth terminal equipment in the t-th network is obtained; the subscript N ∈ {1,2, …, N } represents the nth device, N represents the total number of terminal devices, J ∈ {1,2, …, J } represents the jth component in the Bernoulli-Gaussian mixture, J is the total number of components; upper bound of iteration number is tau_1,max，τ_2,maxAnd T_max；

3.b) initializing the number of internal iterations τ to 0 for any

Updating non-linear estimation values of elements in state matrix of terminal equipment of tau iteration in sequence

Intermediate variables

Sum noise accuracy

First of all update

Wherein e_nVariable representing activation probability of nth device

Is calculated by

Representing the r-th noise-accuracy value, an intermediate variable, of the t-th iteration

Representing variables

Meet the mean of 0 and variance of

Normal distribution, intermediate variable of

Representing variables

Meet the mean of 0 and variance of

Normal distribution of (2);

then, update

Wherein

Is an intermediate variable, function g'_1,nrIs calculated by

Then, update

Wherein||.||₂Represents a two-norm of the signal,

is the intermediate variable(s) of the variable,

is formed by

A vector of components; updating iteration time τ ← τ +1, and then repeating the next iteration until τ ═ τ ← τ +1_1,maxStopping the circulation and finally outputting an estimated value

Wherein

Is formed by

A matrix of compositions;

3, c) to any

Non-linear estimation of state matrix for sequentially updating t-th iteration

Accuracy of noise

Accuracy of noise

And intermediate variables

3, d) updating the linear estimates of the state matrix of the t-th iteration in turn according to the following formula

Intermediate variables

Accuracy of noise

Accuracy of noise

And intermediate variables

Wherein A (. beta.) is^t)＝Adiag(β^t) Where A represents the pilot matrix, diag (β)^t) The representation forms a vector beta^tIs a diagonal matrix of diagonal elements, σ²Representing the variance of the noise, I_NRepresenting a diagonal matrix of dimension NxN with diagonal elements of 1, v_rRepresenting the r-th column of the low-dimensional spatial reception matrix V.

Preferably, the method for training the unknown parameters in the model layer by using the back propagation method in the step 4) comprises the following steps:

initializing training parameters

And

are each beta⁰And Ω⁰Setting T_maxSetting a network layer number identifier t as 0 for training the upper bound of the layer number, and starting parameter learning of the t-th iteration by using 3.b), 3.c) and 3.d) in step 3:

first fix it

Study omega^tTo achieve a cost function that minimizes the denoising step

(ii) wherein | |. calorifies_FRepresenting the F norm, wherein S is the true value of a state matrix of equipment in the system;

then learn Ω^tAnd

to achieve a cost function that minimizes the denoising step

The object of (a);

then fix omega^tAnd

learning beta^tTo achieve a cost function that minimizes the linear least squares step

The object of (a); matrix array

Is composed of a vector

A matrix of formations;

finally study

To achieve a minimum linear least squares step cost function

The object of (a);

after the iteration of one layer of network is completed, updating external iteration times T ← T +1, and repeating the parameter learning of the next layer of network again until T ═ T_maxAnd stopping the circulation when the model is in-1, and finishing the training of unknown parameters in the model.

Preferably, the method for detecting device activation and channel estimation in step 5) comprises: equivalently considering the number of training layers and the number of iterations, performing the following iterations:

a) initializing the intermediate variables of the r-th column with the number of external iterations t equal to 0

Sum noise accuracy

Initial value of (2)Are respectively arranged as

And

maximum number of iterations is T_max1The base station substitutes the model parameters trained in the step 4) into the activation detector constructed in the step 3);

5.b) performing said 3.b), 3.c) and 3.d) one time;

5, c) to any

Updating the external iteration time T ← T +1, and then re-performing the next iteration, namely performing the step 5.b), until T ═ T ← T +_max1Time-out loop, and finally output the estimated value of the state matrix

D) using the activation criterion:

to determine which terminal devices are in an active state, where k is a terminal device identifier, v is an adjustable parameter,

is composed of

The (c) th row of (a),

a set of identities representing the detected active devices; reuse relational expression

Recovering the signal estimation value of the original high-dimensional space, thereby obtaining the specific channel estimation value of the active device

Wherein

Represents an estimate of the unknown state vector in the high-dimensional space,

representing and getting

Neutralization of

Corresponding partial row xi_kIs the transmitted energy of the pilot.

The invention has the beneficial effects that: the deep learning-based large-scale terminal activation detection and channel estimation method provided by the invention can realize more accurate terminal activation detection and channel estimation by using less training data, and solves a series of problems of high computational complexity, low detection accuracy and the like in the traditional large-scale terminal activation detection and channel estimation problems. The method is suitable for a wider pilot matrix, and can effectively reduce the problem of detection performance reduction caused by a sick pilot matrix.

Drawings

FIG. 1 is a diagram of a deep learning ensemble framework;

FIG. 2 is a block diagram of a per-layer learning network;

FIG. 3 is a graph of normalized mean square error of channel estimation versus the number of deep learning layers when the pilot matrix obeys Gaussian distribution with mean 1 and variance 1, comparing the deep learning-based large-scale terminal channel estimation method of the present invention with other commonly used terminal channel estimation methods;

fig. 4 is a relationship between a detection error rate and a signal-to-noise ratio when comparing the large-scale terminal detection method based on deep learning of the present invention with other common terminal detection methods.

Detailed Description

In this embodiment, a base station of a large-scale access system is provided with M antennas, each terminal is configured with 1 antenna, only a small number of terminals are randomly activated to communicate with the base station in each time slot, and other terminals are temporarily in a sleep state. And the activated terminal can directly access the network without being authorized by the base station. That is, the active terminal transmits pilot sequence to the base station at the same time in the beginning of each time slot, and the base station obtains which terminals are in the active stage and obtains corresponding channel state information through large-scale terminal detection and channel estimation algorithm. And in the rest part of each time slot, activating the terminal to perform data interaction with the base station.

Based on the base station, the embodiment provides a model-driven deep learning-based large-scale device detection method, which includes the following steps:

1) at the beginning of each time slot of length T, all active terminal devices simultaneously transmit pilot sequences of length L to the base station.

2) After receiving the pilot sequence, the base station maps the received signal from a high-dimensional space to a low-dimensional space based on a data decomposition method to reduce the algorithm complexity.

In this step, the data decomposition method is:

wherein S_sdIs a unitary matrix, V_sdIs a matrix of singular values and is,

is a unitary matrix; then obtain

Wherein

Is S_sdFront r of^eThe columns of the image data are,

Front r of^eObtaining U; data decomposition satisfies

And V has a rank r^e，

3) In a low-dimensional space, the base station constructs an activation detector based on deep learning based on a vector approximation message transfer model.

In this step, the vector approximation message transmission detection method is developed according to the outer iteration identification, i.e. each iteration t is a layer of network, and each layer of network input is

And

the output is

And

in total of T_maxThe layer networks are connected in series to form a deep learning network. The deep learning-based activation detector specifically comprises:

a) setting the training parameters to

And

wherein

representing the training parameters at the t-th iteration, wherein

the Gaussian distribution variance of the jth component in the Bernoulli-Gaussian mixture distribution of the nth terminal equipment in the t-th network is obtained; the subscript N ∈ {1,2, …, N } represents the nth device, N represents the total number of terminal devices, J ∈ {1,2, …, J } represents the jth component in the Bernoulli-Gaussian mixture, J is the total number of components; upper bound of iteration number is tau_1,max，τ_2,maxAnd T_max. t and τ are the outer loop variable and the inner loop variable, respectively.

3.b) initializing the number of internal iterations τ to 0 for any

Intermediate variables

Sum noise accuracy

First of all update

Wherein e_nVariable representing activation probability of nth device

Is calculated by

For the variance of the jth component of the device n for t iterations,

Representing variables

Meet the mean of 0 and variance of

Normal distribution, intermediate variable of

Representing variables

Meet the mean of 0 and variance of

Normal distribution of (2);

then, update

Wherein

Is an intermediate variable, function g'_1,nrIs calculated by

Then, update

Wherein | |. calo | |)₂Represents a two-norm of the signal,

is the intermediate variable(s) of the variable,

is formed by

Wherein

Is formed by

A matrix of compositions;

3, c) to any

Non-linear estimation of state matrix for sequentially updating t-th iteration

Accuracy of noise

Accuracy of noise

And intermediateMeasurement of

Intermediate variables

Accuracy of noise

Accuracy of noise

And intermediate variables

4) After the base station obtains the activated detector in the step 3), the unknown parameters in the model are trained layer by using a back propagation mode.

In this step, the method for training the unknown parameters in the model layer by using the back propagation mode comprises the following steps:

initializing training parameters

And

first fix it

Study omega^tTo achieve a cost function of minimizing the denoising stepNumber of

then learn Ω^tAnd

to achieve a cost function that minimizes the denoising step

The object of (a);

then fix omega^tAnd

The object of (a); matrix array

Is composed of a vector

A matrix of formations;

finally study

To achieve a minimum linear least squares step cost function

The object of (a);

5) And the base station substitutes the trained parameters into the activation detector in the step 3), detects the activated terminal equipment and estimates the channel state information of the activated equipment.

In this step, the deep learning model of the invention is divided into an off-line training stage and an on-line detection stage, wherein the off-line training stage is to obtain the value of the trained parameter, and t is the t-th network; in the on-line detection stage, the trained parameters are substituted into the algorithm structure, and t is equivalently regarded as the t-th iteration. Therefore, the device activation detection and channel estimation method is specifically as follows:

equivalently considering the number of training layers and the number of iterations, performing the following iterations:

Sum noise accuracy

Are respectively set as

And

5.b) performing said 3.b), 3.c) and 3.d) one time;

5, c) to any

D) using the activation criterion:

is composed of

The (c) th row of (a),

Wherein

representing and getting

Neutralization of

Corresponding partial row xi_kIs the transmitted energy of the pilot.

The deep learning overall framework and the schematic diagram of each layer of learning network are respectively shown in fig. 1 and fig. 2. As can be seen by computer simulation: as shown in fig. 3, the pilot matrix is set to obey gaussian distribution with mean 1 and variance 1, and compared with the conventional channel estimation scheme, i.e., the approximate message passing algorithm, the large-scale terminal channel estimation scheme of the present invention has significantly improved estimation accuracy of the vector approximate message passing algorithm. Fig. 4 shows that the detection accuracy of the large-scale detection method provided by the invention is obviously improved compared with the detection accuracy of the traditional orthogonal matching pursuit algorithm, and the influence of the estimation value of the state matrix rank on the detection accuracy is within an acceptable range. The advantages are that on the basis of inheriting the original advantages of the traditional vector approximation message transfer algorithm, the scheme combines more accurate Bernoulli-Gaussian mixture distribution of the state matrix, combines deep learning to optimize the network structure of the vector approximation message transfer algorithm, and effectively trains system parameters. Therefore, the terminal activation detection and channel estimation scheme provided by the invention can provide an efficient terminal activation detection and channel estimation method for a large-scale communication system.

Claims

1. A model-driven large-scale equipment detection method based on deep learning is characterized by comprising the following steps:

6) in the time length of the rest length of each time slot being T-L, the base station utilizes the channel estimation value to carry out uplink and downlink data interaction with the activation equipment;

the activation detector based on deep learning in the step 3) is as follows:

a) setting the training parameters to

And

wherein

representing the training parameters at the t-th iteration, wherein

3.b) initializing the number of internal iterations τ to 0 for any

Intermediate variables

Sum noise accuracy

First of all update

Wherein e_nVariable representing activation probability of nth device

Is calculated by

Representing variables

Meet the mean of 0 and variance of

Normal distribution, intermediate variable of

Representing variables

Meet the mean of 0 and variance of

Normal distribution of (2);

then, update

Wherein

Is an intermediate variable, function g'_1,nrIs calculated by

Then, update

Wherein | |. calo | |)₂Represents a two-norm of the signal,

is the intermediate variable(s) of the variable,

is formed by

Wherein

Is formed by

A matrix of compositions;

3, c) to any

Non-linear estimation of state matrix for sequentially updating t-th iteration

Accuracy of noise

Accuracy of noise

And intermediate variables

Intermediate variables

Accuracy of noise

Accuracy of noise

And intermediate variables

2. The model-driven deep learning-based large-scale equipment detection method according to claim 1, wherein the data decomposition method in step 2) is as follows:

wherein S_sdIs a unitary matrix, V_sdIs a matrix of singular values and is,

is a unitary matrix; then obtain

Wherein

Is S_sdFront r of^eThe columns of the image data are,

Front r of^eObtaining U; data decomposition satisfies

And V has a rank r^e，

3. The model-driven deep learning-based large-scale equipment detection method according to claim 1, wherein the method for training the unknown parameters in the model layer by layer in the back propagation manner in step 4) comprises:

initializing training parameters

And

first fix it

Study omega^tTo achieve a cost function that minimizes the denoising step

then learn Ω^tAnd

to achieve a cost function that minimizes the denoising step

The object of (a);

then fix omega^tAnd

The object of (a); matrix array

Is composed of a vector

A matrix of formations;

finally study

To achieve a minimum linear least squares step cost function

The object of (a);

4. The model-driven deep learning based large-scale device detection method as claimed in claim 1, wherein the device activation detection and channel estimation method in step 5) is: equivalently considering the number of training layers and the number of iterations, performing the following iterations:

Sum noise accuracy

Are respectively set as

And

maximum number of iterations is T_max1The base station substitutes the model parameters trained in the step 4) into the activation detector constructed in the step 3)Performing the following steps;

5.b) performing said 3.b), 3.c) and 3.d) one time;

5, c) to any