1. Introduction
One of the fundamental problems in control theory is to regulate the output of the plant, which is known in the literature as model predictive regulation (MPR) [
1]. Consider a controlled nonlinear system on a manifold
M as
with an exosystem
and the system output
where
and
The optimal MPR problem is to determine a feedback control law
that regulates the system output asymptotically to zero, i.e.,
An optimal regulation control law is synthesized from an infinite horizon optimal control problem, i.e.,
subject to the controlled dynamics and a fixed initial condition
where the control Lagrangian,
, is sufficiently well-behaved and is zero if and only if
.
The output regulation problem for linear multivariate systems is formulated and its solvability conditions are derived, for the first time, by Francis in [
2]. The regulation problem is further generalized for nonlinear systems [
3] in which a feedforward control law is synthesized by solving the
Francis–Byrnes–Isidori (FBI) partial differential equations (PDEs). The optimal output regulation problem is first studied by Krener [
4], which proposes an optimal feedback control law synthesis in two steps: First, a feedforward control law is designed by solving the FBI PDEs using series solutions [
5]. Second, using the feedforward control, the regulations problem is then translated to an optimal stabilization problem for designing an optimal feedback control law using the Al’brekht method [
5]. In addition, Krener has developed a Nonlinear Systems Toolbox [
6] for obtaining series solutions of the FBI PDEs and the
Hamilton–Jacobi–Bellman (HJB) PDEs associated with the optimal stabilization problem [
5].
Note that the FBI and the HJB PDEs are solved up to a polynomial approximation of a certain degree d, and therefore, the output regulation may not be achieved asymptotically if the output signal is not representable by a degree d polynomial. Due to such limitations of series solutions, output regulations may not be guaranteed for sophisticated output signals such as transcendental functions of the states. In this article, we address a class of output regulation problems in which the system states need to track signals generated by exosystems. For such output regulation problems, we derive novel FBI PDEs that are computationally efficient and achieve the output regulation asymptotically.
Another challenge is to extend the optimal output regulation problem to controlled systems on manifolds. A technique to deal with the output regulation problems on manifolds is to define equivalent output regulation problems on ambient Euclidean space and then employ well studied tools of optimal regulation on Euclidean space. A coherent way is to extend the system dynamics from the manifold to an ambient Euclidean space such that the manifold is a stable attractor of the extended system dynamics [
7,
8,
9], and define the optimal regulation problem on that Euclidean space. An alternate approach is to define the regulation problem on local charts. This approach severely limits the applicability of the regulation problem due to local representation of the system dynamics. Moreover, the output regulation, under this approach, will only be limited to that particular local chart instead of the whole manifold.
In this article, we address two problems: First, we derive a reduced set of FBI PDEs for a class of output regulation problems on Euclidean space. We establish that, under the proposed approach, output regulation for a class of systems is achieved asymptotically by feedback control laws obtained via series solutions [
5]. Second, we generalize the optimal regulation method on the Euclidean space to manifold via stable embedding of the controlled system dynamics on manifold into Euclidean spaces, and this optimal regulation problem was not addressed in [
8]. Our proposed technique is then employed to the quadcopter tracking control and the rigid body attitude tracking system. It is observed from the numerical studies that the proposed approach provides better tracking performance as compared to the classical optimal regulation. Moreover, the controller built for the quadcopter in this paper is globally defined without any singularity, whereas the model predictive regulator designed in [
10] is inherently locally defined since its use of Euler angles. This also demonstrates the superiority of our method. All the systems addressed in this article are continuous-time systems, in parallel to which there have been developments on model predictive regulation for discrete-time systems [
1,
11,
12] and stochastic systems [
13]. However, we do not treat either discrete-time or stochastic systems in this particle, leaving them as future research work.
This article unfolds as follows:
Section 2 is devoted to a class of output regulation problems on Euclidean spaces and the derivation of the novel FBI PDEs. A generalized technique for the optimal output regulation of systems on manifolds is presented in
Section 3. The proposed output regulation technique is well supported by the numerical studies conducted on representative examples of quadcopter tracking control and the rigid body attitude tracking system in
Section 4. The concluding remarks and future directions are discussed in
Section 5.
2. Optimal Output Regulation of Nonlinear Systems
Consider a nonlinear dynamic system
with the following data:
- (a)
plant state , plant input , exosystem state and system output ,
- (b)
is a map depicting the plant dynamics (
1) on
,
- (c)
is a map depicting the exosystem dynamics (
2) on
,
- (d)
accounts for the system output (
3).
An output regulation problem is to find a control law
that steers, for any set of initial conditions
, the system output of the nonlinear dynamics (
1)–(
3) asymptotically to zero, i.e.,
Output regulation problems are very commonly formulated for disturbance rejection and reference tracking by the system. The exosystem dynamics is, in general, designed to generate a reference signal or a modeled disturbance signal. Let us consider a case of reference tracking in which the plant output is the output of the plant dynamics (
1) that does not include the exosystem
for
, needs to track the reference signal,
for
, generated by the exosystem. Then, the system output
asymptotically converging to zero ensures that the plant output is asymptotically tracking the reference signal. In an identical manner, let us consider a case of disturbance rejection in which the plant output is stabilized asymptotically to zero while the plant dynamics (
1) are subjected to the disturbance generated by the exosystem. Therefore, the system output
is regulated to zero under the influence of the disturbance introduced in the plant dynamics.
Before we discuss necessary and sufficient conditions for the solvability of the output regulation problem, let us elucidate standard assumptions considered in the literature:
Assumption 1. The following assumptions for the nonlinear dynamics (1)–(3) hold: - (a)
The vector fields f and a and the map h are smooth.
- (b)
For the control input , the system dynamic (1)–(3) has an equilibrium point such that the system output is zero, i.e., . - (c)
The equilibrium exosystem state of the exosystem (2) is stable and there exists a neighborhood containing zero, such that every initial condition is Poisson stable. - (d)
The linear approximation of the plant dynamics (1) is stabilizable at the equilibrium point , i.e., the pair is stabilizable.
The output regulation problem with such a generality is difficult to solve in general. Therefore, the state feedback control law (
4) is designed in an open neighborhood
of the origin at 0, such that for any initial condition
, the system output (
3) of the dynamics (
1)–(
3) converges at zero. The solvability condition for the output regulation problem is established by the following theorem:
Theorem 1 ([
3]).
Under Assumption 1, there exist a neighborhood of 0 and a state feedback, for , that asymptotically stabilizes the output of the system dynamics (1)–(3) to zero if and only if there exist mappings with , and with , both defined in a neighborhood of 0, such thatfor all .
The PDE (
5) with the algebraic constraints (
6) is known in the literature as the FBI equation [
3,
4]. As a consequence of Theorem 1, for any initial condition
with
, the system dynamics (
1)–(
3) under the feedforward control
leads to
Thus, the feedback control law for the output regulation can be designed as
where the feedback term
with
is derived to make the so-called output regulation manifold
a stable attractor.
The problem of synthesizing optimal feedback control laws for output regulation is first proposed by Krener [
4,
14], and that is generalized to model predictive regulations [
15]. The feedback control law (
7) using Krener’s method is designed in two steps:
- (i)
The feedforward control
and the output regulation manifold
are designed by solving the FBI Equations (
5) and (
6).
- (ii)
For the feedback
, the nonlinear dynamics (
1)–(
3) is defined in the new coordinates
as
Under these new coordinate changes, the output regulation problem (
4) is posed as an optimal stabilization problem for asymptotic stabilization of the dynamics (
8) to zero as
where
is fixed and the smooth control Lagrangian
satisfies
if and only if
. Then, the feedback term
in (
7) is the feedback control law
v obtained by solving the optimal control problem (
9), i.e.,
Remark 1. Note that the PDE (5) along with the algebraic constraints (6) is often solved approximately via finite series solutions [4,5]. Assume that the solution of the PDE is approximated by polynomials of degree r of the formwhere denotes a polynomial homogeneous of degree i in α. Then, under the change of coordinatesthe optimal stabilization problem (9) leads to the following feedback control lawthat in turn ensures that the state-action pair converges asymptotically to . It is worth noting that may not be zero due to the approximation of the feedforward control and the output regulation manifold . Therefore, the series solution (12) does not guarantee asymptotic convergence of the system output (3) to zero. However, the output approximation erroris of order [4], Theorem 4.2. Equipped with a sufficient understanding of output regulation, let us design a feedback law for a class of nonlinear systems that leads to an asymptotic convergence of the system output to zero.
2.1. Problem Statement
Consider a nonlinear system
where
is the plant state with vector field
governing the plant dynamics,
is the exosystem state with vector field
a governing the exosystem dynamics and
is the system output.
Assumption 2. Assumption 1 holds for the system dynamics (13)–(16) with the choice of and that brings the dynamics (13)–(16) to the standard form (1)–(3). Note that the system dynamics (
13)–(
16) is in standard form, and therefore, Theorem 1, leads to the following necessary and sufficient condition for the solvability of the output regulation problem for the system dynamics (
13)–(
16):
Theorem 2. Under Assumption 2, there exist a neighborhood of 0 and a state feedbackthat asymptotically stabilizes the output of the system dynamics (13)–(16) to zero if and only if there exist mappings with , and with , both defined in a neighborhood of 0, satisfying the conditions Proof. We know that the dynamics (
13)–(
16) with the choice of
and
is in standard form (
1)–(
3). Hence, applying Theorem 1 to the dynamics (
13)–(
16) gives: There exists a neighborhood
of 0 and a
state feedback
that asymptotically stabilizes the output of the system dynamics (
13)–(
16) to zero if and only if there exist
mappings
with
, and
with
, both defined in a neighborhood
of 0, satisfying the conditions
The algebraic constraint (
21) is satisfied if and only if
. Therefore, substituting
in (
19) leads to (
18) and (
20) leads to (
17). This proves the assertion. □
Remark 2. Note that the PDE (17) and (18) with algebraic constraint is in the same form as (5) and (6); however, the dimension of the PDE (17) and (18) is reduced. Therefore, the reduced order PDE (17) and (18) is computationally efficient. We now turn to designing an optimal feedback control law using Krener’s method that locally regulates the system output (
16) of the dynamics (
13)–(
16) asymptotically to zero.
First, a feedforward control law is designed by solving the FBI Equations (
17) and (
18) using HJB series solutions [
4,
5]. Let the series solution of the FBI Equations (
17) and (
18) be given by
where
is a homogeneous polynomial in
w up to degree
r.
Second, the error dynamics is defined, under the change of coordinates
as
and the output regulation problem is translated to a stabilization problem as
where
is fixed and the smooth control Lagrangian
satisfies
if and only if
. The infinite horizon optimal control problem (
24) is solved using Al’brekht’s method and the feedback control law
is designed that locally stabilizes
asymptotically to zero [
4], Theorem 4.2. Therefore, the optimal feedback control
locally regulates the output of the dynamics (
13)–(
16) to zero asymptotically. The system output
y converges asymptotically to zero due to the fact that the PDE series solutions (
22) do not affect the output regulation manifold
2.2. Computational Complexity
The feedback regulation problem for the system (
13)–(
16) is solved in two ways. A feedback control law is obtained by solving one of the FBI (
5) and (
6) and the FBI (
17) and (
18). As the dimension of the PDE in FBI (
17) and (
18) is reduced by
p as compared to the FBI (
5) and (
6), it leads to a significant reduction in computation time. On the other hand, the regulation manifold of the system (
13)–(
16) is explicitly known and therefore, the feedback regulation law obtained by FBI (
5) and (
6) is more accurate as compared to FBI (
17) and (
18). A series solution of degree
r of the FBI (
5) requires the solving of a linear system of order
recursively for each degree
. Therefore, the computation time for solving the FBI (
17) and (
18) using series solutions up to degree
r is of order
and for the FBI (
5) and (
6) is of the order
It can be concluded from the computation time analysis that there will be a significant reduction in computation time when the degree of the approximate series solution is large.
Let us now generalize the output regulation problem to manifolds. We know that many robotics and aerospace systems evolve on manifolds. The optimal stabilization theory developed by Krener [
4] cannot be directly applied to the system evolving on manifolds. An intuitive way is to extended the system to the ambient Euclidean space and design the controller in that ambient space; however, such extensions may not preserve the stabilizability of the linearized system, which is one key assumption for the FBI Equations (
5) and (
6). This hurdle is circumvented by stably embedding the system dynamics into the ambient Euclidean space [
8].
3. Output Regulation on Manifolds
Consider a class of nonlinear systems on a manifold
where plant state
, plant input
, exosystem state
and system output
such that the manifold
is embedded in
with
.
The output regulation problem on the manifold is solved by stably embedding the system dynamics (
27)–(
29) to an appropriate Euclidean space such that the linearized system in the ambient Euclidean space is stabilizable. We would like to stress on the fact that the stabilizability of the linearized dynamics is one of the key assumptions for existence of an output regulating feedback control law; see Assumption 1.
A stabilizable extension of the dynamics (
27)–(
29) on the ambient Euclidean space
is conducted in two steps [
8]:
The plant dynamics (
27) is extended to
and the system output (
29) is extended on
as
such that
and
for all
. As the extended plant dynamics (
30) is identical to (
27) on
M, the manifold
M is an invariant subset of
, i.e., for initial conditions
, system trajectories of the dynamics (
30)–(
32) satisfy
Add a drift term to the extended plant dynamics (
30) such that it is stabilizable in the transversal direction to
M in
Suppose there exists a function
on open neighborhood
U of
M in
such that
and
Therefore, the extended plant dynamics (
30) is stably extended and that leads to the following linearly stabilizable extension of (
27)–(
29) on
:
where
. Here, instead of the number
, one can more generally use an
positive definite symmetric matrix-valued function. A detailed discussion on the transversal stability of
M in the stably extended dynamics (
33)–(
35) may be found in [
8].
The system dynamics (
33)–(
35) is defined in Euclidean space and therefore, Krener’s method for designing feedback control for the output regulation problem is directly applicable without any modification.
For the sake of clarity, let us consider an example of a single axis rotation of a rigid body. The state space of the dynamics is
where
(the set of
orthonormal matrices with determinant 1,) accounts for the attitude of the rigid body and the angular velocity of the body about the rotation axis lies in
. The manifold
is a Lie group and the set
, (the set of
real skew-symmetric matrices,) is its Lie algebra. The attitude dynamics for single axis rotation of the rigid body is given by
where
with
R determines the attitude of the rigid body and
determines the angular velocity of the rigid body,
is the moment of inertia,
is the torque applied along the axis of rotation, and the hat map
is the vector space isomorphism defined as follows: for
Note that the manifold
is embedded in
and therefore, the system dynamics (
36) and (
37) is naturally extended to the ambient space
. However, such natural extensions may not guarantee the stabilization of its linearized dynamics around an equilibrium point of interest. Let us define a stable extension of the dynamics (
36) and (
37) in a neighborhood
of
To this end, let us define a Lyapunov-like function,
by
for
with the usual Euclidean norm
on
, which satisfies
It leads to a stable extension of the dynamics (
36) and (
37) on
as
where
Let us consider an output regulation problem on the manifold
, where the exosystem
generates attitude signals,
with
, for the dynamics (
36) and (
37) to track. The system dynamics with an exosystem for the output regulation is defined as
The dynamics (
40)–(
43) are defined in Euclidean space and therefore, the Krener’s method [
4] for optimal regulation is readily applied to find a feedforward and feedback control law. Using Theorem 2, the feedforward control law,
with
, which makes the manifold
invariant, needs to satisfy the following FBI equations
Remark 3. Note that the FBI (44) and (45) is a PDE in with algebraic constraints in in contrast to the FBI obtained using Theorem 1 that is a PDE in with algebraic constraints in . This simple example demonstrates that the PDE dimension is reduced to a large extent and it contributes to fast computation. Remark 4. Embedding to increases the dimension of the state space by 3; however, one can identify with the unit circle and embed the unit circle in (the ambient space) which only increases the dimension of the state space by 1.
Remark 5. Note that the output regulation technique by Krener [4] does not incorporate state and control constraints. For output regulation of the safety critical systems where state and control constraints are crucial to consider at the controller design stage, a model predictive control approach is proposed by Krener [15]. The model predictive control approach is directly extended to manifolds by stably extending the system dynamics to an ambient Euclidean space.