Nothing Special   »   [go: up one dir, main page]

CN110262494B - Collaborative learning and formation control method for isomorphic multi-unmanned ship system - Google Patents

Collaborative learning and formation control method for isomorphic multi-unmanned ship system Download PDF

Info

Publication number
CN110262494B
CN110262494B CN201910560204.3A CN201910560204A CN110262494B CN 110262494 B CN110262494 B CN 110262494B CN 201910560204 A CN201910560204 A CN 201910560204A CN 110262494 B CN110262494 B CN 110262494B
Authority
CN
China
Prior art keywords
component
unmanned
unmanned ship
yaw
unmanned boat
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910560204.3A
Other languages
Chinese (zh)
Other versions
CN110262494A (en
Inventor
戴诗陆
马雨飞
王敏
董超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China Sea Survey Technology Center State Oceanic Administration (south China Sea Marine Buoy Center)
South China University of Technology SCUT
Original Assignee
South China Sea Survey Technology Center State Oceanic Administration (south China Sea Marine Buoy Center)
South China University of Technology SCUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China Sea Survey Technology Center State Oceanic Administration (south China Sea Marine Buoy Center), South China University of Technology SCUT filed Critical South China Sea Survey Technology Center State Oceanic Administration (south China Sea Marine Buoy Center)
Priority to CN201910560204.3A priority Critical patent/CN110262494B/en
Publication of CN110262494A publication Critical patent/CN110262494A/en
Application granted granted Critical
Publication of CN110262494B publication Critical patent/CN110262494B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/02Control of position or course in two dimensions
    • G05D1/0206Control of position or course in two dimensions specially adapted to water vehicles

Landscapes

  • Engineering & Computer Science (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)
  • Feedback Control In General (AREA)

Abstract

本发明公开了一种同构多无人艇系统的协同学习与编队控制方法,该方法针对多个全驱动且具有相同结构的无人艇系统,提出了基于通信连接拓扑图的分布式协同学习控制方法,该方法解决了保持通信的同构无人艇之间的碰撞和保持连接问题,包括以下步骤:建立无人艇的动态模型;设计基于图论的保持通信的无人艇之间的误差;设计满足预设性能的误差转换函数;设计基于动态面控制技术的虚拟控制器;设计径向基函数(RBF)神经网络的权值更新率;设计编队控制器与基于经验的控制器。本发明所提出的满足连接保持且具有协同学习的编队控制方法可以保证,如果两个无人艇在初始时刻保持通信,在其后任意时刻都始终保持安全距离并在通信连接范围内。

Figure 201910560204

The invention discloses a collaborative learning and formation control method for a homogeneous multi-unmanned boat system. The method proposes a distributed collaborative learning based on a communication connection topology map for a plurality of unmanned boat systems that are fully driven and have the same structure. A control method, which solves the problem of collision and connection keeping between isomorphic unmanned boats maintaining communication, including the following steps: establishing a dynamic model of the unmanned boat; designing a graph theory-based communication between unmanned boats maintaining communication error; design the error transfer function that meets the preset performance; design the virtual controller based on the dynamic surface control technology; design the weight update rate of the radial basis function (RBF) neural network; design the formation controller and the experience-based controller. The formation control method that satisfies connection retention and has collaborative learning proposed by the present invention can ensure that if two unmanned boats maintain communication at the initial moment, they will always keep a safe distance and within the range of communication connection at any time thereafter.

Figure 201910560204

Description

一种同构多无人艇系统的协同学习与编队控制方法A Collaborative Learning and Formation Control Method for Isomorphic Multi-UAV Systems

技术领域technical field

本发明涉及无人艇的编队控制领域,具体涉及一种同构多无人艇系统的协同学习与编队控制方法。The invention relates to the field of formation control of unmanned boats, in particular to a collaborative learning and formation control method of a homogeneous multi-unmanned boat system.

背景技术Background technique

随着科学技术的发展及社会的需要,海上任务需要多艘无人艇同时工作,也需要多艘航行补给无人艇提供不同的服务,因此,多艇协作与控制技术得到了迅速的发展。目前,多艇协作与控制技术的应用已经涉及海底铀符、海上加油、海上捕鱼、军队海上演习等多个领域。多艇协作控制技术不仅可以完成复杂环境下的复杂任务,而且可以使人们摆脱一些危险的工作。并且多艇协作与控制技术具有按照人们意愿工作的特点,可以大大减轻人们的劳动强度、提高人们的生活质量。在不久的将来,人们寻找海底资源的愿望将为多艇协作与控制技术创造更为广阔的市场,也必将促进多艇协作与控制技术的更快发展。With the development of science and technology and the needs of society, maritime tasks require multiple unmanned boats to work at the same time, and multiple sailing and supply unmanned boats to provide different services. Therefore, multi-boat cooperation and control technology has been developed rapidly. At present, the application of multi-boat cooperation and control technology has involved many fields such as submarine uranium symbols, marine refueling, marine fishing, and military maritime exercises. Multi-boat cooperative control technology can not only complete complex tasks in complex environments, but also free people from some dangerous jobs. In addition, the multi-boat cooperation and control technology has the characteristics of working according to people's wishes, which can greatly reduce people's labor intensity and improve people's quality of life. In the near future, people's desire to find seabed resources will create a broader market for multi-vessel cooperation and control technology, and will also promote the faster development of multi-vessel cooperation and control technology.

相对于单个无人艇,多艇编队具有以下优点:Compared with a single unmanned boat, the multi-boat formation has the following advantages:

(1)多艇编队执行任务具有更好地鲁棒性、容错能力以及系统生存能力,即使其中一个无人艇失效,也不会影响整体任务;(1) Multi-boat formations perform tasks with better robustness, fault tolerance and system survivability. Even if one of the unmanned boats fails, it will not affect the overall mission;

(2)多艇编队无需单个无人艇装备性能较高的传感器设备,取而代之的是一组成本低廉的无人艇群体,进而降低的运行成本;(2) The multi-boat formation does not require a single unmanned boat to be equipped with high-performance sensor equipment, and replaces it with a group of low-cost unmanned boats, thereby reducing operating costs;

(3)单个无人艇的感知范围是有限的。多艇协同作业可以扩大整个群体的感知区域,因而可以快速高效地完成特定搜索任务。(3) The perception range of a single UAV is limited. The cooperative operation of multiple boats can expand the sensing area of the entire group, so specific search tasks can be completed quickly and efficiently.

多无人艇通过局部信息的交互,相较于单个无人艇更有效,功能更强大,大大扩展了自主无人艇的使用范围,实现了单个无人艇不能完成的任务。基于上述原因,无人艇编队无论在民用领域,还是在军事领域都有广泛应用潜力。此外由于每个无人艇的感测范围都是有限的,为了保证每个无人艇在编队运动的整个过程中都能通过传感器感测到其邻居的状态信息,还应该考虑每个无人艇与其领导者、邻居之间都需要满足最远距离限制与方位角限制。Through the interaction of local information, multiple unmanned boats are more effective and more powerful than a single unmanned boat, which greatly expands the use range of autonomous unmanned boats and realizes tasks that a single unmanned boat cannot complete. Based on the above reasons, unmanned boat formations have wide application potential in both civilian and military fields. In addition, since the sensing range of each unmanned boat is limited, in order to ensure that each unmanned boat can sense the status information of its neighbors through sensors during the entire process of formation movement, each unmanned boat should also be considered. The maximum distance and azimuth angle constraints need to be met between the boat, its leader and its neighbors.

发明内容SUMMARY OF THE INVENTION

发明的目的在于克服现有技术中的缺点与不足,提供一种同构多无人艇系统的协同学习与编队控制方法,本方法针对模型不确定的同构多无人艇设计编队控制器,既能保证在分布式领导者-跟随者编队结构中,每个无人艇都能始终获取到其领导者、邻居的信息,同时保证了编队误差的暂态性能。The purpose of the invention is to overcome the shortcomings and deficiencies in the prior art, and to provide a collaborative learning and formation control method for a homogeneous multi-UAV system. The method designs a formation controller for the homogeneous multi-UAV with uncertain models, It can not only ensure that in the distributed leader-follower formation structure, each unmanned boat can always obtain the information of its leader and neighbors, and at the same time guarantee the transient performance of the formation error.

本发明的目的可以通过如下技术方案实现:The purpose of the present invention can be realized by following technical scheme:

一种同构多无人艇系统的协同学习与编队控制方法,所述方法包括以下步骤:A collaborative learning and formation control method for a homogeneous multi-unmanned boat system, the method comprises the following steps:

步骤(1)、建立多个具有相同结构的无人艇的动态模型;Step (1), establish a plurality of dynamic models of unmanned boats with the same structure;

步骤(2)、根据相邻无人艇之间的安全距离约束和通讯连接范围约束设计跟踪误差约束条件;Step (2), design tracking error constraints according to the safety distance constraints and communication connection range constraints between adjacent unmanned boats;

步骤(3)、为满足预设的性能要求,设计跟踪误差转换函数,将跟踪误差进行转换后得到转换后的转换误差;Step (3), in order to meet the preset performance requirements, design a tracking error conversion function, and convert the tracking error to obtain the converted conversion error;

步骤(4)、应用动态面控制技术设计虚拟控制器:结合动态面控制技术与逐步后推控制器设计技术避免虚拟控制器的求导,从而避免控制器的输入包含邻居的加速度信息;Step (4), applying the dynamic surface control technology to design the virtual controller: combining the dynamic surface control technology and the step-by-step pushback controller design technology to avoid the derivation of the virtual controller, thereby avoiding that the input of the controller contains the acceleration information of the neighbor;

步骤(5)、设计RBF神经网络的权值更新率:应用RBF神经网络估计无人艇系统中的阻尼项;Step (5), design the weight update rate of the RBF neural network: apply the RBF neural network to estimate the damping term in the unmanned boat system;

步骤(6)、设计状态反馈跟踪控制器:应用李雅普诺夫稳定性理论并结合逐步后推设计方法构造稳定的跟踪控制器;Step (6), designing a state feedback tracking controller: applying the Lyapunov stability theory and combining the step-by-step backward design method to construct a stable tracking controller;

步骤(7)、利用存储知识,完成知识利用,设计基于经验的状态反馈跟踪控制器;Step (7), utilize the stored knowledge, complete the knowledge utilization, and design an experience-based state feedback tracking controller;

步骤(2)中:用无向图

Figure GDA0002512532920000021
来描述无人艇编队系统中各个体间的信息交互,
Figure GDA0002512532920000022
是有限非空集合,称为顶点集,集合
Figure GDA0002512532920000023
中的每个顶点对应有相同编号的跟随者;
Figure GDA0002512532920000024
是有限集合,称之为边集,每条边对应有相同编号且能互相通信的相邻无人艇,无向图
Figure GDA0002512532920000025
的邻接矩阵A=(ail)(N)×(N)的元素ail∈{0,1},当无人艇i能够获取无人艇j的信息时,此时j称为尾部,i称为头部,ail=1,否则ail=0;In step (2): use an undirected graph
Figure GDA0002512532920000021
To describe the information interaction between the various entities in the UAV formation system,
Figure GDA0002512532920000022
is a finite non-empty set, called the vertex set, the set
Figure GDA0002512532920000023
Each vertex in corresponds to a follower with the same number;
Figure GDA0002512532920000024
is a finite set, called an edge set, each edge corresponds to adjacent UAVs with the same number and can communicate with each other, undirected graph
Figure GDA0002512532920000025
The element a il ∈{0,1} of the adjacency matrix A=(a il ) (N)×(N) , when the unmanned boat i can obtain the information of the unmanned boat j, then j is called the tail, i is called the head, a il = 1, otherwise a il = 0;

进一步用拓展图

Figure GDA0002512532920000026
来描述包含领航者在内的编队系统中各成员间的信息交互,
Figure GDA0002512532920000027
为虚拟领导者,虚拟领导与部分跟随者保持通信;邻接矩阵A0=diag[a10,…,aN0]T的元素ai0∈{0,1},并且
Figure GDA0002512532920000028
Figure GDA0002512532920000029
Further use of expansion diagrams
Figure GDA0002512532920000026
to describe the information interaction among the members of the formation system including the navigator,
Figure GDA0002512532920000027
For the virtual leader, the virtual leader maintains communication with the partial followers; the adjacency matrix A 0 =diag[a 10 ,...,a N0 ] element a i0 ∈ {0,1} of T , and
Figure GDA0002512532920000028
Figure GDA0002512532920000029

设计一个连续可微的单调递减函数向量G(t),G(t)满足一阶可导、二阶可导,

Figure GDA00025125329200000210
Figure GDA00025125329200000211
为设计常数,G(0)为G(t)的初始值;认为每个无人艇的通信范围有限且最大为
Figure GDA00025125329200000212
如果满足
Figure GDA00025125329200000213
Figure GDA00025125329200000214
其中
Figure GDA00025125329200000215
i≠k,Gx(0)为G(t)在纵向上的初始值,Gy(0)为G(t)在横荡方向的初始值,则无人艇i与无人艇k之间具有通信,其中
Figure GDA0002512532920000031
因此定义每个邻居为:
Figure GDA0002512532920000032
增广邻居集为:
Figure GDA0002512532920000033
Figure GDA0002512532920000034
Figure GDA0002512532920000035
为纵向的设计常数,
Figure GDA0002512532920000036
Figure GDA0002512532920000037
为横荡方向的设计常数,
Figure GDA0002512532920000038
Figure GDA0002512532920000039
为航向角上的设计常数;Design a continuously differentiable monotone decreasing function vector G(t), G(t) satisfies the first-order and second-order derivables,
Figure GDA00025125329200000210
Figure GDA00025125329200000211
is a design constant, G(0) is the initial value of G(t); it is considered that the communication range of each UAV is limited and the maximum is
Figure GDA00025125329200000212
if satisfied
Figure GDA00025125329200000213
Figure GDA00025125329200000214
in
Figure GDA00025125329200000215
i≠k, G x (0) is the initial value of G(t) in the longitudinal direction, G y (0) is the initial value of G(t) in the sway direction, then the relationship between the unmanned boat i and the unmanned boat k communication between
Figure GDA0002512532920000031
So define each neighbor as:
Figure GDA0002512532920000032
The augmented neighbor set is:
Figure GDA0002512532920000033
Figure GDA0002512532920000034
and
Figure GDA0002512532920000035
is the longitudinal design constant,
Figure GDA0002512532920000036
and
Figure GDA0002512532920000037
is the design constant for the sway direction,
Figure GDA0002512532920000038
and
Figure GDA0002512532920000039
is the design constant on the heading angle;

在运用无向通讯拓扑图来表示多个无人艇之间的信息交互,使得一群通讯范围有限的无人艇在以领导-跟随者编队形式跟随给定的领导者轨迹时,同时每个无人艇也需要满足距离与方位角的约束条件以保证每个无人艇能够探测到其邻居的信息,并与其保持连接;定义误差为:The undirected communication topology diagram is used to represent the information interaction between multiple unmanned boats, so that when a group of unmanned boats with limited communication range follows the given leader trajectory in the form of leader-follower formation, each unmanned boat at the same time The human-vessel also needs to meet the constraints of distance and azimuth to ensure that each unmanned boat can detect the information of its neighbors and maintain connection with it; the error is defined as:

Figure GDA00025125329200000310
Figure GDA00025125329200000310

其中ei,1=[eix,1,eiy,1,eiψ,1]T,ei,1为将无人艇与其所有邻居位置差进行转换后求和的向量,eix,1为ei,1在纵向上的分量,eiy,1为ei,1在横荡方向上的分量,eiψ,1为ei,1在航向角上的分量,aik表示第i个无人艇是否与第k个无人艇保持连接,若保持连接aik=1,否则aik=0;ξi,k=[ξix,kiy,kiψ,k]T,ξix,k、ξiy,k、ξiψ,k分别为ξi,k在纵向、横荡方向、航向角方向的分量,

Figure GDA00025125329200000311
Figure GDA00025125329200000312
xk表示第k个无人艇在大地坐标OeXeYe下纵向的位置,yk表示第k个无人艇在大地坐标OeXeYe下横荡方向的位置,ψk为第k个无人艇的航向角,Gx(t)为纵向的衰减函数,Gy(t)为横荡方向的衰减函数,Gψ(t)为航向角上的衰减函数;where e i,1 =[e ix,1 ,e iy,1 ,e iψ,1 ] T , e i,1 is the vector summed after transforming the position difference between the UAV and all its neighbors, e ix,1 is the component of e i,1 in the longitudinal direction, e iy,1 is the component of e i,1 in the sway direction, e iψ,1 is the component of e i,1 in the heading angle, a ik represents the i-th Whether the unmanned boat remains connected to the k-th unmanned boat, if the connection a ik =1, otherwise a ik =0; ξ i,k =[ξ ix,kiy,kiψ,k ] T , ξ ix,k , ξ iy,k , ξ iψ,k are the components of ξ i,k in the longitudinal direction, yaw direction, and heading angle direction, respectively,
Figure GDA00025125329200000311
Figure GDA00025125329200000312
x k represents the longitudinal position of the k-th unmanned boat under the geodetic coordinates O e X e Y e , y k represents the position of the k-th unmanned boat in the sway direction under the geodetic coordinates O e X e Y e , ψ k is the heading angle of the k-th unmanned boat, G x (t) is the attenuation function in the longitudinal direction, G y (t) is the attenuation function in the sway direction, and G ψ (t) is the attenuation function on the heading angle;

“ξi,k为第i个无人艇与第k个无人艇位置差进行第二步转换后的向量”。i,k is the vector converted in the second step of the position difference between the i-th unmanned boat and the k-th unmanned boat".

进一步地,步骤(1)中,第i个无人艇的动态模型为:Further, in step (1), the dynamic model of the i-th unmanned boat is:

Figure GDA00025125329200000313
Figure GDA00025125329200000313

上式中前三项是系统的运动学方程,其中,

Figure GDA00025125329200000314
N表示无人艇的总个数,(xi,yi)表示第i个无人艇在大地坐标OeXeYe下的位置,xi为第i个无人艇的纵向位置,yi为第i个无人艇的横荡方向位置,ψi为第i个无人艇的航向角;ui表示第i个无人艇的纵向速度,vi表示第i个无人艇的横荡速度,ri表示第i个无人艇的转向角速度;M表示无人艇的质量矩阵,
Figure GDA0002512532920000041
表示第i个无人艇在u、v、r方向上的加速度构成的向量,C(vi)表示科氏力矩阵,其中vi=[ui,vi,ri]T为速度向量,D(vi)表示阻尼矩阵,由于多艘无人艇具有相同结构,所以每艘无人艇具有相同的M矩阵;τi=[τuiviri]T为需要设计的控制器向量,τui表示第i个无人艇纵向的推力,τvi表示第i个无人艇横荡方向的推力,τri表示第i个无人艇转向的力矩;τωi=[τωuiωviωri]T为外界时变扰动,τωui表示第i个无人艇在纵向方向受到的外部时变扰动,τωvi表示第i个无人艇在横荡方向受到的外部时变扰动,τωri表示第i个无人艇在转向角方向受到的外部时变扰动;矩阵M、C(vi)、D(vi)、J(ηi)的具体形式分别如下所示:The first three terms in the above formula are the kinematic equations of the system, where,
Figure GDA00025125329200000314
N represents the total number of unmanned boats, ( xi , y i ) represents the position of the ith unmanned boat under the geodetic coordinates O e X e Y e , and xi is the longitudinal position of the ith unmanned boat, y i is the sway direction position of the ith unmanned boat, ψ i is the heading angle of the ith unmanned boat; ui represents the longitudinal speed of the ith unmanned boat, and vi represents the ith unmanned boat s sway speed, ri represents the steering angular velocity of the i -th unmanned boat; M represents the mass matrix of the unmanned boat,
Figure GDA0002512532920000041
Represents the vector formed by the acceleration of the i-th unmanned boat in the u, v, and r directions, C(vi ) represents the Coriolis force matrix, where v i =[ u i ,vi ,r i ] T is the velocity vector , D(v i ) represents the damping matrix. Since multiple unmanned boats have the same structure, each unmanned boat has the same M matrix; τ i =[τ uiviri ] T is the required design Controller vector, τ ui represents the longitudinal thrust of the ith unmanned boat, τ vi represents the thrust of the ith unmanned boat in the sway direction, τ ri represents the turning moment of the ith unmanned boat; τ ωi = [τ ωuiωviωri ] T is the external time-varying disturbance, τ ωui represents the external time-varying disturbance received by the i-th unmanned boat in the longitudinal direction, and τ ωvi represents the external time-varying disturbance received by the i-th unmanned boat in the yaw direction Time-varying disturbance, τ ωri represents the external time-varying disturbance received by the i-th unmanned boat in the direction of the steering angle; the specific forms of the matrices M, C(vi), D(vi ) , and J(η i ) are respectively as follows: Show:

Figure GDA0002512532920000042
Figure GDA0002512532920000042

Figure GDA0002512532920000043
Figure GDA0002512532920000043

其中,m11、m22、m23、m33为常数,d11(ui)是关于ui的函数,d22(vi,ri)、d23(vi,ri)、d32(vi,ri)、d33(vi,ri)是关于vi,ri的函数;将动力学方程转换为如下形式:Among them, m 11 , m 22 , m 23 , and m 33 are constants, d 11 (u i ) is a function of u i , d 22 (vi , ri ) , d 23 (vi , ri ) , d 32 (vi , ri ), d 33 (vi , ri ) are functions of v i , ri ; transform the kinetic equation into the following form:

Figure GDA0002512532920000044
Figure GDA0002512532920000044

其中v′i=J(ηi)vi

Figure GDA0002512532920000045
τ′i=J(ηi)M-1τi,τ′ωi=J(ηi)M-1τωi
Figure GDA0002512532920000046
Figure GDA0002512532920000047
为旋转矩阵J(ηi)的导数,J-1i)为旋转矩阵J(ηi)的逆,M-1为质量矩阵M的逆。where v′ i =J(η i )v i ,
Figure GDA0002512532920000045
τ′ i =J(η i )M -1 τ i , τ′ ωi =J(η i )M -1 τ ωi ,
Figure GDA0002512532920000046
Figure GDA0002512532920000047
is the derivative of the rotation matrix J(η i ), J −1i ) is the inverse of the rotation matrix J(η i ), and M −1 is the inverse of the mass matrix M.

进一步地,步骤(3)中,设计跟踪误差约束条件如下:Further, in step (3), the design tracking error constraints are as follows:

Figure GDA0002512532920000048
Figure GDA0002512532920000048

其中j=x,y,ψ,

Figure GDA0002512532920000049
e ji(t)表示eji(t)的下界性能函数,
Figure GDA00025125329200000410
表示eji(t)的上界性能函数,
Figure GDA00025125329200000411
表示性能函数
Figure GDA00025125329200000412
的初始值,
Figure GDA00025125329200000413
表示性能函数
Figure GDA00025125329200000414
的稳态值,e ji,0表示性能函数e ji(t)的初始值,e ji,∞表示性能函数e ji(t)的稳态值,kji表示其收敛速度;因此误差跟踪转换函数设计为:where j=x, y, ψ,
Figure GDA0002512532920000049
e ji (t) represents the lower bound performance function of e ji (t),
Figure GDA00025125329200000410
represents the upper bound performance function of e ji (t),
Figure GDA00025125329200000411
Represents a performance function
Figure GDA00025125329200000412
the initial value of ,
Figure GDA00025125329200000413
Represents a performance function
Figure GDA00025125329200000414
The steady state value of , e ji,0 represents the initial value of the performance function e ji (t), e ji,∞ represents the steady state value of the performance function e ji (t), and k ji represents its convergence speed; therefore, the error tracking transfer function Designed to:

Figure GDA00025125329200000415
Figure GDA00025125329200000415

其中,zji,1表示第i个无人艇的转换误差,

Figure GDA00025125329200000416
Figure GDA00025125329200000417
表示自然底数的zji,1次幂,
Figure GDA0002512532920000051
表示自然底数的-zji,1次幂,-γji<Tji(zji,1ji)<1,
Figure GDA0002512532920000052
当且仅当zji,1=0时,Tji(zji,1ji)=0;得到如下转换误差:Among them, z ji,1 represents the conversion error of the i-th unmanned boat,
Figure GDA00025125329200000416
Figure GDA00025125329200000417
Represents the natural base z ji, the power of 1 ,
Figure GDA0002512532920000051
Represents the -z ji, 1 power of the natural base, -γ ji <T ji (z ji,1ji ) < 1,
Figure GDA0002512532920000052
T ji (z ji,1ji )=0 if and only if z ji,1 =0; the following conversion error is obtained:

Figure GDA0002512532920000053
Figure GDA0002512532920000053

进一步地,步骤(4)中,定义滤波误差为:Further, in step (4), the filter error is defined as:

ei,f=αi,fi e i,fi,fi

zi,2=v′ii,f z i,2 =v′ ii,f

其中

Figure GDA0002512532920000054
ei,f=[eix,f,eiy,f,eiψ,f]T为滤波误差向量,eix,f为ei,f在纵向上的分量,eiy,f为ei,f在横荡方向上的分量,eiψ,f为ei,f在航向角上的分量;αi=[αixiy]T为虚拟控制输入向量,αix为αi在纵向上的分量,αiy为αi在横荡方向上的分量,α为αi在航向角上的分量;αi,f=[αix,fiy,fiψ,f]T为滤波虚拟控制输入向量,αix,f表示αix的滤波虚拟控制,αiy,f表示αiy的滤波虚拟控制,αiψ,f表示α的滤波虚拟控制;zi,2=[ziu,2,ziv,2,zir,2]T为转换速度误差,ziu,2为zi,2在纵向上的分量,ziv,2为zi,2在横荡方向上的分量,zir,2为zi,2在航向角上的分量;in
Figure GDA0002512532920000054
e i,f =[e ix,f ,e iy,f ,e iψ,f ] T is the filtering error vector, e ix,f is the vertical component of e i,f , e iy,f is e i, The component of f in the sway direction, e iψ,f is the component of e i,f in the heading angle; α i =[α ixiy ] T is the virtual control input vector, α ix is α i Component in the longitudinal direction, α iy is the component of α i in the sway direction, α is the component of α i in the heading angle; α i,f =[α ix,fiy,fiψ, f ] T is the filtering virtual control input vector, α ix,f represents the filtering virtual control of α ix , α iy,f represents the filtering virtual control of α iy , α iψ,f represents the filtering virtual control of α ; =[z iu,2 ,z iv,2 ,z ir,2 ] T is the conversion speed error, z iu,2 is the vertical component of zi,2, z iv,2 is the sway of zi ,2 The component in the direction, z ir,2 is the component of z i,2 in the heading angle;

设计虚拟控制器为:Design the virtual controller as:

Figure GDA0002512532920000055
Figure GDA0002512532920000055

其中ki,1=diag[kix,1,kiy,1,kiψ,1]为设计参数,kix,1为ki,1在纵向上的分量,kix,1为ki,1在横荡方向上的分量,kiψ,1为ki,1在航向角上的分量;Gi=diag[Gix,Giy,G],Gix为Gi在纵向上的分量,Giy为Gi在横荡方向上的分量,G为Gi在航向角上的分量,ki,k=diag[kix,k,kiy,k,kiψ,k],kix,k为ki,k在纵向上的分量,kiy,k为ki,k在横荡方向上的分量,kiψ,k为ki,k在航向角上的分量;θi,1=diag[θix,1iy,1iψ,1],θix,1为θi,1在纵向上的分量,θiy,1为θi,1在横荡方向上的分量,θiψ,1为θi,1在航向角上的分量;pi,k=diag[pix,k,piy,k,piψ,k],pix,k为pi,k在纵向上的分量,piy,k为pi,k在横荡方向上的分量,piψ,k为pi,k在航向角上的分量;hi=diag[hix,hiy,h],hix为hi在纵向上的分量,hiy为hi在横荡方向上的分量,h为hi在航向角上的分量;qi,k=diag[qix,k,qiy,k,qiψ,k],qix,k为qi,k在纵向上的分量,qiy,k为qi,k在横荡方向上的分量,qix,k为qi,k在航向角上的分量;

Figure GDA0002512532920000061
Figure GDA0002512532920000062
Figure GDA0002512532920000063
j=x,y,ψ,
Figure GDA0002512532920000064
e ij,1的导数,
Figure GDA0002512532920000065
Figure GDA0002512532920000066
的导数,
Figure GDA0002512532920000067
Figure GDA0002512532920000068
在纵向上的分量,
Figure GDA0002512532920000069
Figure GDA00025125329200000610
在横荡方向上的分量,
Figure GDA00025125329200000611
Figure GDA00025125329200000612
在航向角上的分量;where k i,1 =diag[k ix,1 ,kiy, 1 ,k iψ,1 ] are design parameters, k ix,1 is the vertical component of k i,1 , k ix,1 is k i, 1 is the component in the sway direction, k iψ,1 is the component of k i,1 in the heading angle; G i =diag[G ix ,G iy ,G ], G ix is the longitudinal component of G i , G iy is the component of G i in the sway direction, G is the component of G i in the heading angle, k i,k =diag[k ix,k ,k iy,k ,k iψ,k ], k ix,k is the component of ki,k in the longitudinal direction, kiy, k is the component of ki,k in the sway direction, kiψ,k is the component of ki ,k in the heading angle; θ i, 1 =diag[θ ix,1iy,1iψ,1 ], θ ix,1 is the component of θ i,1 in the longitudinal direction, θ iy,1 is the component of θ i,1 in the yaw direction Component, θ iψ,1 is the component of θ i,1 on the heading angle; p i,k =diag[pi ix,k ,p iy,k ,p iψ,k ], p ix,k is p i,k Component in the longitudinal direction, p iy,k is the component of p i,k in the sway direction, p iψ,k is the component of p i,k in the heading angle; h i =diag[hi ix ,hi iy , hi ], h ix is the component of hi in the longitudinal direction, hi iy is the component of hi in the sway direction, hi is the component of hi in the heading angle; q i ,k =diag[q ix, k , q iy,k , q iψ,k ], q ix,k is the component of qi ,k in the longitudinal direction, q iy,k is the component of qi ,k in the sway direction, q ix,k is The component of q i,k on the heading angle;
Figure GDA0002512532920000061
Figure GDA0002512532920000062
Figure GDA0002512532920000063
j = x, y, ψ,
Figure GDA0002512532920000064
is the derivative of e ij,1 ,
Figure GDA0002512532920000065
for
Figure GDA0002512532920000066
the derivative of ,
Figure GDA0002512532920000067
for
Figure GDA0002512532920000068
component in the longitudinal direction,
Figure GDA0002512532920000069
for
Figure GDA00025125329200000610
the component in the sway direction,
Figure GDA00025125329200000611
for
Figure GDA00025125329200000612
component in the heading angle;

引入动态面控制技术并设计虚拟控制器的一阶滤波器为:Introducing the dynamic surface control technology and designing the first-order filter of the virtual controller is:

Figure GDA00025125329200000613
Figure GDA00025125329200000613

其中,αi,m=πiihi -1zi,1

Figure GDA00025125329200000614
表示αfi的导数,πi=diag[π123]为滤波器时间常数矩阵,π1为πi在纵向上的分量,π2为πi在横荡方向上的分量,π3为πi在航向角上的分量;zi,1=diag[zix,1,ziy,1,ziψ,1]为转换误差,zix,1为zi,1在纵向上的分量,ziy,1为zi,1在横荡方向上的分量,ziy,1为zi,1在航向角上的分量;αi,f(0)表示滤波虚拟控制αi,f的初始值;where α i,mii h i -1 z i,1 ,
Figure GDA00025125329200000614
Represents the derivative of α fi , π i =diag[π 123 ] is the filter time constant matrix, π 1 is the component of π i in the longitudinal direction, π 2 is the component of π i in the sway direction , π 3 is the component of π i on the heading angle; z i,1 =diag[z ix,1 ,z iy,1 ,z iψ,1 ] is the conversion error, z ix,1 is the longitudinal direction of z i,1 , ziy,1 is the component of zi ,1 in the sway direction, ziy,1 is the component of zi ,1 on the heading angle; α i,f (0) represents the filtering virtual control α i , the initial value of f ;

aik为第i个无人艇是否与第k个无人艇保持连接,若保持连接aik=1,否则aik=0;ei,1为将无人艇与其所有邻居位置差进行转换后求和的向量;a ik is whether the i-th unmanned boat maintains connection with the k-th unmanned boat, if the connection is maintained a ik =1, otherwise a ik =0; e i,1 is to convert the position difference between the unmanned boat and all its neighbors post-summed vector;

αi,m(0)为虚拟控制器在零时刻的值;α i,m (0) is the value of the virtual controller at time zero;

Gi为衰减函数;ki,k为设计常数向量;θi,1为虚拟控制器中误差ei,1的可变系数;pi,k为第i个无人艇与第k个无人艇位置差进行第一步转换后的向量;hi为第i个无人艇与第k个无人艇位置差进行第一步转换后的向量之和的倒数;qi,k为pi,k的转换向量;χi是由ei,1、ei,1的上界函数与ei,1的下界函数构成,χi的逆向量为虚拟控制器中误差ei,1的可变系数;G i is the decay function; k i,k is the design constant vector; θ i,1 is the variable coefficient of the error e i,1 in the virtual controller; p i,k is the ith unmanned boat and the kth unmanned boat The vector of the first conversion of the position difference of the human-boat The transformation vector of i,k ; χ i is composed of the upper bound function of ei , 1 , ei,1 and the lower bound function of ei,1, the inverse vector of χ i is the error of ei,1 in the virtual controller variable coefficient;

进一步地,步骤(5)中RBF神经网络:Further, the RBF neural network in step (5):

Figure GDA00025125329200000727
Figure GDA00025125329200000727

其中,ω(Zi)=[ωu(Zi),ωv(Zi),ωr(Zi)]T为神经网络补偿函数,ωu(Zi)为ω(Zi)在纵向上的分量,ωv(Zi)为ω(Zi)在横荡方向上的分量,ωr(Zi)为ω(Zi)在航向角上的分量;

Figure GDA0002512532920000071
为理想RBF神经网络权值,
Figure GDA0002512532920000072
为W*在纵向上的分量,
Figure GDA0002512532920000073
为W*在横荡方向上的分量,
Figure GDA0002512532920000074
为W*在航向角上的分量;S(Zi)为回归向量;RBF神经网络权值更新率设计如下:Among them, ω(Z i )=[ω u (Z i ), ω v (Z i ), ω r (Z i )] T is the neural network compensation function, ω u (Z i ) is ω(Z i ) in The longitudinal component, ω v (Z i ) is the component of ω(Z i ) in the sway direction, and ω r (Z i ) is the component of ω(Z i ) in the heading angle;
Figure GDA0002512532920000071
is the ideal RBF neural network weight,
Figure GDA0002512532920000072
is the component of W * in the longitudinal direction,
Figure GDA0002512532920000073
is the component of W * in the sway direction,
Figure GDA0002512532920000074
is the component of W * on the heading angle; S(Z i ) is the regression vector; the RBF neural network weight update rate is designed as follows:

Figure GDA0002512532920000075
Figure GDA0002512532920000075

其中,

Figure GDA0002512532920000076
Figure GDA0002512532920000077
的导数,
Figure GDA0002512532920000078
为第i个无人艇
Figure GDA00025125329200000724
方向上的权值,
Figure GDA0002512532920000079
为第l个无人艇
Figure GDA00025125329200000725
方向上的权值,
Figure GDA00025125329200000717
为回归函数,
Figure GDA00025125329200000718
为第i个无人艇
Figure GDA00025125329200000719
方向上的速度误差,
Figure GDA00025125329200000720
为设计参数,
Figure GDA00025125329200000721
Figure GDA00025125329200000722
修正项,
Figure GDA00025125329200000710
为协同调整系数;in,
Figure GDA0002512532920000076
for
Figure GDA0002512532920000077
the derivative of ,
Figure GDA0002512532920000078
is the i-th unmanned boat
Figure GDA00025125329200000724
weights in the direction,
Figure GDA0002512532920000079
for the lth unmanned boat
Figure GDA00025125329200000725
weights in the direction,
Figure GDA00025125329200000717
is the regression function,
Figure GDA00025125329200000718
is the i-th unmanned boat
Figure GDA00025125329200000719
velocity error in direction,
Figure GDA00025125329200000720
are design parameters,
Figure GDA00025125329200000721
for
Figure GDA00025125329200000722
corrections,
Figure GDA00025125329200000710
is the synergistic adjustment factor;

考虑

Figure GDA00025125329200000711
Figure GDA00025125329200000712
为第i个无人艇
Figure GDA00025125329200000723
方向上的神经网络权值误差,W*为理想RBF神经网络权值,
Figure GDA00025125329200000713
的一阶导数为:consider
Figure GDA00025125329200000711
Figure GDA00025125329200000712
is the i-th unmanned boat
Figure GDA00025125329200000723
The neural network weight error in the direction, W * is the ideal RBF neural network weight,
Figure GDA00025125329200000713
The first derivative of is:

Figure GDA00025125329200000714
Figure GDA00025125329200000714

其中,

Figure GDA00025125329200000715
为第l个无人艇
Figure GDA00025125329200000726
方向上的神经网络权值误差;in,
Figure GDA00025125329200000715
for the lth unmanned boat
Figure GDA00025125329200000726
The neural network weight error in the direction;

εi(Zi)为神经网络逼近误差;Zi为神经网络输入向量;ε i (Z i ) is the approximation error of the neural network; Z i is the input vector of the neural network;

ail为第i个无人艇是否与第l个无人艇保持连接,若保持连接ail=1,否则ail=0。a il is whether the i-th unmanned boat remains connected to the l-th unmanned boat, if the connection is maintained, a il =1, otherwise a il =0.

进一步地,步骤(6)中设计的状态反馈跟踪控制器如下:Further, the state feedback tracking controller designed in step (6) is as follows:

Figure GDA00025125329200000716
Figure GDA00025125329200000716

其中,τi′=[τ′iu,τ′iv,τ′ir]T为设计控制器,τ′iu为τi′在纵向上的分量,τ′iv为τi′在横荡方向上的分量,τ′ir为τi′在航向角上的分量;ki,2=diag[kix,2,kiy,2,kiψ,2]为对角矩阵,kix,2表示编队控制器τ′ix的设计参数,kiy,2表示编队控制器τ′iy的设计参数,kiψ,2表示编队控制器τ′的设计参数;

Figure GDA0002512532920000081
为虚拟控制器αi,f的导数,
Figure GDA0002512532920000082
为神经网络权值的转置,hi -1为函数hi的逆;Among them, τ i ′=[τ′ iu ,τ′ iv ,τ′ ir ] T is the design controller, τ′ iu is the component of τ i ′ in the longitudinal direction, τ′ iv is the τ i ′ in the yaw direction , τ′ ir is the component of τ i ′ on the heading angle; k i,2 =diag[k ix,2 ,k iy,2 ,k iψ,2 ] is the diagonal matrix, k ix,2 represents the formation The design parameters of the controller τ′ ix , k iy,2 represents the design parameters of the formation controller τ′ iy , and k iψ,2 represents the design parameters of the formation controller τ′ ;
Figure GDA0002512532920000081
is the derivative of the virtual controller α i,f ,
Figure GDA0002512532920000082
is the transpose of the neural network weights, h i -1 is the inverse of the function h i ;

zi,2为转换速度误差。z i,2 is the conversion speed error.

进一步地,步骤(7)中设计的基于经验的状态反馈跟踪控制器如下:Further, the experience-based state feedback tracking controller designed in step (7) is as follows:

Figure GDA0002512532920000083
Figure GDA0002512532920000083

其中,

Figure GDA0002512532920000084
为第i个无人艇的神经网络权值常数,
Figure GDA0002512532920000085
Figure GDA0002512532920000086
在纵向上的分量,
Figure GDA0002512532920000087
Figure GDA0002512532920000088
在横荡方向上的分量,
Figure GDA0002512532920000089
Figure GDA00025125329200000810
航向角上的分量,
Figure GDA00025125329200000811
Figure GDA00025125329200000812
的转置。in,
Figure GDA0002512532920000084
is the neural network weight constant of the i-th UAV,
Figure GDA0002512532920000085
for
Figure GDA0002512532920000086
component in the longitudinal direction,
Figure GDA0002512532920000087
for
Figure GDA0002512532920000088
the component in the sway direction,
Figure GDA0002512532920000089
for
Figure GDA00025125329200000810
component on the heading angle,
Figure GDA00025125329200000811
for
Figure GDA00025125329200000812
transposition of .

本发明与现有技术相比,具有如下优点和有益效果:Compared with the prior art, the present invention has the following advantages and beneficial effects:

1、本发明提供的一种同构多无人艇系统的协同学习与编队控制方法,引入跟踪误差转换函数确保约束误差的有界性来满足原始的约束跟踪问题,该函数选为时变指数函数以保证目标误差始终在边界函数所规定的范围内,且误差的暂态性能(收敛速度与最大超调量)可以通过调节边界函数的参数来预先设定。1. The collaborative learning and formation control method of a homogeneous multi-unmanned boat system provided by the present invention introduces a tracking error conversion function to ensure the boundedness of the constraint error to satisfy the original constraint tracking problem, and this function is selected as a time-varying index function to ensure that the target error is always within the range specified by the boundary function, and the transient performance of the error (convergence speed and maximum overshoot) can be preset by adjusting the parameters of the boundary function.

2、本发明提供的一种同构多无人艇系统的协同学习与编队控制方法,利用RBF神经网络的非线性逼近能力来估计模型中的不确定部分,在控制器中用估计值对模型不确定部分进行补偿,并选择合适的参数估计值自适应更新率,实现编队误差的收敛。2. A collaborative learning and formation control method for a homogeneous multi-unmanned boat system provided by the present invention utilizes the nonlinear approximation capability of the RBF neural network to estimate the uncertain part in the model, and uses the estimated value in the controller to estimate the model. The uncertain part is compensated, and the adaptive update rate of the parameter estimation value is selected to achieve the convergence of the formation error.

3、本发明提供的一种同构多无人艇系统的协同学习与编队控制方法,基于确定学习理论,采用径向基神经网络获取系统动态知识,并将学到的知识以常数神经网络权值的形式存储,相同或相似的控制任务不再需要重新训练神经网络,而是对已学知识的再利用。3. A method for collaborative learning and formation control of a homogeneous multi-UAV system provided by the present invention is based on the deterministic learning theory, adopts radial basis neural network to obtain system dynamic knowledge, and uses constant neural network weights for the learned knowledge. Stored in the form of values, the same or similar control tasks no longer need to retrain the neural network, but reuse the knowledge already learned.

附图说明Description of drawings

图1为本发明实施例无人艇的分布式领导者-跟随者编队结构示意图。FIG. 1 is a schematic structural diagram of a distributed leader-follower formation of an unmanned boat according to an embodiment of the present invention.

图2为本发明实施例无人艇编队系统的无相通信拓扑图。FIG. 2 is a phaseless communication topology diagram of an unmanned boat formation system according to an embodiment of the present invention.

图3为本发明实施例无人艇的编队控制的整体控制框图。FIG. 3 is an overall control block diagram of formation control of an unmanned boat according to an embodiment of the present invention.

图4为本发明实施例一组同构无人艇编队运动的相平面图。FIG. 4 is a phase plan view of a group of isomorphic unmanned boat formation movement according to an embodiment of the present invention.

图5为本发明实施例一组同构无人艇纵向的跟踪误差eix,1(t)的示意图。FIG. 5 is a schematic diagram of longitudinal tracking errors e ix,1 (t) of a group of isomorphic unmanned boats according to an embodiment of the present invention.

图6为本发明实施例一组同构无人艇横荡方向的的跟踪误差eiy,1(t)的示意图。FIG. 6 is a schematic diagram of the tracking error e iy,1 (t) of a group of isomorphic UAVs in the sway direction according to an embodiment of the present invention.

图7为本发明实施例一组同构无人艇航向角的跟踪误差eiψ,1(t)的示意图。7 is a schematic diagram of tracking errors e iψ,1 (t) of a group of isomorphic UAV heading angles according to an embodiment of the present invention.

图8为本发明实施例估计无人艇阻尼项的神经网络权值二范数

Figure GDA0002512532920000091
的示意图。FIG. 8 is the second norm of the neural network weight for estimating the damping term of the unmanned boat according to the embodiment of the present invention
Figure GDA0002512532920000091
schematic diagram.

图9为本发明实施例无人艇纵向的的推力τix的示意图。FIG. 9 is a schematic diagram of the longitudinal thrust τ ix of the unmanned boat according to the embodiment of the present invention.

图10为本发明实施例无人艇横荡方向的的推力τiy的示意图。FIG. 10 is a schematic diagram of the thrust τ iy in the sway direction of the unmanned boat according to the embodiment of the present invention.

图11为本发明实施例无人艇转向的的推力τ的示意图。FIG. 11 is a schematic diagram of the thrust τ of the steering of the unmanned boat according to the embodiment of the present invention.

图12为本发明实施例一组同构无人艇纵向的跟踪误差eix,1(t)的示意图。12 is a schematic diagram of longitudinal tracking errors e ix,1 (t) of a group of isomorphic unmanned boats according to an embodiment of the present invention.

图13为本发明实施例一组同构无人艇横荡方向的的跟踪误差eiy,1(t)的示意图。FIG. 13 is a schematic diagram of the tracking error e iy,1 (t) of a group of isomorphic UAVs in the sway direction according to an embodiment of the present invention.

图14为本发明实施例一组同构无人艇航向角的跟踪误差eiψ,1(t)的示意图。14 is a schematic diagram of the tracking error e iψ,1 (t) of a group of isomorphic UAV heading angles according to an embodiment of the present invention.

具体实施方式Detailed ways

为了使本发明的目的、技术方案以及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步的详细说明。应当理解,此处所描述的具体实施例仅用于解释本发明,并不限于本发明。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, and not to limit the present invention.

实施例:Example:

本实施例提供了一种同构多无人艇系统的协同学习与编队控制方法,该方法针对无人艇编队控制中保持连接和协同控制问题,提出基于分布式的领导者-跟随者结构的编队控制方法,分布式领导者-跟随者编队结构示意图如图1所示,图2为同构无人艇编队系统的无相通信拓扑图,图3为同构无人艇的编队控制的整体控制框图,所述方法具体包括以下步骤:This embodiment provides a collaborative learning and formation control method for a homogeneous multi-UAV system. The method proposes a distributed leader-follower structure based on the problem of maintaining connection and cooperative control in the formation control of UAVs. Formation control method, the schematic diagram of the distributed leader-follower formation structure is shown in Figure 1, Figure 2 is the phaseless communication topology diagram of the homogeneous unmanned boat formation system, and Figure 3 is the overall formation control of the homogeneous unmanned boat. Control block diagram, the method specifically includes the following steps:

步骤(1):在无人艇编队中每个无人艇的任务相同或相似,并且每个人无人艇所处环境相似(无人艇的通讯范围有限,编队中每个无人艇至少与一个无人艇保持通信),所以在本设计中采用具有相同结构的多个无人艇进行编队控制,建立分布式编队结构中无人艇的动态模型,并将向量形式的动态模型展开成标量形式;Step (1): The tasks of each unmanned boat in the unmanned boat formation are the same or similar, and the environment of each unmanned boat is similar (the communication range of the unmanned boat is limited, and each unmanned boat in the formation has at least One unmanned boat maintains communication), so in this design, multiple unmanned boats with the same structure are used for formation control, the dynamic model of the unmanned boat in the distributed formation structure is established, and the dynamic model in the form of a vector is expanded into a scalar form;

Figure GDA0002512532920000092
个无人艇的动态模型为:the first
Figure GDA0002512532920000092
The dynamic model of each UAV is:

Figure GDA0002512532920000101
Figure GDA0002512532920000101

上式中前三项是系统的运动学方程。其中,(xi,yi)表示第i个无人艇在大地坐标OeXeYe下的位置,xi为第i个无人艇的纵向位置,yi为第i个无人艇的横荡方向位置,ψi为第i个无人艇的航向角;ui表示第i个无人艇的纵向速度,vi表示第i个无人艇的横荡速度,ri表示第i个无人艇的转向角速度;M表示无人艇的质量矩阵,

Figure GDA0002512532920000102
表示第i个无人艇在u,v,r方向上的加速度构成的向量,C(vi)表示科氏力矩阵,其中vi=[ui,vi,ri]T为速度向量,D(vi)表示阻尼矩阵,由于多艘无人艇具有相同结构,所以每艘无人艇具有相同的M矩阵;τi=[τuiviri]T为需要设计的控制器向量,τui表示第i个无人艇纵向的推力,τvi表示第i个无人艇横荡方向的推力,τri表示第i个无人艇转向的力矩;τωi=[τωuiωviωri]T为外界时变扰动,τωui表示第i个无人艇在纵向方向受到的外部时变扰动,τωvi表示第i个无人艇在横荡方向受到的外部时变扰动,τωri表示第i个无人艇在转向角方向受到的外部时变扰动。矩阵M、C(vi)、D(vi)、J(ηi)的具体形式分别如下所示:The first three terms in the above formula are the kinematic equations of the system. Among them, ( xi , y i ) represents the position of the ith unmanned boat under the geodetic coordinates O e X e Y e , xi is the longitudinal position of the ith unmanned boat, and yi is the ith unmanned boat The sway direction position of the boat, ψ i is the heading angle of the i-th unmanned boat; u i represents the longitudinal speed of the i -th unmanned boat, vi represents the sway speed of the i -th unmanned boat, and ri means The steering angular velocity of the i-th unmanned boat; M represents the mass matrix of the unmanned boat,
Figure GDA0002512532920000102
Represents the vector formed by the acceleration of the i-th unmanned boat in the u, v, and r directions, C(vi ) represents the Coriolis force matrix, where v i =[ u i ,vi ,r i ] T is the velocity vector , D(v i ) represents the damping matrix. Since multiple unmanned boats have the same structure, each unmanned boat has the same M matrix; τ i =[τ uiviri ] T is the required design Controller vector, τ ui represents the longitudinal thrust of the ith unmanned boat, τ vi represents the thrust of the ith unmanned boat in the sway direction, τ ri represents the turning moment of the ith unmanned boat; τ ωi = [τ ωuiωviωri ] T is the external time-varying disturbance, τ ωui represents the external time-varying disturbance received by the i-th unmanned boat in the longitudinal direction, and τ ωvi represents the external time-varying disturbance received by the i-th unmanned boat in the yaw direction Time-varying disturbance, τ ωri represents the external time-varying disturbance in the steering angle direction of the ith UAV. The specific forms of the matrices M, C(v i ), D(vi ), and J(η i ) are as follows:

Figure GDA0002512532920000103
Figure GDA0002512532920000103

Figure GDA0002512532920000104
Figure GDA0002512532920000104

在本实例中,选取4个相同的无人艇动态模型,无人艇的系统参数分别为:In this example, four identical dynamic models of the unmanned boat are selected, and the system parameters of the unmanned boat are:

m11=25.8kg,m22=33.8kg,m23=m32=1.0948kg,m33=2.76kg,m 11 = 25.8 kg, m 22 = 33.8 kg, m 23 = m 32 = 1.0948 kg, m 33 = 2.76 kg,

c13(vi)=-m22ivi-m23iri,艇体长度为Li=1.225m。c 13 (vi )=-m 22i v i -m 23i ri , the hull length is Li = 1.225m .

Figure GDA0002512532920000105
Figure GDA0002512532920000105

d22(vi)=0.8612+36.2823*|vi|+0.805*|ri|,d 22 (vi )=0.8612+36.2823*|v i |+0.805*|r i | ,

d23(vi)=-0.1079+0.845*|vi|+3.45*|ri|,d 23 (vi )=-0.1079+0.845*|v i |+3.45*|r i | ,

d32(vi)=-0.1052-5.0437*|vi|-0.13*|ri|,d 32 (vi )=-0.1052-5.0437*|v i |-0.13*|r i | ,

d33(vi)=1.9-0.08*|vi|+0.75*|ri|。d 33 (vi )= 1.9−0.08 *|v i |+0.75*| ri |.

外部扰动的形式为:The form of external disturbance is:

Figure GDA0002512532920000111
Figure GDA0002512532920000111

由虚拟领导者产生的参考轨迹为:The reference trajectory produced by the virtual leader is:

η0=[40sin(0.1t),20sin(0.2t),0.1]T η 0 =[40sin(0.1t),20sin(0.2t),0.1] T

四个无人艇的初始位置为:η1(0)=[-8,0,0.1]T,η2(0)=[0,-6,0.2]T,η3(0)=[0,6,0]T,η4(0)=[8,0,0.1]T,初始速度选择为vi(0)=[0,0,0]TThe initial positions of the four UAVs are: η 1 (0)=[-8,0,0.1] T , η 2 (0)=[0,-6,0.2] T , η 3 (0)=[0 ,6,0] T , η 4 (0)=[ 8,0,0.1 ] T , and the initial velocity is chosen as vi (0)=[0,0,0] T .

将动力学方程转换为如下形式:Transform the kinetic equations into the following form:

Figure GDA0002512532920000112
Figure GDA0002512532920000112

其中v′i=J(ηi),

Figure GDA0002512532920000113
τ′i=J(ηi)M-1τi,τ′ωi=J(ηi)M-1τωi
Figure GDA0002512532920000114
Figure GDA0002512532920000115
为旋转矩阵J(ηi)的导数,J-1i)为旋转矩阵J(ηi)的逆,M-1为质量矩阵M的逆。where v′ i =J(η i ),
Figure GDA0002512532920000113
τ′ i =J(η i )M -1 τ i , τ′ ωi =J(η i )M -1 τ ωi ,
Figure GDA0002512532920000114
Figure GDA0002512532920000115
is the derivative of the rotation matrix J(η i ), J −1i ) is the inverse of the rotation matrix J(η i ), and M −1 is the inverse of the mass matrix M.

步骤(2):用无向图

Figure GDA00025125329200001119
来描述同构多无人艇编队系统中各个体间的信息交互。
Figure GDA0002512532920000116
是有限非空集合,称为顶点集,集合
Figure GDA0002512532920000117
中的每个顶点对应有相同编号的跟随者;
Figure GDA0002512532920000118
是有限集合,称之为边集,每条边对应有相同编号且能互相通信的相邻无人艇。无向图
Figure GDA00025125329200001120
的邻接矩阵A=(ail)(N)×(N)的元素ail∈{0,1}。当无人艇i可以获取无人艇j的信息时(此时j称为尾部,i称为头部),ail=1,否则ail=0。Step (2): Use an undirected graph
Figure GDA00025125329200001119
To describe the information interaction among the various entities in the homogeneous multi-UAV formation system.
Figure GDA0002512532920000116
is a finite non-empty set, called the vertex set, the set
Figure GDA0002512532920000117
Each vertex in corresponds to a follower with the same number;
Figure GDA0002512532920000118
is a finite set, called an edge set, each edge corresponds to adjacent UAVs with the same number and can communicate with each other. Undirected graph
Figure GDA00025125329200001120
The adjacency matrix A=(a il ) (N)×(N) of elements a il ∈ {0,1}. When the unmanned boat i can obtain the information of the unmanned boat j (the j is called the tail at this time, and the i is called the head), a il =1, otherwise a il =0.

进一步用拓展图

Figure GDA0002512532920000119
来描述包含领航者在内的编队系统中各成员间的信息交互。
Figure GDA00025125329200001110
为虚拟领导者,虚拟领导与部分跟随者保持通信;邻接矩阵A0=diag[a10,,aN0]T的元素ai0∈{0,1},并且
Figure GDA00025125329200001111
Figure GDA00025125329200001112
Further use of expansion diagrams
Figure GDA0002512532920000119
To describe the information exchange among the members of the formation system including the pilot.
Figure GDA00025125329200001110
is a virtual leader, which maintains communication with some of its followers; the adjacency matrix A 0 =diag[a 10 ,,a N0 ] the element a i0 ∈ {0,1} of T , and
Figure GDA00025125329200001111
Figure GDA00025125329200001112

设计一个连续可微的单调递减函数向量G(t),G(t)满足一阶可导、二阶可导,

Figure GDA00025125329200001113
(
Figure GDA00025125329200001114
为设计常数,G(0)为G(t)的初始值)。我们认为每个无人艇的通信范围有限且最大为
Figure GDA00025125329200001115
如果满足
Figure GDA00025125329200001116
Figure GDA00025125329200001117
Gx(0)为G(t)在纵向上的初始值,Gy(0)为G(t)在横荡方向的初始值,则无人艇i与无人艇k之间具有通信,其中
Figure GDA00025125329200001118
因此定义每个邻居为:
Figure GDA0002512532920000121
增广邻居集为:
Figure GDA0002512532920000122
Figure GDA0002512532920000123
Figure GDA0002512532920000124
为纵向的设计常数,
Figure GDA0002512532920000125
Figure GDA0002512532920000126
为横荡方向的设计常数,
Figure GDA0002512532920000127
Figure GDA0002512532920000128
为航向角上的设计常数。Design a continuously differentiable monotone decreasing function vector G(t), G(t) satisfies the first-order and second-order derivables,
Figure GDA00025125329200001113
(
Figure GDA00025125329200001114
is the design constant, G(0) is the initial value of G(t)). We believe that the communication range of each UAV is limited and the maximum is
Figure GDA00025125329200001115
if satisfied
Figure GDA00025125329200001116
Figure GDA00025125329200001117
G x (0) is the initial value of G(t) in the longitudinal direction, G y (0) is the initial value of G(t) in the sway direction, then there is communication between the unmanned boat i and the unmanned boat k, in
Figure GDA00025125329200001118
So define each neighbor as:
Figure GDA0002512532920000121
The augmented neighbor set is:
Figure GDA0002512532920000122
Figure GDA0002512532920000123
and
Figure GDA0002512532920000124
is the longitudinal design constant,
Figure GDA0002512532920000125
and
Figure GDA0002512532920000126
is the design constant for the sway direction,
Figure GDA0002512532920000127
and
Figure GDA0002512532920000128
is the design constant on the heading angle.

在运用无向通讯拓扑图来表示同构多无人艇之间的信息交互,使得一群通讯范围有限的无人艇在以领导-跟随者编队形式跟随给定的领导者轨迹(只有部分跟随者可获得领导者信息)时,同时每个无人艇也需要满足距离与方位角的约束条件以保证每个无人艇可以探测到其邻居的信息,并与其保持连接。定义误差为:The undirected communication topology is used to represent the information interaction between homogeneous multi-UAVs, so that a group of UAVs with limited communication range follow a given leader trajectory in the form of a leader-follower formation (only some followers). When the leader information is available), each unmanned boat also needs to meet the constraints of distance and azimuth to ensure that each unmanned boat can detect the information of its neighbors and maintain a connection with it. Define the error as:

Figure GDA0002512532920000129
Figure GDA0002512532920000129

其中ei,1=[eix,1,eiy,1,eiψ,1]T,ei,1为将无人艇与其所有邻居位置差进行转换后求和的向量,eix,1为ei,1在纵向上的分量,eiy,1为ei,1在横荡方向上的分量,eiψ,1为ei,1在航向角上的分量,aik表示第i个无人艇是否与第k个无人艇保持连接,若保持连接aik=1,否则aik=0;ξi,k=[ξix,kiy,kiψ,k]T,ξix,k、ξiy,k、ξiψ,k分别为ξi,k在纵向、横荡方向、航向角方向的分量,

Figure GDA00025125329200001210
Figure GDA00025125329200001211
xk表示第k个无人艇在大地坐标OeXeYe下纵向的位置,yk表示第k个无人艇在大地坐标OeXeYe下横荡方向的位置,ψk为第k个无人艇的航向角,Gx(t)为纵向的衰减函数,Gy(t)为横荡方向的衰减函数,Gψ(t)为航向角上的衰减函数。where e i,1 =[e ix,1 ,e iy,1 ,e iψ,1 ] T , e i,1 is the vector summed after transforming the position difference between the UAV and all its neighbors, e ix,1 is the component of e i,1 in the longitudinal direction, e iy,1 is the component of e i,1 in the sway direction, e iψ,1 is the component of e i,1 in the heading angle, a ik represents the i-th Whether the unmanned boat remains connected to the k-th unmanned boat, if the connection a ik =1, otherwise a ik =0; ξ i,k =[ξ ix,kiy,kiψ,k ] T , ξ ix,k , ξ iy,k , ξ iψ,k are the components of ξ i,k in the longitudinal direction, yaw direction, and heading angle direction, respectively,
Figure GDA00025125329200001210
Figure GDA00025125329200001211
x k represents the longitudinal position of the k-th unmanned boat under the geodetic coordinates O e X e Y e , y k represents the position of the k-th unmanned boat in the sway direction under the geodetic coordinates O e X e Y e , ψ k is the heading angle of the k-th UAV, G x (t) is the attenuation function in the longitudinal direction, G y (t) is the attenuation function in the sway direction, and G ψ (t) is the attenuation function on the heading angle.

本实例如图2所示。四个跟随者的邻居集分别为

Figure GDA00025125329200001212
Figure GDA00025125329200001213
如图4所示的是无人艇编队系统在相平面上的实际轨迹图。This example is shown in Figure 2. The neighbor sets of the four followers are
Figure GDA00025125329200001212
Figure GDA00025125329200001213
Figure 4 shows the actual trajectory of the UAV formation system on the phase plane.

进一步地,步骤(3)中,设计跟踪误差约束条件如下:Further, in step (3), the design tracking error constraints are as follows:

Figure GDA00025125329200001214
Figure GDA00025125329200001214

其中j=x,y,ψ,

Figure GDA00025125329200001215
eji(t)表示eji(t)的下界性能函数,
Figure GDA00025125329200001216
表示eji(t)的上界性能函数,
Figure GDA00025125329200001217
表示性能函数
Figure GDA00025125329200001218
的初始值,
Figure GDA00025125329200001219
表示性能函数
Figure GDA00025125329200001220
的稳态值,eji,0表示性能函数eji(t)的初始值,eji,∞表示性能函数eji(t)的稳态值,kji表示其收敛速度。因此误差跟踪转换函数设计为:where j=x, y, ψ,
Figure GDA00025125329200001215
e ji (t) represents the lower bound performance function of e ji (t),
Figure GDA00025125329200001216
represents the upper bound performance function of e ji (t),
Figure GDA00025125329200001217
Represents a performance function
Figure GDA00025125329200001218
the initial value of ,
Figure GDA00025125329200001219
Represents a performance function
Figure GDA00025125329200001220
The steady state value of , e ji,0 represents the initial value of the performance function e ji (t), e ji,∞ represents the steady state value of the performance function e ji (t), and k ji represents its convergence speed. Therefore the error tracking transfer function is designed as:

Figure GDA00025125329200001221
Figure GDA00025125329200001221

其中,zji,1表示第i个无人艇的转换误差,

Figure GDA00025125329200001222
Figure GDA00025125329200001223
表示自然底数的zji,1次幂,
Figure GDA00025125329200001224
表示自然底数的-zji,1次幂,-γji<Tji(zji,1ji)<1,
Figure GDA00025125329200001225
当且仅当zji,1=0时,Tji(zji,1ji)=0;得到如下转换误差:Among them, z ji,1 represents the conversion error of the i-th unmanned boat,
Figure GDA00025125329200001222
Figure GDA00025125329200001223
Represents the natural base z ji, the power of 1 ,
Figure GDA00025125329200001224
Represents the -z ji, 1 power of the natural base, -γ ji <T ji (z ji,1ji ) < 1,
Figure GDA00025125329200001225
T ji (z ji,1ji )=0 if and only if z ji,1 =0; the following conversion error is obtained:

Figure GDA0002512532920000131
Figure GDA0002512532920000131

在本实例中,四个无人艇的设计参数如下:In this example, the design parameters of the four UAVs are as follows:

Figure GDA0002512532920000132
Figure GDA0002512532920000133
Figure GDA0002512532920000132
Figure GDA0002512532920000133

每个无人艇的最大通信距离为17m,纵向Gx(t)、横荡方向Gy(t)、航向Gψ(t)分别为:The maximum communication distance of each unmanned boat is 17m, and the longitudinal G x (t), sway direction G y (t), and heading G ψ (t) are:

Gy(t)=(10-3)×e-0.1t+3G y (t)=(10-3)×e -0.1t +3

Gx(t)=(10-3)×e-0.1t+3G x (t)=(10-3)×e -0.1t +3

Gψ(t)=(0.4-0.2)×e-0.1t+0.2G ψ (t)=(0.4-0.2)×e -0.1t +0.2

纵向、横荡方向、航向角上预设性能函数上下界分别为:The upper and lower bounds of the preset performance functions in the longitudinal direction, the yaw direction and the heading angle are:

Figure GDA0002512532920000134
Figure GDA0002512532920000134

Figure GDA0002512532920000135
Figure GDA0002512532920000135

Figure GDA0002512532920000136
Figure GDA0002512532920000136

如图5-7分别表示纵向的跟踪误差eix,1(t)、横荡方向的跟踪误差eiy,1(t)、航向角的跟踪误差eiψ,1(t)。从图中eix,1(t)、eiy,1(t)、eiψ,1(t)变化过程知,无人艇间距的暂态波动过程始终没有越过设定的上下边界。此仿真图说明了控制方案能较好的解决碰撞避免和通信连接保持的问题。Figure 5-7 shows the longitudinal tracking error e ix,1 (t), the yaw direction tracking error e iy,1 (t), and the heading angle tracking error e iψ,1 (t). From the change process of e ix,1 (t), e iy,1 (t), and e iψ,1 (t) in the figure, it can be known that the transient fluctuation process of the UAV spacing has never crossed the set upper and lower boundaries. This simulation figure shows that the control scheme can better solve the problems of collision avoidance and communication connection retention.

步骤(4):定义滤波误差为:Step (4): Define the filter error as:

ei,f=αi,fi e i,fi,fi

zi,2=v′ii,f z i,2 =v′ ii,f

其中

Figure GDA0002512532920000141
ei,f=[eix,f,eiy,f,eiψ,f]T为滤波误差向量,eix,f为ei,f在纵向上的分量,eiy,f为ei,f在横荡方向上的分量,eiψ,f为ei,f在航向角上的分量;αi=[αixiy]T为虚拟控制输入向量,αix为αi在纵向上的分量,αiy为αi在横荡方向上的分量,α为αi在航向角上的分量;αi,f=[αix,fiy,fiψ,f]T为滤波虚拟控制输入向量,αix,f表示αix的滤波虚拟控制,αiy,f表示αiy的滤波虚拟控制,αiψ,f表示α的滤波虚拟控制;zi,2=[ziu,2,ziv,2,zir,2]T为转换速度误差,ziu,2为zi,2在纵向上的分量,ziv,2为zi,2在横荡方向上的分量,zir,2为zi,2在航向角上的分量。in
Figure GDA0002512532920000141
e i,f =[e ix,f ,e iy,f ,e iψ,f ] T is the filtering error vector, e ix,f is the vertical component of e i,f , e iy,f is e i, The component of f in the sway direction, e iψ,f is the component of e i,f in the heading angle; α i =[α ixiy ] T is the virtual control input vector, α ix is α i Component in the longitudinal direction, α iy is the component of α i in the sway direction, α is the component of α i in the heading angle; α i,f =[α ix,fiy,fiψ, f ] T is the filtering virtual control input vector, α ix,f represents the filtering virtual control of α ix , α iy,f represents the filtering virtual control of α iy , α iψ,f represents the filtering virtual control of α ; =[z iu,2 ,z iv,2 ,z ir,2 ] T is the conversion speed error, z iu,2 is the vertical component of zi,2, z iv,2 is the sway of zi ,2 The component in the direction, z ir,2 is the component of z i,2 in the heading angle.

设计虚拟控制器为:Design the virtual controller as:

Figure GDA0002512532920000142
Figure GDA0002512532920000142

其中ki,1=diag[kix,1,kiy,1,kiψ,1]为设计参数,kix,1为ki,1在纵向上的分量,kix,1为ki,1在横荡方向上的分量,kiψ,1为ki,1在航向角上的分量;Gi=diag[Gix,Giy,G],Gix为Gi在纵向上的分量,Giy为Gi在横荡方向上的分量,G为Gi在航向角上的分量,ki,k=diag[kix,k,kiy,k,kiψ,k],kix,k为ki,k在纵向上的分量,kiy,k为ki,k在横荡方向上的分量,kiψ,k为ki,k在航向角上的分量;θi,1=diag[θix,1iy,1iψ,1],θix,1为θi,1在纵向上的分量,θiy,1为θi,1在横荡方向上的分量,θiψ,1为θi,1在航向角上的分量;pi,k=diag[pix,k,piy,k,piψ,k],pix,k为pi,k在纵向上的分量,piy,k为pi,k在横荡方向上的分量,piψ,k为pi,k在航向角上的分量;hi=diag[hix,hiy,h],hix为hi在纵向上的分量,hiy为hi在横荡方向上的分量,h为hi在航向角上的分量;qi,k=diag[qix,k,qiy,k,qiψ,k],qix,k为qi,k在纵向上的分量,qiy,k为qi,k在横荡方向上的分量,qix,k为qi,k在航向角上的分量;

Figure GDA0002512532920000143
Figure GDA0002512532920000144
Figure GDA0002512532920000151
Figure GDA0002512532920000152
为eij,1的导数,
Figure GDA0002512532920000153
Figure GDA0002512532920000154
的导数,
Figure GDA0002512532920000155
Figure GDA0002512532920000156
在纵向上的分量,
Figure GDA0002512532920000157
Figure GDA0002512532920000158
在横荡方向上的分量,
Figure GDA0002512532920000159
Figure GDA00025125329200001510
在航向角上的分量。where k i,1 =diag[k ix,1 ,kiy, 1 ,k iψ,1 ] are design parameters, k ix,1 is the vertical component of k i,1 , k ix,1 is k i, 1 is the component in the sway direction, k iψ,1 is the component of k i,1 in the heading angle; G i =diag[G ix ,G iy ,G ], G ix is the longitudinal component of G i , G iy is the component of G i in the sway direction, G is the component of G i in the heading angle, k i,k =diag[k ix,k ,k iy,k ,k iψ,k ], k ix,k is the component of ki,k in the longitudinal direction, kiy, k is the component of ki,k in the sway direction, kiψ,k is the component of ki ,k in the heading angle; θ i, 1 =diag[θ ix,1iy,1iψ,1 ], θ ix,1 is the component of θ i,1 in the longitudinal direction, θ iy,1 is the component of θ i,1 in the yaw direction Component, θ iψ,1 is the component of θ i,1 on the heading angle; p i,k =diag[pi ix,k ,p iy,k ,p iψ,k ], p ix,k is p i,k Component in the longitudinal direction, p iy,k is the component of p i,k in the sway direction, p iψ,k is the component of p i,k in the heading angle; h i =diag[hi ix ,hi iy , hi ], h ix is the component of hi in the longitudinal direction, hi iy is the component of hi in the sway direction, hi is the component of hi in the heading angle; q i ,k =diag[q ix, k , q iy,k , q iψ,k ], q ix,k is the component of qi ,k in the longitudinal direction, q iy,k is the component of qi ,k in the sway direction, q ix,k is The component of q i,k on the heading angle;
Figure GDA0002512532920000143
Figure GDA0002512532920000144
Figure GDA0002512532920000151
Figure GDA0002512532920000152
is the derivative of e ij,1 ,
Figure GDA0002512532920000153
for
Figure GDA0002512532920000154
the derivative of ,
Figure GDA0002512532920000155
for
Figure GDA0002512532920000156
component in the longitudinal direction,
Figure GDA0002512532920000157
for
Figure GDA0002512532920000158
the component in the sway direction,
Figure GDA0002512532920000159
for
Figure GDA00025125329200001510
component in the heading angle.

引入动态面控制技术并设计虚拟控制器的一阶滤波器为:Introducing the dynamic surface control technology and designing the first-order filter of the virtual controller is:

Figure GDA00025125329200001511
Figure GDA00025125329200001511

其中,αi,m=πiihi -1zi,1

Figure GDA00025125329200001512
表示αfi的导数,πi=diag[π123]为滤波器时间常数矩阵,π1为πi在纵向上的分量,π2为πi在横荡方向上的分量,π3为πi在航向角上的分量;zi,1=diag[zix,1,ziy,1,ziψ,1]为转换误差,zix,1为zi,1在纵向上的分量,ziy,1为zi,1在横荡方向上的分量,ziy,1为zi,1在航向角上的分量;αi,f(0)表示滤波虚拟控制αi,f的初始值。本实例中的滤波器时间常数矩阵设计为πi=diag[0.01,0.01,0.01],ki,1=diag[0.01,0.1,1]。where α i,mii h i -1 z i,1 ,
Figure GDA00025125329200001512
Represents the derivative of α fi , π i =diag[π 123 ] is the filter time constant matrix, π 1 is the component of π i in the longitudinal direction, π 2 is the component of π i in the sway direction , π 3 is the component of π i on the heading angle; z i,1 =diag[z ix,1 ,z iy,1 ,z iψ,1 ] is the conversion error, z ix,1 is the longitudinal direction of z i,1 , ziy,1 is the component of zi ,1 in the sway direction, ziy,1 is the component of zi ,1 on the heading angle; α i,f (0) represents the filtering virtual control α i , the initial value of f . The filter time constant matrix in this example is designed as π i =diag[0.01,0.01,0.01], ki ,1 =diag[0.01,0.1,1].

步骤(5):RBF神经网络:Step (5): RBF neural network:

Figure GDA00025125329200001513
Figure GDA00025125329200001513

其中,ω(Zi)=[ωu(Zi),ωv(Zi),ωr(Zi)]T为神经网络补偿函数,ωu(Zi)为ω(Zi)在纵向上的分量,ωv(Zi)为ω(Zi)在横荡方向上的分量,ωr(Zi)为ω(Zi)在航向角上的分量;

Figure GDA00025125329200001514
为理想RBF神经网络权值,
Figure GDA00025125329200001515
为W*在纵向上的分量,
Figure GDA00025125329200001516
为W*在横荡方向上的分量,
Figure GDA00025125329200001517
为W*在航向角上的分量;S(Zi)为回归向量。本实例中高斯径向基函数神经网络
Figure GDA00025125329200001518
包含1000个节点,分布在[0,0.2]×[-4,5]中,宽度为0.8;高斯径向基函数神经网络
Figure GDA00025125329200001519
Figure GDA00025125329200001520
均包含1500个节点,都分布在[0,0.2]×[-5,5]×[-0.1,0.1]中,宽度都为0.8;此外神经网络权值初始值
Figure GDA00025125329200001521
RBF神经网络权值更新率设计如下:Among them, ω(Z i )=[ω u (Z i ), ω v (Z i ), ω r (Z i )] T is the neural network compensation function, ω u (Z i ) is ω(Z i ) in The longitudinal component, ω v (Z i ) is the component of ω(Z i ) in the sway direction, and ω r (Z i ) is the component of ω(Z i ) in the heading angle;
Figure GDA00025125329200001514
is the ideal RBF neural network weight,
Figure GDA00025125329200001515
is the component of W * in the longitudinal direction,
Figure GDA00025125329200001516
is the component of W * in the sway direction,
Figure GDA00025125329200001517
is the component of W * on the heading angle; S(Z i ) is the regression vector. Gaussian radial basis function neural network in this example
Figure GDA00025125329200001518
Contains 1000 nodes distributed in [0, 0.2] × [-4, 5] with a width of 0.8; Gaussian radial basis function neural network
Figure GDA00025125329200001519
and
Figure GDA00025125329200001520
Both contain 1500 nodes, all distributed in [0, 0.2] × [-5, 5] × [-0.1, 0.1], and the width is 0.8; in addition, the initial value of the neural network weights
Figure GDA00025125329200001521
The weight update rate of RBF neural network is designed as follows:

Figure GDA00025125329200001522
Figure GDA00025125329200001522

其中,

Figure GDA00025125329200001523
Figure GDA00025125329200001524
的导数,
Figure GDA00025125329200001525
为第i个无人艇
Figure GDA00025125329200001531
方向上的权值,
Figure GDA00025125329200001526
为第l个无人艇
Figure GDA00025125329200001532
方向上的权值,
Figure GDA00025125329200001535
为回归函数,
Figure GDA00025125329200001534
为第i个无人艇
Figure GDA00025125329200001533
方向上的速度误差,其中设计参数:Γiu=1.5,Γiv=1.5,Γir=3,
Figure GDA00025125329200001530
RBF神经网络权值二范数
Figure GDA00025125329200001528
Figure GDA00025125329200001529
为图8所示。in,
Figure GDA00025125329200001523
for
Figure GDA00025125329200001524
the derivative of ,
Figure GDA00025125329200001525
is the i-th unmanned boat
Figure GDA00025125329200001531
weights in the direction,
Figure GDA00025125329200001526
for the lth unmanned boat
Figure GDA00025125329200001532
weights in the direction,
Figure GDA00025125329200001535
is the regression function,
Figure GDA00025125329200001534
is the i-th unmanned boat
Figure GDA00025125329200001533
Velocity error in direction with design parameters: Γ iu = 1.5, Γ iv = 1.5, Γ ir = 3,
Figure GDA00025125329200001530
RBF neural network weights two norm
Figure GDA00025125329200001528
Figure GDA00025125329200001529
As shown in Figure 8.

考虑

Figure GDA0002512532920000161
Figure GDA0002512532920000162
为第i个无人艇
Figure GDA00025125329200001621
方向上的神经网络权值误差,W*为第i个无人艇
Figure GDA00025125329200001622
方向上的神经网络最优值,
Figure GDA0002512532920000163
的一阶导数为:consider
Figure GDA0002512532920000161
Figure GDA0002512532920000162
is the i-th unmanned boat
Figure GDA00025125329200001621
Neural network weight error in the direction, W * is the i-th UAV
Figure GDA00025125329200001622
The optimal value of the neural network in the direction,
Figure GDA0002512532920000163
The first derivative of is:

Figure GDA0002512532920000164
Figure GDA0002512532920000164

其中,

Figure GDA0002512532920000165
为第l个无人艇
Figure GDA00025125329200001620
方向上的神经网络权值误差。in,
Figure GDA0002512532920000165
for the lth unmanned boat
Figure GDA00025125329200001620
Neural network weight error in direction.

步骤(6):设计的编队跟踪控制器如下:Step (6): The designed formation tracking controller is as follows:

Figure GDA0002512532920000166
Figure GDA0002512532920000166

其中,τi′=[τ′iu,τ′iv,τ′ir]T为设计控制器,τ′iu为τi′在纵向上的分量,τ′iv为τi′在横荡方向上的分量,τ′ir为τi′在航向角上的分量;

Figure GDA0002512532920000167
为虚拟控制器αi,f的导数,
Figure GDA0002512532920000168
为神经网络权值的转置,hi -1为函数hi的逆。在本实例中
Figure GDA0002512532920000169
k2,2=k3,2=k4,2=diag[100,100,100];图9-11分别为无人艇纵向的推力τiu、横荡方向推力τiv、转向的的推力τir。Among them, τ i ′=[τ′ iu ,τ′ iv ,τ′ ir ] T is the design controller, τ′ iu is the component of τ i ′ in the longitudinal direction, τ′ iv is the τ i ′ in the yaw direction The component of τ′ ir is the component of τ i ′ on the heading angle;
Figure GDA0002512532920000167
is the derivative of the virtual controller α i,f ,
Figure GDA0002512532920000168
is the transpose of the neural network weights, and h i -1 is the inverse of the function h i . In this instance
Figure GDA0002512532920000169
k 2,2 =k 3,2 =k 4,2 =diag[100,100,100]; Figures 9-11 are the longitudinal thrust τ iu , the yaw direction thrust τ iv , and the steering thrust τ ir , respectively.

步骤(7):设计的基于经验的编队控制器如下:Step (7): The designed experience-based formation controller is as follows:

Figure GDA00025125329200001610
Figure GDA00025125329200001610

其中,

Figure GDA00025125329200001611
为第i个无人艇的神经网络权值常数,
Figure GDA00025125329200001612
Figure GDA00025125329200001613
在纵向上的分量,
Figure GDA00025125329200001614
Figure GDA00025125329200001615
在横荡方向上的分量,
Figure GDA00025125329200001616
Figure GDA00025125329200001617
航向角上的分量,
Figure GDA00025125329200001618
Figure GDA00025125329200001619
的转置。图12-14分别为纵向的跟踪误差eix,1(t)、横荡方向的跟踪误差eiy,1(t)、航向角的跟踪误差eiψ,1(t)。in,
Figure GDA00025125329200001611
is the neural network weight constant of the i-th UAV,
Figure GDA00025125329200001612
for
Figure GDA00025125329200001613
component in the longitudinal direction,
Figure GDA00025125329200001614
for
Figure GDA00025125329200001615
the component in the sway direction,
Figure GDA00025125329200001616
for
Figure GDA00025125329200001617
component on the heading angle,
Figure GDA00025125329200001618
for
Figure GDA00025125329200001619
transposition of . Figures 12-14 show the longitudinal tracking error e ix,1 (t), the yaw direction tracking error e iy,1 (t), and the heading angle tracking error e iψ,1 (t).

发明针对具有不确定性的多个同构全驱动无人艇的分布式同步跟踪与协同学习控制问题,提出了满足连接保持且具有协同学习的分布式无人艇编队控制方法。控制目标是运用无向通讯拓扑图来表示同构多无人艇之间的信息交互,使一群通讯范围有限的无人艇在以领导-跟随者编队形式跟随给定的领导者轨迹(只有部分跟随者可获得领导者信息)的同时,其中的每个无人艇也满足距离与方位角限制以保证每个无人艇可以探测到其邻居的信息,并与其保持连接。通过引入跟踪误差转换函数确保约束误差的有界性来满足原始的约束跟踪问题;针对相同的模型的不确定性,根据通信拓扑在神经网络权值自适应率之间来实时在线的交流分享权值信息;针对产生回归轨迹的连续非线性动态系统,确定学习可实现未知闭环系统动态的局部准确逼近,本文使用RBF神经网络为多个无人艇的同步跟踪控制设计了一种控制算法,不仅实现了闭环系统所有信号的最终一致有界,而且在稳定的控制过程中,实现了部分神经网络权值收敛到最优值以及未知闭环系统动态的局部准确逼近。学习到的知识以常值神经网络权值的方式存储,可以用来改进系统的控制性能,也可以应用到后续相同或相似的控制任务中。Aiming at the problem of distributed synchronous tracking and collaborative learning control of multiple isomorphic full-drive unmanned boats with uncertainty, the invention proposes a distributed unmanned boat formation control method that satisfies connection retention and has collaborative learning. The control goal is to use the undirected communication topology diagram to represent the information interaction between the homogeneous multi-UAVs, so that a group of UAVs with limited communication range follow the given leader trajectory in the form of leader-follower formation (only part of it). While the follower can obtain the leader information), each of the UAVs also satisfies the distance and azimuth constraints to ensure that each UAV can detect the information of its neighbors and maintain a connection with it. The original constraint tracking problem is satisfied by introducing a tracking error transfer function to ensure the boundedness of the constraint error; for the uncertainty of the same model, real-time online communication and sharing of weights between neural network weight adaptation rates according to communication topology For continuous nonlinear dynamic systems that generate regression trajectories, deterministic learning can achieve local accurate approximation of unknown closed-loop system dynamics. In this paper, RBF neural network is used to design a control algorithm for the synchronous tracking control of multiple unmanned boats, not only Finally, all signals of the closed-loop system are finally consistent and bounded, and in the stable control process, the partial neural network weights converge to the optimal value and the local accurate approximation of the unknown closed-loop system dynamics is realized. The learned knowledge is stored in the form of constant neural network weights, which can be used to improve the control performance of the system, and can also be applied to the same or similar subsequent control tasks.

以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对本发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以权利要求所述为准。The above-mentioned embodiments only represent several embodiments of the present invention, and the descriptions thereof are specific and detailed, but should not be construed as a limitation on the scope of the patent of the present invention. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of the present invention, several modifications and improvements can also be made, which all belong to the protection scope of the present invention. Therefore, the protection scope of the patent of the present invention shall be subject to the claims.

Claims (7)

1. A collaborative learning and formation control method for a homogeneous multi-unmanned ship system is characterized by comprising the following steps:
step (1), establishing a plurality of dynamic models of unmanned boats with the same structure;
step (2), designing a tracking error constraint condition according to the safety distance constraint and the communication connection range constraint between adjacent unmanned boats;
step (3), in order to meet the preset performance requirement, designing a tracking error conversion function, and converting the tracking error to obtain a converted conversion error;
step (4), designing a virtual controller by applying a dynamic surface control technology: the derivation of the virtual controller is avoided by combining a dynamic surface control technology and a step-by-step backward controller design technology, so that the input of the controller is prevented from containing the acceleration information of neighbors;
step (5), designing the weight update rate of the RBF neural network: estimating a damping term in the unmanned ship system by applying an RBF neural network;
step (6), designing a state feedback tracking controller: a stable tracking controller is constructed by applying the Lyapunov stability theory and combining a step-by-step back-pushing design method;
step (7), utilizing the stored knowledge to complete knowledge utilization, and designing a state feedback tracking controller based on experience;
in the step (2): using undirected graphs
Figure FDA0002512532910000011
To describe the information interaction among the individuals in the unmanned boat formation system,
Figure FDA0002512532910000012
is a finite, non-empty set, called a set of vertices
Figure FDA0002512532910000013
Each vertex in the system corresponds to a follower with the same number;
Figure FDA0002512532910000014
is a finite set called as an edge set, each edge corresponds to adjacent unmanned boats with the same number and capable of communicating with each other, and an undirected graph
Figure FDA0002512532910000015
Is (a) ofil)(N)×(N)Element a ofil∈ {0,1}, when unmanned boat i is able to obtain information about unmanned boat j, when j is called tail, i is called head, ail1, otherwise ail=0;
Further using expansion diagram
Figure FDA0002512532910000016
To describe the information interaction among the members in the formation system including the pilot,
Figure FDA00025125329100000111
for a virtual leader, the virtual leader maintains communication with the partial follower; adjacency matrix A0=diag[a10,…,aN0]TElement a ofi0∈ {0,1}, and
Figure FDA0002512532910000017
Figure FDA0002512532910000018
designing a continuous differentiable monotone decreasing function vector G (t), G (t) satisfies the first-order conductibility and the second-order conductibility,
Figure FDA0002512532910000019
Figure FDA00025125329100000110
g (0) is the initial value of G (t) for the design constant; the communication range of each unmanned boat is considered to be limited and maximum
Figure FDA0002512532910000021
If it is satisfied with
Figure FDA0002512532910000022
Figure FDA0002512532910000023
Wherein
Figure FDA0002512532910000024
i≠k,Gx(0) Is an initial value in the longitudinal direction, G (t)y(0) G (t) initial value in the sway direction, then there is communication between unmanned boat i and unmanned boat k, where
Figure FDA0002512532910000025
Thus each neighbor is defined as:
Figure FDA0002512532910000026
the augmented neighbor set is:
Figure FDA0002512532910000027
Figure FDA0002512532910000028
and
Figure FDA0002512532910000029
for the design constant in the longitudinal direction,
Figure FDA00025125329100000210
and
Figure FDA00025125329100000211
for the design constant of the yaw direction,
Figure FDA00025125329100000212
and
Figure FDA00025125329100000213
is a design constant at the course angle;
when a group of unmanned boats with limited communication range follows a given leader track in a leader-follower formation mode, each unmanned boat needs to meet the constraint conditions of distance and azimuth angle to ensure that each unmanned boat can detect the information of the neighbor and keep connection with the unmanned boat; the defined error is:
Figure FDA00025125329100000214
wherein ei,1=[eix,1,eiy,1,eiψ,1]T,ei,1Vector for summing up the transformed differences between the unmanned surface vehicle and all its neighbors, eix,1Is ei,1Component in the longitudinal direction, eiy,1Is ei,1Component in the direction of the yaw, eiψ,1Is ei,1Component at heading angle, aikIndicating whether the ith unmanned ship is kept connected with the kth unmanned ship or not, and if the connection a is keptik1, otherwise aik=0;ξi,k=[ξix,kiy,kiψ,k]T,ξix,k、ξiy,k、ξiψ,kAre respectively ξi,kComponents in the longitudinal direction, the yaw direction, the course angle direction,
Figure FDA00025125329100000215
Figure FDA00025125329100000216
xkindicating the kth unmanned ship at geodetic coordinate OeXeYePosition in the lower longitudinal direction, ykIndicating the kth unmanned ship at geodetic coordinate OeXeYePosition in the lower yaw direction,. psikIs the heading angle, G, of the kth unmanned boatx(t) is the longitudinal decay function, Gy(t) attenuation function in the sway directionNumber, Gψ(t) is a decay function over the course angle;
“ξi,kand (5) carrying out vector conversion after the second step on the position difference between the ith unmanned ship and the kth unmanned ship.
2. The collaborative learning and formation control method of the isomorphic multi-unmanned ship system according to claim 1, wherein in the step (1), the dynamic model of the ith unmanned ship is:
Figure FDA00025125329100000217
the first three terms in the above equation are kinematic equations for the system, where,
Figure FDA0002512532910000031
Figure FDA0002512532910000032
n represents the total number of unmanned boats, (x)i,yi) Indicating the i-th unmanned ship at geodetic coordinate OeXeYePosition of lower, xiLongitudinal position of the ith unmanned ship, yiIs the yaw direction position of the ith unmanned ship, psiiThe course angle of the ith unmanned ship; u. ofiRepresenting the longitudinal speed, v, of the ith unmanned boatiRepresents the yaw rate, r, of the ith unmanned boatiRepresenting the steering angular velocity of the ith unmanned boat; m represents the mass matrix of the unmanned vehicle,
Figure FDA0002512532910000033
a vector C (v) representing the acceleration of the i-th unmanned ship in the u, v, r directionsi) Represents a Coriolis force matrix, wherein vi=[ui,vi,ri]TAs a velocity vector, D (v)i) Representing a damping matrix, each unmanned ship having the same M matrix since the plurality of unmanned ships have the same structure; tau isi=[τuiviri]TFor the controller vector to be designed, τuiExpressing thrust in the longitudinal direction of the i-th unmanned ship, tauviExpressing thrust in the yaw direction, τ, of the ith unmanned boatriRepresenting the moment of steering of the ith unmanned boat; tau isωi=[τωuiωviωri]TIs an external time-varying disturbance, tauωuiRepresenting the external time-varying disturbance, τ, experienced by the ith unmanned boat in the longitudinal directionωviRepresents the external time-varying disturbance, tau, received by the ith unmanned boat in the sway directionωriRepresenting the external time-varying disturbance to the ith unmanned boat in the direction of the steering angle; matrix M, C (v)i)、D(vi)、J(ηi) The specific forms of (A) and (B) are respectively as follows:
Figure FDA0002512532910000034
Figure FDA0002512532910000035
wherein m is11、m22、m23、m33Is a constant number d11(ui) Is about uiFunction of d22(vi,ri)、d23(vi,ri)、d32(vi,ri)、d33(vi,ri) Is about vi,riA function of (a); the kinetic equation is converted to the form:
Figure FDA0002512532910000036
wherein v'i=J(ηi)v,
Figure FDA0002512532910000037
τ′i=J(ηi)M-1τi,τ′ωi=J(ηi)M-1τωi
Figure FDA0002512532910000038
Figure FDA0002512532910000039
Is a rotation matrix J (η)i) Derivative of (A), J-1i) Is a rotation matrix J (η)i) Inverse of (A), M-1Is the inverse of the mass matrix M.
3. The collaborative learning and formation control method of the isomorphic multi-unmanned ship system according to claim 2, wherein in the step (3), the tracking error constraint conditions are designed as follows:
Figure FDA00025125329100000310
where j is x, y, ψ,
Figure FDA00025125329100000311
eji(t) represents eji(t) a lower bound performance function of,
Figure FDA00025125329100000312
denotes eji(t) an upper bound performance function,
Figure FDA00025125329100000313
representing a performance function
Figure FDA00025125329100000314
Is set to the initial value of (a),
Figure FDA00025125329100000315
representing a performance function
Figure FDA00025125329100000316
The steady-state value of (a) is,e ji,0representing a performance functione ji(t) initiation ofThe value of the one or more of,e ji,∞representing a performance functione jiSteady state value of (t), kjiRepresents the convergence rate thereof; the error tracking transfer function is therefore designed to:
Figure FDA0002512532910000041
wherein z isji,1Indicating the conversion error of the ith unmanned boat,
Figure FDA0002512532910000042
Figure FDA0002512532910000043
z representing a natural base numberji,1The power of the first power of the image,
Figure FDA0002512532910000044
z representing a natural base numberji,1Power of the order, -gammaji<Tji(zji,1ji)<1,
Figure FDA0002512532910000045
If and only if zji,1When equal to 0, Tji(zji,1ji) 0; the following conversion errors were obtained:
Figure FDA0002512532910000046
4. the collaborative learning and formation control method of the homogeneous multi-unmanned ship system according to claim 3, wherein in the step (4), the filter error is defined as:
ei,f=αi,fi
zi,2=v′ii,f
wherein
Figure FDA0002512532910000047
ei,f=[eix,f,eiy,f,eiψ,f]TTo filter the error vector, eix,fIs ei,fComponent in the longitudinal direction, eiy,fIs ei,fComponent in the direction of the yaw, eiψ,fIs ei,fComponent at heading angle αi=[αixiy]TTo virtually control the input vector, αixIs αiComponent in the longitudinal direction, αiyIs αiComponent in the direction of the yaw, αIs αiComponent at heading angle αi,f=[αix,fiy,fiψ,f]TFor filtering the virtual control input vector, αix,fRepresentation αixFilter virtual control of αiy,fRepresentation αiyFilter virtual control of αiψ,fRepresentation αFiltering virtual control of (3); z is a radical ofi,2=[ziu,2,ziv,2,zir,2]TTo convert the speed error, ziu,2Is zi,2Component in the longitudinal direction, ziv,2Is zi,2Component in the direction of the yaw, zir,2Is zi,2A component at a heading angle;
designing a virtual controller as follows:
Figure FDA0002512532910000048
wherein k isi,1=diag[kix,1,kiy,1,kiψ,1]To design the parameter, kix,1Is ki,1Component in the longitudinal direction, kix,1Is ki,1Component in the yaw direction, kiψ,1Is ki,1A component at a heading angle; gi=diag[Gix,Giy,G],GixIs GiComponent in the longitudinal direction, GiyIs GiComponent in the direction of the yaw, GIs GiComponent at heading angle, ki,k=diag[kix,k,kiy,k,kiψ,k],kix,kIs ki,kComponent in the longitudinal direction, kiy,kIs ki,kComponent in the yaw direction, kiψ,kIs ki,kA component at a heading angle; thetai,1=diag[θix,1iy,1iψ,1],θix,1Is thetai,1Component in the longitudinal direction, θiy,1Is thetai,1Component in the direction of the yaw, θiψ,1Is thetai,1A component at a heading angle; p is a radical ofi,k=diag[pix,k,piy,k,piψ,k],pix,kIs pi,kComponent in the longitudinal direction, piy,kIs pi,kComponent in the direction of the yaw, piψ,kIs pi,kA component at a heading angle; h isi=diag[hix,hiy,h],hixIs hiComponent in the longitudinal direction, hiyIs hiComponent in the direction of the yaw, hIs hiA component at a heading angle; q. q.si,k=diag[qix,k,qiy,k,qiψ,k],qix,kIs qi,kComponent in the longitudinal direction, qiy,kIs qi,kComponent in the direction of the yaw, qix,kIs qi,kA component at a heading angle;
Figure FDA0002512532910000051
Figure FDA0002512532910000052
Figure FDA00025125329100000515
Figure FDA0002512532910000053
is eij,1The derivative of (a) of (b),
Figure FDA0002512532910000054
is composed of
Figure FDA0002512532910000055
The derivative of (a) of (b),
Figure FDA0002512532910000056
is composed of
Figure FDA0002512532910000057
The component in the longitudinal direction is,
Figure FDA0002512532910000058
is composed of
Figure FDA0002512532910000059
The component in the direction of the yaw is,
Figure FDA00025125329100000510
is composed of
Figure FDA00025125329100000511
A component at a heading angle;
introducing a dynamic surface control technology and designing a first-order filter of a virtual controller as follows:
Figure FDA00025125329100000512
wherein,
Figure FDA00025125329100000513
Figure FDA00025125329100000514
representation αfiDerivative of (a) (# n)i=diag[π123]Is a filter time constant matrix, pi1Is piiComponent in the longitudinal direction, pi2Is piiComponent in the direction of the yaw, pi3Is piiA component at a heading angle; z is a radical ofi,1=diag[zix,1,ziy,1,ziψ,1]To convert errors, zix,1Is zi,1Component in the longitudinal direction, ziy,1Is zi,1Component in the direction of the yaw, ziy,1Is zi,1Component at heading angle αi,f(0) Representing filtering virtual controls αi,fAn initial value of (1);
aikwhether the ith unmanned ship is kept connected with the kth unmanned ship or not is judged, and if the connection a is keptik1, otherwise aik=0;ei,1The vector is obtained by converting and summing the position differences of the unmanned ship and all neighbors of the unmanned ship;
αi,m(0) is the value of the virtual controller at time zero;
Giis a decay function; k is a radical ofi,kTo design a constant vector; thetai,1For errors e in virtual controllersi,1A variable coefficient of (d); p is a radical ofi,kVector after the first step of conversion is carried out on the position difference of the ith unmanned ship and the kth unmanned ship; h isiThe reciprocal of the vector sum after the first step of conversion is carried out on the position difference of the ith unmanned ship and the kth unmanned ship; q. q.si,kIs pi,kThe transformed vector of (2); chi shapeiIs formed by ei,1、ei,1Upper bound function of and ei,1Is a lower bound function ofiThe inverse vector of (a) is the error e in the virtual controlleri,1Is measured.
5. The collaborative learning and formation control method of the isomorphic multi-unmanned ship system according to claim 4, wherein in the step (5), the RBF neural network:
ω(Zi)=D′(ηi,v′i)v′i=W*TS(Zi)+i(Zi)
wherein, ω (Z)i)=[ωu(Zi),ωv(Zi),ωr(Zi)]TAs a compensation function of the neural network, omegau(Zi) Is omega (Z)i) Component in the longitudinal direction, ωv(Zi) Is omega (Z)i) Component in the direction of the yaw, ωr(Zi) Is omega (Z)i) A component at a heading angle;
Figure FDA0002512532910000061
is the weight of an ideal RBF neural network,
Figure FDA0002512532910000062
is W*The component in the longitudinal direction is,
Figure FDA0002512532910000063
is W*The component in the direction of the yaw is,
Figure FDA0002512532910000064
is W*A component at a heading angle; s (Z)i) Is a regression vector; the RBF neural network weight updating rate is designed as follows:
Figure FDA0002512532910000065
wherein,
Figure FDA0002512532910000066
is composed of
Figure FDA0002512532910000067
The derivative of (a) of (b),
Figure FDA0002512532910000068
is the ith unmanned ship
Figure FDA00025125329100000624
The weight in the direction is given to the user,
Figure FDA0002512532910000069
is the first unmanned boat
Figure FDA00025125329100000623
Direction of rotationThe weight of the upper node is higher than the weight of the lower node,
Figure FDA00025125329100000621
in the form of a regression function,
Figure FDA00025125329100000619
is the ith unmanned ship
Figure FDA00025125329100000620
The error in the speed in the direction of the direction,
Figure FDA00025125329100000622
in order to design the parameters of the device,
Figure FDA00025125329100000616
is composed of
Figure FDA00025125329100000617
The correction term is a term that is used to correct,
Figure FDA00025125329100000618
is a cooperative adjustment coefficient;
consider that
Figure FDA00025125329100000610
Figure FDA00025125329100000611
Is the weight error of the neural network in the l direction of the ith unmanned ship, W*Is the weight of an ideal RBF neural network,
Figure FDA00025125329100000612
the first derivative of (d) is:
Figure FDA00025125329100000613
wherein,
Figure FDA00025125329100000614
is the first unmanned boat
Figure FDA00025125329100000615
Neural network weight error in direction;
i(Zi) Approximating the error for a neural network; ziInputting a vector for the neural network;
ailwhether the ith unmanned ship is kept connected with the ith unmanned ship or not is judged, and if the ith unmanned ship is kept connected with the ith unmanned ship, ail1, otherwise ail=0。
6. The collaborative learning and formation control method of the homogeneous multi-unmanned ship system according to claim 5, wherein the state feedback tracking controller designed in the step (6) is as follows:
Figure FDA0002512532910000071
wherein, taui′=[τ′iu,τ′iv,τ′ir]TTo design the controller, τ'iuIs taui'component in longitudinal direction, τ'ivIs taui'component in the yaw direction, τ'irIs taui' component at heading angle; k is a radical ofi,2=diag[kix,2,kiy,2,kiψ,2]Is a diagonal matrix, kix,2Denotes a formation controller τ'ixDesign parameter of (1), kiy,2Denotes a formation controller τ'iyDesign parameter of (1), kiψ,2Denotes a formation controller τ'The design parameters of (1);
Figure FDA0002512532910000072
is a virtual controller αi,fThe derivative of (a) of (b),
Figure FDA0002512532910000073
for transposition of weights of neural networks, hi -1As a function hiThe inverse of (1);
zi,2to convert speed errors.
7. The collaborative learning and formation control method of a homogeneous multi-unmanned ship system according to claim 6, wherein the experience-based state feedback tracking controller designed in the step (7) is as follows:
Figure FDA0002512532910000074
wherein,
Figure FDA0002512532910000075
is the weight constant of the neural network of the ith unmanned ship,
Figure FDA0002512532910000076
is composed of
Figure FDA0002512532910000077
The component in the longitudinal direction is,
Figure FDA0002512532910000078
is composed of
Figure FDA0002512532910000079
The component in the direction of the yaw is,
Figure FDA00025125329100000710
is composed of
Figure FDA00025125329100000711
The component in the angle of the heading,
Figure FDA00025125329100000712
is composed of
Figure FDA00025125329100000713
The transposing of (1).
CN201910560204.3A 2019-06-26 2019-06-26 Collaborative learning and formation control method for isomorphic multi-unmanned ship system Active CN110262494B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910560204.3A CN110262494B (en) 2019-06-26 2019-06-26 Collaborative learning and formation control method for isomorphic multi-unmanned ship system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910560204.3A CN110262494B (en) 2019-06-26 2019-06-26 Collaborative learning and formation control method for isomorphic multi-unmanned ship system

Publications (2)

Publication Number Publication Date
CN110262494A CN110262494A (en) 2019-09-20
CN110262494B true CN110262494B (en) 2020-09-22

Family

ID=67921767

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910560204.3A Active CN110262494B (en) 2019-06-26 2019-06-26 Collaborative learning and formation control method for isomorphic multi-unmanned ship system

Country Status (1)

Country Link
CN (1) CN110262494B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111103833B (en) * 2019-12-20 2023-03-07 南京邮电大学 A volume consistency controller system and design method for reaction liquid in multiple chemical reaction tanks
CN111506114B (en) * 2020-05-25 2021-05-04 北京理工大学 A method of aircraft formation control
CN112034711B (en) * 2020-08-31 2022-06-03 东南大学 Unmanned ship sea wave interference resistance control method based on deep reinforcement learning
CN112099506A (en) * 2020-09-17 2020-12-18 北京航空航天大学 Tracking control method and system for under-actuated unmanned ship time-varying formation
CN112817318B (en) * 2021-01-06 2022-02-11 上海大学 A kind of multi-unmanned boat cooperative search control method and system
CN112947462B (en) * 2021-03-02 2022-11-25 广东省智能机器人研究院 Unmanned ship group formation cooperative control method considering time-varying drift angle and attitude adjustment
CN113189979B (en) * 2021-04-02 2023-12-01 大连海事大学 Finite time control method for distributed queue of unmanned ship

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8150621B1 (en) * 2009-04-07 2012-04-03 The United States of America as represeneted by the Secretary of the Navy Command and control of autonomous surface vehicle
CN107085427B (en) * 2017-05-11 2019-06-18 华南理工大学 A formation control method of unmanned surface craft based on leader-following structure
CN108008628B (en) * 2017-11-17 2020-02-18 华南理工大学 A Preset Performance Control Method for Uncertain Underactuated Unmanned Vehicle System
CN108983612A (en) * 2018-08-08 2018-12-11 华南理工大学 A kind of underwater robot formation control method kept with default capabilities and connection

Also Published As

Publication number Publication date
CN110262494A (en) 2019-09-20

Similar Documents

Publication Publication Date Title
CN110262494B (en) Collaborative learning and formation control method for isomorphic multi-unmanned ship system
CN110196599B (en) A distributed formation control method for unmanned boats under the constraints of collision avoidance and connection retention
CN107085427B (en) A formation control method of unmanned surface craft based on leader-following structure
CN108008628B (en) A Preset Performance Control Method for Uncertain Underactuated Unmanned Vehicle System
Yan et al. Consensus formation tracking for multiple AUV systems using distributed bioinspired sliding mode control
CN108303891B (en) Based on more AUV distributed collaboration tracking and controlling methods under the disturbance of uncertain ocean current
CN108073175B (en) Under-actuated unmanned ship formation intelligent control method based on virtual ship self-adaptive planning
CN112965371B (en) Water surface unmanned ship track rapid tracking control method based on fixed time observer
CN109839934A (en) Unmanned water surface ship default capabilities tracking and controlling method based on RISE technology
CN114578819B (en) A control method for distributed formation of multiple surface ships based on artificial potential field method
CN109240289B (en) Wave glider bow information adaptive filtering method
Hao et al. Layered fully distributed formation-containment tracking control for multiple unmanned surface vehicles
Li et al. Distributed robust prescribed performance 3-D time-varying formation control of underactuated AUVs under input saturations and communication delays
CN112099506A (en) Tracking control method and system for under-actuated unmanned ship time-varying formation
CN113848887A (en) Under-actuated unmanned ship trajectory tracking control method based on MLP method
CN114035566A (en) Design method, system and device of finite-time anti-saturation controller for unmanned boat
CN109062232B (en) Distributed finite-time anti-buffering configuration inclusion control method for submarine geophone flight nodes
CN113359781B (en) Networked surface vessel tracking control method, device, equipment and storage medium
Wang et al. Distributed prescribed-time formation control for underactuated surface vehicles with input saturation: Theory and experiment
Ma et al. Non-singular fixed-time robust containment control for autonomous surface vehicles
CN118068833B (en) A docking control method for unmanned surface vessels based on preset performance
Ghommam et al. Cooperative learning‐based practical formation‐containment control with prescribed performance for heterogeneous clusters of UAV/USV
CN119439996A (en) An adaptive switching control method for unmanned surface vehicle
CN118012041A (en) Event-triggering-based multi-power positioning ship cooperative control system design
CN117519193A (en) A preset time energy-saving motion control method for unmanned boats

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant