JP7428288B1

JP7428288B1 - Plant response estimation device, plant response estimation method, and program

Info

Publication number: JP7428288B1
Application number: JP2023071309A
Authority: JP
Inventors: 吉雄丹下
Original assignee: Fuji Electric Co Ltd
Current assignee: Fuji Electric Co Ltd
Priority date: 2023-04-25
Filing date: 2023-04-25
Publication date: 2024-02-06
Anticipated expiration: 2043-04-25

Abstract

【課題】操業中のプラントの運転データから応答モデルを精度良く推定する技術を提供すること。【解決手段】本開示の一態様によるプラント応答推定装置は、制御対象プラントの運転データに基づいて、前記制御対象プラントの状態を表す状態ベクトルを作成する状態ベクトル作成部と、前記状態ベクトルを用いて、前記状態ベクトルを学習データとするか否かを判定する学習データ選別部と、前記状態ベクトルを学習データとすると判定された場合、前記状態ベクトルを学習データとして用いて、前記制御対象プラントの応答モデルを表す関数のパラメータを更新するパラメータ更新部と、を有する。【選択図】図２[Problem] To provide a technology for accurately estimating a response model from operational data of an operating plant. A plant response estimation device according to an aspect of the present disclosure includes a state vector creation unit that creates a state vector representing a state of a controlled plant based on operation data of the controlled plant; a learning data selection unit that determines whether or not the state vector is to be used as learning data; and a parameter updating unit that updates parameters of a function representing a response model. [Selection diagram] Figure 2

Description

本開示は、プラント応答推定装置、プラント応答推定方法、及びプログラムに関する。 The present disclosure relates to a plant response estimation device, a plant response estimation method, and a program.

制御対象の制御量を目標値に追従させることを目的とする制御方式として、モデル予測制御が知られている。また、モデル予測制御で用いられる線形予測モデルのパラメータを推定する手法として、逐次最小２乗法（ＲＬＳ法：Recursive Least Squares法）等が知られている。例えば、特許文献１には、操業中のプラントの運転データを用いて、ＲＬＳ法によりプラント応答モデルのパラメータを推定する技術が記載されている。 Model predictive control is known as a control method that aims to make the controlled variable of a controlled object follow a target value. Furthermore, as a method for estimating parameters of a linear prediction model used in model predictive control, the Recursive Least Squares method (RLS method) and the like are known. For example, Patent Document 1 describes a technique for estimating the parameters of a plant response model using the RLS method using operational data of a plant in operation.

特開２０２３－５３５７３号公報JP 2023-53573 Publication

しかしながら、従来技術では、プラント応答モデルのパラメータ更新に用いられる学習データとして、例えば、定常状態における一定値しか含まれない運転データが用いられ、不適切なモデルパラメータが推定されてしまうことがある。一定値や１つの周波数成分しか含まれない運転データ等は、プラントのダイナミクスの情報が欠落しており、学習データとしては不適切だが、操業中に十分生じ得る。このような不適切な運転データでの学習では、プラント応答モデルのパラメータの推定値が真値と乖離してしまう場合がある。 However, in the conventional technology, for example, operating data that includes only constant values in a steady state is used as learning data used to update parameters of a plant response model, and inappropriate model parameters may be estimated. Operation data that includes only a constant value or one frequency component lacks information on plant dynamics and is inappropriate as learning data, but it can easily occur during operation. In learning using such inappropriate operational data, the estimated values of the parameters of the plant response model may deviate from the true values.

本開示は、上記の点に鑑みてなされたもので、操業中のプラントの運転データから応答モデルを精度良く推定する技術を提供する。 The present disclosure has been made in view of the above points, and provides a technique for accurately estimating a response model from operational data of an operating plant.

本開示の一態様によるプラント応答推定装置は、制御対象プラントの運転データに基づいて、前記制御対象プラントの状態を表す状態ベクトルを作成する状態ベクトル作成部と、前記状態ベクトルを用いて、前記状態ベクトルを学習データとするか否かを判定する学習データ選別部と、前記状態ベクトルを学習データとすると判定された場合、前記状態ベクトルを学習データとして用いて、前記制御対象プラントの応答モデルを表す関数のパラメータを更新するパラメータ更新部と、を有する。 A plant response estimation device according to an aspect of the present disclosure includes a state vector creation unit that creates a state vector representing a state of the controlled plant based on operation data of the controlled plant; a learning data selection unit that determines whether the vector is to be used as learning data; and when it is determined that the state vector is to be used as learning data, the state vector is used as learning data to represent a response model of the controlled plant; and a parameter update unit that updates parameters of the function.

操業中のプラントの運転データから応答モデルを精度良く推定する技術が提供される。 A technique is provided for estimating a response model with high accuracy from operating data of a plant in operation.

第一の実施形態に係るプラント応答推定装置のハードウェア構成の一例を示す図である。1 is a diagram illustrating an example of a hardware configuration of a plant response estimation device according to a first embodiment; FIG. 第一の実施形態に係るプラント応答推定装置の機能構成の一例を示す図である。1 is a diagram illustrating an example of a functional configuration of a plant response estimation device according to a first embodiment. FIG. バッファ部の動作の一例を説明するための図である。FIG. 3 is a diagram for explaining an example of the operation of a buffer section. 第一の実施形態に係る学習データ選別処理の一例を説明するためのフローチャートである。It is a flowchart for explaining an example of learning data selection processing concerning a first embodiment. 第一の実施形態に係る拡大モデルパラメータ推定処理の一例を説明するためのフローチャートである。7 is a flowchart for explaining an example of expanded model parameter estimation processing according to the first embodiment. パラメータ変換部の動作の一例を説明するための図である。FIG. 3 is a diagram for explaining an example of the operation of a parameter conversion section. ステップ応答計算部の動作の一例を説明するための図である。FIG. 3 is a diagram for explaining an example of the operation of a step response calculation section. 第一の実施形態に係るステップ応答計算処理の一例を説明するためのフローチャートである。7 is a flowchart for explaining an example of step response calculation processing according to the first embodiment. 状態ベクトル更新の一例を説明するための図である。FIG. 3 is a diagram for explaining an example of state vector updating. 第二の実施形態に係る学習データ選別処理の一例を説明するためのフローチャートである。It is a flowchart for explaining an example of learning data selection processing concerning a second embodiment. 第三の実施形態に係る学習データ選別処理の一例を説明するためのフローチャートである。It is a flowchart for explaining an example of learning data selection processing concerning a third embodiment. 第四の実施形態に係る学習データ選別処理の一例を説明するためのフローチャートである。13 is a flowchart for explaining an example of learning data selection processing according to the fourth embodiment. 実施例における対象プラントのステップ応答を示す図である。It is a figure which shows the step response of the target plant in an Example. 学習データ選別部を用いない場合のモデルパラメータ推定結果及び制御結果を示す図（その１）である。FIG. 3 is a diagram (part 1) showing model parameter estimation results and control results when a learning data selection unit is not used. 学習データ選別部を用いない場合のモデルパラメータ推定結果及び制御結果を示す図（その２）である。FIG. 7 is a diagram (part 2) showing model parameter estimation results and control results when the learning data selection unit is not used. 学習データ選別部を用いた場合のモデルパラメータ推定結果及び制御結果を示す図（その１）である。FIG. 3 is a diagram (part 1) showing model parameter estimation results and control results when a learning data selection unit is used. 学習データ選別部を用いた場合のモデルパラメータ推定結果及び制御結果を示す図（その２）である。FIG. 7 is a diagram (part 2) showing model parameter estimation results and control results when the learning data selection unit is used.

以下、本発明の実施形態について説明する。以下の各実施形態では、操業中のプラントの運転データからプラント応答モデルを精度良く推定することができるプラント応答推定装置１０について説明する。なお、操業中とはプラントが正常に稼働し、通常の運転を行っている状態のことであり、例えば、オンライン中、運転中、運用中等と呼ばれてもよい。また、プラントとは１以上の機械、機器、装置等で構成される産業設備のことであり、プラント応答モデルを用いたモデル予測制御によって制御される制御対象である。プラントの具体例としては、例えば、石油化学プラント、食品プラント、鉄鋼プラント、発電プラント等といったものが挙げられるが、これらは一例であって、これらに限られるものではない。また、一般に、これらのプラントの制御システムでは、プラントの制御量、操作量、外乱量といった入出力データが運転データとして扱われる。 Embodiments of the present invention will be described below. In each of the following embodiments, a plant response estimation device 10 that can accurately estimate a plant response model from operation data of a plant in operation will be described. Note that "in operation" refers to a state in which the plant is operating normally and is performing normal operation, and may also be called, for example, on-line, in operation, or in operation. Furthermore, a plant is an industrial facility composed of one or more machines, devices, devices, etc., and is a control target that is controlled by model predictive control using a plant response model. Specific examples of plants include, for example, petrochemical plants, food plants, steel plants, power generation plants, etc., but these are just examples and are not limited to these. Further, in general, in these plant control systems, input/output data such as plant control amounts, operation amounts, and disturbance amounts are treated as operational data.

ここで、プラントの運転条件等によっては、例えば、定常状態が継続したり、運転データの値が上限値や下限値に偏ってしまったりすることがある。このため、定常状態における運転データや上限値・下限値に偏った運転データを学習データとして用いてプラント応答モデルのパラメータを更新した場合、それらの運転データによってパラメータが収束し、その結果、パラメータの推定値が真値と乖離してしまうことがある。このように、定常状態における運転データや上限値・下限値に偏った運転データ等は、プラント応答モデルの学習データとしては適切でない運転データであるといえる。なお、学習データとは、プラント応答モデルのパラメータ更新に用いられるデータのことである。 Here, depending on the operating conditions of the plant, for example, a steady state may continue, or the value of the operating data may be biased towards the upper limit value or the lower limit value. Therefore, if the parameters of the plant response model are updated using steady-state operating data or operating data that is biased toward the upper and lower limits as learning data, the parameters will converge based on those operating data, and as a result, the parameters will The estimated value may deviate from the true value. In this way, it can be said that operating data in a steady state, operating data biased towards upper and lower limit values, etc. are not suitable as learning data for a plant response model. Note that the learning data is data used to update parameters of the plant response model.

そこで、以下の各実施形態では、定常状態における運転データや上限値・下限値に偏った運転データ等といった適切でない運転データは学習データとしては用いずに、それ以外の運転データを学習データとして用いてプラント応答モデルのパラメータを更新する場合について説明する。これにより、以下の各実施形態に係るプラント応答推定装置１０では、精度の良いプラント応答モデルを推定することが可能となる。 Therefore, in each of the following embodiments, unsuitable operation data such as operation data in a steady state or operation data biased toward upper and lower limit values is not used as learning data, and other operation data is used as learning data. The case where the parameters of the plant response model are updated will be explained. Thereby, the plant response estimating device 10 according to each embodiment below can estimate a highly accurate plant response model.

［第一の実施形態］
まず、第一の実施形態について説明する。 [First embodiment]
First, a first embodiment will be described.

＜プラント応答推定装置１０のハードウェア構成例＞
第一の実施形態に係るプラント応答推定装置１０のハードウェア構成例を図１に示す。図１に示すように、第一の実施形態に係るプラント応答推定装置１０は、入力装置１１と、表示装置１２と、外部Ｉ／Ｆ１３と、通信Ｉ／Ｆ１４と、プロセッサ１５と、メモリ装置１６とを有する。これらの各ハードウェアは、それぞれがバス１７を介して通信可能に接続される。 <Example of hardware configuration of plant response estimation device 10>
FIG. 1 shows an example of the hardware configuration of a plant response estimation device 10 according to the first embodiment. As shown in FIG. 1, the plant response estimation device 10 according to the first embodiment includes an input device 11, a display device 12, an external I/F 13, a communication I/F 14, a processor 15, and a memory device 16. and has. Each of these pieces of hardware is communicably connected via a bus 17 .

入力装置１１は、例えば、キーボード、マウス、タッチパネル、各種物理ボタン等である。表示装置１２は、例えば、ディスプレイ、表示パネル等である。なお、プラント応答推定装置１０は、例えば、入力装置１１及び表示装置１２のうちの少なくとも一方を有していなくてもよい。 The input device 11 is, for example, a keyboard, a mouse, a touch panel, various physical buttons, or the like. The display device 12 is, for example, a display, a display panel, or the like. Note that the plant response estimation device 10 may not include at least one of the input device 11 and the display device 12, for example.

外部Ｉ／Ｆ１３は、記録媒体１３ａ等の外部装置とのインタフェースである。記録媒体１３ａとしては、例えば、ＣＤ（Compact Disc）、ＤＶＤ（Digital Versatile Disk）、ＳＤメモリカード（Secure Digital memory card）、ＵＳＢ（Universal Serial Bus）メモリカード等が挙げられる。 The external I/F 13 is an interface with an external device such as a recording medium 13a. Examples of the recording medium 13a include a CD (Compact Disc), a DVD (Digital Versatile Disk), an SD memory card (Secure Digital memory card), and a USB (Universal Serial Bus) memory card.

通信Ｉ／Ｆ１４は、プラント応答推定装置１０を通信ネットワークに接続するためのインタフェースである。プロセッサ１５は、例えば、ＣＰＵ（Central Processing Unit）等の各種演算装置である。メモリ装置１６は、例えば、ＳＳＤ（Solid State Drive）、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、フラッシュメモリ等の各種記憶装置である。 Communication I/F 14 is an interface for connecting plant response estimation device 10 to a communication network. The processor 15 is, for example, various arithmetic devices such as a CPU (Central Processing Unit). The memory device 16 is, for example, various storage devices such as an SSD (Solid State Drive), a RAM (Random Access Memory), a ROM (Read Only Memory), and a flash memory.

なお、図１に示すハードウェア構成は一例であって、プラント応答推定装置１０は、他のハードウェア構成であってもよい。例えば、プラント応答推定装置１０は、複数のプロセッサ１５や複数のメモリ装置１６を有していてもよいし、図示したハードウェア以外の種々のハードウェアを有していてもよい。 Note that the hardware configuration shown in FIG. 1 is an example, and the plant response estimation device 10 may have another hardware configuration. For example, the plant response estimation device 10 may include a plurality of processors 15 and a plurality of memory devices 16, or may include various hardware other than the illustrated hardware.

＜プラント応答推定装置１０の機能構成例＞
第一の実施形態に係るプラント応答推定装置１０の機能構成例を図２に示す。図２に示すように、第一の実施形態に係るプラント応答推定装置１０は、モデルパラメータ推定部１０１と、ステップ応答計算部１０２とを有する。これら各部は、例えば、プラント応答推定装置１０にインストールされた１以上のプログラムが、プロセッサ１５等に実行させる処理により実現される。 <Example of functional configuration of plant response estimation device 10>
FIG. 2 shows an example of the functional configuration of the plant response estimation device 10 according to the first embodiment. As shown in FIG. 2, the plant response estimation device 10 according to the first embodiment includes a model parameter estimation section 101 and a step response calculation section 102. Each of these units is realized, for example, by one or more programs installed in the plant response estimating device 10 causing the processor 15 or the like to execute the process.

モデルパラメータ推定部１０１は、サンプリング周期Δ毎に、制御対象のプラント（又はその運転状態を計測するセンサ等の機器）から運転データの観測値（制御量ｙ、操作量ｕ及び外乱ｖ）を受信し、そのサンプリング周期Δに応じて、プラント応答モデルのモデルパラメータを推定する。なお、運転データは、例えば、計測データや観測データ等と呼ばれてもよい。 The model parameter estimating unit 101 receives observed values of operating data (controlled variable y, manipulated variable u, and disturbance v) from a plant to be controlled (or a device such as a sensor that measures its operating state) at every sampling period Δ. Then, model parameters of the plant response model are estimated according to the sampling period Δ. Note that the operation data may be called, for example, measurement data, observation data, or the like.

ここで、時刻をｔとすれば制御量ｙ、操作量ｕ及び外乱ｖはそれぞれｙ（ｔ）、ｕ（ｔ）及びｖ（ｔ）と表され、各サンプリング時刻ｔ_ｋ（ｋはサンプリング時刻を表すインデックスであり、定周期でインクリメントされる。）に関してｔ_ｋ＋１－ｔ_ｋ＝Δが成り立つ。なお、ｋは０以上の整数を取り得るものとし、またサンプリング時刻ｔ_ｋはｔ_１から開始するものとする。以下では、特に断らない限り、各サンプリング時刻ｔ_ｋをそのインデックスｋと同一視し、ｙ（ｋ）、ｕ（ｋ）及びｖ（ｋ）とも表すことにする。 Here, if time is t, the controlled variable y, the manipulated variable u, and the disturbance v are expressed as y(t), u(t), and v(t), respectively, and each sampling time t _k (k is the sampling time t _k+1 −t k =Δ holds true for t k+1 −t _k =Δ. It is assumed that k can take an integer greater than or equal to 0, and that the sampling time _tk starts from _t1 . In the following, unless otherwise specified, each sampling time t _k will be equated with its index k, and will also be expressed as y(k), u(k), and v(k).

また、以下では、プラント応答モデルは、パラメータθを持つプラント応答関数Ｓ_θ（ｔ）で表されるものとして、パラメータθをモデルパラメータということにする。プラント応答モデルとしては様々なプラント応答関数Ｓ_θで表現されるモデルを採用することが可能であるが、以下では、主に、ＡＲＭＡＸモデルといった多項式モデルを想定する。プラント応答モデルがＡＲＭＡＸモデルである場合、モデルパラメータθはＡＲＭＡＸモデルの係数となる。なお、プラント応答モデルとしてＡＲＭＡＸモデル以外の様々なモデルが採用可能であることは言うまでもない。 Further, in the following, the plant response model is represented by a plant response function S _θ (t) having a parameter θ, and the parameter θ is referred to as a model parameter. Although it is possible to employ various models expressed by a plant response function S _θ as the plant response model, a polynomial model such as an ARMAX model is mainly assumed below. When the plant response model is an ARMAX model, the model parameter θ becomes a coefficient of the ARMAX model. It goes without saying that various models other than the ARMAX model can be employed as the plant response model.

更に、以下では、ＡＲＭＡＸモデルで表されたプラント応答モデルに対して、制御対象プラントの定常状態を表すオフセット項（定数項）を追加したモデルを拡大プラント応答モデルと呼ぶことにする。 Further, hereinafter, a model obtained by adding an offset term (constant term) representing the steady state of the controlled plant to the plant response model represented by the ARMAX model will be referred to as an expanded plant response model.

ステップ応答計算部１０２は、モデルパラメータ推定部１０１によって推定されたモデルパラメータ（以下、モデルパラメータ推定値ともいう。）θを用いて、与えられた時刻ｔにおけるプラントのステップ応答の推定値（以下、ステップ応答推定値ともいう。）Ｓ_θ（ｔ）を計算する。なお、ステップ応答とは、操作量としてステップ信号がプラントに印加されたときの制御量のことである。 The step response calculation unit 102 uses the model parameters (hereinafter also referred to as model parameter estimated values) θ estimated by the model parameter estimation unit 101 to calculate an estimated value of the step response of the plant at a given time t (hereinafter referred to as (Also referred to as step response estimate.) Calculate S _θ (t). Note that the step response is a controlled variable when a step signal is applied to the plant as a manipulated variable.

ただし、ステップ応答計算部１０２はモデルパラメータ推定値θの利用方法の１つであって、本実施形態に係るプラント応答推定装置１０は、モデルパラメータ推定部１０１のみを有する構成であってもよい。また、本実施形態に係るプラント応答推定装置１０は、ステップ応答以外のプラント応答を計算する機能部を有する構成であってもよい。例えば、インパルス応答を計算する「インパルス応答計算部」を有する構成であってもよいし、ランプ応答を計算する「ランプ応答計算部」を有する構成であってもよい。これら以外にも、その他、本実施形態の主旨を逸脱しない範囲で様々な利用方法に対して適用可能である。 However, the step response calculation unit 102 is one of the methods of using the model parameter estimated value θ, and the plant response estimation device 10 according to this embodiment may have a configuration including only the model parameter estimation unit 101. Moreover, the plant response estimating device 10 according to the present embodiment may be configured to include a functional unit that calculates a plant response other than a step response. For example, the configuration may include an "impulse response calculation unit" that calculates an impulse response, or a configuration that includes a "lamp response calculation unit" that calculates a lamp response. In addition to these, the present invention can be applied to various other usage methods without departing from the spirit of the present embodiment.

ここで、モデルパラメータ推定部１０１には、バッファ部１１１と、状態ベクトル変換部１１２と、学習データ選別部１１３と、逐次推定計算部１１４と、パラメータ変換部１１５とが含まれる。 Here, the model parameter estimation section 101 includes a buffer section 111 , a state vector conversion section 112 , a learning data selection section 113 , a sequential estimation calculation section 114 , and a parameter conversion section 115 .

バッファ部１１１は、或る所定の期間における制御量ｙ（ｋ）、操作量ｕ（ｋ）及び外乱ｖ（ｋ）をメモリ装置１６に蓄積（バッファ）する。状態ベクトル変換部１１２は、メモリ装置１６にバッファされている制御量ｙ（ｋ）、操作量ｕ（ｋ）及び外乱ｖ（ｋ）の再サンプリングを行って、それら再サンプリングされた制御量ｙ（ｋ）、操作量ｕ（ｋ）及び外乱ｖ（ｋ）が含まれる状態ベクトルξ（ｋ）を作成する。学習データ選別部１１３は、状態ベクトルξ（ｋ）を用いて、再サンプリングされた制御量ｙ（ｋ）、操作量ｕ（ｋ）及び外乱ｖ（ｋ）を学習データとして用いるか否かを判定し、学習データとして用いる場合は更新フラグをＯＮ、学習データとして用いない場合は更新フラグをＯＦＦとする。逐次推定計算部１１４は、更新フラグがＯＮである場合、状態ベクトルξ（ｋ）を用いて、拡大プラント応答モデルのモデルパラメータである拡大モデルパラメータを推定する。パラメータ変換部１１５は、拡大モデルパラメータの推定値をモデルパラメータ推定値θに変換する。なお、状態ベクトルとは制御の分野で用いられる用語であり、制御対象やそれを含む系（システム）の状態を表すベクトルである。 The buffer unit 111 stores (buffers) the control amount y(k), the manipulated variable u(k), and the disturbance v(k) in the memory device 16 during a certain predetermined period. The state vector conversion unit 112 resamples the controlled variable y(k), the manipulated variable u(k), and the disturbance v(k) buffered in the memory device 16, and converts the resampled controlled variable y( k), a state vector ξ(k) including the manipulated variable u(k) and the disturbance v(k) is created. The learning data selection unit 113 uses the state vector ξ(k) to determine whether or not to use the resampled control amount y(k), operation amount u(k), and disturbance v(k) as learning data. However, when the data is used as learning data, the update flag is set to ON, and when the data is not used as learning data, the update flag is set to OFF. When the update flag is ON, the sequential estimation calculation unit 114 estimates expanded model parameters that are model parameters of the expanded plant response model using the state vector ξ(k). The parameter conversion unit 115 converts the estimated value of the expanded model parameter into the model parameter estimated value θ. Note that a state vector is a term used in the field of control, and is a vector that represents the state of a controlled object or a system that includes it.

＜バッファ部１１１の動作＞
サンプリング時刻ｔ_ｋにおける制御量バッファをＹ（ｋ）、操作量バッファをＵ（ｋ）、外乱バッファをＶ（ｋ）として、これらの各バッファは以下のベクトルで表されるものとする。 <Operation of buffer unit 111>
Assume that the control amount buffer at sampling time t _k is Y(k), the operation amount buffer is U(k), and the disturbance buffer is V(k), and each of these buffers is represented by the following vector.

すなわち、制御量バッファＹ（ｋ）にはｋ－Ｂ_１からｋまでの制御量ｙ、操作量バッファＵ（ｋ）にはｋ－Ｂ_２からｋまでの操作量ｕ、外乱バッファＶ（ｋ）にはｋ－Ｂ_３からｋまでの外乱ｖがそれぞれ格納されているものとする。ここで、Ｂ_１、Ｂ_２及びＢ_３はそれぞれ０以上の整数であり、制御量バッファ、操作量バッファ及び外乱バッファの大きさを決めるパラメータである。これらのＢ_１、Ｂ_２及びＢ_３はメモリ装置１６のサイズ等に応じて、適宜、その値が決定される。

That is, the control amount buffer Y(k) contains the control amount y from k-B ₁ to k, the operation amount buffer U(k) contains the operation amount u from k-B ₂ to k, and the disturbance buffer V(k). It is assumed that disturbances v from k-B ₃ to k are stored respectively. Here, B ₁ , B ₂ , and B ₃ are each integers greater than or equal to 0, and are parameters that determine the sizes of the control amount buffer, the operation amount buffer, and the disturbance buffer. The values of these B ₁ , B ₂ , and B ₃ are determined as appropriate depending on the size of the memory device 16 and the like.

このとき、バッファ部１１１は、制御量ｙ（ｋ）、操作量ｕ（ｋ）及び外乱ｖ（ｋ）を受信すると、図３に示すように、制御量ｙ（ｋ）と制御量バッファＹ（ｋ－１）から制御量バッファＹ（ｋ）、操作量ｕ（ｋ）と操作量バッファＵ（ｋ－１）から操作量バッファＵ（ｋ）、外乱ｖ（ｋ）と外乱バッファＶ（ｋ－１）から外乱バッファＶ（ｋ）にそれぞれ更新する。 At this time, when the buffer section 111 receives the controlled variable y(k), the manipulated variable u(k), and the disturbance v(k), the buffer section 111 receives the controlled variable y(k) and the controlled variable buffer Y( k-1) to the control amount buffer Y(k), the manipulated variable u(k) and the manipulated variable buffer U(k-1) to the manipulated variable buffer U(k), the disturbance v(k) and the disturbance buffer V(k- 1) to the disturbance buffer V(k).

具体的には、バッファ部１１１は、制御量バッファＹ（ｋ－１）に格納されているｙ（ｋ－Ｂ_１－１）を削除した上で、新たに観測された制御量ｙ（ｋ）を格納することで、ｙ（ｋ－Ｂ_１）からｙ（ｋ）までの制御量が格納された制御量バッファＹ（ｋ）に更新する。同様に、バッファ部１１１は、操作量バッファＵ（ｋ－１）に格納されている操作量ｕ（ｋ－Ｂ_２－１）を削除した上で、新たに観測された操作量ｕ（ｋ）を格納することで、ｕ（ｋ－Ｂ_２）からｕ（ｋ）までの操作量が格納された操作量バッファＵ（ｋ）に更新する。同様に、バッファ部１１１は、外乱バッファＶ（ｋ－１）に格納されている外乱ｖ（ｋ－Ｂ_３－１）を削除した上で、新たに観測された外乱ｖ（ｋ）を格納することで、ｖ（ｋ－Ｂ_３）からｖ（ｋ）までの外乱が格納された外乱バッファＶ（ｋ）に更新する。 Specifically, the buffer unit 111 deletes y(k-B ₁ -1) stored in the control amount buffer Y(k-1), and then deletes the newly observed control amount y(k). By storing , the control amount buffer Y(k) is updated to store the control amounts from y(k-B ₁ ) to y(k). Similarly, the buffer unit 111 deletes the manipulated variable u(k-B ₂ -1) stored in the manipulated variable buffer U(k-1), and then deletes the newly observed manipulated variable u(k). is updated to the manipulated variable buffer U(k) in which the manipulated variables from u(k-B ₂ ) to u(k) are stored. Similarly, the buffer unit 111 deletes the disturbance v(k-B ₃ -1) stored in the disturbance buffer V(k-1), and then stores the newly observed disturbance v(k). As a result, the disturbance buffer V(k) is updated to store the disturbances from v(k-B ₃ ) to v(k).

なお、制御量バッファＹ（０）、操作量バッファＵ（０）、外乱バッファＶ（０）は適切に初期化（例えば、すべて０等に初期化）されているものとする。また、上記では外乱ｖがある場合を想定して説明したが、外乱ｖがない場合は外乱バッファＶ（ｋ）がなくてもよい。 It is assumed that the control amount buffer Y(0), the operation amount buffer U(0), and the disturbance buffer V(0) have been appropriately initialized (for example, all are initialized to 0, etc.). Furthermore, although the above explanation assumes that there is a disturbance v, the disturbance buffer V(k) may not be provided if there is no disturbance v.

＜状態ベクトル変換部１１２の動作＞
再サンプリング周期をＤとする。このとき、状態ベクトル変換部１１２は、制御量バッファＹ（ｋ）、操作量バッファＵ（ｋ）及び外乱バッファＶ（ｋ）が更新されると、これらの各バッファＹ（ｋ）、Ｕ（ｋ）及びＶ（ｋ）から再サンプリング周期Ｄで再サンプリングを行って、以下の再サンプリング制御量ベクトルＹ_Ｄ（ｋ）、再サンプリング操作量ベクトルＵ_Ｄ（ｋ）及び再サンプリング外乱ベクトルＶ_Ｄ（ｋ）をそれぞれ作成する。 <Operation of state vector conversion unit 112>
Let D be the resampling period. At this time, when the control amount buffer Y(k), the operation amount buffer U(k), and the disturbance buffer V(k) are updated, the state vector conversion unit 112 converts each of these buffers Y(k) and U(k ) and V(k) at a resampling period D to obtain the following resampling control amount vector Y _D (k), resampling manipulated variable vector U _D (k), and resampling disturbance vector V _D (k ) respectively.

ここで、Ｎ、Ｍ及びＬはプラント応答モデルに応じて決定されるパラメータ（具体的には、ＮはＡＲＭＡＸモデルの制御量ｙに関する項の係数の数、Ｍは操作量ｕに関する項の係数の数、Ｌは外乱ｖに関する項の係数の数）である。また、再サンプリング制御量ベクトルＹ_Ｄ（ｋ）ではＡＲＭＡＸモデルにならい、現在のサンプリング時刻を表す要素ｙ（ｋ）がスキップされる（つまり、ｙ（ｋ）は再サンプリングされない。）。

Here, N, M, and L are parameters determined according to the plant response model (specifically, N is the number of coefficients of the term related to the controlled variable y of the ARMAX model, and M is the number of coefficients of the term related to the manipulated variable u). (L is the number of coefficients of the term related to the disturbance v). Furthermore, in the resampling control amount vector Y _D (k), the element y(k) representing the current sampling time is skipped (that is, y(k) is not resampled), following the ARMAX model.

なお、一般に、Ｎ、Ｍ及びＬの値は大きい方が多様な表現が可能で、プラント応答の高精度な予測が期待できるが、モデルパラメータθの推定のために多くの計算資源やメモリ量が必要となる。 In general, the larger the values of N, M, and L, the more diverse expressions can be made, and more accurate predictions of plant responses can be expected. It becomes necessary.

そして、状態ベクトル変換部１１２は、再サンプリング制御量ベクトルＹ_Ｄ（ｋ）、再サンプリング操作量ベクトルＵ_Ｄ（ｋ）及び再サンプリング外乱ベクトルＶ_Ｄ（ｋ）から以下により状態ベクトルξ（ｋ）を作成する。 Then, the state vector conversion unit 112 converts the state vector ξ(k) from the resampling control amount vector Y _D (k), the resampling operation amount vector U _D (k), and the resampling disturbance vector V _D (k) as follows. create.

すなわち、状態ベクトルξ（ｋ）とは、Ｙ_Ｄ（ｋ）、Ｕ_Ｄ（ｋ）及びＶ_Ｄ（ｋ）の要素に対して、オフセット項を表す要素として「１」を追加したベクトルである。ここで、オフセット項とは、上述したように、ＡＲＭＡＸモデルで定常状態を表す項として出現する定数項のことである。これにより、再サンプリング制御量ベクトルＹ_Ｄ（ｋ）、再サンプリング操作量ベクトルＵ_Ｄ（ｋ）及び再サンプリング外乱ベクトルＶ_Ｄ（ｋ）が状態ベクトルξ（ｋ）に変換されたことになる。この状態ベクトルξ（ｋ）は「拡大状態ベクトル」等と呼ばれてもよい。

That is, the state vector ξ(k) is a vector in which "1" is added as an element representing an offset term to the elements of Y _D (k), U _D (k), and V _D (k). Here, the offset term is a constant term that appears as a term representing a steady state in the ARMAX model, as described above. As a result, the resampling control amount vector Y _D (k), the resampling manipulated variable vector U _D (k), and the resampling disturbance vector V _D (k) are converted into the state vector ξ(k). This state vector ξ(k) may be called an "expanded state vector" or the like.

なお、状態ベクトルξ（０）は適切な値に初期化（例えば、零ベクトル等に初期化）されているものとする。また、上記では外乱ｖがある場合を想定して説明したが、外乱ｖがない場合は再サンプリング外乱ベクトルＶ_Ｄ（ｋ）がなくてもよい。 It is assumed that the state vector ξ(0) has been initialized to an appropriate value (for example, initialized to a zero vector, etc.). Furthermore, although the above description assumes that there is a disturbance v, the resampling disturbance vector V _D (k) may not be provided if there is no disturbance v.

＜学習データ選別部１１３の動作＞
学習データ選別部１１３は、状態ベクトルξ（ｋ）が作成されると、この状態ベクトルξ（ｋ）を用いて、再サンプリング制御量ベクトルＹ_Ｄ（ｋ）、再サンプリング操作量ベクトルＵ_Ｄ（ｋ）及び再サンプリング外乱ベクトルＶ_Ｄ（ｋ）を学習データとして用いるか否かを判定した上で、その判定結果に応じて更新フラグをＯＮ又はＯＦＦにする。これにより、学習データとして用いる運転データ（の観測値）が選別される。 <Operation of learning data selection unit 113>
When the state vector ξ(k) is created, the learning data selection unit 113 uses this state vector ξ(k) to determine the resampling control amount vector Y _D (k) and the resampling operation amount vector U _D (k ) and the resampled disturbance vector V _D (k) are determined whether to be used as learning data, and then the update flag is turned ON or OFF according to the determination result. As a result, driving data (observed values thereof) to be used as learning data are selected.

以下、或るｋに関して学習データとして用いる運転データを選別する処理（第一の実施形態に係る学習データ選別処理）について、図４を参照しながら説明する。ただし、ｋ＝０である場合は更新フラグ＝ＯＮとして処理を終了する。 Hereinafter, a process for selecting driving data to be used as learning data for a certain k (learning data selecting process according to the first embodiment) will be described with reference to FIG. 4. However, if k=0, the update flag is set to ON and the process ends.

ステップＳ１０１：まず、学習データ選別部１１３は、状態ベクトルξ（ｋ）と１時刻前の状態ベクトルξ（ｋ－１）との差のノルムｄ（ｋ）を計算する。すなわち、学習データ選別部１１３は、ｄ（ｋ）＝||ξ（ｋ）－ξ（ｋ－１）||_ｐを計算する。ここで、ｐは、ｐ＝１、ｐ＝２又はｐ＝∞のいずれかであり、||・||_１はＬ１ノルム、||・||_２はＬ２ノルム、||・||_∞はＬ∞ノルムをそれぞれ表す。以下、ｄ（ｋ）をベクトル差ノルムともいう。 Step S101: First, the learning data selection unit 113 calculates the norm d(k) of the difference between the state vector ξ(k) and the state vector ξ(k-1) one time ago. That is, the learning data selection unit 113 calculates d(k)=||ξ(k)−ξ(k−1)|| _p . Here, p is either p=1, p=2 or p=∞, ||・|| ₁ is L1 norm, ||・|| ₂ is L2 norm, ||・|| _∞ is Each represents the L∞ norm. Hereinafter, d(k) is also referred to as vector difference norm.

ステップＳ１０２：次に、学習データ選別部１１３は、ベクトル差ノルムｄ（ｋ）が予め設定された閾値γ以上であるか否かを判定する。 Step S102: Next, the learning data selection unit 113 determines whether the vector difference norm d(k) is greater than or equal to a preset threshold value γ.

ｄ（ｋ）≧γであると判定した場合（ステップＳ１０２でＹＥＳ）、学習データ選別部１１３は、ステップＳ１０３に進む。一方で、ｄ（ｋ）≧γであると判定しなかった場合（ステップＳ１０２でＮＯ）、学習データ選別部１１３は、ステップＳ１０４に進む。 If it is determined that d(k)≧γ (YES in step S102), the learning data selection unit 113 proceeds to step S103. On the other hand, if it is not determined that d(k)≧γ (NO in step S102), the learning data selection unit 113 proceeds to step S104.

ステップＳ１０３：学習データ選別部１１３は、更新フラグをＯＮにする。ここで、更新フラグとは、再サンプリング操作量ベクトルＵ_Ｄ（ｋ）及び再サンプリング外乱ベクトルＶ_Ｄ（ｋ）を学習データとして用いるか否かを表すフラグのことである。更新フラグがＯＮである場合、再サンプリング操作量ベクトルＵ_Ｄ（ｋ）及び再サンプリング外乱ベクトルＶ_Ｄ（ｋ）が学習データとして用いられて拡大モデルパラメータの更新が行われる。一方で、更新フラグがＯＦＦである場合、再サンプリング操作量ベクトルＵ_Ｄ（ｋ）及び再サンプリング外乱ベクトルＶ_Ｄ（ｋ）は学習データとして用いられず、拡大モデルパラメータの更新は行われない。 Step S103: The learning data selection unit 113 turns on the update flag. Here, the update flag is a flag indicating whether or not the resampling operation amount vector U _D (k) and the resampling disturbance vector V _D (k) are used as learning data. When the update flag is ON, the resampling operation amount vector U _D (k) and the resampling disturbance vector V _D (k) are used as learning data to update the enlarged model parameters. On the other hand, when the update flag is OFF, the resampling operation amount vector U _D (k) and the resampling disturbance vector V _D (k) are not used as learning data, and the enlarged model parameters are not updated.

ステップＳ１０４：学習データ選別部１１３は、更新フラグをＯＦＦにする。 Step S104: The learning data selection unit 113 turns off the update flag.

＜逐次推定計算部１１４の動作＞
以下では、拡大モデルパラメータをθ_ｅと表す。逐次推定計算部１１４は、学習データ選別部１１３によって更新フラグがＯＮ又はＯＦＦに設定されると、状態ベクトルξ（ｋ）を用いて、拡大モデルパラメータθ_ｅの値を推定する。すなわち、逐次推定計算部１１４は、サンプリング時刻ｔ_ｋ毎に、逐次的に拡大モデルパラメータθ_ｅの値を推定する。 <Operation of sequential estimation calculation unit 114>
In the following, the expanded model parameters are expressed as θ _e . When the update flag is set to ON or OFF by the learning data selection unit 113, the sequential estimation calculation unit 114 estimates the value of the expanded model parameter θ _e using the state vector ξ(k). That is, the sequential estimation calculation unit 114 sequentially estimates the value of the expanded model parameter θ _e at every sampling time t _k .

以下、或るｋに関して拡大モデルパラメータθ_ｅの値を推定する処理（第一の実施形態に係る拡大モデルパラメータ推定処理）について、図５を参照しながら説明する。なお、以下では、サンプリング時刻ｔ_ｋにおける拡大モデルパラメータθ_ｅの推定値をθ_ｅ（ｋ）と表す。 Hereinafter, the process of estimating the value of the expanded model parameter θ _e for a certain k (extended model parameter estimation process according to the first embodiment) will be described with reference to FIG. 5. Note that hereinafter, the estimated value of the expanded model parameter θ _e at the sampling time t _k is expressed as θ _e (k).

ステップＳ２０１：まず、逐次推定計算部１１４は、更新フラグがＯＮであるか否かを判定する。 Step S201: First, the sequential estimation calculation unit 114 determines whether the update flag is ON.

更新フラグがＯＮであると判定した場合（ステップＳ２０１でＹＥＳ）、逐次推定計算部１１４は、ステップＳ２０２に進む。一方で、更新フラグがＯＮであると判定しなかった場合（ステップＳ２０１でＮＯ）、逐次推定計算部１１４は、拡大モデルパラメータ推定処理を終了する。すなわち、更新フラグがＯＦＦである場合、逐次推定計算部１１４は、拡大モデルパラメータθ_ｅの値を推定及び更新せずに処理を終了する。 If it is determined that the update flag is ON (YES in step S201), the sequential estimation calculation unit 114 proceeds to step S202. On the other hand, if it is not determined that the update flag is ON (NO in step S201), the sequential estimation calculation unit 114 ends the expanded model parameter estimation process. That is, when the update flag is OFF, the sequential estimation calculation unit 114 ends the process without estimating or updating the value of the expanded model parameter θ _e .

ステップＳ２０２：逐次推定計算部１１４は、拡大モデルパラメータθ_ｅと共分散行列Ｐとを初期化するか否かを判定する。ここで、初期化すると判定される場合としては、例えば、初回計算時（つまり、ｋ＝１のとき）、ユーザ等により初期化指示が行われたとき等が挙げられる。 Step S202: The sequential estimation calculation unit 114 determines whether to initialize the expanded model parameter θ _e and the covariance matrix P. Here, cases in which it is determined that initialization is to be performed include, for example, at the time of first calculation (that is, when k=1), and when an initialization instruction is given by a user or the like.

拡大モデルパラメータθ_ｅと共分散行列Ｐとを初期化すると判定した場合（ステップＳ２０２でＹＥＳ）、逐次推定計算部１１４は、ステップＳ２０３に進む。一方で、拡大モデルパラメータθ_ｅと共分散行列Ｐとを初期化すると判定しなかった場合（ステップＳ２０２でＮＯ）、逐次推定計算部１１４は、ステップＳ２０４に進む。 If it is determined that the expanded model parameter θ _e and the covariance matrix P are to be initialized (YES in step S202), the sequential estimation calculation unit 114 proceeds to step S203. On the other hand, if it is not determined that the expanded model parameter θ _e and the covariance matrix P are to be initialized (NO in step S202), the sequential estimation calculation unit 114 proceeds to step S204.

ステップＳ２０３：逐次推定計算部１１４は、サンプリング時刻ｔ_ｋのインデックスｋをｋ＝１に初期化すると共に、θ_ｅ（０）＝θ_０及びＰ（０）＝Ｉと初期化する。ここで、θ_０は予め設定された拡大モデルパラメータの初期値、Ｉは予め設定された任意の行列（例えば、単位行列等）である。 Step S203: The sequential estimation calculation unit 114 initializes the index k of the sampling time _tk to k=1, and also initializes θ _e (0)=θ ₀ and P(0)=I. Here, θ ₀ is a preset initial value of the enlarged model parameter, and I is a preset arbitrary matrix (for example, a unit matrix, etc.).

ステップＳ２０４：逐次推定計算部１１４は、拡大モデルパラメータ推定値θ_ｅ（ｋ－１）と状態ベクトルξ（ｋ）と制御量ｙ（ｋ）とを用いて、予測誤差ε（ｋ）を計算する。予測誤差ε（ｋ）は、例えば、ε（ｋ）＝ｙ（ｋ）－ξ（ｋ）^Τθ_ｅ（ｋ－１）により計算される。なお、Τは転置を表す。 Step S204: The sequential estimation calculation unit 114 calculates the prediction error ε(k) using the expanded model parameter estimate θ _e (k−1), the state vector ξ(k), and the control amount y(k). . The prediction error ε(k) is calculated, for example, by ε(k)=y(k)−ξ(k) ^Τ θ _e (k−1). Note that T represents transposition.

ステップＳ２０５：逐次推定計算部１１４は、共分散行列Ｐ（ｋ－１）を以下により更新して共分散行列Ｐ（ｋ）を得る。 Step S205: The sequential estimation calculation unit 114 updates the covariance matrix P(k-1) as follows to obtain the covariance matrix P(k).

ここで、λ＞０は調整係数であり、予め設定された値である。調整係数λはその値に応じて以下の２つの使い方をすることができる。

Here, λ>0 is an adjustment coefficient and is a preset value. The adjustment coefficient λ can be used in the following two ways depending on its value.

１つ目は、０＜λ≦１とすることにより、過去の運転データを忘却するための忘却係数として用いることができる。λが０に近いほど急激に過去の運転データの影響が減少し、１に近いほど過去の運転データの影響が保持される。 First, by setting 0<λ≦1, it can be used as a forgetting coefficient for forgetting past driving data. The closer λ is to 0, the more rapidly the influence of past driving data is reduced, and the closer λ is to 1, the more the influence of past driving data is retained.

２つ目は、λ＞１とすることにより、パラメータ更新を加速させるための係数として用いることができる。λの値が大きいほどパラメータの収束を加速させることができる。 Second, by setting λ>1, it can be used as a coefficient for accelerating parameter updating. The larger the value of λ, the faster the parameter convergence can be accelerated.

ステップＳ２０６：そして、逐次推定計算部１１４は、拡大モデルパラメータ推定値θ_ｅ（ｋ－１）を以下により更新して拡大モデルパラメータ推定値θ_ｅ（ｋ）を得る。 Step S206: Then, the sequential estimation calculation unit 114 updates the expanded model parameter estimated value θ _e (k−1) as follows to obtain the expanded model parameter estimated value θ _e (k).

これにより、サンプリング時刻ｔ_ｋにおける拡大モデルパラメータ推定値θ_ｅ（ｋ）が得られる。

As a result, the expanded model parameter estimate θ _e (k) at the sampling time t _k is obtained.

＜パラメータ変換部１１５の動作＞
拡大モデルパラメータ推定値θ_ｅ（ｋ）を以下で表す。 <Operation of parameter conversion unit 115>
The expanded model parameter estimate θ _e (k) is expressed below.

ここで、θ_Ｙ（ｋ）はサンプリング時刻ｔ_ｋにおけるＡＲＭＡＸモデルの制御量ｙに関する項の係数を要素とするＮ次元ベクトル、θ_Ｕ（ｋ）は操作量ｕに関する項の係数を要素とするＭ次元ベクトル、θ_Ｖ（ｋ）は外乱ｖに関する項の係数を要素とするＬ次元ベクトル、θ_Ｃ（ｋ）はオフセット項を表すスカラー値である。

Here, θ _Y (k) is an N-dimensional vector whose elements are the coefficients of the term related to the controlled variable y of the ARMAX model at sampling time t _k , and θ _U (k) is an M vector whose elements are the coefficients of the term related to the manipulated variable u. The dimensional vector θ _V (k) is an L-dimensional vector whose elements are coefficients of terms related to the disturbance v, and θ _C (k) is a scalar value representing the offset term.

このとき、パラメータ変換部１１５は、拡大モデルパラメータ推定値θ_ｅ（ｋ）が得られると、図６に示すように、θ_Ｃ（ｋ）を除いたベクトルに変換することで、モデルパラメータ推定値θ（ｋ）を得る。これにより、以下のモデルパラメータ推定値が得られる。 At this time, when the expanded model parameter estimated value θ _e (k) is obtained, the parameter conversion unit 115 converts it into a vector excluding θ _C (k), as shown in FIG. Obtain θ(k). This yields the following model parameter estimates.

このように、制御対象プラントのステップ応答を推定する際には、拡大モデルパラメータ推定値θ_ｅ（ｋ）からθ_Ｃ（ｋ）を除くことで、モデルパラメータ推定値θ（ｋ）が得られる。この理由については、例えば、特許文献１等を参照されたい。

In this way, when estimating the step response of the controlled plant, the model parameter estimate θ(k) is obtained by removing θ _C (k) from the expanded model parameter estimate θ _e (k). For the reason for this, please refer to, for example, Patent Document 1.

＜ステップ応答計算部１０２の動作＞
以下、ステップ応答の推定に用いられるモデルパラメータ推定値（つまり、プラント応答関数Ｓ_θに設定されるモデルパラメータ推定値）θ＝θ（ｋ）のことを「モデルパラメータ設定値θ」ともいう。また、以下では、簡単のため、ステップ信号として単位ステップ信号を想定する。 <Operation of step response calculation unit 102>
Hereinafter, the model parameter estimated value θ=θ(k) used for estimating the step response (that is, the model parameter estimated value set to the plant response function S _θ ) is also referred to as the “model parameter setting value θ”. Further, in the following, for simplicity, a unit step signal is assumed as the step signal.

ステップ応答計算部１０２は、図７に示すように、モデルパラメータ設定値θと時刻ｔとが与えられると、初期時刻０から時間ｔ経過後の時刻ｔにおける単位ステップ応答Ｓ_θ（ｔ）を計算する。なお、単位ステップ応答とは、操作量ｕとして単位ステップ信号を印加した場合における応答（つまり、制御対象プラントのプラント応答モデルの出力）のことである。 As shown in FIG. 7, the step response calculation unit 102 calculates a unit step response S _θ (t) at time t after time t has elapsed from initial time 0, when model parameter setting value θ and time t are given. do. Note that the unit step response is the response when a unit step signal is applied as the manipulated variable u (that is, the output of the plant response model of the plant to be controlled).

或る時刻ｔと或るモデルパラメータ設定値θとが与えられたときに、ステップ応答Ｓ_θ（ｔ）を計算する処理（第一の実施形態に係るステップ応答計算処理）について、図８を参照しながら説明する。なお、以下では、インデックスｋ'における状態ベクトルφ（ｋ'）を以下で表すものとする。 See FIG. 8 for the process of calculating the step response S _θ (t) when a certain time t and a certain model parameter setting value θ are given (step response calculation process according to the first embodiment). I will explain while doing so. In addition, below, the state vector φ(k') at index k' shall be expressed as follows.

なお、インデックスｋ'は、サンプリング時刻ｔ_ｋのインデックスｋと同じ値を取り得る変数であるが、本処理の中でのみ利用され、インデックスｋとは独立に値が更新されることに留意されたい。

Note that index k' is a variable that can take the same value as index k at sampling time _tk , but it is used only in this process and its value is updated independently of index k. .

ステップＳ３０１：ステップ応答計算部１０２は、本処理の中でのみ利用する時刻を表すインデックスをτとして、τ＝０、ｋ'＝０と初期化すると共に、状態ベクトルφ（０）を以下のように初期化する。 Step S301: The step response calculation unit 102 initializes τ=0, k'=0, with τ as an index representing a time used only in this process, and sets the state vector φ(0) as follows. Initialize to .

すなわち、ｕ（０）のみ１、それ以外の要素は０と状態ベクトルφ（０）を初期化する。

That is, the state vector φ(0) is initialized so that only u(0) is 1 and the other elements are 0.

ステップＳ３０２：ステップ応答計算部１０２は、ｙ（ｋ'）＝φ（ｋ'）^Τθにより制御量予測値ｙ（ｋ'）を計算する。 Step S302: The step response calculation unit 102 calculates the predicted control amount value y(k') using y(k')=φ(k') ^T θ.

ステップＳ３０３：ステップ応答計算部１０２は、制御量予測値ｙ（ｋ'）を用いて、状態ベクトルφ（ｋ'）を、次のインデックスｋ'＋１における状態ベクトルφ（ｋ'＋１）に更新する。このとき、ステップ応答計算部１０２は、図９に示すように、状態ベクトルφ（ｋ'＋１）のｙ（ｋ'）には上記のステップＳ３０２で計算した制御量予測値ｙ（ｋ'）を設定し、ｙ（ｋ'－Ｎ＋１）～ｙ（ｋ'－１）には状態ベクトルφ（ｋ'）と同じ値を設定する。また、状態ベクトルφ（ｋ'＋１）のｕ（ｋ'＋１）には１を設定し、ｕ（ｋ'－Ｍ＋１）～ｕ（ｋ'）には状態ベクトルφ（ｋ'）と同じ値を設定する。更に、状態ベクトルφ（ｋ'＋１）のｖ（ｋ'＋１）には０を設定し、ｖ（ｋ'－Ｌ＋１）～ｖ（ｋ'）には状態ベクトルφ（ｋ'）と同じ値を設定する。 Step S303: The step response calculation unit 102 updates the state vector φ(k') to the state vector φ(k'+1) at the next index k'+1 using the control amount predicted value y(k'). . At this time, as shown in FIG. 9, the step response calculation unit 102 sets the control amount predicted value y(k') calculated in step S302 above to y(k') of the state vector φ(k'+1). and set the same value as the state vector φ(k') for y(k'-N+1) to y(k'-1). In addition, u(k'+1) of state vector φ(k'+1) is set to 1, and u(k'-M+1) to u(k') are set to the same value as state vector φ(k'). Set. Furthermore, v(k'+1) of state vector φ(k'+1) is set to 0, and v(k'-L+1) to v(k') are set to the same values as state vector φ(k'). Set.

ステップＳ３０４：ステップ応答計算部１０２は、時刻τをτ＋Δに更新すると共に、インデックスｋ'をｋ'＋１に更新する。 Step S304: The step response calculation unit 102 updates the time τ to τ+Δ and updates the index k' to k'+1.

ステップＳ３０５：ステップ応答計算部１０２は、τ≧ｔであるか否かを判定する。そして、τ≧ｔであると判定されなかった場合（ステップＳ３０５でＮＯ）、ステップ応答計算部１０２は、ステップＳ３０２に戻る。これにより、τ≧ｔとなるまで、ステップＳ３０２～ステップＳ３０４が繰り返し実行される。 Step S305: The step response calculation unit 102 determines whether τ≧t. If it is not determined that τ≧t (NO in step S305), the step response calculation unit 102 returns to step S302. As a result, steps S302 to S304 are repeatedly executed until τ≧t.

一方で、τ≧ｔであると判定された場合（ステップＳ３０５でＹＥＳ）、ステップ応答計算部１０２は、ステップ応答計算処理を終了する。これにより、最終的に計算された制御量予測値ｙ（ｋ'）が単位ステップ応答Ｓ_θ（ｔ）として得られる（つまり、Ｓ_θ（ｔ）＝ｙ（ｋ）が、プラント応答モデルの単位ステップ応答として得られる。）。 On the other hand, if it is determined that τ≧t (YES in step S305), the step response calculation unit 102 ends the step response calculation process. As a result, the finally calculated controlled variable predicted value y(k') is obtained as the unit step response S _θ (t) (that is, S _θ (t) = y(k) is the unit of the plant response model. (obtained as a step response).

［第二の実施形態］
次に、第二の実施形態について説明する。第二の実施形態では、学習データ選別処理が第一の実施形態と異なる。なお、第二の実施形態で特に言及しない事項については、第一の実施形態と同様としてよい。 [Second embodiment]
Next, a second embodiment will be described. In the second embodiment, the learning data selection process is different from the first embodiment. Note that matters not specifically mentioned in the second embodiment may be the same as those in the first embodiment.

そこで、以下、或るｋに関して学習データとして用いる運転データを選別する処理（第二の実施形態に係る学習データ選別処理）について、図１０を参照しながら説明する。ただし、ｋ＝０である場合は更新フラグ＝ＯＮとして処理を終了する。 Hereinafter, a process for selecting driving data to be used as learning data for a certain k (learning data selecting process according to the second embodiment) will be described with reference to FIG. 10. However, if k=0, the update flag is set to ON and the process ends.

ステップＳ４０１：まず、学習データ選別部１１３は、図４のステップＳ１０１と同様に、状態ベクトルξ（ｋ）と１時刻前の状態ベクトルξ（ｋ－１）との差のノルムｄ（ｋ）＝||ξ（ｋ）－ξ（ｋ－１）||_ｐをベクトル差ノルムとして計算する。 Step S401: First, similarly to step S101 in FIG. 4, the learning data selection unit 113 determines the norm d(k) of the difference between the state vector ξ(k) and the state vector ξ(k-1) one time before. ||ξ(k)−ξ(k−1)|| Calculate _p as the vector difference norm.

ステップＳ４０２：次に、学習データ選別部１１３は、状態ベクトルξ（ｋ）を用いて、閾値γ（ｋ）を計算する。すなわち、学習データ選別部１１３は、以下により閾値γ（ｋ）を計算する。 Step S402: Next, the learning data selection unit 113 calculates the threshold value γ(k) using the state vector ξ(k). That is, the learning data selection unit 113 calculates the threshold value γ(k) as follows.

ここで、Ｅ_ξ（ｋ）は状態ベクトルξ（１），・・・，ξ（ｋ）の平均値、Ｖ_ξ（ｋ）は状態ベクトルξ（１），・・・，ξ（ｋ）の分散、γ（ｋ）は状態ベクトルξ（１），・・・，ξ（ｋ）の標準偏差のα倍を表す。また、αは調整係数であり、予め設定された値である。

Here, E _ξ (k) is the average value of state vectors ξ (1), ..., ξ (k), and V _ξ (k) is the average value of state vectors ξ (1), ..., ξ (k). The variance γ(k) represents α times the standard deviation of the state vector ξ(1), . . . , ξ(k). Further, α is an adjustment coefficient and is a preset value.

ステップＳ４０３：次に、学習データ選別部１１３は、ベクトル差ノルムｄ（ｋ）が閾値γ（ｋ）以上であるか否かを判定する。 Step S403: Next, the learning data selection unit 113 determines whether the vector difference norm d(k) is greater than or equal to the threshold value γ(k).

ｄ（ｋ）≧γ（ｋ）であると判定した場合（ステップＳ４０３でＹＥＳ）、学習データ選別部１１３は、ステップＳ４０４に進む。一方で、ｄ（ｋ）≧γ（ｋ）であると判定しなかった場合（ステップＳ４０３でＮＯ）、学習データ選別部１１３は、ステップＳ４０５に進む。 If it is determined that d(k)≧γ(k) (YES in step S403), the learning data selection unit 113 proceeds to step S404. On the other hand, if it is not determined that d(k)≧γ(k) (NO in step S403), the learning data selection unit 113 proceeds to step S405.

ステップＳ４０４：学習データ選別部１１３は、図４のステップＳ１０３と同様に、更新フラグをＯＮにする。 Step S404: The learning data selection unit 113 turns on the update flag, similar to step S103 in FIG.

ステップＳ４０５：学習データ選別部１１３は、図４のステップＳ１０４と同様に、更新フラグをＯＦＦにする。 Step S405: The learning data selection unit 113 turns off the update flag, similar to step S104 in FIG.

［第三の実施形態］
次に、第三の実施形態について説明する。第三の実施形態では、学習データ選別処理が第一の実施形態と異なる。また、閾値γ（ｋ）の計算方法が第二の実施形態と異なる。なお、第三の実施形態で特に言及しない事項については、第一の実施形態と同様としてよい。 [Third embodiment]
Next, a third embodiment will be described. In the third embodiment, the learning data selection process is different from the first embodiment. Furthermore, the method of calculating the threshold value γ(k) is different from the second embodiment. Note that matters not specifically mentioned in the third embodiment may be the same as those in the first embodiment.

そこで、以下、或るｋに関して学習データとして用いる運転データを選別する処理（第三の実施形態に係る学習データ選別処理）について、図１１を参照しながら説明する。ただし、ｋ＝０である場合は更新フラグ＝ＯＮとして処理を終了する。 Hereinafter, a process for selecting driving data to be used as learning data for a certain k (learning data selecting process according to the third embodiment) will be described with reference to FIG. 11. However, if k=0, the update flag is set to ON and the process ends.

ステップＳ５０１：まず、学習データ選別部１１３は、ｋ＝１であるか否かを判定する。 Step S501: First, the learning data selection unit 113 determines whether k=1.

ｋ＝１であると判定した場合（ステップＳ５０１でＹＥＳ）、学習データ選別部１１３は、ステップＳ５０２に進む。一方で、ｋ＝１であると判定しなかった場合（ステップＳ５０１でＮＯ）、学習データ選別部１１３は、ステップＳ５０３に進む。 If it is determined that k=1 (YES in step S501), the learning data selection unit 113 proceeds to step S502. On the other hand, if it is not determined that k=1 (NO in step S501), the learning data selection unit 113 proceeds to step S503.

ステップＳ５０２：学習データ選別部１１３は、Ｅ_ξ（０）＝０、Ｓ_ξ（０）＝０と初期化する。ここで、Ｅ_ξ（０）は状態ベクトルのノルムの平均値の初期値、Ｓ_ξ（０）は状態ベクトルのノルムの２乗の平均値の初期値である。 Step S502: The learning data selection unit 113 initializes E _ξ (0)=0 and S _ξ (0)=0. Here, E _ξ (0) is the initial value of the average value of the norm of the state vector, and S _ξ (0) is the initial value of the average value of the square of the norm of the state vector.

ステップＳ５０３：学習データ選別部１１３は、図４のステップＳ１０１と同様に、状態ベクトルξ（ｋ）と状態ベクトルξ（ｋ－１）との差のノルムｄ（ｋ）＝||ξ（ｋ）－ξ（ｋ－１）||_ｐをベクトル差ノルムとして計算する。 Step S503: Similarly to step S101 in FIG. 4, the learning data selection unit 113 determines the norm of the difference between the state vector ξ(k) and the state vector ξ(k-1), d(k)=||ξ(k). −ξ(k−1)|| Calculate _p as the vector difference norm.

ステップＳ５０４：次に、学習データ選別部１１３は、状態ベクトルξ（ｋ）を用いて、閾値γ（ｋ）を計算する。すなわち、学習データ選別部１１３は、以下により閾値γ（ｋ）を計算する。 Step S504: Next, the learning data selection unit 113 calculates the threshold value γ(k) using the state vector ξ(k). That is, the learning data selection unit 113 calculates the threshold value γ(k) as follows.

ここで、Ｅ_ξ（ｋ）は状態ベクトルξ（１），・・・，ξ（ｋ）のノルムの平均値、Ｓ_ξ（ｋ）は状態ベクトルξ（１），・・・，ξ（ｋ）のノルムの２乗の平均値、Ｖ_ξ（ｋ）は状態ベクトルξ（１），・・・，ξ（ｋ）のノルムの分散、γ（ｋ）は状態ベクトルξ（１），・・・，ξ（ｋ）のノルムの標準偏差のα倍を表す。また、αは調整係数であり、予め設定された値である。

Here, E _ξ (k) is the average value of the norm of the state vector ξ (1), ..., ξ (k), and S _ξ (k) is the state vector ξ (1), ..., ξ (k ), V _ξ (k) is the variance of the norm of the state vector ξ(1),..., ξ(k), and γ(k) is the average value of the square of the norm of the state vector ξ(1),...・, represents α times the standard deviation of the norm of ξ(k). Further, α is an adjustment coefficient and is a preset value.

ステップＳ５０５：次に、学習データ選別部１１３は、ベクトル差ノルムｄ（ｋ）が閾値γ（ｋ）以上であるか否かを判定する。 Step S505: Next, the learning data selection unit 113 determines whether the vector difference norm d(k) is greater than or equal to the threshold value γ(k).

ｄ（ｋ）≧γ（ｋ）であると判定した場合（ステップＳ５０５でＹＥＳ）、学習データ選別部１１３は、ステップＳ５０６に進む。一方で、ｄ（ｋ）≧γ（ｋ）であると判定しなかった場合（ステップＳ５０５でＮＯ）、学習データ選別部１１３は、ステップＳ５０７に進む。 If it is determined that d(k)≧γ(k) (YES in step S505), the learning data selection unit 113 proceeds to step S506. On the other hand, if it is not determined that d(k)≧γ(k) (NO in step S505), the learning data selection unit 113 proceeds to step S507.

ステップＳ５０６：学習データ選別部１１３は、図４のステップＳ１０３と同様に、更新フラグをＯＮにする。 Step S506: The learning data selection unit 113 turns on the update flag similarly to step S103 in FIG.

ステップＳ５０７：学習データ選別部１１３は、図４のステップＳ１０４と同様に、更新フラグをＯＦＦにする。 Step S507: The learning data selection unit 113 turns off the update flag, similar to step S104 in FIG.

［第四の実施形態］
次に、第四の実施形態について説明する。第四の実施形態では、学習データ選別処理が第一の実施形態～第三の実施形態と異なる。なお、第四の実施形態で特に言及しない事項については、第一の実施形態と同様としてよい。 [Fourth embodiment]
Next, a fourth embodiment will be described. In the fourth embodiment, the learning data selection process is different from the first to third embodiments. Note that matters not specifically mentioned in the fourth embodiment may be the same as those in the first embodiment.

そこで、以下、或るｋに関して学習データとして用いる運転データを選別する処理（第四の実施形態に係る学習データ選別処理）について、図１２を参照しながら説明する。ただし、ｋ＝０である場合は更新フラグ＝ＯＮとして処理を終了する。 Hereinafter, a process for selecting driving data to be used as learning data for a certain k (learning data selecting process according to the fourth embodiment) will be described with reference to FIG. 12. However, if k=0, the update flag is set to ON and the process ends.

ステップＳ６０１：まず、学習データ選別部１１３は、ｋ＝１であるか否かを判定する。 Step S601: First, the learning data selection unit 113 determines whether k=1.

ｋ＝１であると判定した場合（ステップＳ６０１でＹＥＳ）、学習データ選別部１１３は、ステップＳ６０２に進む。一方で、ｋ＝１であると判定しなかった場合（ステップＳ６０１でＮＯ）、学習データ選別部１１３は、ステップＳ６０３に進む。 If it is determined that k=1 (YES in step S601), the learning data selection unit 113 proceeds to step S602. On the other hand, if it is not determined that k=1 (NO in step S601), the learning data selection unit 113 proceeds to step S603.

ステップＳ６０２：学習データ選別部１１３は、Ｅ_ｙ（０）＝０、Ｓ_ｙ（０）＝０と初期化する。ここで、Ｅ_ｙ（０）は制御量の平均値の初期値、Ｓ_ｙ（０）は制御量の２乗の平均値の初期値である。 Step S602: The learning data selection unit 113 initializes E _y (0)=0 and S _y (0)=0. Here, E _y (0) is the initial value of the average value of the control amount, and S _y (0) is the initial value of the average value of the square of the control amount.

ステップＳ６０３：学習データ選別部１１３は、状態ベクトルξ（ｋ）を用いて、予測誤差（の絶対値）ｄ（ｋ）を計算する。予測誤差ｄ（ｋ）は、図５のステップＳ２０４と同様に、ｄ（ｋ）＝｜ｙ（ｋ）－ξ（ｋ）^Τθ_ｅ（ｋ－１）｜により計算される。 Step S603: The learning data selection unit 113 calculates (the absolute value of) the prediction error d(k) using the state vector ξ(k). The prediction error d(k) is calculated by d(k)=|y(k)−ξ(k) ^Τ θ _e (k−1)|, similarly to step S204 in FIG.

ステップＳ６０４：次に、学習データ選別部１１３は、制御量ｙ（ｋ）を用いて、閾値γ（ｋ）を計算する。すなわち、学習データ選別部１１３は、以下により閾値γ（ｋ）を計算する。 Step S604: Next, the learning data selection unit 113 calculates the threshold value γ(k) using the control amount y(k). That is, the learning data selection unit 113 calculates the threshold value γ(k) as follows.

ここで、Ｅ_ｙ（ｋ）は制御量ｙ（１），・・・，ｙ（ｋ）の平均値、Ｓ_ｙ（ｋ）は制御量ｙ（１），・・・，ｙ（ｋ）の２乗の平均値、Ｖ_ｙ（ｋ）は制御量ｙ（１），・・・，ｙ（ｋ）の分散、γ（ｋ）は制御量ｙ（１），・・・，ｙ（ｋ）の標準偏差のα倍を表す。また、αは調整係数であり、予め設定された値である。

Here, E _y (k) is the average value of the controlled quantities y(1),..., y(k), and S _y (k) is the average value of the controlled quantities y(1),..., y(k). The average value of the squares, V _y (k) is the variance of the control amount y(1),..., y(k), and γ(k) is the control amount y(1),..., y(k). represents α times the standard deviation of Further, α is an adjustment coefficient and is a preset value.

ステップＳ６０５：次に、学習データ選別部１１３は、予測誤差ｄ（ｋ）が閾値γ（ｋ）以上であるか否かを判定する。 Step S605: Next, the learning data selection unit 113 determines whether the prediction error d(k) is greater than or equal to the threshold value γ(k).

ｄ（ｋ）≧γ（ｋ）であると判定した場合（ステップＳ６０５でＹＥＳ）、学習データ選別部１１３は、ステップＳ６０６に進む。一方で、ｄ（ｋ）≧γ（ｋ）であると判定しなかった場合（ステップＳ６０５でＮＯ）、学習データ選別部１１３は、ステップＳ６０７に進む。 If it is determined that d(k)≧γ(k) (YES in step S605), the learning data selection unit 113 proceeds to step S606. On the other hand, if it is not determined that d(k)≧γ(k) (NO in step S605), the learning data selection unit 113 proceeds to step S607.

ステップＳ６０６：学習データ選別部１１３は、図４のステップＳ１０３と同様に、更新フラグをＯＮにする。 Step S606: The learning data selection unit 113 turns on the update flag similarly to step S103 in FIG.

ステップＳ６０７：学習データ選別部１１３は、図４のステップＳ１０４と同様に、更新フラグをＯＦＦにする。 Step S607: The learning data selection unit 113 turns off the update flag, similar to step S104 in FIG.

［実施例］
以下、第一の実施形態に係るプラント応答推定装置１０の実施例について説明する。 [Example]
Hereinafter, an example of the plant response estimation device 10 according to the first embodiment will be described.

本実施例では、制御対象のプラントの拡大プラント応答モデルは次式で表されるものとする。 In this embodiment, it is assumed that the expanded plant response model of the plant to be controlled is expressed by the following equation.

ｙ（ｋ）＝ａ_１ｙ（ｋ－１）＋ａ_２ｙ（ｋ－２）＋ｂ_０ｕ（ｋ）＋ｂ_１ｕ（ｋ－１）＋ｂ_２ｕ（ｋ－２）＋ｃ_０
すなわち、制御量ｙに関しては２次元、操作量ｕに関しては３次元の要素を有するものとする。なお、本実施例では外乱ｖがないものとする。 y(k)=a ₁ y(k-1)+a ₂ y(k-2)+b ₀ u(k)+b ₁ u(k-1)+b ₂ u(k-2)+c ₀
That is, it is assumed that the control amount y has a two-dimensional element, and the manipulated variable u has a three-dimensional element. In this example, it is assumed that there is no disturbance v.

このとき、状態ベクトルξ（ｋ）及び拡大モデルパラメータθ_ｅ（ｋ）は以下で表される。 At this time, the state vector ξ(k) and the expanded model parameter θ _e (k) are expressed as follows.

また、制御量ｙの推定値はｙ_ｅｓｔ（ｋ）＝ξ（ｋ）^Τθ_ｅ（ｋ－１）で表される。

Further, the estimated value of the control amount y is expressed as y _est (k)=ξ(k) ^T θ _e (k−1).

更に、本実施例では、学習データ選別部１１３がベクトル差ノルムｄ（ｋ）を計算するときのノルムとしてはＬ１ノルムを用いると共に閾値はγ＝０．１とした。また、本実施例における制御対象プラントのステップ応答としては、図１３に示すステップ応答を想定する。 Furthermore, in this embodiment, the learning data selection unit 113 uses the L1 norm as the norm when calculating the vector difference norm d(k), and the threshold value is set to γ=0.1. Further, as the step response of the plant to be controlled in this embodiment, the step response shown in FIG. 13 is assumed.

以上の設定の下で、学習データ選別部１１３を用いない場合と学習データ選別部１１３を用いた場合とを比較した。 Under the above settings, a case where the learning data sorting section 113 is not used and a case where the learning data sorting section 113 is used were compared.

図１４は、学習データ選別部１１３を用いない場合で調整係数をλ＝０．９９９としたときのモデルパラメータ推定結果及び制御結果である。また、図１５は、学習データ選別部１１３を用いない場合で調整係数をλ＝１．１としたときのモデルパラメータ推定結果及び制御結果である。 FIG. 14 shows model parameter estimation results and control results when the learning data selection unit 113 is not used and the adjustment coefficient is λ=0.999. Further, FIG. 15 shows model parameter estimation results and control results when the learning data selection unit 113 is not used and the adjustment coefficient is set to λ=1.1.

図１４及び図１５はいずれも時刻０から１０００までは制御量ｙと操作量ｕが一定であるため、モデルパラメータも時刻０付近で初期値から変動した後、時刻１０００まで一定となっている。 In both FIGS. 14 and 15, the controlled variable y and the manipulated variable u are constant from time 0 to 1000, so the model parameters also fluctuate from their initial values near time 0 and then remain constant until time 1000.

時刻１０００以降に関しては、図１４では、モデルパラメータは変化し、時刻４０００付近まで変化がみられる。一方で、図１５では、モデルパラメータは変化していない。これは、調整係数λが大きい場合、共分散行列の更新が早まり、ゼロ付近になるためである。 As for after time 1000, in FIG. 14, the model parameters change, and changes can be seen up to around time 4000. On the other hand, in FIG. 15, the model parameters have not changed. This is because when the adjustment coefficient λ is large, the covariance matrix is updated quickly and becomes close to zero.

制御応答に関しては、図１４では、モデルパラメータが変化したことにより、制御量が目標値に追従できている。一方で、図１５では、モデルパラメータが初期に推定した値から変化していないため、モデルが正しく推定されておらず、制御量が目標値に追従できずに応答が不安定になっている。 Regarding the control response, in FIG. 14, the control amount is able to follow the target value due to the change in the model parameters. On the other hand, in FIG. 15, the model parameters have not changed from the initially estimated values, so the model is not estimated correctly, the control amount cannot follow the target value, and the response is unstable.

図１６は、学習データ選別部１１３を用いた場合で調整係数をλ＝０．９９９としたときのモデルパラメータ推定結果及び制御結果である。また、図１７は、学習データ選別部１１３を用いた場合で調整係数をλ＝１．１としたときのモデルパラメータ推定結果及び制御結果である。 FIG. 16 shows model parameter estimation results and control results when the learning data selection unit 113 is used and the adjustment coefficient is λ=0.999. Further, FIG. 17 shows model parameter estimation results and control results when the learning data selection unit 113 is used and the adjustment coefficient is set to λ=1.1.

図１６及び図１７はいずれも時刻０から１０００までは制御量ｙと操作量ｕが一定であるため、モデルパラメータも時刻０付近で初期値から変動した後、時刻１０００まで一定となっている。 In both FIGS. 16 and 17, the controlled variable y and the manipulated variable u are constant from time 0 to 1000, so the model parameters also fluctuate from their initial values near time 0 and then remain constant until time 1000.

時刻１０００以降に関しては、図１６では、モデルパラメータは変化し、時刻１５００付近で収束している。一方で、図１７では、図１５と異なり、モデルパラメータは変化し、時刻１２００付近で収束している。これは、学習データ選別部１１３の働きによって時刻０～１０００までの定常状態の運転データが更新に用いられないため共分散行列が定常状態でもゼロに収束せず、また時刻１０００付近で制御量ｙと操作量ｕが変化した運転データは更新に用いられるためパラメータ更新が加速され真値に近くなり、収束が早まったためである。 As for after time 1000, in FIG. 16, the model parameters change and converge around time 1500. On the other hand, in FIG. 17, unlike FIG. 15, the model parameters change and converge around time 1200. This is because the operation data in the steady state from time 0 to 1000 is not used for updating due to the operation of the learning data selection unit 113, so the covariance matrix does not converge to zero even in the steady state, and the control amount y This is because the operation data in which the manipulated variable u has changed is used for updating, so the parameter updating is accelerated and becomes closer to the true value, resulting in faster convergence.

制御応答に関しては、図１６及び図１７のいずれもモデルパラメータが比較的正しく推定されているため、制御量が目標値に追従できている。 Regarding the control response, since the model parameters in both FIGS. 16 and 17 are estimated relatively correctly, the control amount is able to follow the target value.

よって、調整係数λを１より大きくしてモデルパラメータの収束を早めた際に、学習データ選別部１１３の働きにより、定常状態における運転データでパラメータが収束してしまう事態を回避できているといえる。 Therefore, when the adjustment coefficient λ is made larger than 1 to speed up the convergence of the model parameters, it can be said that the operation of the learning data selection unit 113 avoids the situation where the parameters converge with the operating data in the steady state. .

以上のように、学習データ選別部１１３により、例えば、定常状態が継続するだけ等といった運転データに十分な情報が含まれない場合にはパラメータ更新が停止され、制御量ｙや操作量ｕが変化する場合にはパラメータ更新が行われる。これにより、例えば、定常状態で真値とは遠い局所的な値にパラメータが不用意に収束してしまう事態を回避しつつ、学習データを選別しない場合よりも真値に近い値に素早く収束する、という利点が生じている。 As described above, the learning data selection unit 113 stops parameter updating when the operating data does not include sufficient information, such as when a steady state continues, for example, and the control amount y and operation amount u change. If so, the parameters are updated. This allows, for example, to avoid situations where parameters inadvertently converge to local values that are far from the true value in a steady state, while converging to values closer to the true value more quickly than when the training data is not selected. , there are advantages.

なお、上記の各実施形態では、主に、プラント応答モデルのパラメータを推定する場合について説明したが、これに限られず、例えば、上記の各実施形態に係るプラント応答推定装置１０は、当該パラメータを設定したプラント応答モデルにより制御対象プラントを実際に制御する制御装置として機能してもよい。このとき、上記の各実施形態に係るプラント応答推定装置１０は、モデル予測制御等といった既知の制御手法を用いて、当該パラメータを設定したプラント応答モデルにより制御対象プラントを制御すればよい。このように、上記の各実施形態に係るプラント応答推定装置１０は、例えば、モデル予測制御等といった既知の制御手法と組み合わせることにより、制御の追従性と応答の安定化にも寄与することができる。 In each of the above embodiments, the case where the parameters of the plant response model are estimated has been mainly described, but the present invention is not limited to this. For example, the plant response estimation device 10 according to each of the above embodiments may estimate the parameters. It may also function as a control device that actually controls the plant to be controlled using the set plant response model. At this time, the plant response estimation device 10 according to each of the embodiments described above may control the plant to be controlled using a plant response model in which the parameters are set using a known control method such as model predictive control. In this way, the plant response estimation device 10 according to each of the above embodiments can contribute to control followability and response stabilization by combining with known control methods such as model predictive control, for example. .

本発明は、具体的に開示された上記の実施形態に限定されるものではなく、特許請求の範囲の記載から逸脱することなく、種々の変形や変更、既知の技術との組み合わせ等が可能である。 The present invention is not limited to the above-described specifically disclosed embodiments, and various modifications and changes, combinations with known techniques, etc. can be made without departing from the scope of the claims. be.

１０プラント応答推定装置
１１入力装置
１２表示装置
１３外部Ｉ／Ｆ
１３ａ記録媒体
１４通信Ｉ／Ｆ
１５プロセッサ
１６メモリ装置
１７バス
１０１モデルパラメータ推定部
１０２ステップ応答計算部
１１１バッファ部
１１２状態ベクトル変換部
１１３学習データ選別部
１１４逐次推定計算部
１１５パラメータ変換部 10 Plant response estimation device 11 Input device 12 Display device 13 External I/F
13a Recording medium 14 Communication I/F
15 Processor 16 Memory device 17 Bus 101 Model parameter estimation section 102 Step response calculation section 111 Buffer section 112 State vector conversion section 113 Learning data selection section 114 Sequential estimation calculation section 115 Parameter conversion section

Claims

a state vector creation unit that creates a state vector representing the state of the controlled plant based on operation data of the controlled plant;
a learning data selection unit that uses the state vector to determine whether or not the state vector is to be used as learning data;
a parameter updating unit that uses the state vector as learning data to update parameters of a function representing a response model of the controlled plant when it is determined that the state vector is to be used as learning data;
has
The learning data selection unit is
Calculate the norm of the difference between the current value and the previous value of the state vector,
Calculating the product of the standard deviation of all the state vectors created so far by the state vector creation unit and a predetermined coefficient having a value greater than 0 as a threshold;
If the norm is greater than or equal to the threshold, the operating data is neither a steady state nor an upper limit value nor a lower limit value, and the state vector is determined to be learning data;
If the norm is less than the threshold, it is determined that the state vector is not to be used as learning data because the driving data is in a steady state, an upper limit value, or a lower limit value;
The parameter update unit includes:
Using the state vector determined to be the learning data, sequentially update the covariance matrix and the estimated value of the parameter by an iterative least squares method each time it is determined to be the learning data,
Creation of the state vector by the state vector creation unit, calculation of the threshold by the learning data selection unit and determination of whether or not to use the learning data, and updating of the parameters by the parameter update unit are based on the operation. A plant response estimation device that is executed sequentially at each data sampling period .

a state vector creation unit that creates a state vector representing the state of the controlled plant based on operation data of the controlled plant;
a learning data selection unit that uses the state vector to determine whether or not the state vector is to be used as learning data;
a parameter updating unit that uses the state vector as learning data to update parameters of a function representing a response model of the controlled plant when it is determined that the state vector is to be used as learning data;
has
The learning data selection unit is
Using the current value of the state vector and the parameter, calculate an estimated value of the controlled variable of the controlled plant,
calculating a prediction error representing an error between the estimated value of the controlled variable and the current value of the controlled variable;
The product of the standard deviation of all the state vectors created up to now by the state vector creation unit and a predetermined coefficient that takes a value greater than 0 , or the product of the standard deviation of all the control variables up to now and more than 0. Calculate the product with a predetermined coefficient that takes a large value as a threshold ,
If the prediction error is greater than or equal to the threshold, determining that the state vector is to be learning data because the driving data is neither a steady state nor an upper limit value nor a lower limit value ;
If the prediction error is less than the threshold, it is determined that the state vector is not to be used as learning data because the driving data is in a steady state, an upper limit value, or a lower limit value;
The parameter update unit includes:
Using the state vector determined to be the learning data, sequentially update the covariance matrix and the estimated value of the parameter by an iterative least squares method each time it is determined to be the learning data,
Creation of the state vector by the state vector creation unit, calculation of the threshold by the learning data selection unit and determination of whether or not to use the learning data, and updating of the parameters by the parameter update unit are based on the operation. A plant response estimation device that is executed sequentially at each data sampling period .

a state vector creation procedure for creating a state vector representing the state of the controlled plant based on operational data of the controlled plant;
a learning data selection procedure that uses the state vector to determine whether or not the state vector is to be used as learning data;
If it is determined that the state vector is to be used as learning data, a parameter updating procedure of updating parameters of a function representing a response model of the controlled plant using the state vector as learning data;
The computer executes
The learning data selection procedure is as follows :
Calculate the norm of the difference between the current value and the previous value of the state vector,
Calculating the product of the standard deviation of all the state vectors created so far by the state vector creation procedure and a predetermined coefficient having a value greater than 0 as a threshold;
If the norm is greater than or equal to the threshold, the operating data is neither a steady state nor an upper limit value nor a lower limit value, and the state vector is determined to be learning data;
If the norm is less than the threshold, it is determined that the state vector is not to be used as learning data because the driving data is in a steady state, an upper limit value, or a lower limit value;
The parameter update procedure is as follows:
Using the state vector determined to be the learning data, sequentially update the covariance matrix and the estimated value of the parameter by an iterative least squares method each time it is determined to be the learning data,
Creation of the state vector by the state vector creation procedure, calculation of the threshold value and determination of whether or not to use the learning data by the learning data selection procedure, and updating of the parameters by the parameter update procedure are the same as those for the operation. A plant response estimation method that is executed sequentially at each data sampling period .

a state vector creation procedure for creating a state vector representing the state of the controlled plant based on operational data of the controlled plant;
a learning data selection procedure that uses the state vector to determine whether or not the state vector is to be used as learning data;
If it is determined that the state vector is to be used as learning data, a parameter updating procedure of updating parameters of a function representing a response model of the controlled plant using the state vector as learning data;
The computer executes
The learning data selection procedure is as follows :
Using the current value of the state vector and the parameter, calculate an estimated value of the controlled variable of the controlled plant,
calculating a prediction error representing an error between the estimated value of the controlled variable and the current value of the controlled variable;
The product of the standard deviation of all the state vectors created up to now by the state vector creation procedure and a predetermined coefficient that takes a value greater than 0 , or the product of the standard deviation of all the control variables up to now and more than 0. Calculate the product with a predetermined coefficient that takes a large value as a threshold ,
If the prediction error is greater than or equal to the threshold, determining that the state vector is to be learning data because the driving data is neither a steady state nor an upper limit value nor a lower limit value ;
If the prediction error is less than the threshold, it is determined that the state vector is not to be used as learning data because the driving data is in a steady state, an upper limit value, or a lower limit value;
The parameter update procedure is as follows:
Using the state vector determined to be the learning data, sequentially update the covariance matrix and the estimated value of the parameter by an iterative least squares method each time it is determined to be the learning data,
Creation of the state vector by the state vector creation procedure, calculation of the threshold value and determination of whether or not to use the learning data by the learning data selection procedure, and updating of the parameters by the parameter update procedure are the same as those for the operation. A plant response estimation method that is executed sequentially at each data sampling period .

a state vector creation procedure for creating a state vector representing the state of the controlled plant based on operational data of the controlled plant;
a learning data selection procedure that uses the state vector to determine whether or not the state vector is to be used as learning data;
If it is determined that the state vector is to be used as learning data, a parameter updating procedure of updating parameters of a function representing a response model of the controlled plant using the state vector as learning data;
make the computer run
The learning data selection procedure is as follows :
Calculate the norm of the difference between the current value and the previous value of the state vector,
Calculating the product of the standard deviation of all the state vectors created so far by the state vector creation procedure and a predetermined coefficient having a value greater than 0 as a threshold;
If the norm is greater than or equal to the threshold, the operating data is neither a steady state nor an upper limit value nor a lower limit value, and the state vector is determined to be learning data;
If the norm is less than the threshold, it is determined that the state vector is not to be used as learning data because the driving data is in a steady state, an upper limit value, or a lower limit value;
The parameter update procedure is as follows:
Using the state vector determined to be the learning data, sequentially update the covariance matrix and the estimated value of the parameter by an iterative least squares method each time it is determined to be the learning data,
Creation of the state vector by the state vector creation procedure, calculation of the threshold value and determination of whether or not to use the learning data by the learning data selection procedure, and updating of the parameters by the parameter update procedure are the same as those for the operation. A program that is executed sequentially at each data sampling period .

a state vector creation procedure for creating a state vector representing the state of the controlled plant based on operational data of the controlled plant;
a learning data selection procedure that uses the state vector to determine whether or not the state vector is to be used as learning data;
If it is determined that the state vector is to be used as learning data, a parameter updating procedure of updating parameters of a function representing a response model of the controlled plant using the state vector as learning data;
make the computer run
The learning data selection procedure is as follows :
Using the current value of the state vector and the parameter, calculate an estimated value of the controlled variable of the controlled plant,
calculating a prediction error representing an error between the estimated value of the controlled variable and the current value of the controlled variable;
The product of the standard deviation of all the state vectors created up to now by the state vector creation procedure and a predetermined coefficient that takes a value greater than 0 , or the product of the standard deviation of all the control variables up to now and more than 0. Calculate the product with a predetermined coefficient that takes a large value as a threshold ,
If the prediction error is greater than or equal to the threshold, determining that the state vector is to be learning data because the driving data is neither a steady state nor an upper limit value nor a lower limit value ;
If the prediction error is less than the threshold, it is determined that the state vector is not to be used as learning data because the driving data is in a steady state, an upper limit value, or a lower limit value;
The parameter update procedure is as follows:
Using the state vector determined to be the learning data, sequentially update the covariance matrix and the estimated value of the parameter by an iterative least squares method each time it is determined to be the learning data,
Creation of the state vector by the state vector creation procedure, calculation of the threshold value and determination of whether or not to use the learning data by the learning data selection procedure, and updating of the parameters by the parameter update procedure are the same as those for the operation. A program that is executed sequentially at each data sampling period .