Quantum next generation reservoir computing: an efficient quantum algorithm for forecasting quantum dynamics

1823 Accesses
2 Altmetric
Explore all metrics

A Correction to this article was published on 05 February 2025

This article has been updated

Abstract

Next Generation Reservoir Computing (NG-RC) is a modern class of model-free machine learning that enables an accurate forecasting of time series data generated by dynamical systems. We demonstrate that NG-RC can accurately predict full many-body quantum dynamics in both integrable and chaotic systems. This is in contrast to the conventional application of reservoir computing that concentrates on the prediction of the dynamics of observables. In addition, we apply a technique which we refer to as skipping ahead to predict far future states accurately without the need to extract information about the intermediate states. However, adopting a classical NG-RC for many-body quantum dynamics prediction is computationally prohibitive due to the large Hilbert space of sample input data. In this work, we propose an end-to-end quantum algorithm for many-body quantum dynamics forecasting with a quantum computational speedup via the block-encoding technique. This proposal presents an efficient model-free quantum scheme to forecast quantum dynamics coherently, bypassing inductive biases incurred in a model-based approach.

Optimal training of finitely sampled quantum reservoir computers for forecasting of chaotic dynamics

Article Open access 27 February 2025

Optimizing quantum noise-induced reservoir computing for nonlinear and chaotic time series prediction

Article Open access 07 November 2023

Predicting three-dimensional chaotic systems with four qubit quantum systems

Article Open access 20 February 2025

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Motivation and introduction

Learning quantum dynamics presents fundamental challenges in quantum physics, to which numerous machine learning techniques have been applied (Huang et al. 2023; Caro et al. 2023; Flurin et al. 2020; Carleo and Troyer 2017; Jerbi et al. 2023; Mohseni et al. 2022; Granade et al. 2012; Rodríguez et al. 2022, and the references therein). Significant efforts have been made to compactly represent many-body quantum states and efficiently parameterize their dynamics using classical learning algorithms with classical data (Carleo and Troyer 2017; Mohseni et al. 2022; Schmitt and Heyl 2020; Hartmann and Carleo 2019). On the other hand, quantum algorithms that work on a large batch of classical data typically require extreme assumptions about data loading and readout, which remain points of contention, see, for example, Aaronson (2015); Tang (2019); Matteo et al. (2020).

The two challenges spur the proposals of quantum algorithms that can learn directly from quantum data (Schatzki et al. 2021; Huang et al. 2022). Our work adds to the literature by providing a novel quantum algorithm for learning from quantum data inspired by next generation reservoir computing (NG-RC) algorithm. Importantly, this model-free paradigm of reservoir computing requires time-series of quantum data, without assuming a quantum dynamical ansatz a priori. Such approach circumvents inductive biases that could arise in learning complex quantum dynamics. We commence with a concise overview of the standard RC and the NG-RC, emphasizing the innovative utilization of NG-RC in the accurate prediction of many-body quantum dynamics. Subsequently, we address the limitations inherent to classical NG-RC and introduce a quantum algorithm designed to overcome the challenges presented by its classical counterpart.

1.1 Reservoir computing (RC)

RC is a computational paradigm in machine learning that harnesses a recurrent neural network (RNN) to learn time-series data, such as the states of a dynamical system. Even when the dynamics is complicated or chaotic, well-tuned RC can accurately forecast the future states of such dynamics up to many Lyapunov times (Pathak et al. 2017, 2018). Importantly, RC bypasses the training of an RNN by utilizing a fixed, randomly initialized RNN, called a reservoir, consisting of a large number L of hidden neurons.

Suppose that an input data $\varvec{s}_k\in \mathbb {R}^{D}$ is fed into the reservoir, where k specifies the time step of some dynamical process. The data is represented as a feature vector $\varvec{x}_k = f(\varvec{s}_k,\varvec{x}_{k-1}) \in \mathbb {R}^L$, where f is typically a highly non-linear function that represents the dynamics of the reservoir. The feature vector is then linearly transformed in the final, trainable output layer into a prediction vector $\varvec{y}'_k = W\varvec{x}_k$. In particular, the input $\varvec{s}_k$ at the kth time step and the output at the previous time step $\varvec{y}'_{k-1}$ may be fed together as inputs into the reservoir to train the feature vector $\varvec{x}_k$ for the next time step.

To prevent overfitting in a supervised learning via RC, the weight matrix W is obtained by optimizing $\varvec{y}'_k$ in the least-square sense with respect to the desired target $\varvec{y}_k$, which can be done via the Tikhonov regularization

$$\begin{aligned} W = YX^T(XX^T+\lambda I)^{-1}. \end{aligned}$$

(1)

Here, $Y=(\varvec{y}_0,\ldots ,\varvec{y}_{T-1})\in \mathbb {R}^{D\times T}$ is the target matrix, $X = (\varvec{x}_0,\ldots ,\varvec{x}_{T-1})\in \mathbb {R}^{L\times T}$ is the feature matrix, and $\lambda \ge 0$ is the regularization parameter. Here, T stands for the number of time steps of dynamics used to train the reservoir.

1.2 NG-RC

Despite the fast training protocol offered by RC, the random initialization of the reservoir presents its own problem: there is an overwhelmingly large number of hyperparameters to be optimized, and there is no consensus on how to pick an optimal reservoir. NG-RC is an alternative approach that takes advantage of the discovery that RC can be equivalently performed using a linear reservoir and a nonlinear trainable output layer (Gonon and Ortega 2020; Hart et al. 2021). The latter is in turn equivalent to a nonlinear vector autoregression (NVAR) machine (Bollt 2021). An NVAR machine predicts future observations of a time series using past observations. In particular, the underlying state space of the dynamics to be learned can be reconstructed using linear and nonlinear functions of past observations. Inspired by this correspondence, NG-RC (Gauthier et al. 2021) proposes taking m time-delay data:

$$\begin{aligned} \varvec{o}_k=\varvec{s}_{k}\oplus \varvec{s}_{k-\Delta }\oplus \varvec{s}_{k-2\Delta }\oplus \ldots \oplus \varvec{s}_{k-(m-1)\Delta } \end{aligned}$$

(2)

with step size $\Delta $ as an input, and, forgoing the need for a reservoir, directly constructing a feature vector,

$$\begin{aligned} \varvec{x}_k=\varvec{o}_k\oplus (\varvec{o}_k)^{\otimes p}, \end{aligned}$$

(3)

whose nonlinearity arises from a degree-p monomial of the m-delay data, $\Delta $, m, and p being the NG-RC hyperparameters. The feature vector is then optimized via a linear least-square regularization to predict the target dynamics.

NG-RC with $\Delta =1$, $m=2$, and $p=2$ has been used to efficiently predict the dynamics of the Lorenz attractor using only small data sets (Gauthier et al. 2021). In fact, with this set of hyperparameters, which possess minimal memory of the past and a minimal degree of non-linearity, the NG-RC can still perform very well, as we demonstrate later in this paper, in forecasting the dynamics of both integrable and chaotic Ising models. The feature vector in this case is a $(4D^2+2D)$-dimensional vector of the form

$$\begin{aligned} \varvec{x}_k = \varvec{s}_{k}\oplus \varvec{s}_{k-1} \oplus [(\varvec{s}_{k}\oplus \varvec{s}_{k-1}) \otimes (\varvec{s}_{k}\oplus \varvec{s}_{k-1})]. \end{aligned}$$

(4)

While NG-RC has been demonstrated to effectively forecast complex classical dynamics such as spatiotemporal chaos (Barbosa and Gauthier 2022) and exhibits robust predictive capabilities despite noisy input dynamics (Liu et al. 2023), its potential application to forecasting the dynamics of quantum observables and more generally of full many-body quantum states, which encompass both real and imaginary components, remains unexplored.

1.3 Learning quantum dynamics with NG-RC

Here, we demonstrate the capability of NG-RC to predict a unitary quantum dynamics using the strategy we refer to as skipping ahead. Instead of training the output of the feature vector Eq. 4 to match the target state at the next time step, $\varvec{y}_k = \varvec{s}_{k+1}$, we could train it to match the target state at the next $\tau $ steps, skipping into the future. That is, given $\{\varvec{x}_k\}_{k=0}^{T-1}$ according to Eq. 4, the next generation reservoir is trained using $\{\varvec{y}_k = \varvec{s}_{k+\tau }\}_{k=0}^{T-1}$. We found that our approach of skipping ahead yields a remarkably accurate prediction of the full quantum state into the far future, retaining the fidelity measure close to 1 in the standard example of integrable one-dimensional many-body dynamics that we now discuss.

The one-dimensional transverse-field Ising model is governed by the Hamiltonian

$$\begin{aligned} \mathcal {H}=-J\sum _{i=1}^d Z_{i}Z_{i+1} + h\sum _{i=1}^d X_{i}, \end{aligned}$$

with a periodic boundary condition, where $X_i$ (resp. $Z_i$) is the Pauli-X (resp. Z) operator acting on the ith qubit. The system can be exactly solved by mapping to the free-fermionic model via the Jordan-Wigner transformation. However, the computation of certain expectation values such as the two-point correlation function $\langle {X_iX_j}\rangle $, let alone the computation of the time-evolved state, on a classical computer becomes infeasible as the number of qubits becomes large (Bravyi and Kitaev 2002; Beenakker et al. 2004). (Note that the model can be simulated efficiently on a quantum computer by exact diagonalization (Cervera-Lierta 2018).)

We benchmark the prediction for the time evolution of a four-qubit transverse-field Ising model in the disordered phase with $J=0.5$ and $h=5$. The initial quantum state $|{0000}\rangle $ followed by a long burn-in period initializes the inputs for the predictions of unseen $\widetilde{T}=4\times 10^4$ time steps in the future. (After the burn-in period, the input states are far into the future of those from the training set.) Fig. 2 (left column) shows the results employing the skipping-ahead strategy with the time skip of $\tau =10^6$ steps into the future, whereas Fig. 2 (right column) shows the results from the conventional iterative NG-RC approach to predict successive states in the future. Notably, the skipping-ahead strategy significantly outperforms the conventional approach as reflected by the almost perfect fidelity between the predicted states and the target states, as well as the almost perfect prediction of the observables. The iterative approach’s performances drop as the number of iteration increases, as expected.

In addition, we also benchmark our skipping-ahead approach with a non-integrable, chaotic many-body dynamics in a tilted-field Ising model with five qubits. The fidelity of the predicted states remains above 0.99, showing an excellent agreement with the full many-body states into the future. The details and results are provided in Section S5.

1.4 Challenges in using NG-RC for quantum dynamics predictions

The preceding results provide a piece of evidence that NG-RC may offer an alternative route to a high-precision prediction of quantum dynamics. However, the primary computational bottleneck arises from matrix inversion in Eq. 1 (or its generalization for complex-valued data with $X^T$ replaced by $X^{\dagger }$). The Tikhonov-regularized least squares require $O(MN^2)$ matrix operations, where N and M are, respectively, the larger and the smaller dimension of the matrix. This complexity is computationally prohibitive when the data matrix is collected from many-body quantum states, since the dimension of each column vector scales exponentially with the system size. In the following section, we present an end-to-end, quantum next generation reservoir computing (QNG-RC) algorithm that does not suffer from the exponential complexity of the classical counterpart.

1.5 Related works

A significant portion of work on quantum variants of classical reservoir computing, referred to as quantum reservoir computing (QRC), utilizes quantum systems as reservoirs (substrates) (Fujii and Nakajima 2017; Martínez-Peña et al. 2024; Nokkala et al. 2021; Mujal et al. 2021; Fujii and Nakajima 2021; Kutvonen et al. 2020; Martínez-Peña et al. 2021; Mujal et al. 2023; Martínez-Peña and Ortega 2023; Kubota et al. 2023; Fry et al. 2023). However, the outputs of these QRCs remain classical in nature, namely the expectation values of some observables, which are subsequently used to predict classical dynamical systems, not the full quantum state dynamics as in this work. Other QRC proposals focus on the task to extract partial information about quantum states such as their entanglement or purity (Ghosh et al. 2019) or perform a complete tomography (Ghosh et al. 2020). Kawai and Nakagawa (2020) propose learning excited states of a Hamiltonian by measuring expectation values in the ground state, which are then subsequently optimized using classical machine learning techniques. In all three examples, the main goal is to use QRC to effectively transform final single-qubit measurements into entangled ones that contain the desired information about the quantum state of interest, thus reducing the number or the complexity of measurement settings.

Nevertheless, there exist notable works that share several characteristics with ours. Carleo and Troyer (2017) introduce a method for learning quantum states and dynamics using neural network states, circumventing inductive biases and the exponential size of classical memory needed to process quantum data. Since reservoirs can be considered as special cases of recurrent neural networks, conventional QRC cannot surpass the predictions of the best quantum neural networks. However, the training of neural networks itself is a complex task in its own right, and both RC and NG-RC offer a means to navigate this challenge. Moreover, Ref. Carleo and Troyer (2017) learn a quantum dynamics by classically processing the data obtained by sampling from the trained network, a point that diverges from the approach presented in our work. Another related contribution, highlighted in Ghosh et al. (2021) employs QRC to coherently simulate the target state of a quantum dynamics, whereas our approach shifts the methodology to that of a next generation RC.

While our skipping-ahead method offers a fast route to generate quantum states that would otherwise emerge in the far future through the natural progression of quantum dynamics, the technique should not be confused with the fast-forwarding of quantum evolution, since our method requires training samples from the equally distant past. Moreover, our prediction operates in a model-free framework, devoid of Trotterization techniques. This distinction ensures that our proposed skipping-ahead approach is fundamentally different from fast forwarding. Consequently, the constraints outlined in the no-fast-forwarding literature (Cîrstoiu et al. 2020; Berry et al. 2007; Atia and Aharonov 2017; Haah et al. 2018; Gu et al. 2021) should not apply to our prediction method. Additionally, unlike many QRC proposals that utilize near-term quantum devices as quantum reservoirs, our circuit-based QNG-RC is designed for implementation on fault-tolerant quantum devices.

2 Method and results

2.1 Overview and input assumptions

To overcome the exponential scaling of the memory cost of predicting generic quantum dynamics by means of NG-RC, we turn to our quantum algorithm for NG-RC. At the heart of NG-RC are the creation of nonlinear feature vectors via tensoring, Eq. 4, and the training to find the optimal (complex-valued) weight matrix W via linear least-square regularization. The predictions of future quantum states are then the results of applying the weight matrix to feature vectors.

Our main quantization techniques are block encoding and the quantum singular value transform (QSVT) (Gilyén et al. 2019; Martyn et al. 2021), which allow us to efficiently implement a series of linear-algebraic operations involving the input and target states that block-encodes W into a unitary gate $U_W$ as a quantum circuit (middle of Fig. 1). This constitutes the training phase. $U_W$ is then fed into the circuit of the prediction phase to generate the output state vectors (bottom of Fig. 1).

The input quantum data are fed into the circuits in the form of a coherent superposition of states at different time steps in the dynamics, for example, $|{o_k}\rangle =(|{s_{k}}\rangle |{0}\rangle +|{s_{k-1}}\rangle |{1}\rangle )/\sqrt{2}$ in the training phase. We formalize this input assumption as having the oracles

$$\begin{aligned} \mathcal {O}_0: |{0}\rangle ^{\otimes d}|{k}\rangle&\mapsto |{s_k}\rangle |{k}\rangle ,\end{aligned}$$

(5a)

$$\begin{aligned} \mathcal {O}_{-1}: |{0}\rangle ^{\otimes d}|{k}\rangle&\mapsto |{s_{k-1}}\rangle |{k}\rangle , \end{aligned}$$

(5b)

$$\begin{aligned} \mathcal {O}_{\tau }: |{0}\rangle ^{\otimes d}|{k}\rangle&\mapsto |{s_{k+\tau }}\rangle |{k}\rangle , \end{aligned}$$

(5c)

$$\begin{aligned} \widetilde{\mathcal {O}}_0: |{0}\rangle ^{\otimes d}|{\tilde{k}}\rangle&\mapsto |{\tilde{s}_{\tilde{k}}}\rangle |{\tilde{k}}\rangle , \end{aligned}$$

(5d)

$$\begin{aligned} \widetilde{\mathcal {O}}_{-1}: |{0}\rangle ^{\otimes d}|{\tilde{k}}\rangle&\mapsto |{\tilde{s}_{\tilde{k}-1}}\rangle |{\tilde{k}}\rangle , \end{aligned}$$

(5e)

where $|{s_k}\rangle $ and $|{\tilde{s}_{\tilde{k}}}\rangle $ are d-qubit input data in the training phase and in the prediction phase, respectively, and $k=0,\ldots ,T-1$ and $\tilde{k}=0,\ldots ,\widetilde{T}-1$.

To express the total query complexity of the algorithm, we denote the number of calls to the oracle $\mathcal {O}_i$ (resp. $\widetilde{\mathcal {O}}_i$) by $T_{\mathcal {O}}$ (resp. $T_{\widetilde{\mathcal {O}}}$). For convenience, the states created by the oracles are assumed to be error-free, but the total query complexity will take into account errors that occur in the block encoding processes of various matrices.

In practice, the time complexity of the algorithm would depend on the way the oracles are instantiated. For example, we may have an access to the controlled version of a unitary U that generates each data point, a common assumption in the block-encoding literature (Martyn et al. 2021). In this case, even though we could obtain the quantum state in the future $\tau $ time step by repeatedly applying U, the same state (up to an error) can be generated by a single round of the QNG-RC if it was trained for the time skip of duration $\tau $.

In general, constructing a controlled-U operation is not possible without further knowledge about U. A weaker but more natural assumption is that we are only handed each data point $|{s_k}\rangle $, in which case the uniform superposition can still be created by consuming multiple copies of each data point (Kimmel et al. 2017). However, the complexity of the algorithm in Kimmel et al. (2017) depends on the unknown overlap between the data points we wish to superpose.

2.2 Block encoding

We employ the block-encoding technique to construct the non-linear feature matrix and perform the Tikhonov regularization. Specifically, a relevant matrix A is embedded as a submatrix of a unitary gate U such that

$$\begin{aligned} \left( \langle {0}|^{\otimes a}\otimes I^{\otimes s}\right) U \left( |{0}\rangle ^{\otimes a}\otimes I^{\otimes s}\right) = A/\alpha \text { for } U = \left( \begin{array}{ll} A& * \\ *& *\end{array}\right) , \end{aligned}$$

(6)

where $|{0}\rangle ^{\otimes a}$ is the fiducial state of an a-qubit ancillary system, and $\alpha \ge \Vert {A}\Vert $ due to the unitary constraint. Definition 1 is a concrete definition of the block-encoding.

Definition 1

Suppose that A is an s-qubit operator, the parameters $\alpha ,\epsilon \in \mathbb {R}$, and $a\in \mathbb {N}$. The $(\alpha ,a,\epsilon )$-block-encoding of a $(s+a)$-qubit unitary operator U can be defined if

$$\begin{aligned} \Vert {A-\alpha \left( \langle {0}|^{\otimes a}\otimes I\right) U\left( |{0}\rangle ^{\otimes a}\otimes I\right) }\Vert \le \epsilon . \end{aligned}$$

Once the block-encoded matrix is in place, QSVT allows us to construct a degree-q polynomial approximation of essentially any well-behaved function of the singular values of A, using the number of gates U and controlled operations that are efficient in q (Gilyén et al. 2019; Martyn et al. 2021). This approach enables straightforward creation of matrix polynomials and, in particular, creation of the Moore-Penrose pseudo-inverse by inverting the singular values of A. Note that the Tikhonov regularization in the block-encoded form is also presented as a subroutine in Chakraborty et al. (2023). Our approach here, outlined in Lemma S7 in the supplementary material, is slightly distinct from theirs.

2.3 Training phase

According to the NG-RC procedure, by utilizing regularized linear optimization, we obtain an optimal weight matrix through the Tikhonov-regularized pseudo-inverse of the feature matrix X (cf. Eq. 1 with complex elements). Therefore, we begin by efficiently constructing the feature matrix X. Assuming the existence of oracles $\mathcal {O}_0$ and $\mathcal {O}_{-1}$, we can generate the linear component of the feature vector, denoted as $|{o_k}\rangle $, by introducing an additional qubit to entangle with these oracles. This process is depicted by the operator $U^{\text {lin}}$, shown as a green box in Fig. 1, which maps the state $|{0}\rangle ^{\otimes d+1}|{k}\rangle $ to $|{o_k}\rangle |{k}\rangle $. To incorporate the nonlinear component $|{o_k}\rangle \otimes |{o_k}\rangle $ into the feature vector, we must apply the operator $U^{\text {lin}}$ twice due to the constraints imposed by the no-cloning theorem. The resulting feature vector $|{x_k}\rangle $ Eq. 4 is represented by the quantum circuit $U^f$, which maps the state $|{0}\rangle ^{\otimes 2d+3}|{k}\rangle $ to $|{x_k}\rangle |{k}\rangle $, as also illustrated in Fig. 1 (middle, orange box). Note that, with this encoding method, the dimension of $|{x_k}\rangle $ is $8D^2$, wherein $4D^2+2D$ dimensions constitute the feature vector $\varvec{x}_k$ analogous to the case of classical NG-RC Eq. 4, and the remaining dimensions are for zero padding. We can coherently construct the feature matrix $X=\left( |{x_0}\rangle ,\cdots ,|{x_{T-1}}\rangle \right) \in \mathbb {C}^{8D^2\times T}$ with the block-encoding technology using quantum gates

$$\begin{aligned} \left( I^{\otimes \max (0,2d+3-t)}\otimes H^{\otimes t}\otimes I^{\otimes \max (2d+3,t)}\right) \cdot \text {SWAP}\cdot U^f, \end{aligned}$$

which is the $(\sqrt{T},\max (2d+3,t),0)$-block-encoding of the feature matrix X, illustrated in Fig. 1 (middle). Note that the dimension of X here is $\max (T,8D^2)\times \max (T,8D^2)$, which includes another zero padding to convert the submatrix X in the block-encoding unitary into a square submatrix (Gilyén et al. 2019).

Due to the requirement for matrix multiplication involving Y and the pseudo-inverse of X in Eq. 1, the size of submatrix Y in the block-encoding unitary must align with the dimension of the submatrix X. Utilizing block encoding techniques, the quantum circuit

$$\begin{aligned} \left( I^{\otimes \max (0,2d+3-t)}\otimes H^{\otimes t}\otimes I^{\otimes \max (2d+3,t)}\right) \cdot \textsc {SWAP}\cdot \mathcal {O}_\tau , \end{aligned}$$

along with pre-amplification proposed in Chakraborty et al. (2019), can effectively create the $(\sqrt{2}\Vert Y\Vert ,\max (2d+3,t)+1,\delta _Y)$-block encoding of Y, where $\delta _Y$ is the error of the pre-amplification.

By virtue of Lemma S7 in the supplementary material, the optimal regularized weight matrix Eq. 1 in the block-encoded form can be constructed with the resources stated below in Theorem 2.

Theorem 2

Let $\delta _W\in (0,1]$. Suppose that we have the $(\sqrt{T},\max (2d+3,t),0)$-block-encoding of the feature matrix X and the $(\sqrt{2}\Vert Y\Vert ,\max (2d+3,t)+1,\delta _Y)$-block encoding of training target matrix Y. Let $\kappa _X$ be the condition number of X, and

$$\begin{aligned} \kappa = \kappa _X\sqrt{\frac{\Vert {X}\Vert ^2+\lambda }{\Vert {X}\Vert ^2+\lambda \kappa _X^2}}. \end{aligned}$$

(7)

where $\lambda \ge 0$ is the regularization parameter (Chakraborty et al. 2023). Also, define

$$\begin{aligned} w=2\max (2d+3,t)+2 \end{aligned}$$

(8)

to be the number of ancilla qubits used in the block encoding of W. In a condition such that $\delta _Y\le \delta _W/(4\kappa )$, then we can construct the unitary operator $U_W$ as a $\left( \frac{2\sqrt{2}\kappa \Vert Y\Vert }{\Vert {X}\Vert +\sqrt{\lambda }},w,\delta _W\right) $-block-encoding of the weight matrix W in

$$\begin{aligned} T_W=O\left( \left( \frac{\kappa }{\Vert {X}\Vert +\sqrt{\lambda }}+\frac{1}{\Vert {Y}\Vert }\right) \log \left( \frac{\kappa \Vert {Y}\Vert }{\delta _W}\right) T_\mathcal {O}\sqrt{T}\right) \end{aligned}$$

(9)

queries, where $T_\mathcal {O}$ is the number of calls to the oracles.

In the case $t>2d+3$, $U_W$ would be a $\Big (\frac{2\sqrt{2}\kappa \Vert Y\Vert }{\Vert {X}\Vert +\sqrt{\lambda }},2t+2,$ $\delta _W\Big )$-block encoding of the weight matrix W. The query complexity Eq. 9 can be expressed more directly in terms of T and D using inequality of operator norm. Since each column in both X and Y is unit vector, implying that we have an inequality $\frac{1}{\sqrt{T}}\le \Vert {X}\Vert \le D\sqrt{T}$ and $\frac{1}{\sqrt{T}}\le \Vert {Y}\Vert \le \sqrt{DT}$, respectively. Matrix norm inequalities are detailed in Section S2 B of the supplementary material. By assuming $\lambda =O(1)$, the time complexity is expressed to

$$\begin{aligned} T_W = O\left( \kappa T\log \left( \frac{\kappa DT}{\delta _W}\right) T_\mathcal {O}\right) , \end{aligned}$$

(10)

Note that the time complexity here is difficult to compare to that of the classical NG-RC because of the unknown gate complexity of implementing $\mathcal {O}_i$. We can only say that the number of calls to $\mathcal {O}_i$ is of the order of $O\left( \kappa T\log \left( \frac{\kappa DT}{\delta _W}\right) \right) $.

2.4 Prediction phase

Now that we have the trained weight matrix in the block-encoded form, $U_W$, we can build a circuit to predict future quantum states by the skipping-ahead method. As shown in Fig. 1, the quantum circuit for the prediction phase of the QNG-RC has the same structure as that for the training phase, but with input past quantum states incorporated via the oracles $\widetilde{\mathcal {O}}_0$ and $\widetilde{\mathcal {O}}_{-1}$ defined in Eqs. 5d and 5e, respectively. The output of the circuit is the desired prediction vector $|{\tilde{s}_{\tilde{k}+\tau }}\rangle =W|{\tilde{x}_{\tilde{k}}}\rangle /\Vert {W|{\tilde{x}_{\tilde{k}}}\rangle \Vert }_2$ up to an error $\delta \in (0,1)$ due to an inaccuracy in the application of the weight matrix W. More specifically, we have an inequality

$$\begin{aligned} \delta _W\le \frac{\delta \Vert {W}\Vert }{4\kappa _W}, \end{aligned}$$

(11)

where $\delta _W$ is the error in Theorem 2, and $\kappa _W$ is the condition number of W. Employing the pre-amplification technique for block encoding applied on a quantum state in Ref. Chakraborty et al. (2023), the number of queries to produce such a $\delta $-close prediction vector is

$$\begin{aligned} O\left( \frac{\kappa \kappa _W}{\Vert {X}\Vert +\sqrt{\lambda }}\frac{\Vert {Y}\Vert }{\Vert {W}\Vert }\log \left( \frac{\kappa _W}{\delta }\right) T_W+\kappa _WT_{\widetilde{\mathcal {O}}}\right) , \end{aligned}$$

where $\kappa $ is defined in Eq. 7. What is more, the operator norm inequality allows us to rewrite the query complexity as $O\left( {\kappa \kappa _W}\log \left( \kappa _W/\delta \right) T_W+\kappa _WT_{\widetilde{\mathcal {O}}}\right) $. The technical details regarding the pre-amplification and the operator norm inequality are given in Lemma S4 and Section S2 B in the supplementary material.

Number of ancilla qubits.–The overall circuit for the prediction represents the unitary operator

$$\begin{aligned} \tilde{U}: |{0}\rangle ^{\otimes w'}|{0}\rangle ^{\otimes 2d+3}|{\tilde{k}}\rangle \mapsto |{0}\rangle ^{\otimes (w'+d+3)}|{\tilde{s}_{\tilde{k}+\tau }}\rangle |{\tilde{k}}\rangle , \end{aligned}$$

(12)

where the register $|{\tilde{k}}\rangle $ contains $\max \left( \tilde{t},2d+3\right) $ qubits. The block encoding requires in total $w'=w+\max (0,t-2d-3)$ qubits. The w given in Theorem 2 is the number of index qubits in the block-encoding (BE) register, and $\max (0,t-2d-3)$ is the number of qubits used in the zero padding in the register RD, see Fig. 1 and the corresponding caption. In the case $t>2d+3$, the number of ancilla qubits is $w'=3t-2d-1$ and $w'=4d+8$ for $t<2d+3$.

3 Conclusion and outlooks

Drawing inspiration from the paradigm of NG-RC, we develop a novel, end-to-end quantum algorithm for predicting skipped-ahead many-body quantum dynamics. The algorithm requires a time-series of quantum data with no assumption on the unitary generating the dynamics, i.e., the forecasting is model-free. The algorithm employs block encoding to efficiently handle matrix algebra subroutines, including matrix multiplication, inversion, and regularized linear optimization, which are manipulated by the QSVT. In addition to providing a quantum computational speedup and avoiding exponential resource consumption in storing many-body quantum states classically, our algorithm coherently processes and generates quantum data, thereby circumventing classical-quantum data conversion problems during encoding and readout procedures. Since the main advantage of the QNG-RC is the coherent output of far-future quantum states, it is worth investigating how the QNG-RC can be interfaced with or used alongside other algorithms. One potential application could be using the QNG-RC to output future states within a thermalized quantum ensemble, thereby functioning as a sampling algorithm. However, the specific dynamics and initial states for which the QNG-RC is most effective in such scenarios remain an open question and warrant further studies.

Both the NG-RC and the QNG-RC algorithms are heuristic in the sense that the convergence of the predictions to the true state is not guaranteed. Therefore, determining the effectiveness of this method for larger quantum systems and developing benchmarks when classical simulation is no longer feasible remains an outstanding challenge. In contrast, it is worth noting that our algorithm is not heuristic during the training stage. The “trained” optimal weight matrix is derived as a closed-form solution of the regularized least-squares optimization problem, rather than through gradient descent. As a result, the training is not susceptible to issues such as the barren plateau phenomenon that often affects the training of variational quantum circuits (McClean et al. 2018).

In the original NG-RC framework, a feature vector $\varvec{x}_i$ can be constructed with a degree-p monomial of an m-time-delay data, see Eqs. 2–3. In our quantum circuit design, though we assumed $m=2$ for simplicity, the quantum operator $U^\text {lin}$ can be slightly modified to incorporate the oracles $\{\mathcal {O}_{-j\Delta }\}_{j=0}^{m-1}$ for an m-time-delayed $|{o_k}\rangle $. A quantum circuit that encodes the nonlinear map of Fig. 1 would then involve the addition of multiple $U^\text {lin}$ and more qubits to accommodate the tensor product $(\varvec{o}_i)^{\otimes p}$. The construction of the corresponding quantum circuits of this more general form of non-linearity is elaborated in Section S3 of the supplementary material.

Lastly, the conventional application of the NG-RC to make iterative predictions, whose results are shown in Fig. 2b, can also be implemented on a quantum circuit. However, the number of gates and the error scales exponentially with the number of iterations. Nevertheless, we discuss a quantum circuit for this approach in Section S4 of the supplementary material, along with the analysis of the error propagation.

Data Availability

The codes used in this study are available upon reasonable request to the corresponding author.

Change history

31 January 2025
The original online version of this article was revised: In the Funding section of this article the grant number relating to the Program Management Unit for Human Resources & Institutional Development, Research and Innovation was incorrectly given as grant number B39G670016 and should have been grant number B39G670018.
05 February 2025
A Correction to this paper has been published: https://doi.org/10.1007/s42484-025-00245-9

References

Aaronson S (2015) Nature Physics 11:291. https://doi.org/10.1038/nphys3272
Article Google Scholar
Atia Y, Aharonov D (2017) Nature Communications 8:1572. https://doi.org/10.1038/s41467-017-01637-7
Article Google Scholar
Barbosa WAS, Gauthier DJ Chaos: An Interdisciplinary Journal of Nonlinear Science 32:093137.https://doi.org/10.1063/5.0098707
Beenakker C, DiVincenzo D, Emary C, Kindermann M (2004) Physical review letters 93:020501
Article Google Scholar
Berry DW, Ahokas G, Cleve R, Sanders BC (2007) Communications in Mathematical Physics 270:359. https://doi.org/10.1007/s00220-006-0150-x
Article MathSciNet Google Scholar
Bollt E (2021) Chaos: An Interdisciplinary Journal of Nonlinear Science 31:013108. https://doi.org/10.1063/5.0024890
Bravyi SB, Kitaev AY (2002) Annals of Physics 298:210
Article MathSciNet Google Scholar
Carleo G, Troyer M (2017) Science 355:602. https://doi.org/10.1126/science.aag2302
Article MathSciNet Google Scholar
Caro MC, Huang H-Y, Ezzell N, Gibbs J, Sornborger AT, Cincio L, Coles PJ, Holmes Z (2023) Nature Communications 14:3751. https://doi.org/10.1038/s41467-023-39381-w
Article Google Scholar
Cervera-Lierta A (2018) Quantum 2:114
Article Google Scholar
Chakraborty S, Gilyén A, Jeffery S (2019) 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019), Leibniz International Proceedings in Informatics (LIPIcs), Vol. 132, edited by Baier C, Chatzigiannakis I, Flocchini P, Leonardi S (Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany. pp 33:1–33:14. https://doi.org/10.4230/LIPIcs.ICALP.2019.33
Chakraborty S, Morolia A, Peduri A (2023) Quantum 7:988. https://doi.org/10.22331/q-2023-04-27-988
Cîrstoiu C, Holmes Z, Iosue J, Cincio L, Coles PJ, Sornborger A (2020) npj Quantum Information 6:82. https://doi.org/10.1038/s41534-020-00302-0
D’Alessio L, Kafri Y, Polkovnikov A, Rigol M (2016) Advances in Physics 65:239. https://doi.org/10.1080/00018732.2016.1198134
Article Google Scholar
Flurin E, Martin LS, Hacohen-Gourgy S, Siddiqi I (2020) Phys. Rev. X 10:011006. https://doi.org/10.1103/PhysRevX.10.011006
Article Google Scholar
Fry D, Deshmukh A, Chen SY-C, Rastunkov V, Markov V (2023) Optimizing quantum noise-induced reservoir computing for nonlinear and chaotic time series prediction. arXiv:2303.05488
Fujii K, Nakajima K (2017) Physical Review Applied 8:024030
Article Google Scholar
Fujii K, Nakajima K (2021) Quantum reservoir computing: a reservoir approach toward quantum machine learning on near-term quantum devices. In: Reservoir Computing: Theory, Physical Implementations, and Applications. Edited by Nakajima K, Fischer I (Springer Singapore, Singapore) pp 423–450. https://doi.org/10.1007/978-981-13-1687-6_18
Gauthier DJ, Bollt E, Griffith A, Barbosa WAS (2021) Nature Communications 12:5564. https://doi.org/10.1038/s41467-021-25801-2
Article Google Scholar
Ghosh S, Opala A, Matuszewski M, Paterek T, Liew TC (2020) IEEE Transactions on Neural Networks and Learning Systems 32:3148
Article Google Scholar
Ghosh S, Krisnanda T, Paterek T, Liew TC (2021) Communications Physics 4:105
Article Google Scholar
Ghosh S, Opala A, Matuszewski M, Paterek T, Liew TC (2019) npj Quantum Information 5:35
Gilyén A, Su Y, Low GH, Wiebe N (2019) In:Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, STOC 2019 (Association for Computing Machinery, New York, NY, USA) pp 193-204. https://doi.org/10.1145/3313276.3316366
Gonon L, Ortega J-P (2020) IEEE Transactions on Neural Networks and Learning Systems 31:100. https://doi.org/10.1109/TNNLS.2019.2899649
Article MathSciNet Google Scholar
Granade CE, Ferrie C, Wiebe N, Cory DG (2012) New Journal of Physics 14:103013. https://doi.org/10.1088/1367-2630/14/10/103013
Article MathSciNet Google Scholar
Gu S, Somma RD, Şahinoğlu B (2021) Quantum 5:577. https://doi.org/10.22331/q-2021-11-15-577
Haah J, Hastings M, Kothari R, Low GH (2018) In: 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS) pp 350–360. https://doi.org/10.1109/FOCS.2018.00041
Hart AG, Hook JL, Dawes JH (2021) Physica D: Nonlinear Phenomena 421:132882. https://doi.org/10.1016/j.physd.2021.132882
Article Google Scholar
Hartmann MJ, Carleo G (2019) Phys Rev Lett 122:250502. https://doi.org/10.1103/PhysRevLett.122.250502
Article Google Scholar
Huang H-Y, Broughton M, Cotler J, Chen S, Li J, Mohseni M, Neven H, Babbush R, Kueng R, Preskill J (2022) McClean JR 376:1182. https://www.science.org/doi/pdf/10.1126/science.abn7293
Huang H-Y, Chen S, Preskill J (2023) Learning to predict arbitrary quantum processes. arXiv:2210.14894
Jerbi S, Gibbs J, Rudolph MS, Caro MC, Coles PJ, Huang H-Y, Holmes Z (2023) The power and limitations of learning quantum dynamics incoherently. arXiv:2303.12834
Karthik J, Sharma A, Lakshminarayan A (2007) Phys Rev A 75:022304. https://doi.org/10.1103/PhysRevA.75.022304
Article Google Scholar
Kawai H, Nakagawa YO (2020) Machine Learning: Science and Technology 1:045027
Google Scholar
Kimmel S, Lin CY-Y, Low GH, Ozols M, Yoder TJ (2017) npj Quantum Information 3:13. https://doi.org/10.1038/s41534-017-0013-7
Kubota T, Suzuki Y, Kobayashi S, Tran QH, Yamamoto N, Nakajima K (2023) Phys Rev Res 5:023057. https://doi.org/10.1103/PhysRevResearch.5.023057
Article Google Scholar
Kutvonen A, Fujii K, Sagawa T (2020) Scientific Reports 10:14687. https://doi.org/10.1038/s41598-020-71673-9
Article Google Scholar
Liu S, Xiao J, Yan Z, Gao J (2023) Nonlinear Dynamics 111:14295. https://doi.org/10.1007/s11071-023-08592-7
Martínez-Peña R, Ortega J-P (2023) Phys Rev E 107:035306. https://doi.org/10.1103/PhysRevE.107.035306
Article Google Scholar
Martínez-Peña R, Giorgi GL, Nokkala J, Soriano MC, Zambrini R (2021) Phys Rev Lett 127:100502. https://doi.org/10.1103/PhysRevLett.127.100502
Martínez-Peña R, Nokkala J, Giorgi GL, Zambrini R, Soriano MCa (2024) Cognitive Computation, 1
Martyn JM, Rossi ZM, Tan AK, Chuang IL (2021) PRX Quantum 2:040203. https://doi.org/10.1103/PRXQuantum.2.040203
Matteo OD, Gheorghiu V, Mosca M (2020) IEEE Transactions on Quantum Engineering. 1:1. https://doi.org/10.1109/TQE.2020.2965803
McClean JR, Boixo S, Smelyanskiy VN, Babbush R, Neven H (2018) Nature communications 9:4812
Article Google Scholar
Mohseni N, Fösel T, Guo L, Navarrete-Benlloch C, Marquardt F (2022) Quantum 6:714. https://doi.org/10.22331/q-2022-05-17-714
Mujal P, Martínez-Peña R, Giorgi GL, Soriano MC, Zambrini R (2023) npj Quantum Information 9:16. https://doi.org/10.1038/s41534-023-00682-z
Mujal P, Martínez-Peña R, Nokkala J, García-Beni J, Giorgi G, Soriano M, Zambrini R (2021) Advanced Quantum Technologies 4. https://doi.org/10.1002/qute.202100027
Nokkala J, Martínez-Peña R, Giorgi GL, Parigi V, Soriano MC, Zambrini R Communications Physics 4:53
Pathak J, Hunt B, Girvan M, Lu Z, Ott E (2018) Phys Rev Lett 120:024102. https://doi.org/10.1103/PhysRevLett.120.024102
Article Google Scholar
Pathak J, Lu Z, Hunt B, Girvan M, Ott E (2017) Chaos: An Interdisciplinary Journal of Nonlinear Science 27. https://doi.org/10.1063/1.5010300
Rodríguez LEH, Ullah A, Espinosa KJR, Dral PO, Kananenka AA (2022) Machine Learning: Science and Technology 3:045016. https://doi.org/10.1088/2632-2153/ac9a9d
Article Google Scholar
Schatzki L, Arrasmith A, Coles PJ, Cerezo M (2021) arXiv:2109.03400
Schmitt M, Heyl M (2020) Phys Rev Lett 125:100503. https://doi.org/10.1103/PhysRevLett.125.100503
Article MathSciNet Google Scholar
Tang E (2019) Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, STOC 2019 (Association for Computing Machinery, New York, NY, USA,) pp 217-228. https://doi.org/10.1145/3313276.3316310

Download references

Acknowledgements

We thank Supanut Thanasilp, Zoë Holmes, and Dario Poletti for a helpful discussion. This research has received funding support from the NSRF via the Program Management Unit for Human Resources & Institutional Development, Research and Innovation [grant number B39G670018] and by Thailand Science research and Innovation Fund Chulalongkorn University (IND66230005). The authors acknowledge the National Science and Technology Development Agency, National e-Science Infrastructure Consortium, Chulalongkorn University and the Chulalongkorn Academic Advancement into Its 2nd Century Project, NSRF via the Program Management Unit for Human Resources & Institutional Development, Research and Innovation [grant numbers B05F650021, B37G660013] (Thailand) for providing computing infrastructure that has contributed to the research results within this paper.

Author information

Authors and Affiliations

Chula Intelligent and Complex Systems, Department of Physics, Faculty of Science, Chulalongkorn University, Bangkok, 10330, Thailand
Apimuk Sornsaeng & Thiparat Chotibut
Science, Mathematics and Technology Cluster, Singapore University of Technology and Design, 8 Somapah Road, Singapore, 487372, Singapore
Apimuk Sornsaeng
The Institute for Fundamental Study, Naresuan University, Phitsanulok, 65000, Thailand
Ninnat Dangniam

Authors

Apimuk Sornsaeng
View author publications
You can also search for this author in PubMed Google Scholar
Ninnat Dangniam
View author publications
You can also search for this author in PubMed Google Scholar
Thiparat Chotibut
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed equally.

Corresponding author

Correspondence to Thiparat Chotibut.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this article was revised: In the Funding section of this article the grant number relating to the Program Management Unit for Human Resources & Institutional Development, Research and Innovation was incorrectly given as grant number B39G670016 and should have been grant number B39G670018.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 14127 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Sornsaeng, A., Dangniam, N. & Chotibut, T. Quantum next generation reservoir computing: an efficient quantum algorithm for forecasting quantum dynamics. Quantum Mach. Intell. 6, 57 (2024). https://doi.org/10.1007/s42484-024-00188-7

Download citation

Received: 19 March 2024
Accepted: 31 July 2024
Published: 09 September 2024
DOI: https://doi.org/10.1007/s42484-024-00188-7

Quantum next generation reservoir computing: an efficient quantum algorithm for forecasting quantum dynamics

Abstract

Similar content being viewed by others

Optimal training of finitely sampled quantum reservoir computers for forecasting of chaotic dynamics

Optimizing quantum noise-induced reservoir computing for nonlinear and chaotic time series prediction

Predicting three-dimensional chaotic systems with four qubit quantum systems