The Riccati update for LQ systems has a complexity , which is the size of the state space. So even computing the otimal gains in a large-scale system is computationally challenging. In this section we show that under certain regularity and symmetry assumptions, the optimal solution of a large-scale system can be computed with low complexity.
Mean-field control
Consider a system consisting of subsystems, indexed by the set . Each subsystem has a state and a control input . The dynamics of each subsystem are given as where are the emperical mean-field of the state and control, respectively and , , , are matrices of appropriate dimensions. The noise processes , are correlated across subsystem but are assumed to be independent across time.
We use and to denote the global state and control of the system. The system incurs a per-step cost given by and a terminal cost
We are interested in identifying policies where to minimize
Note that the subsystems are weakly coupled in the dynamics and cost. A naive solution using by solving a single Riccati equation has a complexity of , which scales cubically in the number of agents.
State space decomposition
We now present a decomposition method to simplify the above optimization problem. Note that implies that where . Define Then, subtracting from , we get where .
We can think of as the “center of mass” of the system and to be the relative coordinates of subsystem wrt the center of mass. Building on this intuition, we make the following simple observation, which may be viewed as an analog of the parallel axis theorem in physics.
Lemma 29.1 We have the following:
- .
- .
The proof follows from the observation that and elementary algebra.
An immediate implication of Lemma 29.1 is that the per-step cost can be decomposed as follows: where A similar decomposition also holds for the terminal cost .
Thus, the original system is equivalent to coupled subsystems:
- a mean-field subsystem with state , control input , and per-step cost .
- auxiliary subsytems, where subsystem has state , control input , and per-step cost .
Note that the only coupling between the subsystems is through the noise. Therefore, by the argument presented in Exercise 28.3, the optimal control strategy is of the following form.
Proposition 29.1 The optimal control policy for the mean-field control system described above is given by where
The above result is significant, both for synthesis and implementation.
For synthesis, rather than solving one Riccati equation with state dimension , we need to solve two Riccati equations with dimension . Thus, the complexity of computing the optimal controller gains does not depend on the number of subsystems.
For implementation, each subsystem does not need access to the global state ; instead it just needs access to the mean-field in addition to its local state .
Network coupled subsystems
Notes
The results for mean-field control are adapted from Arabneydi and Mahajan (2016). The discussion above is restricted to the simplest setting of homogenous subsystems. Generalization to hetrogeneous subsystems and infinite horizon settings are also presented in Arabneydi and Mahajan (2016). A similar result for is also presented in Elliott et al. (2013).
The results for network coupled subsystems are adapted from Gao and Mahajan (2022).
Arabneydi, J. and Mahajan, A. 2016. Linear quadratic mean field teams: Optimal and approximately optimal decentralized solutions. Available at:
https://arxiv.org/abs/1609.00056v2.
Elliott, R., Li, X., and Ni, Y.-H. 2013. Discrete time mean-field stochastic linear-quadratic optimal control problems.
Automatica 49, 11, 3222–3233. DOI:
10.1016/j.automatica.2013.08.017.
Gao, S. and Mahajan, A. 2022. Optimal control of network-coupled subsystems: Spectral decomposition and low-dimensional solutions.
IEEE Transactions on Control of Network Systems 9, 2, 657–669. DOI:
10.1109/tcns.2021.3124259.