29  Large scale systems

Updated

January 6, 2025

Keywords

LQR, mean-field control

Summary

The Riccati update for LQ systems has a complexity O(dx3), which dx is the size of the state space. So even computing the otimal gains in a large-scale system is computationally challenging. In this section we show that under certain regularity and symmetry assumptions, the optimal solution of a large-scale system can be computed with low complexity.

29.1 Mean-field control

Consider a system consisting of N subsystems, indexed by the set N:={1,,N}. Each subsystem iN has a state xtiRdx and a control input utiRdu. The dynamics of each subsystem are given as (1)xt+1i=Axti+Buti+Dx¯t+Eu¯t+wti where (2)x¯t:=1NiNxtiandu¯t:=1NiNuti are the emperical mean-field of the state and control, respectively and A, B, D, E are matrices of appropriate dimensions. The noise processes {wti}t1, iN are correlated across subsystem but are assumed to be independent across time.

We use xxt:=(xt1,,xtN) and uut:=(ut1,,utN) to denote the global state and control of the system. The system incurs a per-step cost given by (3)c(xxt,uut)=x¯tQ¯x¯t+u¯tR¯u¯t+1NiN[(xti)Qxti+(uti)Ruti] and a terminal cost (4)cT(xxT)=x¯TQ¯Tx¯T+1NiN(xTi)QTxTi

We are interested in identifying policies g=(g1,,gT1) where uut=gt(xxt) to minimize (5)J(g)=Eg[t=1T1c(xxt,uut)+cT(xxT)]

Weakly coupled dynamics and cost

Note that the subsystems are weakly coupled in the dynamics and cost. A naive solution using by solving a single Riccati equation has a complexity of O(dx3N3), which scales cubically in the number of agents.

29.1.1 State space decomposition

We now present a decomposition method to simplify the above optimization problem. Note that (1) implies that (6)x¯t=(A+D)x¯t+(B+E)u¯t+w¯t where w¯t=(iNwti)/N. Define x˘ti=xtix¯tandu˘ti=utiu¯t. Then, subtracting (6) from (1), we get (7)x˘ti=Ax˘ti+Bu˘ti+w˘ti where w˘ti=wtiw¯t.

We can think of x¯t as the “center of mass” of the system and x˘ti to be the relative coordinates of subsystem i wrt the center of mass. Building on this intuition, we make the following simple observation, which may be viewed as an analog of the parallel axis theorem in physics.

Lemma 29.1 We have the following:

  1. 1NiN(xti)Qxti=x¯tQx¯t+1NiN(x˘ti)Qx˘ti.
  2. 1NiN(uti)Quti=u¯tQu¯t+1NiN(u˘ti)Qu˘ti.
Proof

The proof follows from the observation that iNx˘ti=0 and elementary algebra.

An immediate implication of is that the per-step cost can be decomposed as follows: (8)c(xxt,uut)=c¯(x¯t,u¯t)+1NiNc˘i(x˘ti,u˘ti) where c¯(x¯t,u¯t)=x¯t(Q+Q¯)x¯t+u¯t(R+R¯)u¯t,c˘i(x˘ti,u˘ti)=1NiN[(x˘ti)Qx˘ti+(u˘ti)Qu˘ti]. A similar decomposition also holds for the terminal cost ct(xxt,uut).

Thus, the original system is equivalent to N+1 coupled subsystems:

  • a mean-field subsystem with state x¯t, control input u¯t, and per-step cost c¯(x¯t,u¯t).
  • N auxiliary subsytems, where subsystem iN has state x˘ti, control input u˘ti, and per-step cost c˘(x˘ti,u˘ti).

Note that the only coupling between the subsystems is through the noise. Therefore, by the argument presented in , the optimal control strategy is of the following form.

Proposition 29.1 The optimal control policy for the mean-field control system described above is given by ut=K¯tx¯t+K˘t(xtix¯t) where

  • K¯1:T1=LQRT(A+D,B+E,Q+Q¯,R+R¯;QT+Q¯T)
  • K˘1:T1=LQRT(A,B,Q,R;QT).
Significance of the result

The above result is significant, both for synthesis and implementation.

For synthesis, rather than solving one Riccati equation with state dimension nN, we need to solve two Riccati equations with dimension dx. Thus, the complexity of computing the optimal controller gains does not depend on the number N of subsystems.

For implementation, each subsystem does not need access to the global state xxt; instead it just needs access to the mean-field x¯t in addition to its local state xti.

29.2 Network coupled subsystems

Notes

The results for mean-field control are adapted from Arabneydi and Mahajan (). The discussion above is restricted to the simplest setting of homogenous subsystems. Generalization to hetrogeneous subsystems and infinite horizon settings are also presented in Arabneydi and Mahajan (). A similar result for N is also presented in Elliott et al. ().

The results for network coupled subsystems are adapted from Gao and Mahajan ().

close all nutshells