- Traditional stochastic methods primarily assume random noise.
- 🚫 They often fail to account for parametric uncertainty or unmodeled dynamics.
- Iterative Learning Control (ILC) falls under the general category of adaptive control. It adapts to unknown models to achieve optimal control.
- 🛡️ Robust control offers a more conservative approach. It aims to fit all possible models, often trading off robustness against optimality.
- Linear Quadratic Gaussian (LQG) control can be highly fragile in the presence of model uncertainty.
Consider the following system dynamics:
Here, the disturbance ( w_n ) could represent constant parameters, time-varying non-smooth forces, or even model errors. We make an assumption that:
The optimization problem can be formulated as:
This is a min-max optimization problem, which is typically challenging to solve.
For non-linear systems, one can use ( H_\infty ) control, which is a generalized form of LQG. Local approximation solutions are also viable.
Minimax DDP is another variation of LQR tailored for robust control. The approach uses local/linear/quadratic Taylor expansion to iteratively find a locally-optimal trajectory and feedback policy.
Given:
The action-value function can be expressed as:
Using the Bellman equation (cost-to-go function):
From the gradient:
The solution can be derived as:
$$ \begin{align*} \Rightarrow \Delta u &= -d -K\Delta x\ d&=(G_{uu}-\underbrace{G_{uw}G^{-1}{ww}G{wu}}{\text{robust term}})^{-1}(g_u-\underbrace{G{uw}G^{-1}{ww}g_w}{\text{robust term}})\ K&=(G_{uu}-\underbrace{G_{uw}G^{-1}{ww}G{wu}}{\text{robust term}})^{-1}(G{ux}-\underbrace{G_{uw}G^{-1}{ww}G{wu}}_{\text{robust term}})) \end{align*} $$
For the adversary:
$$ \begin{align*} \Delta w &= -e - L\Delta x \ e &= (G_{ww} - G_{wu} G^{-1}{uu} G{uw})^{-1} (g_w - G_{wu} G^{-1}{uu} g_u) \ L &= (G{ww} - G_{wu} G^{-1}{uu} G{uw})^{-1} (G_{wx} - G_{wu} G^{-1}{uu} G{ux}) \end{align*} $$
The new value function is:
$$ \begin{align*} V_{n-1}(x+\Delta x) &= V_{n-1}(x) + p_{n-1}^T \Delta x + \frac{1}{2} \Delta x^T P_{n-1} \Delta x \ p_{n-1} &= g_x - G_{xu} G^{-1}{uu} g_u - G{xw} G_{ww}^{-1} g_w \ P_{n-1} &= G_{xx} - G_{xu} G^{-1}{uu} G{ux} - G_{xw} G_{ww}^{-1} G_{wx} \end{align*} $$
A significant difference with standard DDP is the requirement for ( \text{dim}(x) + \text{dim}(u) ) number of positive eigenvalues and ( \text{dim}(w) ) negative eigenvalues.
For the quadratic programming, the equation should be:
Here, a larger ( ||w|| ) means more robustness. As ( ||w|| ) approaches infinity, the problem converges to the standard DDP.
Robust control, particularly through the use of Minimax DDP, provides a powerful framework for handling uncertainties in control systems. By considering both the control and disturbance variables, it offers a more conservative and reliable approach compared to traditional stochastic methods.
The inclusion of robust terms in the equations ensures that the system can handle uncertainties in both the model parameters and external disturbances. This makes it suitable for real-world applications where perfect knowledge of the system is often unattainable.
The mathematical formulation and iterative methods used in Minimax DDP allow for the fine-tuning of the trade-off between robustness and optimality, providing a flexible tool for control system designers.