Statistical Results on Filtering and Epi-convergence for Learning-Based Model Predictive Control
Sastry, S. Shankar
Technical Report Identifier: EECS-2011-153
December 17, 2011
Abstract: Learning-based model predictive control (LBMPC) is a technique that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance. This technical note provides a result that elucidates the reasons for the choice of measurement model used with LBMPC, and it gives proofs concerning the stochastic convergence of LBMPC. The first part of this note discusses simultaneous state estimation and statistical identification (or learning) of unmodeled dynamics, for dynamical systems that can be described by ordinary differential equations (ODE's). The second part provides proofs concerning the epi-convergence of different statistical estimators that can be used with the LBMPC technique. In particular, we prove results on the statistical properties of a nonparametric estimator that we have designed to have the correct deterministic and stochastic properties for numerical implementation when used in conjunction with LBMPC.