UC BERKELEY
EECS technical reports
TECHNICAL REPORTS


EECS-2011-153.pdf
Conditions of Use

Archive Home Page

Statistical Results on Filtering and Epi-convergence for Learning-Based Model Predictive Control

Authors:
Aswani, Anil
Gonzalez, Humberto
Sastry, S. Shankar
Tomlin, Claire
Technical Report Identifier: EECS-2011-153
December 17, 2011
EECS-2011-153.pdf

Abstract: Learning-based model predictive control (LBMPC) is a technique that provides deterministic guarantees on robustness, while statistical identification tools are used to identify richer models of the system in order to improve performance. This technical note provides a result that elucidates the reasons for the choice of measurement model used with LBMPC, and it gives proofs concerning the stochastic convergence of LBMPC. The first part of this note discusses simultaneous state estimation and statistical identification (or learning) of unmodeled dynamics, for dynamical systems that can be described by ordinary differential equations (ODE's). The second part provides proofs concerning the epi-convergence of different statistical estimators that can be used with the LBMPC technique. In particular, we prove results on the statistical properties of a nonparametric estimator that we have designed to have the correct deterministic and stochastic properties for numerical implementation when used in conjunction with LBMPC.