Peter Lehmann
(Advisor: Prof. Theodorou)
will defend a master’s thesis entitled,
Maximum Entropy Sequential Quadratic Programming
On
Monday, April 21 at 10:00 a.m.
Coda C1315 Grant Park
756 W Peachtree St NW
Abstract
Trajectory optimization (TO) plays a critical role across a broad spectrum of scientific and engineering fields, including robotics, energy and power systems, economics, and biomechanics. Through the lens of optimal control, TO often involves solving constrained optimization problems with non-convex objective function, nonlinear state and actuation constraints, and nonlinear dynamics. A widely used technique to solve such problems is Sequential Quadratic Programming (SQP). This method iteratively solves a series of quadratic subproblems, each of which relies on first- and second-order approximations of the constraints and cost around a nominal trajectory. However, since each subproblem only captures local information, SQP can suffer from converging to local minima. To overcome this limitation, this thesis proposes Maximum Entropy Sequential Quadratic Programming (MESQP). The proposed method leverages a stochastic policy and introduces an entropy-based regularization into the objective function. This regularization encourages exploration during optimization and enables the formulation of MESQP under both unimodal and multimodal policy representations. By embedding this stochastic policy into the SQP framework, the resulting MESQP algorithm creates a scheme of alternating optimization and sampling steps, yielding the ability to overcome local minima. To evaluate the method’s efficacy, this framework is compared experimentally with regular SQP and Maximum Entropy Differential Dynamic Programming (MEDDP) across several TO tasks.
Committee
• Prof. Evangelos Theodorou – School of Aerospace Engineering (advisor)
• Prof. Kyriakos Vamvoudakis – School of Aerospace Engineering
• Prof. Lu Gan – School of Aerospace Engineering