aligator
0.10.0
A primal-dual augmented Lagrangian-type solver for nonlinear trajectory optimization.
|
A proximal, augmented Lagrangian-type solver for trajectory optimization. More...
#include <aligator/solvers/proxddp/solver-proxddp.hpp>
Classes | |
struct | AlmParams |
struct | LinesearchVariant |
Public Types | |
using | Scalar = _Scalar |
using | Problem = TrajOptProblemTpl<Scalar> |
using | Workspace = WorkspaceTpl<Scalar> |
using | Results = ResultsTpl<Scalar> |
using | StageFunctionData = StageFunctionDataTpl<Scalar> |
using | DynamicsData = DynamicsDataTpl<Scalar> |
using | CostData = CostDataAbstractTpl<Scalar> |
using | StageModel = StageModelTpl<Scalar> |
using | StageData = StageDataTpl<Scalar> |
using | CallbackPtr = shared_ptr<CallbackBaseTpl<Scalar>> |
using | CallbackMap = boost::unordered_map<std::string, CallbackPtr> |
using | ConstraintStack = ConstraintStackTpl<Scalar> |
using | CstrSet = ConstraintSetTpl<Scalar> |
using | TrajOptData = TrajOptDataTpl<Scalar> |
using | LinesearchOptions = typename Linesearch<Scalar>::Options |
Public Member Functions | |
ALIGATOR_DYNAMIC_TYPEDEFS (Scalar) | |
SolverProxDDPTpl (const Scalar tol=1e-6, const Scalar mu_init=0.01, const std::size_t max_iters=1000, VerboseLevel verbose=VerboseLevel::QUIET, StepAcceptanceStrategy sa_strategy=StepAcceptanceStrategy::LINESEARCH_NONMONOTONE, HessianApprox hess_approx=HessianApprox::GAUSS_NEWTON) | |
std::size_t | getNumThreads () const |
void | setNumThreads (const std::size_t num_threads) |
Scalar | getDualTolerance () const |
void | setDualTolerance (const Scalar tol) |
Manually set desired dual feasibility tolerance. | |
Scalar | tryLinearStep (const Problem &problem, const Scalar alpha) |
Try a step of size \(\alpha\). | |
Scalar | tryNonlinearRollout (const Problem &problem, const Scalar alpha) |
Policy rollout using the full nonlinear dynamics. The feedback gains need to be computed first. This will evaluate all the terms in the problem into the problem data, similar to TrajOptProblemTpl::evaluate(). | |
Scalar | forwardPass (const Problem &problem, const Scalar alpha) |
void | updateLQSubproblem () |
void | setup (const Problem &problem) |
Allocate new workspace and results instances according to the specifications of problem . | |
void | cycleProblem (const Problem &problem, shared_ptr< StageDataTpl< Scalar > > data) |
bool | run (const Problem &problem, const std::vector< VectorXs > &xs_init={}, const std::vector< VectorXs > &us_init={}, const std::vector< VectorXs > &vs_init={}, const std::vector< VectorXs > &lams_init={}) |
Run the numerical solver. | |
bool | innerLoop (const Problem &problem) |
Perform the inner loop of the algorithm (augmented Lagrangian minimization). | |
void | computeInfeasibilities (const Problem &problem) |
Compute the primal infeasibility measures. | |
void | computeCriterion () |
Compute stationarity criterion (dual infeasibility). | |
bool | computeMultipliers (const Problem &problem, const std::vector< VectorXs > &lams, const std::vector< VectorXs > &vs) |
ALIGATOR_INLINE Scalar | mudyn () const |
ALIGATOR_INLINE Scalar | mu () const |
ALIGATOR_INLINE Scalar | mu_inv () const |
void | updateGains () |
Update primal-dual feedback gains (control, costate, path multiplier) | |
callbacks | |
void | registerCallback (const std::string &name, CallbackPtr cb) |
Add a callback to the solver instance. | |
void | clearCallbacks () noexcept |
Remove all callbacks from the instance. | |
const CallbackMap & | getCallbacks () const |
void | removeCallback (const std::string &name) |
auto | getCallbackNames () const |
CallbackPtr | getCallback (const std::string &name) const |
void | invokeCallbacks (Workspace &workspace, Results &results) |
Invoke callbacks. | |
Protected Member Functions | |
void | updateTolsOnFailure () noexcept |
void | updateTolsOnSuccess () noexcept |
ALIGATOR_INLINE void | setAlmPenalty (Scalar new_mu) noexcept |
Set dual proximal/ALM penalty parameter. | |
void | initializeRegularization () noexcept |
void | increaseRegularization () noexcept |
A proximal, augmented Lagrangian-type solver for trajectory optimization.
This class implements the Proximal Differential Dynamic Programming algorithm, a variant of the augmented Lagrangian method for trajectory optimization. The paper "PROXDDP: Proximal Constrained Trajectory Optimization" by Jallet et al (2023) is the reference [1] for this implementation.
Definition at line 36 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::Scalar = _Scalar |
Definition at line 38 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::Problem = TrajOptProblemTpl<Scalar> |
Definition at line 40 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::Workspace = WorkspaceTpl<Scalar> |
Definition at line 41 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::Results = ResultsTpl<Scalar> |
Definition at line 42 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::StageFunctionData = StageFunctionDataTpl<Scalar> |
Definition at line 43 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::DynamicsData = DynamicsDataTpl<Scalar> |
Definition at line 44 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::CostData = CostDataAbstractTpl<Scalar> |
Definition at line 45 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::StageModel = StageModelTpl<Scalar> |
Definition at line 46 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::StageData = StageDataTpl<Scalar> |
Definition at line 47 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::CallbackPtr = shared_ptr<CallbackBaseTpl<Scalar>> |
Definition at line 48 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::CallbackMap = boost::unordered_map<std::string, CallbackPtr> |
Definition at line 49 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::ConstraintStack = ConstraintStackTpl<Scalar> |
Definition at line 50 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::CstrSet = ConstraintSetTpl<Scalar> |
Definition at line 51 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::TrajOptData = TrajOptDataTpl<Scalar> |
Definition at line 52 of file solver-proxddp.hpp.
using aligator::SolverProxDDPTpl< _Scalar >::LinesearchOptions = typename Linesearch<Scalar>::Options |
Definition at line 53 of file solver-proxddp.hpp.
aligator::SolverProxDDPTpl< _Scalar >::SolverProxDDPTpl | ( | const Scalar | tol = 1e-6, |
const Scalar | mu_init = 0.01, | ||
const std::size_t | max_iters = 1000, | ||
VerboseLevel | verbose = VerboseLevel::QUIET, | ||
StepAcceptanceStrategy | sa_strategy = StepAcceptanceStrategy::LINESEARCH_NONMONOTONE, | ||
HessianApprox | hess_approx = HessianApprox::GAUSS_NEWTON ) |
aligator::SolverProxDDPTpl< _Scalar >::ALIGATOR_DYNAMIC_TYPEDEFS | ( | Scalar | ) |
|
inline |
Definition at line 217 of file solver-proxddp.hpp.
void aligator::SolverProxDDPTpl< _Scalar >::setNumThreads | ( | const std::size_t | num_threads | ) |
|
inline |
Definition at line 220 of file solver-proxddp.hpp.
|
inline |
Manually set desired dual feasibility tolerance.
Definition at line 222 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::tryLinearStep | ( | const Problem & | problem, |
const Scalar | alpha ) |
Try a step of size \(\alpha\).
Scalar aligator::SolverProxDDPTpl< _Scalar >::tryNonlinearRollout | ( | const Problem & | problem, |
const Scalar | alpha ) |
Policy rollout using the full nonlinear dynamics. The feedback gains need to be computed first. This will evaluate all the terms in the problem into the problem data, similar to TrajOptProblemTpl::evaluate().
Scalar aligator::SolverProxDDPTpl< _Scalar >::forwardPass | ( | const Problem & | problem, |
const Scalar | alpha ) |
void aligator::SolverProxDDPTpl< _Scalar >::updateLQSubproblem | ( | ) |
void aligator::SolverProxDDPTpl< _Scalar >::setup | ( | const Problem & | problem | ) |
Allocate new workspace and results instances according to the specifications of problem
.
problem | The problem instance with respect to which memory will be allocated. |
void aligator::SolverProxDDPTpl< _Scalar >::cycleProblem | ( | const Problem & | problem, |
shared_ptr< StageDataTpl< Scalar > > | data ) |
bool aligator::SolverProxDDPTpl< _Scalar >::run | ( | const Problem & | problem, |
const std::vector< VectorXs > & | xs_init = {}, | ||
const std::vector< VectorXs > & | us_init = {}, | ||
const std::vector< VectorXs > & | vs_init = {}, | ||
const std::vector< VectorXs > & | lams_init = {} ) |
Run the numerical solver.
problem | The trajectory optimization problem to solve. |
xs_init | Initial trajectory guess. |
us_init | Initial control sequence guess. |
vs_init | Initial path multiplier guess. |
lams_init | Initial co-state guess. |
bool aligator::SolverProxDDPTpl< _Scalar >::innerLoop | ( | const Problem & | problem | ) |
Perform the inner loop of the algorithm (augmented Lagrangian minimization).
void aligator::SolverProxDDPTpl< _Scalar >::computeInfeasibilities | ( | const Problem & | problem | ) |
Compute the primal infeasibility measures.
void aligator::SolverProxDDPTpl< _Scalar >::computeCriterion | ( | ) |
Compute stationarity criterion (dual infeasibility).
void aligator::SolverProxDDPTpl< _Scalar >::registerCallback | ( | const std::string & | name, |
CallbackPtr | cb ) |
Add a callback to the solver instance.
|
inlinenoexcept |
Remove all callbacks from the instance.
Definition at line 284 of file solver-proxddp.hpp.
|
inline |
Definition at line 286 of file solver-proxddp.hpp.
|
inline |
Definition at line 287 of file solver-proxddp.hpp.
|
inline |
Definition at line 288 of file solver-proxddp.hpp.
|
inline |
Definition at line 296 of file solver-proxddp.hpp.
|
inline |
Invoke callbacks.
Definition at line 305 of file solver-proxddp.hpp.
bool aligator::SolverProxDDPTpl< _Scalar >::computeMultipliers | ( | const Problem & | problem, |
const std::vector< VectorXs > & | lams, | ||
const std::vector< VectorXs > & | vs ) |
Compute the merit function and stopping criterion dual terms: first-order Lagrange multiplier estimates, shifted and projected constraints.
|
inline |
Definition at line 320 of file solver-proxddp.hpp.
|
inline |
Definition at line 323 of file solver-proxddp.hpp.
|
inline |
Definition at line 324 of file solver-proxddp.hpp.
|
inline |
Update primal-dual feedback gains (control, costate, path multiplier)
|
inlineprotectednoexcept |
Definition at line 331 of file solver-proxddp.hpp.
|
inlineprotectednoexcept |
Definition at line 337 of file solver-proxddp.hpp.
|
inlineprotectednoexcept |
Set dual proximal/ALM penalty parameter.
Definition at line 344 of file solver-proxddp.hpp.
|
inlineprotectednoexcept |
Definition at line 351 of file solver-proxddp.hpp.
|
inlineprotectednoexcept |
Definition at line 361 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::inner_tol_ |
Subproblem tolerance.
Definition at line 120 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::prim_tol_ |
Desired primal feasibility (for each outer loop)
Definition at line 122 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::target_tol_ = 1e-6 |
Solver tolerance \(\epsilon > 0\). When sync_dual_tol is false, this will be the desired primal feasibility, where the dual feasibility tolerance is controlled by SolverProxDDPTpl::target_tol_dual.
Definition at line 126 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::mu_init = 0.01 |
Definition at line 137 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::reg_min = 1e-10 |
Definition at line 141 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::reg_max = 1e9 |
Definition at line 142 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::reg_init = 1e-9 |
Definition at line 143 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::reg_inc_k_ = 10. |
Definition at line 144 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::reg_inc_first_k_ = 100. |
Definition at line 145 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::reg_dec_k_ = 1. / 3. |
Definition at line 146 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::preg_ = reg_init |
Definition at line 147 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::preg_last_ = 0. |
Definition at line 148 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::inner_tol0 = 1. |
Definition at line 152 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::prim_tol0 = 1. |
Definition at line 153 of file solver-proxddp.hpp.
Logger aligator::SolverProxDDPTpl< _Scalar >::logger {} |
Definition at line 156 of file solver-proxddp.hpp.
VerboseLevel aligator::SolverProxDDPTpl< _Scalar >::verbose_ |
Solver verbosity level.
Definition at line 159 of file solver-proxddp.hpp.
LQSolverChoice aligator::SolverProxDDPTpl< _Scalar >::linear_solver_choice = LQSolverChoice::SERIAL |
Choice of linear solver.
Definition at line 161 of file solver-proxddp.hpp.
HessianApprox aligator::SolverProxDDPTpl< _Scalar >::hess_approx_ = HessianApprox::GAUSS_NEWTON |
Type of Hessian approximation. Default is Gauss-Newton.
Definition at line 163 of file solver-proxddp.hpp.
LinesearchOptions aligator::SolverProxDDPTpl< _Scalar >::ls_params |
Linesearch options, as in proxsuite-nlp.
Definition at line 165 of file solver-proxddp.hpp.
MultiplierUpdateMode aligator::SolverProxDDPTpl< _Scalar >::multiplier_update_mode = MultiplierUpdateMode::NEWTON |
Type of Lagrange multiplier update.
Definition at line 167 of file solver-proxddp.hpp.
LinesearchMode aligator::SolverProxDDPTpl< _Scalar >::ls_mode = LinesearchMode::PRIMAL |
Linesearch mode.
Definition at line 169 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::dual_weight = 1.0 |
Weight of the dual variables in the primal-dual linesearch.
Definition at line 171 of file solver-proxddp.hpp.
RolloutType aligator::SolverProxDDPTpl< _Scalar >::rollout_type_ = RolloutType::NONLINEAR |
Type of rollout for the forward pass.
Definition at line 173 of file solver-proxddp.hpp.
AlmParams aligator::SolverProxDDPTpl< _Scalar >::bcl_params |
Parameters for the BCL outer loop of the augmented Lagrangian algorithm.
Definition at line 175 of file solver-proxddp.hpp.
StepAcceptanceStrategy aligator::SolverProxDDPTpl< _Scalar >::sa_strategy_ |
Step acceptance mode.
Definition at line 177 of file solver-proxddp.hpp.
bool aligator::SolverProxDDPTpl< _Scalar >::force_initial_condition_ = true |
Force the initial state \( x_0 \) to be fixed to the problem initial condition.
Definition at line 181 of file solver-proxddp.hpp.
std::size_t aligator::SolverProxDDPTpl< _Scalar >::max_refinement_steps_ |
Definition at line 183 of file solver-proxddp.hpp.
Scalar aligator::SolverProxDDPTpl< _Scalar >::refinement_threshold_ = 1e-13 |
Definition at line 185 of file solver-proxddp.hpp.
std::size_t aligator::SolverProxDDPTpl< _Scalar >::max_iters |
Definition at line 186 of file solver-proxddp.hpp.
std::size_t aligator::SolverProxDDPTpl< _Scalar >::max_al_iters = 100 |
Definition at line 187 of file solver-proxddp.hpp.
uint aligator::SolverProxDDPTpl< _Scalar >::rollout_max_iters |
Definition at line 188 of file solver-proxddp.hpp.
Workspace aligator::SolverProxDDPTpl< _Scalar >::workspace_ |
Definition at line 190 of file solver-proxddp.hpp.
Results aligator::SolverProxDDPTpl< _Scalar >::results_ |
Definition at line 191 of file solver-proxddp.hpp.
std::unique_ptr<gar::RiccatiSolverBase<Scalar> > aligator::SolverProxDDPTpl< _Scalar >::linearSolver_ |
LQR subproblem solver.
Definition at line 193 of file solver-proxddp.hpp.
FilterTpl<Scalar> aligator::SolverProxDDPTpl< _Scalar >::filter_ |
Definition at line 194 of file solver-proxddp.hpp.
LinesearchVariant aligator::SolverProxDDPTpl< _Scalar >::linesearch_ |
Linesearch function.
Definition at line 196 of file solver-proxddp.hpp.