Expertini Research Research
Artificial Intelligence And Data Science PDF Available Non-peer-reviewed Preprint

Dynamical System Optimization

Emo Todorov  ·  Published 2025-06-10

Abstract

We develop an optimization framework centered around a core idea: once a (parametric) policy is specified, control authority is transferred to the policy, resulting in an autonomous dynamical system. Thus we should be able to optimize policy parameters without further reference to controls or actions, and without directly using the machinery of approximate Dynamic Programming and Reinforcement Learning. Here we derive simpler algorithms at the autonomous system level, and show that they compute the same quantities as policy gradients and Hessians, natural gradients, proximal methods. Analogs to approximate policy iteration and off-policy learning are also available. Since policy parameters and other system parameters are treated uniformly, the same algorithms apply to behavioral cloning, mechanism design, system identification, learning of state estimators. Tuning of generative AI models is not only possible, but is conceptually closer to the present framework than to Reinforcement Learning.
📄 Full Paper Available as PDF
This paper is available as a downloadable PDF.
📄 Download PDF

✨ AI Plain-English Summary

Get a plain-English summary of this paper generated by AI (5 free per day).

Comments (0)

No comments yet. Be the first to comment.

Related Papers

Artificial Intelligence And Data Science PDF

An Efficient Algorithm for Computing Interventional Distributions in ...

2012
Artificial Intelligence And Data Science PDF

Sparse matrix-variate Gaussian process blockmodels for network modeling

2012
Artificial Intelligence And Data Science PDF

Hierarchical Maximum Margin Learning for Multi-Class Classification

2012
Artificial Intelligence And Data Science PDF

Tightening MRF Relaxations with Planar Subproblems

2012