Variable time support in create_objective_function. by moorepants · Pull Request #205 · csu-hmc/opty

moorepants · 2024-08-05T05:22:26Z

No description provided.

Peter230655 · 2024-08-05T05:38:29Z

Does this mean, that create_objective_function will soon be able to handle variable h? :-)

moorepants · 2024-08-05T05:40:52Z

Only if I can figure it out.

Peter230655 · 2024-08-05T05:44:43Z

Only if I can figure it out.

I feel like you know it already! Surely will avoid these ugly expressions like in my ball/disc simulation.
If you want to minimize h, the only sensible expression to minimize would be weight * h, weight > 0.0 ?

moorepants · 2024-08-05T06:44:28Z

A general solution to this needs to support this cost function:

(general optimal control cost function)

Creating the discrete version of that and the discrete gradient from a general SymPy expression is not so simple.

There are past open issues about this: #30 #31

Peter230655 · 2024-08-05T08:11:14Z

Just to make sure I understand it correctly:

E and F are functions, differentiable in all their arguments.
x is the state vector of the system.
u is the control (input) vector.
J[x(.), u(.), t_0, t_f] is to be minimized.

A necessary condition for a minimum is that grad_(x, u, t_f)(J) = 0.

tjstienstra · 2024-08-06T07:48:11Z

There are past open issues about this: #30 #31

See also #190

moorepants · 2024-08-06T09:39:09Z

Yes, I was starting to implement your suggestion but I think it was missing taking the derivative wrt to h and I started getting hung up on the solution. I basically copied your code and started trying to make it work, but haven't gotten it to yet.

tjstienstra · 2024-08-06T10:14:53Z

but I think it was missing taking the derivative wrt to h

I would expect that you can just add it to the objective_grad computation, the list of symbols w.r.t. which the jacobian is computed. However, I don't remember testing the solution (in depth), but it is nice to see that you have already written some tests.

tjstienstra · 2024-08-06T11:10:24Z

opty/tests/test_utils.py

+        def expected_obj(free):
+            f = free[2*self.N:-1]
+            return free[-1]*np.sum(f**2)
+
+        def expected_obj_grad(free):
+            f = free[2*self.N:-1]
+            grad = np.zeros_like(free)
+            grad[2*self.N:-1] = 2.0*free[-1]*free[2*self.N:-1]
+            grad[-1] = np.sum(f**2)
+            return grad


You should take into account that this is backward Euler, so the first term falls out, see test_backward_single_input.
If free is [x(t), v(t), f1(t), f2(t), c, k, m, h] then the objective should be (f1_vals[1:]**2 + f2_vals[1:]**2).sum() * h_val.
Similarly, the gradient should be a stack of zeros(2*N+1), 2*h_val*f1_vals[1:], [0] 2*h_val*f2_vals[1:], [0, 0, 0, (f1_vals[1:]**2 + f2_vals[1:]**2).sum()]

P.S. quickly wrote out the equations on my phone so would advise checking them.

I don't think I assumed any specific integration routine in the manually created objective functions.

I don't think I assumed any specific integration routine in the manually created objective functions.

I have a basic question:
In the current create_objective_function(...) there is a distinction in forming obj, obj_grad depending on the integration method.
But, for example, in the examples-gallery simulation plot_pendulum_swing_up_variable_duration, the gradient is formed as I would naively expect it to be formed.

Can the gradient always be formed as per the method used in plot_pendulum_swing_up_variable_duration, or is this only valid for midpoint euler or is this a (good) approximation, if I understood #30 #31 correctly?
Thanks for any explanations!

The examples are a little loose, but it probably doesn't matter too much in the objective calc because both methods have about the same minima.

Thanks!
This would mean one could calculate obj and obj_grad 'naively' without committing a large error?

The gradient of the objective has to be the valid gradient within some numerical tolerance. But your choice of solving the integral in the objective does not really matter, as there are numerous integration methods.

Thanks!
The gradient as calculated in the simulation plot_pendulum_swing_up_variable_duration surely is the valid gradient (?)

Added unit tests for variable time support in create_objective_function.

3350d26

tjstienstra reviewed Aug 6, 2024

View reviewed changes

moorepants added the enhancement label Aug 30, 2024

moorepants changed the title ~~Added unit tests for variable time support in create_objective_function.~~ Variable time support in create_objective_function. Aug 30, 2024

Conversation

moorepants commented Aug 5, 2024

Uh oh!

Peter230655 commented Aug 5, 2024

Uh oh!

moorepants commented Aug 5, 2024

Uh oh!

Peter230655 commented Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

moorepants commented Aug 5, 2024

Uh oh!

Peter230655 commented Aug 5, 2024

Uh oh!

tjstienstra commented Aug 6, 2024

Uh oh!

moorepants commented Aug 6, 2024

Uh oh!

tjstienstra commented Aug 6, 2024

Uh oh!

tjstienstra Aug 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

moorepants Aug 6, 2024

Choose a reason for hiding this comment

Uh oh!

Peter230655 Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

moorepants Jan 20, 2025

Choose a reason for hiding this comment

Uh oh!

Peter230655 Jan 20, 2025

Choose a reason for hiding this comment

Uh oh!

moorepants Jan 20, 2025

Choose a reason for hiding this comment

Uh oh!

Peter230655 Jan 20, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Peter230655 commented Aug 5, 2024 •

edited

Loading

tjstienstra Aug 6, 2024 •

edited

Loading

Peter230655 Jan 20, 2025 •

edited

Loading