<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">On Thu, Aug 18, 2016 at 8:55 AM, Munson, Todd <span dir="ltr"><<a href="mailto:tmunson@mcs.anl.gov" target="_blank">tmunson@mcs.anl.gov</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
Just to add some more thoughts to this argument. Currently TAO uses a<br>
model where we have methods to get the gradient and Hessian of the<br>
objective function.<br>
<br>
For quadratic problems we really want the Hessian matrix H and the<br>
vector c so that the objective function is x'*H*x + c'*x.<br>
<br>
Because we do not have any specialized code for quadratic problems,<br>
we end up using the gradient of the objective and the Hessian<br>
matrix to back out the constant c.<br>
<br>
c = grad - Hx<br>
<br>
where grad = Hx + c.<br>
<br>
This relies on cancellation of Hx in order to get an accurate value<br>
for the vector c, which can be a problem, especially if we start<br>
far from a solution.<br></blockquote><div><br></div><div>What about c = grad 0?</div><div><br></div><div> Matt</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
You may have the same problem with linear ODE if you are relying<br>
on a residual and jacobian evaluation pair to back out the<br>
constant right-hand side vector.<br>
<br>
Its probably too complicated for users, but what one might like is<br>
to have the user separate the objective function, for example, and<br>
provide, c, H, and a nonlinear part f. Then I can assemble the<br>
objective function internally as<br>
<br>
c'*x + x'*H*x + f(x)<br>
<br>
while retaining correct values for c and H when I need them.<br>
<br>
The user could continue as always by putting everything into f(x),<br>
but then they would have to use a nonlinear solver.<br>
<br>
I'd do the same for constraints:<br>
<br>
b + A*x + F(x)<br>
<br>
This also allows the methods to do internal type checking to ensure<br>
the problem passed into the methods has the correct signature.<br>
E.g. for quadratic programs the nonlinear functions are NULL.<br>
<span class="HOEnZb"><font color="#888888"><br>
Todd.<br>
</font></span><div class="HOEnZb"><div class="h5"><br>
> On Aug 16, 2016, at 7:03 PM, Barry Smith <<a href="mailto:bsmith@mcs.anl.gov">bsmith@mcs.anl.gov</a>> wrote:<br>
><br>
><br>
>> On Aug 16, 2016, at 6:09 PM, Munson, Todd <<a href="mailto:tmunson@mcs.anl.gov">tmunson@mcs.anl.gov</a>> wrote:<br>
>><br>
>><br>
>> Is there a reason to put flags into PETSc for linear or nonlinear equations? In PETSc<br>
>> there is a natural difference between linear (KSP) and nonlinear (SNES) that is<br>
>> reflected in the objects.<br>
><br>
> The flags are for TS ODE solvers (not KSP or SNES); they can come into play for two reasons<br>
><br>
> 1) whether Jacobian entries can be changed inside the solver or not (for nonlinear ODES they can usually be changed since the user provides new values at each Jacobian update; for linear the user won't normally be putting in the same values again so if the solver changes the values it has to have some way to recover the values; often but not always the changes are only on the diagonal). Note that this reason is JUST a question of optimizing memory and code (do you make a copy of matrices or not?), not correctness of results.<br>
><br>
> 2) There may also be specialized ODE integrator only worked for a linear problem. (I don't know if we have any of those implemented or not). This is not just a question of optimizing the solver, it is a question of correctness. If I try to solve a nonlinear problem with a linear only solver I will get a wrong or meaningless answer.<br>
><br>
> Similarly I think for TAO both reasons for wanting to know properties of the problem exist.<br>
>><br>
>> Any thoughts on splitting up the TAO methods in a similar way and distinguish at<br>
>> least between between quadratic and general nonlinear problems, rather than<br>
>> putting in flags. This would be consistent with the rest of PETSc.<br>
>><br>
>> For example, instead of having a single "bound" directory, having something<br>
>> like a "bco" directory for generic nonlinear bound-constrained optimization<br>
>> and a "bqp" directory for bound constrained quadratic programs.<br>
>><br>
>> I'd do the same for "unconstrained" into "uco" and "uqp".<br>
>><br>
>> General constraints then become more cumbersome with too many combinations.<br>
>> Could probably get by with generic "lp", "qp", and "nlp" though.<br>
><br>
> If you did this for the TAO user API then one could argue it should be done for TS also. But I don't think it should be done for TS.<br>
><br>
> Two reasons for not having a separate API for linear and nonlinear problems is<br>
><br>
> 1) I may want to use a "nonlinear" ODE integrator (that is an integrator that works for nonlinear problems also) on my linear problem and I shouldn't need to have ifs if my code to switch to use it. That is I don't want ugly code like<br>
><br>
> if (using nonlinear integrator) {<br>
> TSSetIJacobian(..)<br>
> } else {<br>
> TSSetConstantMatrix(...)<br>
> }<br>
> Now I could also have my "nonlinear integrators" support the TSSetConstantMatrix() API but that leads to more complex implementations that are more work to maintain. Why not just have something like<br>
> TSSetIJacobian(ts, J,TSConstantJacobianFunction()<wbr>) when you are doing a linear ODE?<br>
><br>
> 2) I may spend 3 months writing my linear ODE problem using the linear interface and then realize I have to add a nonlinear term, now I have to change my app use the nonlinear interface of TS. This is why Matt argues we should just publicly support the SNES api even for linear problems.<br>
><br>
> Dealing with changing or not changing matrices has another issue we have not dealt with properly in PETSc (or TAO). When the matrix entries are set. For linear ODE problems we kind of assume the matrix entries are set by the user before they ever call the TS code (and many users think this is a natural way to proceed), so essentially they compute the J matrix up front and then pass it to the TS which uses it. For nonlinear problems, of course, we cannot set the matrix entries up front, they MUST be set in the callback that TS calls to compute the matrix entries. But if the matrix structure is provided internally by a DM then the user does not have direct access to the matrix before calling TSSolve() so even for linear problems the code logic more naturally wants the (constant) Jacobian to be computed at the first timestep computation (and never again) rather than up front. Using the Matt Knepley consistency/simplicity argument we should just always use the callback approach and not even "support" the "linear only" approach. Then comes the nasty linear "flag" to indicate 1) if the user intends to provide the matrix only once and 2) to indicate the problem has special (linear) structure that only certain solvers will work correctly on.<br>
><br>
> I'm rambling a bit but I need to in order to make clear these kind of design decisions are not straightforward.<br>
><br>
> In terms of organizing the implementation code and the names (TaoType) of the methods I think spitting them into categories makes good sense. But never with your horrible abbreviations that no one but you would be able to remember! We are not using Fortran IV with 6 char limits.<br>
><br>
> Before doing anything let's try to generate a table that includes all (most of?) the cases.<br>
><br>
> The function being optimized is linear, quadratic, or general (another case is separable).<br>
><br>
> The equality constraints are linear?, quadratic? general? or something else?<br>
><br>
> The inequality constraints are simple bounds? linear? quadratic? general? something else?<br>
><br>
> Is there a fourth (or even more) axis of problem properties?<br>
><br>
> Is this level of categories enough to organize the directories and naming of TAO solvers?<br>
><br>
> Barry<br>
><br>
><br>
><br>
>><br>
>> The methods would then explicitly mention the type of optimization problem<br>
>> with, for example, TAO_UCO_LMVM, TAO_BCO_TRON, and TAO_BQP_GPCG.<br>
>><br>
>> Thoughts?<br>
>><br>
>> Todd.<br>
>><br>
>><br>
>>> On Aug 16, 2016, at 4:32 PM, Oxberry, Geoffrey Malcolm <<a href="mailto:oxberry1@llnl.gov">oxberry1@llnl.gov</a>> wrote:<br>
>>><br>
>>> Definitely flags for linear problems would be helpful for TAO. Once there is an example up, I'd be happy to add that to the SQPTR pull request.<br>
>>><br>
>>> ______________________________<wbr>__________<br>
>>> From: <a href="mailto:petsc-dev-bounces@mcs.anl.gov">petsc-dev-bounces@mcs.anl.gov</a> [<a href="mailto:petsc-dev-bounces@mcs.anl.gov">petsc-dev-bounces@mcs.anl.gov</a><wbr>] on behalf of Barry Smith [<a href="mailto:bsmith@mcs.anl.gov">bsmith@mcs.anl.gov</a>]<br>
>>> Sent: Monday, August 15, 2016 6:18 PM<br>
>>> To: Munson, Todd<br>
>>> Cc: petsc-dev<br>
>>> Subject: Re: [petsc-dev] coding style<br>
>>><br>
>>>> On Aug 15, 2016, at 8:08 PM, Munson, Todd <<a href="mailto:tmunson@mcs.anl.gov">tmunson@mcs.anl.gov</a>> wrote:<br>
>>>><br>
>>>><br>
>>>> Since we are only doing diagonal modifications to the matrix in<br>
>>>> my case, should I simply create a matnest in my part of the<br>
>>>> code and apply the diagonal modification in the matnest<br>
>>>> without directly modifying any of the matrices?<br>
>>>> That might be cleaner for the users.<br>
>>><br>
>>> It will be dependent on the solver how much of the matrix you need retain the parts you have modified.<br>
>>><br>
>>> MatNest isn't the right thing here. I think you can do MatGetDiagonal() to keep the valid diagonal entries then MatDiagonalSet() to put back the valid entries.<br>
>>><br>
>>> Barry<br>
>>><br>
>>><br>
>>>><br>
>>>> Todd.<br>
>>>><br>
>>>>> On Aug 15, 2016, at 7:55 PM, Barry Smith <<a href="mailto:bsmith@mcs.anl.gov">bsmith@mcs.anl.gov</a>> wrote:<br>
>>>>><br>
>>>>><br>
>>>>>> On Aug 15, 2016, at 7:42 PM, Munson, Todd <<a href="mailto:tmunson@mcs.anl.gov">tmunson@mcs.anl.gov</a>> wrote:<br>
>>>>>><br>
>>>>>><br>
>>>>>> Got it. I will get all the code fragments in the developer doc that<br>
>>>>>> are not meant to be deliberately wrong fixed tomorrow; at least<br>
>>>>>> that code will be compliant... :)<br>
>>>>>><br>
>>>>>> Then its onto fixing bqpip.<br>
>>>>>><br>
>>>>>> Was there a final say on the Hessian/Jacobian matrix when they are<br>
>>>>>> modified? Were we adding a "dirty" bit or was I changing my code<br>
>>>>>> to not modify those matrices?<br>
>>>>><br>
>>>>> Changing email to petsc-dev since this is a general PETSc/TAO issue.<br>
>>>>><br>
>>>>> This is why we don't wait months to fix something :-) I have completely forgot the context you are asking about.<br>
>>>>><br>
>>>>> I think the general issue comes up when a problem is linear and the user just wants to set and forget the matrices while when the problem is nonlinear the user needs to reset values into the matrix anyways so it is fine for you to change the matrix internally. Unless there is a way for the user to explicitly indicate the system is linear your code cannot know if it needs to make a copy of the matrix.<br>
>>>>><br>
>>>>> In TS we have typedef enum {TS_LINEAR,TS_NONLINEAR} TSProblemType; and TSSetProblemType() that can be used to indicate if the matrices can be changed or not (though I don't think we use it in this way, we have an open issue <a href="https://bitbucket.org/petsc/petsc/issues/135/memory-optimization-for-ts" rel="noreferrer" target="_blank">https://bitbucket.org/petsc/<wbr>petsc/issues/135/memory-<wbr>optimization-for-ts</a><br>
>>>>><br>
>>>>> I think you should start by having a flag that indicates if it is safe to modify the matrices associated with the TAO object and if not safe keep a backup copy.<br>
>>>>><br>
>>>>> Barry<br>
>>>>><br>
>>>>><br>
>>>>><br>
>>>>>><br>
>>>>>> Todd.<br>
>>>>>><br>
>>>>>>> On Aug 15, 2016, at 6:09 PM, Barry Smith <<a href="mailto:bsmith@mcs.anl.gov">bsmith@mcs.anl.gov</a>> wrote:<br>
>>>>>>><br>
>>>>>>><br>
>>>>>>> Todd,<br>
>>>>>>><br>
>>>>>>>> On Aug 15, 2016, at 10:26 AM, Satish Balay <<a href="mailto:balay@mcs.anl.gov">balay@mcs.anl.gov</a>> wrote:<br>
>>>>>>>><br>
>>>>>>>> Todd,<br>
>>>>>>>><br>
>>>>>>>> Forwarding this to Barry.<br>
>>>>>>>><br>
>>>>>>>> Satish<br>
>>>>>>>><br>
>>>>>>>> On Mon, 15 Aug 2016, Munson, Todd wrote:<br>
>>>>>>>><br>
>>>>>>>>><br>
>>>>>>>>> I'm going through the developer documentation. There seems to be<br>
>>>>>>>>> a few things missing from the coding style.<br>
>>>>>>>>><br>
>>>>>>>>> For example, in function prototypes, is it:<br>
>>>>>>>>><br>
>>>>>>>>> PetscErrorCode MyFunction(char *,const int *,int);<br>
>>>>>>>>><br>
>>>>>>>>> or<br>
>>>>>>>>><br>
>>>>>>>>> PetscErrorCode MyFunction(char*,const int*,int)<br>
>>>>>>><br>
>>>>>>> The second one because the preference is to not have unnecessary space.<br>
>>>>>>><br>
>>>>>>>>><br>
>>>>>>>>><br>
>>>>>>>>> Same thing in the definitions:<br>
>>>>>>>>><br>
>>>>>>>>> PetscErrorCode MyFunction(char *c,const int *i,int j)<br>
>>>>>>>>> {<br>
>>>>>>>>> }<br>
>>>>>>>>><br>
>>>>>>>>> or<br>
>>>>>>>>><br>
>>>>>>>>> PetscErrorCode MyFunction(char* c,const int* i,int j)<br>
>>>>>>><br>
>>>>>>> The * should always be attached to the variable.<br>
>>>>>>>>> {<br>
>>>>>>>>> }<br>
>>>>>>>>><br>
>>>>>>>>> The same questions apply with * replaced by [].<br>
>>>>>>>>><br>
>>>>>>>>><br>
>>>>>>>>> Next is in the types for variables. Are pointers grouped<br>
>>>>>>>>> separately or together with the type:<br>
>>>>>>>>><br>
>>>>>>>>> int *i,j,*k;<br>
>>>>>>>>><br>
>>>>>>>>> or<br>
>>>>>>>>><br>
>>>>>>>>> int *i,*k;<br>
>>>>>>>>> int j;<br>
>>>>>>>>><br>
>>>>>>>>> If grouped together, should they be ordered with pointers first<br>
>>>>>>>>> and the others later?<br>
>>>>>>>>><br>
>>>>>>>>> int *i,*k,j;<br>
>>>>>>><br>
>>>>>>> There is no preference on the above; the pointers do not have to be on their own line.<br>
>>>>>>>>><br>
>>>>>>>>> On the next topic, do you use<br>
>>>>>>>>><br>
>>>>>>>>> char *blah;<br>
>>>>>>>>><br>
>>>>>>>>> or<br>
>>>>>>>>><br>
>>>>>>>>> char* blah;<br>
>>>>>>>>><br>
>>>>>>>>> when there is only one of them?<br>
>>>>>>><br>
>>>>>>> * belongs on the variable<br>
>>>>>>><br>
>>>>>>>>><br>
>>>>>>>>> I have the same issues with the typedefs. Sometimes there is<br>
>>>>>>>>><br>
>>>>>>>>> typedef struct _EH* EH;<br>
>>>>>>>>><br>
>>>>>>>>> and other times<br>
>>>>>>>>><br>
>>>>>>>>> typedef struct _EH *EH;<br>
>>>>>>><br>
>>>>>>> Here the * belongs as in the first case<br>
>>>>>>>>><br>
>>>>>>>>><br>
>>>>>>>>> My personal thought is to be consistent and use char * with a space<br>
>>>>>>>>> everywhere, but you tell me. I will then go and fix the developer<br>
>>>>>>>>> documentation to be consistent with the style guide as a test of<br>
>>>>>>>>> my branch management.<br>
>>>>>>>>><br>
>>>>>>>>> Todd.<br>
>>>>>>>>><br>
>>>>>>>>><br>
>>>>>>>><br>
>>>>>>><br>
>>>>>><br>
>>>>><br>
>>>><br>
>>><br>
>><br>
><br>
<br>
</div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature" data-smartmail="gmail_signature">What most experimenters take for granted before they begin their experiments is infinitely more interesting than any results to which their experiments lead.<br>-- Norbert Wiener</div>
</div></div>