Al momento, stai seguendo questa domanda
- Vedrai gli aggiornamenti nel tuofeed del contenuto seguito.
- Potresti ricevere delle e-mail a seconda delle tuepreferenze per le comunicazioni.
can I pass these nonlinear constraints to lsqnonlin?
 denote a function of two variables and
 denote a function of two variables and  the parameters of the optimization problem
 the parameters of the optimization problem  
  , fmust be convex at all iterations. Otherwise, it is purely a matter of luck that the identification works.
, fmust be convex at all iterations. Otherwise, it is purely a matter of luck that the identification works.

Risposta accettata
 as well, although that will be a simpler, linear constraint in E.
 as well, although that will be a simpler, linear constraint in E. 44 Commenti
 can be easily implemented in A*x<b.
 can be easily implemented in A*x<b.
 or did you refer to something else?
 or did you refer to something else? , which is the balance of linear momentan (Newton's law). It is a purely mechanical boundary-value-problem and I solve it by means of the finite-element-method. P is a second-order tensor (stress tensor, 3x3 matrix) which can be derived from the strain energy density fas follows:
, which is the balance of linear momentan (Newton's law). It is a purely mechanical boundary-value-problem and I solve it by means of the finite-element-method. P is a second-order tensor (stress tensor, 3x3 matrix) which can be derived from the strain energy density fas follows: , where Cis a SPD second order tensor (strain tensor, 3x3 matrix). fis a scalar valued function (strain energy density) chracterizing the constitutive behavior of the material under consideration. If we assume the material to be isotropic (same response regardless of the loading direction), then we can express $f$ in terms of (some of) the principal invariants of $C$, i.e., $f(C) = f(I_1(C), I_2(C))$. Renaming the invariants to x and y, we end up with
, where Cis a SPD second order tensor (strain tensor, 3x3 matrix). fis a scalar valued function (strain energy density) chracterizing the constitutive behavior of the material under consideration. If we assume the material to be isotropic (same response regardless of the loading direction), then we can express $f$ in terms of (some of) the principal invariants of $C$, i.e., $f(C) = f(I_1(C), I_2(C))$. Renaming the invariants to x and y, we end up with  as I defined it in my question. From the physics we know that fmust vanish for zero deformation, which is the case for
 as I defined it in my question. From the physics we know that fmust vanish for zero deformation, which is the case for  or
 or  , where it should also attain its global minimum. f should have no other stationary points. Also we know that
, where it should also attain its global minimum. f should have no other stationary points. Also we know that  based on the properties of C and
 based on the properties of C and  since energy can not be negative. Also, $f$ should go to infinity if either x or y approach infinity and is ideally convex. However,
 since energy can not be negative. Also, $f$ should go to infinity if either x or y approach infinity and is ideally convex. However,  are resonable upper bounds for most physical applications. That are the main properties we know about f.
 are resonable upper bounds for most physical applications. That are the main properties we know about f. and fit a quite small number of material parameters (here
 and fit a quite small number of material parameters (here  ) to some measurements. However, this approximation a priori assumes a particular shape of fand it turns out that this global approximation is not really satisfactory which is why people try to find better approximations of fto make it a good model for nearly all loading cases of the above boundary-value-problem. My idea is/was to discretize f(locally) over the space of invariants (x and y), distribute some support points in that space, and find the values of fat those support points (the
) to some measurements. However, this approximation a priori assumes a particular shape of fand it turns out that this global approximation is not really satisfactory which is why people try to find better approximations of fto make it a good model for nearly all loading cases of the above boundary-value-problem. My idea is/was to discretize f(locally) over the space of invariants (x and y), distribute some support points in that space, and find the values of fat those support points (the  ) with optimization. This works excellent in a one-dimensional invariant space, but as I told you, not so good in two-dimensional invariant space because the principal invariants of a SPD tensor show some correlation; In other words, there are some support points (x_a, y_a) that I do not really "touch" when I solve my boundary-value-problem, in particular the ones at the boundary. I highly doubt it works to identify the parameters on those support points with a purey local approximation.
) with optimization. This works excellent in a one-dimensional invariant space, but as I told you, not so good in two-dimensional invariant space because the principal invariants of a SPD tensor show some correlation; In other words, there are some support points (x_a, y_a) that I do not really "touch" when I solve my boundary-value-problem, in particular the ones at the boundary. I highly doubt it works to identify the parameters on those support points with a purey local approximation.  (the ones at the "most activated" support points) with optimization, and computing the
 (the ones at the "most activated" support points) with optimization, and computing the  at the boundary based on the optimized
 at the boundary based on the optimized  and the convexity property.
 and the convexity property. " That seems to be the same as Laplacian(f)=0."
But f is not a second order tensor in the Laplacian.
" You've restated here that f should be convex, but still haven't explained why. You also haven't explained why this convexity needs to be enforced at all iterations."
f(C) should be polyconvex, which is often weakened by requiring f(x,y) to be convex, where x and y are the first two principal invariants of C. I solve the pde with finite elements. Since it is nonlinear in the displacement field U, a Newton scheme is applied where the increments are obtained by solving the linear system K*deltaU = -R
If f(x,y) is convex, it can be guaranteed that the pde has a unique solution and that K is PSD. We can not guarantee that if f(x,y) is not convex. If the convexity is not enforced at all iterations, I often see that K is rank deficient, i.e. I can not solve the pde.
Strengthened, I think you mean,.
Yes.
Ah well. We've already discussed what I think about solving PDEs inside objective functions... One question. Is the converse true? If you have an f that solves the PDEs will necessarily be convex or polyconvex?
It is on my list for the future to implement the pde as a nonlinear constraint.
No. If f solves the pde, f must not necessarily be convex/polyconvex. Does this help?
Also, were the information that I provided helpful to make a suggestion for a different set of basis functions / different approximation of f? You mentioned that there are also convex basis functions or semi-local approximation strategies.
You often have great ideas, I really hope also here once again :)
You would have to solve a potentially non-sparse linear system to find coefficients E_ij giving a certain set of f(xi,yj) values. That means that the number of basis functions you could have might have to be somewhat modest.
I thought the approximation f = sum N_ij implies that we have ad many basis functions as there are nodal points. So what do you mean by "number of basis functions..." modest?

 , where the N_i are the local finite element basis functions that I use so far. The blue dots are the sampled points when I solve the pde. You can clearly see the correlation between x and y which makes it nearly impossible to identfiy all E_i. Replacing my N_i with your proposed
, where the N_i are the local finite element basis functions that I use so far. The blue dots are the sampled points when I solve the pde. You can clearly see the correlation between x and y which makes it nearly impossible to identfiy all E_i. Replacing my N_i with your proposed

Let me finally clarify a few points regarding
N_i(x,y) = E_i*log( 1 + exp(a_i*(x-x_i)) + exp(b_i*(y-y_i)) )/log(3)
1. The purpose of the factor 1/log(3) is to make sure that N_i(x_i,y_i)=E_i ? Is that necessari at all given that the basis is not interpolating?
2. What is the purpose of log(...), i.e. why not simple
N_i(x,y) = E_i*( 1 + exp(a_i*(x-x_i)) + exp(b_i*(y-y_i)) )
(1)That would make the basis function separable in x,y. Not sure that would be a good basis unless f(x,y) is also separable in x and y
Most of the f(x,y) that I am working on are indeed of the form f(x,y)=h(x)+g(y). Given that, you think
N_i(x,y) = E_i*log( 1 + exp(a_i*(x-x_i)) + exp(b_i*(y-y_i)) )
is appropriate?
you will want to use log1p for small values of the operands, a_i(x-x_i)<=33 and b_i(y-y_i)<=33 and you will need to use an appropriate Taylor approximation when either of the exponential terms exp(a_i*(x-x_i)), exp(b_i*(y-y_i)) overflows.*
I think x_i,y_i,E_i<=10 are reasonable upper bounds, so overflow should not be a matter of concern. Why in particular 33 (a_i*(x-x_i)<=33)? The examples on your link use logp1(1+x) for quite small values of x.

 . So it is probably a trade-off: if I do not situate a basis function at every support point, I have to introduce the additional parameters on the exponentials to make the basis more flexible. If I have a higher density of basis functions, I can probably set them to one. Makes intuitively sense?
. So it is probably a trade-off: if I do not situate a basis function at every support point, I have to introduce the additional parameters on the exponentials to make the basis more flexible. If I have a higher density of basis functions, I can probably set them to one. Makes intuitively sense?Più risposte (1)

89 Commenti

 where the s_i are known shifts which can be thought of as the "locations" of the basis functions. The function F(x) is also continuous and defined on some possibly larger domain  [Fmin,Fmax].  You want these functions to agree at certain discrete sample points,
 where the s_i are known shifts which can be thought of as the "locations" of the basis functions. The function F(x) is also continuous and defined on some possibly larger domain  [Fmin,Fmax].  You want these functions to agree at certain discrete sample points,

 
 


 is defined on the domain
 is defined on the domain  and
 and  on the domain
 on the domain  as indicated by the two dashed lines in the picture,
 as indicated by the two dashed lines in the picture,  are the linear interpolating kernels. Clearly, we have
 are the linear interpolating kernels. Clearly, we have  . This is the terminology you introduced some comments before.
. This is the terminology you introduced some comments before.  , but how would I determine
, but how would I determine  and
 and  . We said
. We said  is not defined there.
 is not defined there.the convexity conditions E(i+1)-2*E(i)+E(i-1) do not have to be satisfied at all i. It is sufficient for this to hold for i=2,...n-1
But if we just have E(n)>=E(n-1), convexity at the boundary can not be guaranteed. Why do you think the constraints for i=2,...,n-1 are sufficient?
So there is reason to think that all 3*n parameters could be estimated, assuming you have 3*n sample points
So I should have at least 3n sample points to determine the 3n parameters? Did you mean that?
There is no other constraint involving E(n) that I've proposed.
You did not indeed. But I thought E(n)-E(n-1) would be necessary in addition to ensure convexity. But I do not need this constraint?
May it be that E(n)-E(n-1) along with E(n)-2E(n-1)+E(n-2)>=0 makes f non-convex?




I can modify my example as below to get poor accuracy again, even with your actual tolerances.
As you said, setting the tolerances to 1e-12 is probably a good choice. I think I read in a different chat that one should not go below 1e-14. Of course, as your examples shows, one could create cases where this still may not be sufficient.
Even in my example with (x-1)^4, the objective is not perfectly flat anywhere, but you can still see that it is flat enough to cause early termination.
The fact that the optimizer pulls away from ground truth, along with exitflag=2, suggests that my objective is "quite" flat around ground truth, right?
where D_g and D_h are the D parameters (from our earlier discussion) of g and h
I discretized g and h with five parameters each. Do you mean D_g and D_h is the solution that I got in the noisy case? Would the associated Jacobian not be zero then if D_g and D_h are const numbers?
Makes totally sense what you say. One point:
so the second derivatives will merely be encouraged to be small in magnitude rather required to have some exact value, as with constraints.
My motivation to multiply A by 1e4 was because some components 1e3*A*E were still greater than zero. So I increased the exponent by one. What I want to say is that with c, c*A*E, I do influence the sign and magnitude simultaneously, right?
And is it really that this factor somehow sets the derivatives to an exact value (...required to have some exact value, as with constraints)? I mean is there not also an merit function internally whose goal is to decrease the constraint violations?
It's hard to know what's really happening, since I don't know which of lsqnonlin's algorithms you are running.
I use lsqnonlin's interior point algorithm which, If I am not mistaken, calls fmincon's interior-point algorithm internally. Given that, do you know what the scaling of the A matrix affects qualitatively? Just magnitude or sign, or both?

Therefore, your steps will always be guided more by the constraints than the objective.
This makes qualitatively sense to me, but I do not see why this translates into small steps that can even trigger the stop tolerance?
Vedere anche
Categorie
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!Si è verificato un errore
Impossibile completare l'azione a causa delle modifiche apportate alla pagina. Ricarica la pagina per vedere lo stato aggiornato.
Seleziona un sito web
Seleziona un sito web per visualizzare contenuto tradotto dove disponibile e vedere eventi e offerte locali. In base alla tua area geografica, ti consigliamo di selezionare: .
Puoi anche selezionare un sito web dal seguente elenco:
Come ottenere le migliori prestazioni del sito
Per ottenere le migliori prestazioni del sito, seleziona il sito cinese (in cinese o in inglese). I siti MathWorks per gli altri paesi non sono ottimizzati per essere visitati dalla tua area geografica.
Americhe
- América Latina (Español)
- Canada (English)
- United States (English)
Europa
- Belgium (English)
- Denmark (English)
- Deutschland (Deutsch)
- España (Español)
- Finland (English)
- France (Français)
- Ireland (English)
- Italia (Italiano)
- Luxembourg (English)
- Netherlands (English)
- Norway (English)
- Österreich (Deutsch)
- Portugal (English)
- Sweden (English)
- Switzerland
- United Kingdom(English)
Asia-Pacifico
- Australia (English)
- India (English)
- New Zealand (English)
- 中国
- 日本Japanese (日本語)
- 한국Korean (한국어)





