Is the Proof for the Nearest Point in a Cone Valid?

kaosAD
Messages
33
Reaction score
0
I encountered a problem in a book with a proof given. But I am a bit skeptic about it. I hope someone can help shed some light.

Let \{g_{i}\} be a set of vectors and imagine a cone defined as K = \left\{v \,\bigg|\, v =-\sum_{i}\lambda_{i}g_{i}, \textup{ where }\lambda_{i}\geq 0 \ , \forall i \right\}.

Let f \notin K and let u \in K be the closest point to f. Obviously u is the projected point of f onto K. The objective is to prove that if d = u - f, then g_{i}^\top d \leq 0, \, \forall i. (Note that d \neq 0.)

The proof given is by contradiction: Suppose that is not true, that is, \hat{g}_{i}^\top d = s_{i} for some scalar s_{i} > 0, \, \forall i, where \hat{g}_{i}= g_{i}/\|g_{i}\|. It is not difficult to see that (u-s_{i}\hat{g}_{i}) \in K, \, \forall i, i.e., it remains in the cone even by small or large perturbation on the vector u. Now, we shall show the perturbed point has smaller distance. Indeed this is the case since for any i,

<br /> \begin{align*}<br /> \|(u-s_{i}\hat{g}_{i})-f \|^{2} &amp;= \|(u-f)-s_{i}\hat{g}_{i}\|^{2}= \|(u-f)\|^{2}-2 s_{i}\hat{g}_{i}^\top (u-f)+s_{i}^{2}\|\hat{g}_{i}\|^{2} \\ <br /> &amp;= \|d\|-2s_{i}\hat{g}_{i}^\top d+s_{i}^{2} \\ <br /> &amp;= \|d\|-2s_{i}^{2}+s_{i}^{2} \\ <br /> &amp;= \|d\|-s_{i}^{2}\leq \|d\|, <br /> \end{align*}<br />

which contradicts with the assumption that u is the nearest point in K to f -- done!.

All looks good, however if I let \hat{g}_{i}^\top d = t_{i} for which the scalar t_{i}&lt; 0,\, \forall i but sufficiently close to 0 such that (u-t_{i}\hat{g}_{i}) \in K for any i, then using the same derivation I arrive at \|(u-t_{i}\hat{g}_{i})-f \|^{2}= \|d\|-t_{i}^{2}\leq \|d\| too! This means it can contradict even for the case g_{i}^\top d &lt; 0. I now question the validity of this proof. I welcome your comment.
 
Last edited:
Physics news on Phys.org
Firstly, the negation of (for all i) is (there exists an i).

Secondly nothing states that the condition of g_i^Td<=0 for all i implies that this determines u uniquely. Thus given such a u with this condition, there may be points closer and lying in the cone. And if there isn't a closer point you won't be able to find things sufficiently close to zero.
 
Last edited:
matt grime said:
Firstly, the negation of (for all i) is (there exists an i).

You mean in the definition of K? But you can't change that.

matt grime said:
Secondly nothing states that the condition of g_i^Td<=0 for all i implies that this determines u uniquely. Thus given such a u with this condition, there may be points closer and lying in the cone. And if there isn't a closer point you won't be able to find things sufficiently close to zero.

Yes, I agree with you that nothing states about the implication but
since K is a cone which is closed and convex, u \in K exists and must be a unique point.
 
Last edited:
No, I do not mean the definition of K. You are doing a proof by contradiction, so what is the negation of the statement you're trying to prove? It is not what you wrote.

And you still haven't justified that in your 'second argument' that you can actually choose things as you claim you can. Just write down a simple example and work out where you go wrong. (For example there is nothing to stop you picking 1-d things, for example the cone {x : x=>1, x in R} and f=0, u=1)
 
Right you have the point there:there might not be any point (u - t_i \hat{g}_i) \in K such that it satisfies \hat{g}^\top d = t_i &lt; 0. This means the book cannot also claim that the point (u - s_i \hat{g}_i) \in K satisfyng \hat{g}^\top d = s_i &gt; 0 always exist.
 
Last edited:
It can because of the definition of K. (I admit I've not thought to carefully about this or your attempted counter example, but it is clear that what the line of argument is approximately: that all things are greater than or equal to zero, so if something is not strictly negative, then it is positive, and, say, 1/2 of a positive number is positive again, so in K as well. You really ought to work through an example to see what is going on. It is quite simple, I believe: if z is in K, then so is z+r_ig_i for any positive z_i by the definition of K.)
 
Last edited:
Thread 'Derivation of equations of stress tensor transformation'
Hello ! I derived equations of stress tensor 2D transformation. Some details: I have plane ABCD in two cases (see top on the pic) and I know tensor components for case 1 only. Only plane ABCD rotate in two cases (top of the picture) but not coordinate system. Coordinate system rotates only on the bottom of picture. I want to obtain expression that connects tensor for case 1 and tensor for case 2. My attempt: Are these equations correct? Is there more easier expression for stress tensor...
Back
Top