Math Challenge - February 2020

In summary: I am holding out for the more general solution. In summary, this conversation covers a variety of solved problems and open questions in mathematics. The topics discussed include limits, polynomial interpolation, arc-length parameterization, Green's formula, the Wirtinger inequality, uniform convergence, matrix multiplication, permutations, integrals, continuous bilinear forms, the Hilbert space, chess puzzles, and geometric angle calculations.
  • #71
a)
By Green's theorem:
$$\oint_C Pdx+Qdy = \int\int \left(\frac{\partial Q}{\partial x}-\frac{\partial P}{\partial y}\right)dxdy$$
we have that if the integrand on the right-hand side is equal to 1, we get the formula for the area. One of the functions for which this is satisfied is ##P=0##, ##Q=x##. Then we rewrite the theorem:
$$\oint_C xdy = \int\int dxdy = A$$
Now we use the parametrization given in the exercise ##dy = y'(t)dt##, so we have:
$$\int_0^{2\pi} x(t)y'(t)dt = A$$
Squaring the relation and using Cauchy-Schwarz in integral form, gives:
$$A^2 = \left(\int_0^{2\pi} x(t)y'(t)dt\right)^2 \leq \int_0^{2\pi} (x(t))^2dt\int_0^{2\pi}(y'(t))^2dt$$
This proves the inequality.
b) Assume we translated the curve so that ##\int_0^{2\pi} x(t)dt##. Then Wirtinger's inequality holds.
Before using it, we will observe the property of arc-length parametrization that follows from its definition. Namely:
$$dt = \sqrt{dx^2 + dy^2} = \sqrt{(x'(t))^2 + (y'(t))^2}dt$$
from which we conclude ##(x'(t))^2 + (y'(t))^2 = 1##.
Now we apply Wirtinger's inequality to the one we had in a):
$$\int_0^{2\pi} (x(t))^2dt\int_0^{2\pi}(y'(t))^2dt \leq \int_0^{2\pi} (x'(t))^2dt\int_0^{2\pi}(y'(t))^2dt$$
We denote now:
$$I_x = \int_0^{2\pi} (x'(t))^2dt \qquad I_y = \int_0^{2\pi} (y'(t))^2dt$$
We then use the AM-GM inequality, remembering that both of the numbers above are nonnegative:
$$\left(\frac{I_x+I_y}{2}\right)^2 \geq I_xI_y$$
But also, integrating the relation ##(x'(t))^2 + (y'(t))^2 = 1## over ##t##, we find:
$$I_x + I_y = 2\pi$$
Then we combine the two expressions above:
$$I_xI_y \leq \pi^2$$
Substituting this into Wirtinger's inequality, we find finally:
$$A^2 \leq \int_0^{2\pi} (x'(t))^2dt\int_0^{2\pi}(y'(t))^2dt \leq \pi^2$$
From here we obtain the relation that's needed by taking square root of both side.
c)
We examine the conditions where Wirtinger inequality turns into equality, and also conditions of equality for AM-GM and Cauchy-Schwarz that we used.
Wirtinger's inequality equality condition:
$$x'(t) = a\sin(x+\phi)$$
for ##a## and ##\phi## constant.
AM-GM inequality equality condition:
$$I_x = I_y$$
Cauchy-Schwarz inequality equality condition:
$$x(t) = cy'(t)$$
for ##c## a constant.
The first condition, along with the condition ##(x'(t))^2 + (y'(t))^2 = 1##, gives:
$$|a|=1 \qquad y'(t) = acos(x+\phi)$$
the first condition coming from the fact that ##|x(t)|## has to be less than or equal to 1.
The second condition is then trivial, of form ##0=0##.
The third condition gives ##x(t) = \int x'(t)dt = -acos(x+\phi) = cy'(t) \Rightarrow c=-1##.
So Wirtinger's equality condition implies other conditions trivially, and this conditions implies the equation of a circle. Hence, ##A=\pi## can only happen if ##C## is a circle.
 
  • Like
Likes fresh_42 and Infrared
Physics news on Phys.org
  • #72
Antarres said:
Even if ##f## is aperiodic, it must be of such behavior that at infinity it becomes a periodic function of period ##\frac{1}{k}##, otherwise ##f_n## wouldn't converge even pointwise.
I think this is your key claim that I wasn't able to find a clear proof of in your writing. Could you give a rigorous argument for this? There is a very simple, direct argument that doesn't require any casework.
Also, I didn't see where you used uniform instead of just pointwise convergence.

Your answer to number 3 is good! This is my favorite argument for the isoperimetric inequality, and I thought I had to share :)
 
  • #73
Hmm, well I think my case argument was based on classifying cases by how many accumulation points can ##f_n## have, since if it doesn't converge pointwise, it won't converge uniform either. But since I didn't find a way to make it more rigorous, although I didn't find any counterexample to it, I found a simpler ##\epsilon-\delta## type of argument, which is probably much clearer.

Let's see if the continuity of ##f## along with uniform convergence of ##f_n## implies that ##f## is uniformly continuous.
First choose ##\epsilon>0##. Then choose ##N\in\mathbb{N}## such that for all ##n\geq N## and all ##x##, ##|f_n(x)-g(x)|<\frac{\epsilon}{2}##.
Then, we find ##\delta## such that(for ##n\geq N##):
$$|x-y| <\delta \Rightarrow |f_n(x)-f_n(y)|=|f_n(x) - g(x)+g(x)-f_n(y)| \leq |f_n(x)-g(x)|+|g(x)-f_n(y)|<\epsilon$$
This ##\delta## is independent of ##x## and ##y##(because the boundary we obtained using uniform convergence doesn't depend on ##x## or ##y##), so we find that functions ##f_n(x)## are uniformly continuous for ##n\geq N##, or in other words ##f## is uniformly continuous for ##x>N##(since ##f_n(x) = f(x+n)##).
Now we divide the positive ##x##-axis into overlapping intervals: ##[0,2N]## and ##[N,\infty]##.
By what we derived above, ##f## is proven to be uniformly continuous on the second interval. But ##[0,2N]## is closed and ##f## is continuous on it, hence ##f## is uniformly continuous on it.
So this proves that ##f## is uniformly continuous on the whole domain.

Now we turn to proving uniform continuity of ##g##. For ##\epsilon>0##, choose ##N## such that for ##n\geq N##: ##|f_n(x) - g(x)| <\frac{\epsilon}{3}##, for all ##x##.
Also from uniform continuity of ##f##, we find ##\delta## such that for all ##x## and ##y## in the domain and any ##n\geq N##:
$$|x-y|<\delta \Rightarrow |f_n(x)-f_n(y)|<\frac{\epsilon}{3}$$
Then:
$$|x-y|<\delta \Rightarrow |g(x)-g(y)| = |g(x) - f_n(x) + f_n(x) - f_n(y)+f_n(y)-g(y)|\\ \leq |g(x)-f_n(x)| + |f_n(x)-f_n(y)| + |f_n(y)-g(y)|<\epsilon$$
The boundary is by construction independent of ##x## and ##y##, so ##g## is also uniformly continuous.
 
  • Like
Likes Infrared
  • #74
Great, this is much clearer!

I think you could do it your original way too: ##g(x+1)=\lim_{n\to\infty}f(x+1+n)=\lim_{n\to\infty}f(x+n+1)=g(x)##, so ##g## is periodic with period dividing ##1##. It's continuous by uniform convergence, so uniformly continuous.
 
  • Like
Likes Antarres
  • #75
Oh true, that's exactly how I wanted to form it, but I used a lot of words to formulate it, since for some reason I didn't feel comfortable writing it in limit form. Thanks for the remark, it would've shortened my argument immensely if I just wrote it that way.
 
  • Like
Likes Infrared
  • #76
The idea for the proof is as follows. Basically for chosen functional ##F##, we want to find a decomposition of vectors of ##\mathcal{H}## to the ones belonging to its kernel, and the ones belonging to the orthogonal complement of the kernel, where orthogonal complement is defined with respect to the bilinear form ##\beta##. This will allow us to make a decomposition of a vector such that actions of the bilinear form and functional on it coincide, so we find the relation we need. We construct the proof, by first inspecting the property of orthogonal complement.

I will assume here that ##\beta(x,x) = 0## is equivalent to ##x=0##, since that wasn't mentioned, but if we take that all vectors have non-zero norm, we exclude the zero vector(cause the form is bilinear) from the space, which shouldn't happen.

Denote ##K = \ker F## which is a subset of ##\mathcal{H}##(a closed subset, by continuity of ##F##), we assert that ##K^\perp## is one-dimensional(if we exclude the trivial case where kernel would take up the whole space, for which choice ##f^\dagger=0## would suffice). Suppose that it isn't, that there are two independent non-zero vectors ##v_1## and ##v_2## in ##\mathcal{H}##, that belong to ##K^\perp##. Then we have that ##F(v_1)v_2 - F(v_2)v_1## is a non-zero vector from ##K^\perp##, but by linearity of ##F## we have that
$$F(F(v_1)v_2-F(v_2)v_1) = 0$$
which is a contradiction as it would indicate ##F(v_1)v_2 - F(v_2)v_1 \in K##. We can extend this argument to any number of dimensions.
Assume ##K^\perp## is spanned by ##n## vectors. Then we create sum of form:
$$v=\sum_i a_i F(y_1)F(y_2)\dots y_i F(y_{i+1})\dots F(y_n) $$
such that ##\sum_i a_i=0##.
Then ##F(v)=0##, similar to above so its a contradiction.
So ##K^\perp## is one-dimensional.
Let's find a vector ##w\in K^\perp## such that ##F(w) = \beta(w,w)##.
Choose:
$$w= \frac{yF(y)}{\beta(y,y)}$$
for some non-zero ##y\in K^\perp##. By substitution we see that it satisfies the relation above that we wanted to impose. We can span ##K^\perp## with ##w##.
That means that any vector from ##\mathcal{H}## can be decomposed in the form ##v = u + kw##, where ##k## is a scalar, ##u\in K## and ##w \in K^\perp##. It follows that:
$$F(x) = kF(w) = k\beta(w,w) = \beta(w,u + kw) = \beta(w,x)$$
where we used ##\beta(u,w)=0## - the orthogonal complement definition, and ##F(u) = 0## cause ##u\in K##. Therefore we constructed ##w\in\mathcal{H}## such that ##F(x) = \beta(w,x)##, so in our formula ##w=f^\dagger##.

Let's prove uniqueness. Suppose there are ##f_1^\dagger## and ##f_2^\dagger## that both satisfy the identity ##F(x) = \beta(f,x)## for all ##x\in\mathcal{H}##.
Then, take ##x = f_1^\dagger - f_2^\dagger##. It follows that:
$$F(x) = \beta(f_1^\dagger, f_1^\dagger-f_2^\dagger)$$
but also:
$$F(x) = \beta(f_2^\dagger,f_1^\dagger-f_2^\dagger)$$
Subtracting the identities above we find:
$$\beta(f_1^\dagger - f_2^\dagger, f_1^\dagger-f_2^\dagger) = 0 \Rightarrow f_1^\dagger = f_2^\dagger.$$
The uniqueness is thus proved.
 
Last edited:
  • #77
QuantumQuest said:
What I ask for, is an approach / solution using some sort of limits. I say "using calculus" in the wording of the question, which is not clear enough but if I make it totally clear, I'll give the way of the solution ;)
Maybe like this?
$$\begin{align*}
\int_a^b\,f(x)dx&=\lim_{n\to\infty}\sum_{k=0}^{n}\,f\left(a+k\frac{b-a}{n}\right)\frac{b-a}{n}\\
&=\lim_{n\to\infty}\sum_{k=0}^{n}\,f\left((a+d)-d+k\frac{(b+d)-(a+d)}{n}\right)\frac{(b+d)-(a+d)}{n}\\
&=\int_{a+d}^{b+d}f(x-d)dx
\end{align*}$$
 
Last edited:
  • Like
Likes QuantumQuest
  • #78
Antarres said:
The idea for the proof is as follows. Basically for chosen functional ##F##, we want to find a decomposition of vectors of ##\mathcal{H}## to the ones belonging to its kernel, and the ones belonging to the orthogonal complement of the kernel, where orthogonal complement is defined with respect to the bilinear form ##\beta##. This will allow us to make a decomposition of a vector such that actions of the bilinear form and functional on it coincide, so we find the relation we need. We construct the proof, by first inspecting the property of orthogonal complement.

I will assume here that ##\beta(x,x) = 0## is equivalent to ##x=0##, since that wasn't mentioned, but if we take that all vectors have non-zero norm, we exclude the zero vector(cause the form is bilinear) from the space, which shouldn't happen.

Denote ##K = \ker F## which is a subset of ##\mathcal{H}##(a closed subset, by continuity of ##F##), we assert that ##K^\perp## is one-dimensional(if we exclude the trivial case where kernel would take up the whole space, for which choice ##f^\dagger=0## would suffice). Suppose that it isn't, that there are two independent non-zero vectors ##v_1## and ##v_2## in ##\mathcal{H}##, that belong to ##K^\perp##. Then we have that ##F(v_1)v_2 - F(v_2)v_1## is a non-zero vector from ##K^\perp##, but by linearity of ##F## we have that
$$F(F(v_1)v_2-F(v_2)v_1) = 0$$
which is a contradiction as it would indicate ##F(v_1)v_2 - F(v_2)v_1 \in K##. We can extend this argument to any number of dimensions.
Assume ##K^\perp## is spanned by ##n## vectors. Then we create sum of form:
$$v=\sum_i a_i F(y_1)F(y_2)\dots y_i F(y_{i+1})\dots F(y_n) $$
such that ##\sum_i a_i=0##.
Then ##F(v)=0##, similar to above so its a contradiction.
So##K^\perp## is one-dimensional.
Let's find a vector ##w\in K^\perp## such that ##F(w) = \beta(w,w)##.
Choose:
$$w= \frac{yF(y)}{\beta(y,y)}$$
for some non-zero ##y\in K^\perp##. By substitution we see that it satisfies the relation above that we wanted to impose. We can span ##K^\perp## with ##w##.
That means that any vector from ##\mathcal{H}## can be decomposed in the form ##v = u + kw##, where ##k## is a scalar, ##u\in K## and ##w \in K^\perp##. It follows that:
$$F(x) = kF(w) = k\beta(w,w) = \beta(w,u + kw) = \beta(w,x)$$
where we used ##\beta(u,w)=0## - the orthogonal complement definition, and ##F(u) = 0## cause ##u\in K##. Therefore we constructed ##w\in\mathcal{H}## such that ##F(x) = \beta(w,x)##, so in our formula ##w=f^\dagger##.

Let's prove uniqueness. Suppose there are ##f_1^\dagger## and ##f_2^\dagger## that both satisfy the identity ##F(x) = \beta(f,x)## for all ##x\in\mathcal{H}##.
Then, take ##x = f_1^\dagger - f_2^\dagger##. It follows that:
$$F(x) = \beta(f_1^\dagger, f_1^\dagger-f_2^\dagger)$$
but also:
$$F(x) = \beta(f_2^\dagger,f_1^\dagger-f_2^\dagger)$$
Subtracting the identities above we find:
$$\beta(f_1^\dagger - f_2^\dagger, f_1^\dagger-f_2^\dagger) = 0 \Rightarrow f_1^\dagger = f_2^\dagger.$$
The uniqueness is thus proved.
At first sight it looks as if you had proven Riesz' representation theorem. However, the task is a different one. We want to know its generalization to any continuous bilinear form which is bounded on the diagonal from below, but which doesn't necessarily define orthogonality. Riesz' representation theorem is useful for the proof, but not equivalent to the statement.
 
  • Like
Likes Antarres
  • #79
fresh_42 said:
11. b.) In how many different ways can eight queens be placed on a chessboard, such that no queen threatens another? Two solutions are not different, if they can be achieved by a rotation or by mirroring of the board. (For this part there is no proof required.)

Is the answer 17? I consider an arrangement to be "different" or "unique" if it cannot be obtained as a rotation (by 90, 180 or 270 degrees) or as a mirror image (reflect across horizontal or vertical center line of chessboard) of any other arrangement in the unique set. The following are 17 unique arrangements I get:
  1. [1, 5, 8, 6, 3, 7, 2, 4]
  2. [1, 6, 8, 3, 7, 4, 2, 5]
  3. [1, 7, 4, 6, 8, 2, 5, 3]
  4. [1, 7, 5, 8, 2, 4, 6, 3]
  5. [2, 4, 6, 8, 3, 1, 7, 5]
  6. [2, 5, 7, 1, 3, 8, 6, 4]
  7. [2, 5, 7, 4, 1, 8, 6, 3]
  8. [2, 6, 1, 7, 4, 8, 3, 5]
  9. [2, 6, 8, 3, 1, 4, 7, 5]
  10. [2, 7, 3, 6, 8, 5, 1, 4]
  11. [2, 7, 5, 8, 1, 4, 6, 3]
  12. [3, 1, 7, 5, 8, 2, 4, 6]
  13. [3, 5, 2, 8, 1, 7, 4, 6]
  14. [3, 5, 8, 4, 1, 7, 2, 6]
  15. [3, 6, 2, 5, 8, 1, 7, 4]
  16. [3, 6, 2, 7, 1, 4, 8, 5]
  17. [3, 6, 2, 7, 5, 1, 8, 4]
Each array or list above is to be interpreted as denoting an arrangement with each element in the list denoting the column number (occupied on the chessboard) of the queen placed in the row number corresponding to the index of that element of that list. So for example, arrangement [3, 6, 2, 7, 5, 1, 8, 4] means that in row 1 of the chessboard, the queen is placed in column 3, in row 2, the queen is in column 6 and so on till in row 8, the queen is in column 4.

If the answer is correct, I can share the script I wrote to find the solution, if that will help.
 
  • #80
17 is too many, but reflections along the diagonals are considered equal. Do you have the number without equivalence, i.e. how many possibilities at all?

Remember that I only need the numbers, regardless how you found them.
 
  • #81
fresh_42 said:
17 is too many, but reflections along the diagonals are considered equal. Do you have the number without equivalence, i.e. how many possibilities at all?

Remember that I only need the numbers, regardless how you found them.

When not considering rotations and mirror images as being equivalent, I got 92 different ways to place 8 queens on 8x8 chessboard such that no 2 queens attack each other. I had not considered the possibility of reflection along diagonals in the solution I had posted.
 
  • Like
Likes Infrared and fresh_42
  • #82
92 is correct. Fundamental solutions are less than 17.
 
  • Informative
Likes Not anonymous
  • #83
fresh_42 said:
92 is correct. Fundamental solutions are less than 17.

Thanks. Will look at the arrangements from the earlier solution again after sometime and find duplicates arising from diagonal reflections
 
  • #84
fresh_42 said:
92 is correct. Fundamental solutions are less than 17.

Is the answer 12? The following are the unique arrangements when diagonal reflections also to be treated as non-unique.
  1. [1, 5, 8, 6, 3, 7, 2, 4]
  2. [1, 6, 8, 3, 7, 4, 2, 5]
  3. [2, 4, 6, 8, 3, 1, 7, 5]
  4. [2, 5, 7, 1, 3, 8, 6, 4]
  5. [2, 5, 7, 4, 1, 8, 6, 3]
  6. [2, 6, 1, 7, 4, 8, 3, 5]
  7. [2, 6, 8, 3, 1, 4, 7, 5]
  8. [2, 7, 3, 6, 8, 5, 1, 4]
  9. [2, 7, 5, 8, 1, 4, 6, 3]
  10. [3, 5, 2, 8, 1, 7, 4, 6]
  11. [3, 5, 8, 4, 1, 7, 2, 6]
  12. [3, 6, 2, 5, 8, 1, 7, 4]
The arrangements in my earlier solution that are now considered non-unique because they are diagonal reflections of some other arrangement in the final solution are:
[1, 7, 4, 6, 8, 2, 5, 3]
[1, 7, 5, 8, 2, 4, 6, 3]
[3, 1, 7, 5, 8, 2, 4, 6]
[3, 6, 2, 7, 1, 4, 8, 5]
[3, 6, 2, 7, 5, 1, 8, 4]
 
  • Like
Likes etotheipi, fresh_42 and Infrared
  • #85
Not anonymous said:
Is the answer 12? The following are the unique arrangements when diagonal reflections also to be treated as non-unique.
  1. [1, 5, 8, 6, 3, 7, 2, 4]
  2. [1, 6, 8, 3, 7, 4, 2, 5]
  3. [2, 4, 6, 8, 3, 1, 7, 5]
  4. [2, 5, 7, 1, 3, 8, 6, 4]
  5. [2, 5, 7, 4, 1, 8, 6, 3]
  6. [2, 6, 1, 7, 4, 8, 3, 5]
  7. [2, 6, 8, 3, 1, 4, 7, 5]
  8. [2, 7, 3, 6, 8, 5, 1, 4]
  9. [2, 7, 5, 8, 1, 4, 6, 3]
  10. [3, 5, 2, 8, 1, 7, 4, 6]
  11. [3, 5, 8, 4, 1, 7, 2, 6]
  12. [3, 6, 2, 5, 8, 1, 7, 4]
The arrangements in my earlier solution that are now considered non-unique because they are diagonal reflections of some other arrangement in the final solution are:
[1, 7, 4, 6, 8, 2, 5, 3]
[1, 7, 5, 8, 2, 4, 6, 3]
[3, 1, 7, 5, 8, 2, 4, 6]
[3, 6, 2, 7, 1, 4, 8, 5]
[3, 6, 2, 7, 5, 1, 8, 4]
Very good!
 
  • #86
fresh_42 said:
11. Answer the following questions:
a.) How many knights can you place on a ##n\times m## chessboard such that no two attack each other?
There are three cases. ##r## is the required number.
CASE 1: When any of ##m## & ##n## ##=1##
##r=n/m##
(##r=n## for ##m=1## and vice versa)
CASE 2: When any of ##m## & ##n## ##=2##
(let ##q=## number equal to 2 & ##p=## other number)
1. ##p=4x+0##
##r=\frac{pq}2##
2. ##p=##else
##r=(\lfloor\frac p2\rfloor+1)\times q##
CASE 3: When both ##m## & ##n## ##\gt2##
##r=\lceil\frac{mn}2\rceil##
 
  • #87
mmaismma said:
There are three cases. ##r## is the required number.
CASE 1: When any of ##m## & ##n## ##=1##
##r=n/m##
(##r=n## for ##m=1## and vice versa)
CASE 2: When any of ##m## & ##n## ##=2##
(let ##q=## number equal to 2 & ##p=## other number)
1. ##p=4x+0##
##r=\frac{pq}2##
2. ##p=##else
##r=(\lfloor\frac p2\rfloor+1)\times q##
CASE 3: When both ##m## & ##n## ##\gt2##
##r=\lceil\frac{mn}2\rceil##
Let me sort this out. ##r## be the number of knights on an ##n \times m## board.

You already correctly had the cases ##m=1## in which we get ##r=n## and ##m>2## with ##r=\lceil\frac{mn}2\rceil##. Now only ##m=2## was left to do.

If ##m=2## and ##n=4k## then you say ##r=n##, since ##m=q=2## and thus ##q/2=1\, , \,p=r=n.##
This is correct.

If ##m=2## and ##n=4k+s## with ##s=1,2,3## then you get ##r=(\lfloor\frac p2\rfloor+1)\times q## which is in the variables of the problem ##r=(\lfloor\frac n2\rfloor+1)\times 2##.
This is correct, too.

The arrangement in case of two rows is
1581663447288.png
 
  • Like
Likes mmaismma
  • #88
In order to calculate the Legendre symbol ##\left(\frac{a}{p}\right)##, we are going to use Euler's criterion:
$$\left(\frac{a}{p}\right) \equiv a^{\frac{p-1}{2}} \pmod{p}$$

Observing that ##\lambda_{a,p}## is a permutation, we will decompose it into cycles. We observe that any cycle that is a part of the permutation must be of form: ##(x, ax, a^2x, a^3x,\dots, a^kx)##, where ##k## is the smallest natural number such that ##a^{k+1} \equiv 1 \pmod{p}##, and every number in the cycle is taken modulo ##p##.
Such a number ##k## exists and is smaller than ##p-1##, since we have only ##p-1## distinct elements in ##\mathbb{Z}_p^\times##.
We also observe that all cycles in our permutation must be of same length, because this permutation is induced by left multiplication. That is, if ##k## is the smallest number such that ##a^{k+1} \equiv 1 \pmod{p}##, then there can't be another number ##l## satisfying the same property(because only one of them can be considered 'smallest'), and so the cycle will always be terminated by ##a^kx##, for every ##x##, and fixed ##a##.

We conclude that, if we take ##k## to be the length of a cycle corresponding to number ##a##(cycles then terminate with ##a^{k-1}x##), that there are ##\frac{p-1}{k}## cycles in the permutation ##\lambda_{a,p}##. Every cycle of length ##k## has the signature ##(-1)^{k-1}##. Therefore we have:
$$\text{sgn}(\lambda_{a,p}) = \left((-1)^{k-1}\right)^\frac{p-1}{k}$$
The above expression is equal to ##1## for odd ##k##, and equal to ##(-1)^\frac{p-1}{k}## for even ##k##.
Now also, for even ##k##:
$$a^{\frac{p-1}{2}} = (a^{\frac{k}{2}})^{\frac{p-1}{k}} \equiv (-1)^\frac{p-1}{k} \pmod{p}$$
The last equality above follows because if ##a^k \equiv 1 \pmod{p}## then ##a^\frac{k}{2} \equiv \pm 1 \pmod{p}##. But also we can't have ##a^\frac{k}{2} \equiv 1 \pmod{p}## since ##k## is the smallest positive integer such that this relation holds. Hence it must be ##a^\frac{k}{2} \equiv -1 \pmod{p}##.
For odd ##k##, we have:
$$a^\frac{p-1}{2} = (a^k)^\frac{p-1}{2k} \equiv 1 \pmod{p}$$
Hence we have that, by Euler's criterion we mentioned at the beginning:
$$\left(\frac{a}{p}\right) = \text{sgn}(\lambda_{a,p})$$
QED
 
  • #89
Antarres said:
In order to calculate the Legendre symbol ##\left(\frac{a}{p}\right)##, we are going to use Euler's criterion:
$$\left(\frac{a}{p}\right) \equiv a^{\frac{p-1}{2}} \pmod{p}$$

Observing that ##\lambda_{a,p}## is a permutation, we will decompose it into cycles. We observe that any cycle that is a part of the permutation must be of form: ##(x, ax, a^2x, a^3x,\dots, a^kx)##, where ##k## is the smallest natural number such that ##a^{k+1} \equiv 1 \pmod{p}##, and every number in the cycle is taken modulo ##p##.
Such a number ##k## exists and is smaller than ##p-1##, since we have only ##p-1## distinct elements in ##\mathbb{Z}_p^\times##.
We also observe that all cycles in our permutation must be of same length, because this permutation is induced by left multiplication. That is, if ##k## is the smallest number such that ##a^{k+1} \equiv 1 \pmod{p}##, then there can't be another number ##l## satisfying the same property(because only one of them can be considered 'smallest'), and so the cycle will always be terminated by ##a^kx##, for every ##x##, and fixed ##a##.

We conclude that, if we take ##k## to be the length of a cycle corresponding to number ##a##(cycles then terminate with ##a^{k-1}x##), that there are ##\frac{p-1}{k}## cycles in the permutation ##\lambda_{a,p}##. Every cycle of length ##k## has the signature ##(-1)^{k-1}##. Therefore we have:
$$\text{sgn}(\lambda_{a,p}) = \left((-1)^{k-1}\right)^\frac{p-1}{k}$$
The above expression is equal to ##1## for odd ##k##, and equal to ##(-1)^\frac{p-1}{k}## for even ##k##.
Now also, for even ##k##:
$$a^{\frac{p-1}{2}} = (a^{\frac{k}{2}})^{\frac{p-1}{k}} \equiv (-1)^\frac{p-1}{k} \pmod{p}$$
The last equality above follows because if ##a^k \equiv 1 \pmod{p}## then ##a^\frac{k}{2} \equiv \pm 1 \pmod{p}##. But also we can't have ##a^\frac{k}{2} \equiv 1 \pmod{p}## since ##k## is the smallest positive integer such that this relation holds. Hence it must be ##a^\frac{k}{2} \equiv -1 \pmod{p}##.
For odd ##k##, we have:
$$a^\frac{p-1}{2} = (a^k)^\frac{p-1}{2k} \equiv 1 \pmod{p}$$
Hence we have that, by Euler's criterion we mentioned at the beginning:
$$\left(\frac{a}{p}\right) = \text{sgn}(\lambda_{a,p})$$
QED
I was working on this, and took the same approach. Unfortunately, I didn’t realize that k-cycles have the signature ##(-1)^{k-1}##, and not ##(-1)^k##. D’oh!
 
  • #90
Antarres said:
In order to calculate the Legendre symbol ##\left(\frac{a}{p}\right)##, we are going to use Euler's criterion:
$$\left(\frac{a}{p}\right) \equiv a^{\frac{p-1}{2}} \pmod{p}$$

Observing that ##\lambda_{a,p}## is a permutation, we will decompose it into cycles. We observe that any cycle that is a part of the permutation must be of form: ##(x, ax, a^2x, a^3x,\dots, a^kx)##, where ##k## is the smallest natural number such that ##a^{k+1} \equiv 1 \pmod{p}##, and every number in the cycle is taken modulo ##p##.
Such a number ##k## exists and is smaller than ##p-1##, since we have only ##p-1## distinct elements in ##\mathbb{Z}_p^\times##.
We also observe that all cycles in our permutation must be of same length, because this permutation is induced by left multiplication. That is, if ##k## is the smallest number such that ##a^{k+1} \equiv 1 \pmod{p}##, then there can't be another number ##l## satisfying the same property(because only one of them can be considered 'smallest'), and so the cycle will always be terminated by ##a^kx##, for every ##x##, and fixed ##a##.

We conclude that, if we take ##k## to be the length of a cycle corresponding to number ##a##(cycles then terminate with ##a^{k-1}x##), that there are ##\frac{p-1}{k}## cycles in the permutation ##\lambda_{a,p}##. Every cycle of length ##k## has the signature ##(-1)^{k-1}##. Therefore we have:
$$\text{sgn}(\lambda_{a,p}) = \left((-1)^{k-1}\right)^\frac{p-1}{k}$$
The above expression is equal to ##1## for odd ##k##, and equal to ##(-1)^\frac{p-1}{k}## for even ##k##.
Now also, for even ##k##:
$$a^{\frac{p-1}{2}} = (a^{\frac{k}{2}})^{\frac{p-1}{k}} \equiv (-1)^\frac{p-1}{k} \pmod{p}$$
The last equality above follows because if ##a^k \equiv 1 \pmod{p}## then ##a^\frac{k}{2} \equiv \pm 1 \pmod{p}##. But also we can't have ##a^\frac{k}{2} \equiv 1 \pmod{p}## since ##k## is the smallest positive integer such that this relation holds. Hence it must be ##a^\frac{k}{2} \equiv -1 \pmod{p}##.
For odd ##k##, we have:
$$a^\frac{p-1}{2} = (a^k)^\frac{p-1}{2k} \equiv 1 \pmod{p}$$
Hence we have that, by Euler's criterion we mentioned at the beginning:
$$\left(\frac{a}{p}\right) = \text{sgn}(\lambda_{a,p})$$
QED
This result is called Zolotarev's Lemma. You used but haven't explicitly stated that ##k+1=\operatorname{ord}a##. It's a bit easier to read if you start with that definition and the remark that the cycles are disjoint.

Hint for problem 10:
Consider ##B(f)(g)=\beta(f,g)##, and use Riesz representation theorem to establish an isometry ##T## between ##\mathcal{H}^*## and ##\mathcal{H}##. Then apply Banach's fixed-point theorem on the function ##Q(f)=f-\lambda (T(B(f))-T(f))## for a suitable ##\lambda##.
 
  • Like
Likes Antarres
  • #91
I don't know if @fresh_42 didn't want the fact that ##\mathbb{Z}_p^*## is cyclic to be used for problem 9, but the result follows easily from it. If ##a=b^2\in\mathbb{Z}_p^*## is a square, then ##\lambda_a=\lambda_{b}\circ\lambda_{b}## is even. On the other hand, if ##a## is not a square, then it is an odd power of a generator, and hence also a generator. So ##\lambda_a## is a ##(p-1)##-cycle, which is odd.
 
  • #92
Infrared said:
I don't know if @fresh_42 didn't want the fact that ##\mathbb{Z}_p^*## is cyclic to be used for problem 9, but the result follows easily from it. If ##t## is a generator, and ##a\in\mathbb{Z}_p^*## is a square, then ##a=t^{2k}## for some integer ##k##, so ##\lambda_a=\lambda_{t^k}\circ\lambda_{t^k}## is even. On the other hand, if ##a=t^{2k+1}##, then ##a## is also a generator, so ##\lambda_a## is a ##(p-1)##-cycle and hence odd.
It does no harm to give an elementary proof.
 
  • #93
QuantumQuest said:
Well done @etotheipi. It's a very nice shortcut way for this particular case. However, there is a more general way we can solve problems like this. I won't tell what is it but if anyone else wants to try the problem and come up with a correct solution, he / she'll also get credit.

Can you explain what you mean by problems like this? ##A## is a specific matrix, which is its own inverse. What sort of generalisation are you looking for?
 
  • #94
PeroK said:
Can you explain what you mean by problems like this? ##A## is a specific matrix, which is its own inverse. What sort of generalisation are you looking for?

Well, there is some specific procedure we can follow, a relation coming out of it and then the classic way we treat problems like this.
 
Last edited:
  • #95
QuantumQuest said:
Well, there is some specific procedure we can follow, a relation coming out of it and then the classic way we treat problems like this.
That's all very well, but I have no idea what the question is asking! Problems like what? Perhaps no one has solved it because no one knows what you're driving at.
 
  • #96
In question 1) is the power of n inside the cosine or outside it?
 
  • #97
PeroK said:
That's all very well, but I have no idea what the question is asking! Problems like what? Perhaps no one has solved it because no one knows what you're driving at.

I'm not saying anything cryptic. When I say problems like this I literally mean it: problems that ask to calculate some power(s) of a matrix. So, in standard curricula for mathematicians, there is a standard way to treat problems like this - the solution given so far in the challenge is absolutely acceptable but there is some more formal way to tackle the whole thing ;)
 
  • #98
skanskan said:
In question 1) is the power of n inside the cosine or outside it?
Outside
 
  • #99
Since the solution I came up with is relatively simple and ignores @fresh_42’s hint, I feel like I must be completely wrong.

Consider the operator ##\mathrm{B_1}: \mathcal{H}\rightarrow\mathcal{H}## defined by ##\mathrm{B_1}(f)=(\beta(f,\cdot))^*##. The question essentially is asking for a proof that ##\mathrm{B_1}## is a bijection.
Now, ##\beta(f,f)## bounded below implies that ##\mathrm{B_1}## is, too:
$$\beta(f,f)=\langle\mathrm{B_1}f,f\rangle\leq\|\mathrm{B_1}f\|\|f\|$$
by Cauchy-Schwarz;
$$\Rightarrow\|\mathrm{B_1}f\|\|f\|\geq C\|f\|^2\\
\Rightarrow\|\mathrm{B_1}f\|\geq C\|f\|; \|f\|\neq 0.$$
When ##\|f\|=0## the final inequality is trivially true. Additionally, the same can be applied to the map ##\mathrm{B}_2(f)=(\beta(\cdot,f))^*=\mathrm{B}_1^*(f)##.

Now, we use the following fact: a continuous linear map is bounded below iff it is injective with closed range.
This means that ##\mathrm{B_1}## and ##\mathrm{B_2}## are injective with closed range. By the closed range theorem, ##\mathrm{im(B_1)=ker(B_1^*)^\perp=ker(B_2)^\perp}##. But since ##\mathrm{B_2}## is injective, ##\mathrm{ker(B_2)}## is trivial, and so ##\mathrm{im(B_1)}=\mathcal{H}##. So ##\mathrm{B_1}## is injective and surjective, proving the original result.

I tried to use the hint, and was able to apply the fixed point theorem for ##\lambda\in(1,2)##, but didn’t know what to do after that.
 
Last edited:
  • #100
suremarc said:
Since the solution I came up with is relatively simple and ignores @fresh_42’s hint, I feel like I must be completely wrong.

Consider the operator ##\mathrm{B_1}: \mathcal{H}\rightarrow\mathcal{H}## defined by ##\mathrm{B_1}(f)=(\beta(f,\cdot))^*##. The question essentially is asking for a proof that ##\mathrm{B_1}## is a bijection.
Now, ##\beta(f,f)## bounded below implies that ##\mathrm{B_1}## is, too:
$$\beta(f,f)=\langle\mathrm{B_1}f,f\rangle\leq\|\mathrm{B_1}f\|\|f\|$$
by Cauchy-Schwarz;
$$\Rightarrow\|\mathrm{B_1}f\|\|f\|\geq C\|f\|^2\\
\Rightarrow\|\mathrm{B_1}f\|\geq C\|f\|; \|f\|\neq 0.$$
When ##\|f\|=0## the final inequality is trivially true. Additionally, the same can be applied to the map ##\mathrm{B}_2(f)=(\beta(\cdot,f))^*=\mathrm{B}_1^*(f)##.

Now, we use the following fact: a continuous linear map is bounded below iff it is injective with closed range.
This means that ##\mathrm{B_1}## and ##\mathrm{B_2}## are injective with closed range. By the closed range theorem, ##\mathrm{im(B_1)=ker(B_1^*)^\perp=ker(B_2)^\perp}##. But since ##\mathrm{B_2}## is injective, ##\mathrm{ker(B_2)}## is trivial, and so ##\mathrm{im(B_1)}=\mathcal{H}##. So ##\mathrm{B_1}## is injective and surjective, proving the original result.

I tried to use the hint, and was able to apply the fixed point theorem for ##\lambda\in(1,2)##, but didn’t know what to do after that.
I'm a bit skeptical about your use of duality. E.g. why is ##\operatorname{im}B_1 =\mathcal{H}\,?## Didn't you use ##\mathcal{H}=\left(\mathcal{H}^*\right)^*## with ##\beta## instead of the inner product? In other words: why does ##B_1## land in the space you claim it does? You have only added an asterix.

I know that boundedness and continuity is equivalent. Can you reference the theorem you used to make the range closed, not only dense? I think this is the crucial point. My proof does those things in smaller steps by using Banach to actually define the specific image ##f^\dagger## we are looking for.

The whole theorem is a generalization of Riesz, so we have to be careful with duality, i.e. rigorously separate the inner product and the given bilinear form. The simple notation with an asterix needs a bit of an explanation.
 
  • #101
@fresh_42 I think I have the solution using the hint. That hint is quite a good idea, it's probably useful to remember in general.
First, we use Riesz representation theorem to define a mapping ##T: \mathcal{H}^* \rightarrow \mathcal{H}##, which defines the mapping from the dual space of linear functionals to Hilbert space:
$$F(x) = (T(F),x)$$
for every ##x \in \mathcal{H}##, where ##(\cdot,\cdot)## is the inner product. The theorem itself states that the vector ##T(F)## exists and is unique, thus defining the mapping above. Taking into account that inner product is linear(in a real Hilbert space), we find that ##T## is also linear, namely:
$$(T(F+G),x) = (F+G)(x) = F(x)+G(x) = (T(F)+T(G),x)$$
Also, we define the mapping ##B##:
$$B(f)(h) = \beta(f,h)$$
which is possible for continuous bilinear forms. ##B## is linear by definition.
Now we induce the norm on our Hilbert space from the inner product, and metric from the norm naturally:
$$\lVert x \rVert_\mathcal{H} \equiv \sqrt{(x,x)} \qquad d_\mathcal{H}(x,y) \equiv \lVert x-y\rVert_\mathcal{H}$$
We define the mapping which was hinted(there's a typo in the hint though, I assume):
$$Q(f) \equiv f -\lambda(T(B(f)) - T(F))$$
where ##\lambda## is a free parameter.
If it is possible to find ##\lambda## such that this function has a fixed point ##x##, then we have for that point:
$$Q(x) = x \Leftrightarrow T(B(x)) = T(F)$$
and this would be equivalent to the solution of our exercise, since by Riesz we certainly have unique vectors ##T(B(f))## and ##T(F)## such that, for some ##f \in \mathcal{H}##:
$$(\forall h \in \mathcal{H})( F(h) = (T(F),h) \wedge \beta(f,h) = (T(B(f)),h))$$
but the above then proves that we have a unique vector ##f## such that ##\beta(f,h) = F(h)## for all ##h \in \mathcal{H}##. hence it proves the theorem.

So, we aim to prove that ##Q(f)## has a unique fixed point, which we will prove using Banach fixed point theorem. We first want ##Q(f)## to be a contraction.
A contraction is a function ##f## on a metric space, such that:
$$d(f(x),f(y)) \leq q d(x,y)$$
for every two elements ##x## and ##y## and for some number ##q \in [0,1)##.
Using the naturally induced norm, we have:
$$\lVert Q(f) - Q(g)\rVert^2 = \lVert f -g + \lambda(T(B(f-g)) \rVert^2 \stackrel{f-g=u}{=} \lVert u-\lambda T(B(u)) \rVert^2 = (u-\lambda T(B(u)),u-\lambda T(B(u))) = \lVert u\rVert^2 - 2\lambda(T(B(u)),u) + \lambda^2\lVert T(B(u))\rVert^2$$
At this point, we use the coercivity of ##\beta##, keeping in mind that we have:
$$\beta(f,h) = (T(B(f)),h) \wedge \beta(f,f) \geq C\lVert f\rVert^2$$
It follows that:
$$(T(B(u),u) = \beta(u,u) \geq C\lVert u\rVert^2$$
$$\lVert T(B(u))\rVert^2 \stackrel{Riesz}{=} \lVert B(u)\rVert^2 \leq M\lVert u\rVert^2$$
where ##M## is a positive number in the definition of boundedness of ##\beta##(since ##\beta## is continuous).
Substituting, we obtain the inequality:
$$\lVert Q(f) - Q(g)\rVert^2 \leq \lVert f-g\rVert^2 -2\lambda C\lVert f-g\rVert^2 +\lambda^2M^2\lVert f-g\rVert^2 = (1-2\lambda C + \lambda^2M^2) \lVert f-g\rVert^2$$
Now all that is left is to prove that we can pick ##\lambda##, such that the above coefficient ##(1-2\lambda C+\lambda^2M^2) \in (0,1)##. We keep in mind that trivially ##C\leq M##(because ##C\lVert u\rVert^2 \leq \beta(u,u) \leq M\lVert u\rVert^2## for any ##u## by definition).
We consider the following inequalities:
$$1 - 2\lambda C + \lambda^2M^2 <1 \Leftrightarrow 0<\lambda < \frac{2C}{M^2}$$
$$C<M \Rightarrow 1 - 2\lambda C + \lambda^2M^2> 0 $$
From this we see that the acceptable choice of ##\lambda##, would be ##0<\lambda<\frac{2C}{M^2}##. This value asserts that ##Q(f)## is a contraction, hence by Banach fixed point theorem, there is a unique fixed point of ##Q(f)##, which implies existence and uniqueness of ##f \in \mathcal{H}## such that for all ##h \in \mathcal{H}##:
$$F(h) = \beta(f,h)$$
as we have shown above when we defined ##Q##. This finishes our proof. Banach's theorem even gives algorithm of how we would arrive at this vector(as a limit of infinite power of ##Q## on an arbitrary vector ##f## from our Hilbert space, that is as: ##\lim_{n\rightarrow \infty} Q(Q(\dots Q(f)))## where ##Q## is applied in a composition ##n## times).
 
Last edited:
  • #102
I guess it's time to give both of you - @suremarc and @Antarres - the credit for solving the problem, since you both have had a correct strategy. Since this problem - Lemma of Babuška-Lax-Milgram - is a bit confusing, I'll add my proof. There was nothing wrong with either of yours, but for all other readers who might have had difficulties to follow:

If we define a continuous function ##B(f)(g):=\beta(f,g)## then Riesz' representation theorem gives us an isometric isomorphism ##T\, : \,\mathcal{H}^* \longrightarrow \mathcal{H}## such that for every ##B(f)\in \mathcal{H}^*## there is a unique ##T(B(f))## such that ##\|B(f)\|=\|T(B(f))\|## and
$$
B(f)(g)=\langle T(B(f)) ,g \rangle_\mathcal{H} = \beta(f,g)\quad \forall \,\, g\in \mathcal{H} \quad (*)
$$
or generally ##f^*(g)=\langle T(f^*),g \rangle_\mathcal{H}\,\, \forall \,\, g\in \mathcal{H} \quad (*)##

The functionals ##B(f)## are bounded since ##\beta## is continuous, i.e. ##\|B\|## is a finite real number. We get from our lower bound
\begin{align*}
C\|f\|^2&\leq |\beta(f,f)|=\langle T(B(f)) ,f \rangle_\mathcal{H}\\
&\leq \|T(B(f))\|\cdot\|f\|= \|B(f)\|\cdot\|f\| \leq \|B\|\cdot \|f\|^2
\end{align*}
hence ##0 < \dfrac{C}{\|B\|} \leq 1.## We now define the function
$$
Q(f):=f-k\cdot \left( T(B(f)) - T(F) \right)
$$
on ##\mathcal{H}## with a real number ##k\in \mathbb{R}-\{0\}.## A vector ##f^\dagger \in \mathcal{H}## is a fixed point of ##Q## iff ##T(B(f^\dagger)) - T(F)=0\,.## In general we have for all ##g\in \mathcal{H}##
\begin{align*}
T(B(f)) - T(F) \stackrel{(**)}{=} 0 &\Longleftrightarrow F(g)\stackrel{(**)}{=}B(f)(g) =\beta(f ,g)\stackrel{(*)}{=}\langle T(B(f)) ,g \rangle_\mathcal{H}\\
&\Longleftrightarrow F(g) \stackrel{(*)}{=} \langle T(F),g \rangle_\mathcal{H} \stackrel{(**)}{=} \langle T(B(f)) ,g \rangle_\mathcal{H}\\
&\Longleftrightarrow \langle T(B(f))-T(F),g\rangle_\mathcal{H} \stackrel{(**)}{=} 0
\end{align*}
again by Riesz' representation theorem and the equations above. As ##g\in \mathcal{H}## is arbitrary, we may set ##g:=T(B(f^\dagger ))-T(F)## for a fixed point of ##Q## and get ##\|T(B(f^\dagger ))-T(F)\|^2=0## hence
$$
B(f^\dagger)=\beta(f^\dagger,-)=F
$$
which has to be shown. Thus all what's left to show is, that such a unique fixed point ##f^\dagger## of ##Q## exists, which we will prove with Banach's fixed point theorem.
\begin{align*}
\|Q(f)-Q(g)\|^2 &= \| f-k(T(B(f))-T(F))-g+k(T(B(g))-T(F)) \|^2\\
&= \langle (f-g)-kT(B(f-g)),(f-g)-kT(B(f-g)) \rangle_\mathcal{H}\\
&\stackrel{(*)}{=}l, \|f-g\|^2 -2k\, \langle T(B(f-g)) , f-g\rangle_\mathcal{H} +k^2\,\|T(B(f-g))\|^2 \\
&\stackrel{(*)}{=} \|f-g\|^2 -2k \,\beta(f-g,f-g) +k^2\, \|B(f-g)\|^2 \\
&\leq \|f-g\|^2 -2k\, C\,\|f-g\|^2 + k^2\, \|B\|^2\,\|f-g\|^2 \\
&= \|f-g\|^2 \left( 1-2k\,C+k^2\,\|B\|^2 \right) \\
&\stackrel{\text{set }k:=C/\|B\|^2}{=} \|f-g\|^2 \left(1 - \dfrac{C^2}{\|B\|^2}\right)
\end{align*}
We have seen that ##\dfrac{C}{\|B\|} \in (0,1]## hence ##q:=1 - \dfrac{C^2}{\|B\|^2} \in [0,1)## and ##\|Q(f)-Q(g)\|^2=q\,\|f-g\|^2## and the statement follows from Banach's fixed point theorem.
 
Last edited:
  • Like
Likes Antarres and suremarc
  • #103
archaic said:
$$\begin{align*}
\lim_{n\to\infty}\cos\left(\frac{t}{\sqrt{n}}\right)^n&=\lim_{n\to\infty}e^{n\ln\left(\cos\left(\frac{t}{\sqrt{n}}\right)\right)}\\
&=\lim_{n\to\infty}e^{n\ln\left(1-\frac{t^2}{2n}+o(\frac{1}{n})\right)}\\
&=\lim_{n\to\infty}e^{n\left(-\frac{t^2}{2n}+o(\frac{1}{n})\right)}\\
&=e^{-\frac{t^2}{2}}
\end{align*}$$
Tried to PM the solver but couldn't. I have some questions about the steps here, particularly how the cosine was changed into the algebraic expression?
 
  • Like
Likes archaic
  • #104
Hsopitalist said:
Tried to PM the solver but couldn't. I have some questions about the steps here, particularly how the cosine was changed into the algebraic expression?
https://en.wikipedia.org/wiki/Taylor_series
 
  • #105
Oh man, thank you!
 
  • Like
Likes archaic

Similar threads

2
Replies
61
Views
11K
2
Replies
42
Views
8K
2
Replies
64
Views
14K
3
Replies
80
Views
7K
2
Replies
61
Views
9K
Replies
33
Views
8K
3
Replies
100
Views
9K
2
Replies
67
Views
9K
3
Replies
102
Views
9K
3
Replies
93
Views
12K
Back
Top