Solving Problem 2.4 in Ballentine: Nonnegativeness Derivation

In summary, the conversation discusses solving Problem 2.4 in Ballentine and the attempt to use an orthonormal basis to represent a 2x2 state operator. The conversation also explores the use of eigenvalues to solve the problem and why the method does not work for higher dimensional matrices. The main point is that for dim##V > 2,## there could be one or more negative eigenvalues, which violates Ballentine's equation (2.12).
  • #1
EE18
112
13
Misplaced Homework Thread
I am trying to solve Problem 2.4 in Ballentine:
Screen Shot 2023-03-07 at 10.41.08 AM.png

I note in my attempt below to what (2.6) and (2.7) refer.

My attempt thus far is as follows:
A ##2 \times 2## state operator can be represented in a particular orthonormal ##\beta = \{\phi_i\}## as below, where we have enforced trace normalization (2.6) and self-adjointness (2.7) (and have yet to enforce nonnegativeness),
$$[\rho]_{\beta} = \begin{bmatrix}
a & b \\
b^* & (1-a)
\end{bmatrix}$$
Now enforcing ##Tr{\rho^2}## and using the basis independence of the trace we obtain
$$Tr{\rho^2} = Tr{[\rho]_{\beta}^2} = Tr{ \begin{bmatrix}
a & b \\
b^* & (1-a)
\end{bmatrix}^2} = a^2 +2|b|^2+ (1-a)^2 \leq 1$$
with ##a \in \mathbb{R}##.

Now for an arbitrary ##u## in our space we may expand ##u = \sum_i c_i {\phi_i}## so we can immediately compute
$$(u,\rho u) = \begin{bmatrix}
c_1^* & c_2^*
\end{bmatrix}\begin{bmatrix}
a & b \\
b^* & (1-a)
\end{bmatrix}\begin{bmatrix}
c_1 \\ c_2
\end{bmatrix} = \begin{bmatrix}
c_1^* & c_2^*
\end{bmatrix}
\begin{bmatrix}
ac_1 + bc_2 \\ b^*c_1+(1-a)c_2 \end{bmatrix}$$
$$=c_1^*(ac_1 + bc_2)+ c_2^*(b^*c_1+(1-a)c_2) = |{c_1}|^2a +2\textrm{Re}(c_1^*c_2b) + (1-a)^2|{c_2}|^2$$
but I can't seem to see how to go further here. It seems like I have to use my aforementioned inequality but I can't see how. Any help would be greatly appreciated.
 
Physics news on Phys.org
  • #2
For this type of problem, it's often more efficient to work with the eigenvalues directly, rather than a generic matrix. For this problem, we are given that:
$$Tr(\rho^2) ~\le 1 ~,~~~~ Tr(\rho) ~=~ 1 ~,~~~~ \rho = \rho^\dagger ~.$$For the 2D case, there are 2 eigenvalues, ##\rho_1## and ##\rho_2##, say, hence the eigenvalues of ##\rho^2## are the squares of these.

Self-adjointness of ##\rho## implies both the ##\rho_i## are real, hence ##\,\rho_i^2 \ge 0##.

The trace of a matrix is the sum of its eigenvalues, so we have 2 conditions:
$$\rho_1 + \rho_2 ~=~ 1 ~,~~~~ \rho^2_1 + \rho^2_2 ~\le~ 1 ~.$$Squaring the 1st equation gives $$\rho_1^2 + \rho_2^2 + 2 \rho_1 \rho_2 ~=~ 1 ~,$$and using this in conjunction with the 2nd equation implies... what?

I leave it to you to figure out the rest of the proof, including the follow-on of why it doesn't work for higher dimensional matrices. :oldbiggrin:
 
  • Like
Likes DrClaude, vanhees71 and EE18
  • #3
strangerep said:
For this type of problem, it's often more efficient to work with the eigenvalues directly, rather than a generic matrix. For this problem, we are given that:
$$Tr(\rho^2) ~\le 1 ~,~~~~ Tr(\rho) ~=~ 1 ~,~~~~ \rho = \rho^\dagger ~.$$For the 2D case, there are 2 eigenvalues, ##\rho_1## and ##\rho_2##, say, hence the eigenvalues of ##\rho^2## are the squares of these.

Self-adjointness of ##\rho## implies both the ##\rho_i## are real, hence ##\,\rho_i^2 \ge 0##.

The trace of a matrix is the sum of its eigenvalues, so we have 2 conditions:
$$\rho_1 + \rho_2 ~=~ 1 ~,~~~~ \rho^2_1 + \rho^2_2 ~\le~ 1 ~.$$Squaring the 1st equation gives $$\rho_1^2 + \rho_2^2 + 2 \rho_1 \rho_2 ~=~ 1 ~,$$and using this in conjunction with the 2nd equation implies... what?

I leave it to you to figure out the rest of the proof, including the follow-on of why it doesn't work for higher dimensional matrices. :oldbiggrin:
Thank you so much for the detailed response. If possible, I came up with a demonstration of why it doesn't work for ##\dim V > 2## but it's really ugly:

In the case of 3 or more dimensions (for arbitrary dimension consider a state operator with 3 nonzero eigenvalues) we see that we can follow the proof up to the point ##Tr{\rho^2} = \rho_1^2+ \rho_2^2 + \rho_3^2 \leq 1 = \rho_1^2+ \rho_2^2 \rho_3^2 +2\rho_1 \rho_2 +2\rho_1 \rho_3 +2\rho_3 \rho_2## which implies ##\rho_1 \rho_2 +\rho_1 \rho_3 +\rho_3 \rho_2 \geq 0##. We can imagine obeying this constraint with one negative eigenvalue and two positive eigenvalues such that the positive eigenvalues "outweigh" the negative. Consider ##\rho_1 = \rho_2 -1/10 = 1/2## and ##\rho_3 = -1/10##. Then we have ##\rho_1 \rho_2 +\rho_1 \rho_3 +\rho_3 \rho_2 \geq 0## and ##Tr{\rho} = 1##. If we then take the eigenvector corresponding to that negative eigenvalue we see that the expectation value is negative.

Would you be able to suggest a nicer proof? Thank you again!
 
  • #4
You only need to recognize that, for dim##V > 2,## there could be 1 or more negative eigenvalues while still satisfying the input constraints. That alone is enough to violate Ballentine's eq(2.12), i.e., that ##\rho_n \ge 0## for all ##n##.
 
  • Like
Likes vanhees71

FAQ: Solving Problem 2.4 in Ballentine: Nonnegativeness Derivation

What is the main goal of Problem 2.4 in Ballentine?

The main goal of Problem 2.4 in Ballentine is to demonstrate the nonnegativeness of a certain operator or expression within the context of quantum mechanics. This typically involves proving that the expectation value of a specific operator is always greater than or equal to zero, ensuring that it represents a physically meaningful quantity.

What are the key steps involved in deriving the nonnegativeness in Problem 2.4?

The key steps generally involve expressing the operator in a form that allows for the application of mathematical inequalities, such as the Cauchy-Schwarz inequality. This often includes rewriting the operator in terms of its eigenvalues and eigenvectors, and then showing that these components satisfy the required nonnegative conditions.

Why is nonnegativeness important in quantum mechanics?

Nonnegativeness is crucial in quantum mechanics because it ensures that certain physical quantities, such as probabilities and expectation values of observables, remain physically meaningful. For example, probabilities must be nonnegative and sum to one, and the expectation value of an observable must reflect the actual possible outcomes of measurements.

What mathematical tools are commonly used in solving Problem 2.4?

Common mathematical tools include linear algebra techniques, such as eigenvalue decomposition, and inequalities like the Cauchy-Schwarz inequality. Additionally, concepts from functional analysis and operator theory may be employed to rigorously demonstrate the nonnegativeness of the operator in question.

Can you provide a specific example or application of the nonnegativeness derivation from Problem 2.4?

A specific example might involve the derivation of the nonnegativeness of the density matrix in quantum mechanics. This ensures that the density matrix, which represents the statistical state of a quantum system, yields probabilities that are always nonnegative, thereby maintaining the consistency and physical interpretability of the quantum system's description.

Back
Top