- #1
Master1022
- 611
- 117
- Homework Statement
- Given the semidefinite primal program below, find its dual
- Relevant Equations
- Inner product
Hi,
I was working through the following optimization problem, and am getting stuck on how to get to the dual problem that is being presented.
Question:
Find the dual problem for the semidefinite primal problem below:
[tex] min_{X} tr(C^T X) [/tex]
[tex] \text{subject to} AX = B [/tex]
[tex] X \succeq 0 [/tex]
(the answer is given as:
[tex] max_{Z} -tr(B^T X) [/tex]
[tex] \text{subject to} A^T Z + C \succeq 0 [/tex]
Attempt:
I need to start by re-writing the constraints in the required form:
[tex] AX = B \rightarrow AX - B = 0 [/tex]
[tex] X \succeq 0 \rightarrow - X \preceq 0 [/tex]
Now we can write the Lagrangian function with some parameters ##Z## and ##Y##:
[tex] L = tr(C^T X) + \langle AX - B, Z \rangle + \langle -X, Y \rangle [/tex]
where ## \langle P, Q \rangle = tr(P^T Q) ## for matrix constraints
[tex] L = tr(C^T X) + \langle AX, Z \rangle - \langle B, Z \rangle - \langle X, Y \rangle [/tex]
[tex] L = tr(C^T X) + tr((AX)^T Z) - tr(B^T Z ) - tr(X^T Y) \rightarrow tr \left( C^T X + X^T A^T Z - X^T Y \right) - tr(B^T Z) [/tex]
Then I can use the fact that: ## tr(A^T B) = tr(B^T A) ## and can write ## tr(C^T X) = tr(X^T C) ##
[tex] L = tr \left( X^T (C + A^T Z - Y) \right) - tr(B^T Z) [/tex]
This is one point where I don't fully understand the logic. Now I want to 'remove' dependence on ##X## from the above expression by thinking about the value of ## C + A^T Z - Y ##. If ## C + A^T Z - Y \neq 0 ##, then we could pick ## X ## to be something to make the minimum ## -\infty ## (although I don't fully understand why this isn't ideal). Otherwise, if ## C + A^T Z - Y = 0 ##, then the minimum becomes: ## - tr(B^T Z) ##
This leads us to the dual problem:
[tex] max_{Z} -tr(B^T X) [/tex]
[tex] \text{subject to} A^T Z + C - Y \succeq 0 [/tex]
I am left with a few questions:
1. I don't know why there is a ##Y## in my expression (from inequality parameter), which isn't present in the answer?
2. Do I need to include the constraints on the inequality parameters?
3. (from above) why do we want to remove ##X## dependence from the expression such that min doesn't go to ##-\infty##?
Thanks in advance
I was working through the following optimization problem, and am getting stuck on how to get to the dual problem that is being presented.
Question:
Find the dual problem for the semidefinite primal problem below:
[tex] min_{X} tr(C^T X) [/tex]
[tex] \text{subject to} AX = B [/tex]
[tex] X \succeq 0 [/tex]
(the answer is given as:
[tex] max_{Z} -tr(B^T X) [/tex]
[tex] \text{subject to} A^T Z + C \succeq 0 [/tex]
Attempt:
I need to start by re-writing the constraints in the required form:
[tex] AX = B \rightarrow AX - B = 0 [/tex]
[tex] X \succeq 0 \rightarrow - X \preceq 0 [/tex]
Now we can write the Lagrangian function with some parameters ##Z## and ##Y##:
[tex] L = tr(C^T X) + \langle AX - B, Z \rangle + \langle -X, Y \rangle [/tex]
where ## \langle P, Q \rangle = tr(P^T Q) ## for matrix constraints
[tex] L = tr(C^T X) + \langle AX, Z \rangle - \langle B, Z \rangle - \langle X, Y \rangle [/tex]
[tex] L = tr(C^T X) + tr((AX)^T Z) - tr(B^T Z ) - tr(X^T Y) \rightarrow tr \left( C^T X + X^T A^T Z - X^T Y \right) - tr(B^T Z) [/tex]
Then I can use the fact that: ## tr(A^T B) = tr(B^T A) ## and can write ## tr(C^T X) = tr(X^T C) ##
[tex] L = tr \left( X^T (C + A^T Z - Y) \right) - tr(B^T Z) [/tex]
This is one point where I don't fully understand the logic. Now I want to 'remove' dependence on ##X## from the above expression by thinking about the value of ## C + A^T Z - Y ##. If ## C + A^T Z - Y \neq 0 ##, then we could pick ## X ## to be something to make the minimum ## -\infty ## (although I don't fully understand why this isn't ideal). Otherwise, if ## C + A^T Z - Y = 0 ##, then the minimum becomes: ## - tr(B^T Z) ##
This leads us to the dual problem:
[tex] max_{Z} -tr(B^T X) [/tex]
[tex] \text{subject to} A^T Z + C - Y \succeq 0 [/tex]
I am left with a few questions:
1. I don't know why there is a ##Y## in my expression (from inequality parameter), which isn't present in the answer?
2. Do I need to include the constraints on the inequality parameters?
3. (from above) why do we want to remove ##X## dependence from the expression such that min doesn't go to ##-\infty##?
Thanks in advance