In the case of quadratic programming, the theory developed is applicable to a class of ... program. A final application of the gauge duality theory is to linear programming, where the gauge ...... arithmetic and the geometric mean. Let s, t, and u ...
Dual Gauge Programs, with Applications to Quadratic Programming and The Minimum-Norm Problem by Robert M. Freund Working Paper #1726-85
Revised August 1986
Abstract A gauge function f(.) positively homogeneous and
is a nonnegative convex function that satisfies f(O)=O.
are specific instances of a gauge function.
Norms and pseudonorms This paper presents a
gauge duality theory for a gauge program, which minimizing the value of a gauge function f(.)
is
is the problem of
over a convex set.
The gauge dual program is also a gauge program, unlike the standard Lagrange dual.
We present sufficient conditions on f(-)
the existence of optimal
solutions to the gauge
dual, with no duality gap.
qualifications on the constraints.
The
gauge dual
programs,
program is
than the standard dual,
Keywords:
and
independent of any
to the minimum lp norm problem.
shown to provide a smaller duality gap
in a certain sense discussed
in the text.
Gauge function, norm, quadratic program, Lagrange dual,
Running Header: , , l 11,
its
The theory is applied to a class
duality.
_ll,
program and
These sufficient conditions are
relatively weak and are easy to verify, and are
of convex quadratic
that ensure
Dual Gauge Programs.
Introduction A gauge function
function f(.):RnRu{(+
=}
is a nonnegative
convex
that is positively homogeneous and satisfies
f(O)=O.
and pseudonorms are specific instances of a gauge function. program
is
defined as an optimization
P:
f(.)
is
the form
quadratic
to Mx > b
a gauge
programming fall
function.
Many problems
into this category,
programming,
in mathematical
including strictly convex
linear programming,
problem on a polyhedron (see Luenberger
and the minimum norm
[9]).
Duality for programs
similar to P have been studied by Eisenberg [2], recently been
generalized by Gwinner
instances where explicit Lagrange programs
A gauge
minimize f(x) subject
where
problem of
Norms
like
P can be stated,
[7].
whose work has most
Glassey [6]
has examined
duals of convex homogeneous
without
reference to primal
variables. This
paper presents a gauge duality theory for gauge programs
that contrasts, but particular,
is related to,
the gauge dual D of P
the Lagrange dual
of P.
In
is also a gauge program, unlike
its
Lagrange dual.
are
feasible values of the primal and dual objective functions,
z.v > D.
The gauge duality theory states that if
z and v then
1, with equality only if z and v are optimal values of P and
This inequality is analagous to the weak duality relationship z
> v for the Lagrange dual.
We present sufficient
that ensure that optimal
solutions
and that z-v=1
solutions.
for these
1
conditions on f(.)
to the dual gauge programs exist These sufficient conditions are
III
relatively weak and are
easy to verify.
They are independent of any
qualification on the constraints of the gauge program, unlike the Slater condition In the case applicable to than the dual
for Lagrange duality, for example. of quadratic
a class
programming, the theory developed
of quadratic
programs that
class of strictly convex quadratic
is equivalent
program different
is slightly broader
programs.
The gauge
(by a monotone transformation) to a quadratic from the
Lagrange dual.
Another application of the gauge duality theory is problem of minimizing the The gauge dual certain
is
is shown
lp norm of a vector over a polyhedron.
to provide a smaller duality gap
sense discussed in the text)
hence provides a better for feasible values of
to the
than the standard dual,
lower bound on the primal the dual,
(in a
than does
and
objective value,
the standard dual
program. A final application of the gauge duality theory programming, where the gauge dual
is different
is to linear
(but equivalent to)
the standard linear programming dual. In order 1 reviews
to lay the groundwork for the ensuing theory,
basic polarity properties
of gauge functions.
presents the gauge duality theory, which theorem, and necessary conditions valid.
Section
Section 2
includes a weak duality
for a strong duality theorem to
be
The duality theory of Section 2 is generalized to gauge
programs with nonlinear constraints
in Section 3.
Section 2 is applied to selected mathematical Section 4. programming,
This
section first
The theory of
programming problem in
discusses convex quadratic
followed by a duality analysis of the minimum
problem, for which strictly convex quadratic programming
2
lp norm
is a
special case.
The discussion shows that the duality gap for the
gauge dual is in a certain sense smaller than that of the standard dual.
Section 4 concludes with an analysis of linear programming in
the context of gauge duality theory.
3
III
1.
Preliminaries Let R denote
the set
numbers and
real
positively homogeneous
f(ax)
(i.e.,
= af(x)
and pseudonorms are gauge functions.
A
let R=Ru{(+}.
convex, nonnegative,
is
f(-)
f(.):Rn-R is called a gauge if
function
Norms
of
for a > 0),
and f(O)=O.
A gauge function need
not be symmetric and can take on the value +,
unlike a pseudonorm
or a norm. An example
of a gauge function
f(x)=f(xl,
x2,
( Note
that f(x)
that
f(x)
= 0
x3)
=
all x =
if 2x1-x2-x3 =
1 (Xl-x3),(xl-x2)12
(+
otherwise. the plane
is finite only on for
is
In
(a,a,a).
{xER 3 12xl-x 2 -x
this example,
3
f(.)
= 0),
and
is
symmetric. For any convex set C c by C
=
{yeRnlyTx
1 for all
containing the origin,
the polar of C,
Rn,
xC}.
is a gauge
function,
and
is a closed convex set
C
(see Rockafellar
[12],
p.
If
121).
if C is defined by
C = then f(-)
is defined
= C if and only if C is a closed
and C
convex set containing the origin f(.)
denoted C,
f(x)
{(xRnl
(1)
< 1)
can be represented by f(x)
where by convention, closed function and only if
(i.e.,
the origin, function,
({
Furthermore, f(.)
level sets of f(.)
Also,
the function
(2)
> OxEuC).
= +=O.
inf
all of the
C is a closed set.
that contains closed gauge
we denote
= inf
are closed)
for any closed convex set C f(.)
defined by
called the closed gauge function
corresponding to C.
4
is a
(2)
is a
if
For any gauge function =
fo(y)
f(.),
inf
Then f(.)
is a closed gauge.
(ylfo(y)
1),
olyTx
< vf(x) for all x).
If C =
1},
(xlf(x)
whereby f(.) is closed,
if f(.)
is
polar function f(.) by
closed,
since
then fOO(x) =
C
then C
(3) =
is closed.
f(x),
because C ° O
=
C.
The following summarizes the above statements: Remark
1 (see Rockafellar
f(.)ofO(.)
[12],
p.
129).
induces a one-to-one symmetric correspondence in the
of all closed gauges on R n .
class
The polarity operation
the origin are polar to functions are polar
Two closed convex sets containing
each other
if and only if
their gauge
X]
to each other.
We also have: Remark
2.
If f(.)
is a closed gauge function,
then f(.)
and f(.)
can be written as: f(x) =
sup yTx yECO
and
In the case when f(x) IX{{q,
=
fO(y)
that yTx < Holder
where 1/p +
=
fO(y)
xHIp,
1/q =
the lp norm,
1
p
defined by x E Rn and w
defined by f(x,w)
= g(x)
and
fo(y,z) where y E Rn and z
go(y)
if
z =
0
+*
if
z
0
=
Rm.
6
is
E
R,
a gauge
then
2.
Dual Gauge Programs Consider
the following nonlinear program:
P:
to
subject where f(.)
is
z = f(x)
minimize
Mx > b The Lagrange dual of P
a closed gauge function.
is
formulated as
{f(x) -
supremum {infimum >
XT(Mx-b))),
x
to
which can be simplified
-
supremum {bTX X > O
f*(MTX))},
or LD:
maximize
v = bTX
subject to
fo(MTX)
0 Dual
programs for classes of programs
developed by Eisenberg how to construct problems.
All
[2]
explicit duals
[7].
Glassey
[6] has
shown
for such
problem for which
is convex and positively homogeneous
need not be nonnegative,
to be finite-valued).
P have been
(with no primal variables)
three authors work with a primal
the objective function f(.) (f(.)
and Gwinner
that include
as in our
primal, but
is restricted
When applied to a gauge program P, however,
the dual programs that each author develops is the program LD above. function with the polar
If we exchange the objective constraint in LD, D:
we obtain a dual gauge program: minimize subject
v =
fo(MTX)
to bTX =
1
X > 0
7
gauge
III
Together, Note
the pair
that both the primal
problems,
and
to primal appears
(P) and dual
programs,
(D) are minimization
their objective functions
are polar gauge
functions.
variables X are restricted to be nonnegative and correspond
The dual
side
P and D constitute dual gauge
constraints.
in the dual
(RHS) of
in
The constraint matrix M in
the primal
the objective function, and the right-hand
the primal appears
in the equality constraint in the
dual. The definition of other
types
constraint
of of
the gauge dual program D can be extended to
linear constraints P is Mix
= ()
bi,
in the then in
ith variable Xi to be unrestricted
standard way. the dual D,
in sign
(less
If
the
ith
we require
than or equal
the to
zero.) Note
that the dual of the dual
write the dual
in
to:
y-MTX = 0 1
X > 0 = f(y).
s with constraints
To see this,
v = g(y,X)
bTX =
where g(y,X)
the primal.
the format:
minimize subject
is
Then
(i),
(i)
(x)
(ii)
(t)
(iii)
(s).
if we associate the variables x,
(ii),
(iii),
and
is: minimize
z = gO(x,bt-Mx+s)
subject to:
t = 1 s
>
0.
8
t,
and
the dual of this program
Thus
fOO(x) =
gO(x,w) =
However,
f(x) when w=O, and gO(x,w) =
+.
O.
if w
the last program can be written as z = f(x)
minimize
Mx-s =
subject to:
b
s >O is precisely the primal
which
P.
The vector of variables x if x
said to be feasible i.e.
satisfy the linear constraints,
(X)
otherwise x
(X)
= +),
(fO(MTX)
If x
infeasible.
is
then x
feasible and f(x) If P
is
(X)
Mx > b
is feasible but
(X)
for P (D)
is a strongly feasible program.
We have the following preliminary duality result for dual
gauge
programs. Theorem 1.
Let z
respectively.
and v*
respectively,
then zv >
v*
=
+~.
is essentially infeasible,
i.e.,
z
=
+o.
= O,
then D is essentially
(iii)
If v*
= O,
then P
(iv)
If x and X are feasible solutions values z and v, optimal
respectively,
solutions of P and D,
If x and X are
and
(iii)
f(x)
= zv,
and zv =
1, then x and X are
respectively.
by Remark 3.
then we have
This result shows
follow by contradiction from
9
1.
for P and D with objective
strongly feasible for P and D,
1 = XTb < XTMx < fo(MTX) (ii)
1, and hence z*v* > i.e.,
If z*
and
(D),
infeasible,
(ii)
(i),
and
strongly feasible for P and D, with objective
values z and v,
PROOF:
(P)
Then:
If x and X are
(i)
be optimal values of
(i).
(iv)
follows
III
from z.
(i)
is achieved by
bound on z*, and
by noting that 1/v is a lower
[XI (i)
Assertion value
of
corresponds to the standard duality result that the less
is
the max program
Assertions
equals min, then both are optimal.
f(x)
>
0 for any x,
the min
of
if max
(iii)
and
is
Because f(.)
= 0,
the program P
the same way that a standard
in
limit,
program would have a value of -, i.e.,
If z*
whereby z* > O.
has achieved its absolute lower
infeasible,
(ii)
in the standard theory.
correspond to unbounded cases a gauge,
to the value
corresponds to the result that
(iv)
program, and assertion
than or equal
and hence the dual program is
= +.
v*
In order to prove a strong duality theorem for the dual gauge programs their the
optimal values,
it is necessary
A convex set S c Rn satisfies
A gauge
both C and C C =
(xERnlf(x)
the projection property if all
S are closed convex sets,
transformation A:Rn4Rm, set.
to impose some qualifications on
We proceed as follows.
function f(.).
projections of
and that P and D both attain
that z*v*=l
P and D that asserts
(z
RmIz=Ax for
E
function f(.)
satisfies
i.e.,
if for any is
some xES)
a closed convex
the projection qualification
satisfy
the projection
property, where C is
< 1).
For notational
convenience,
assumed to be related by relations
linear
(1) and
(2)
f(-)
if
given by
and C will
be
of the previous section,
for the remainder of this paper. f(-)
satisfies the projection
f(.)
satisfies the projection
Note that by definition, qualification if and only if qualification.
If
C and C
are convex and compact,
the projection qualification, and hence projection qualification
for
1
1.
sets UC and X are convex and have no points in common.
There thus exists a hyperplane satisfies the closed.
if
{uIU=yTx for some yECO}.
by the projection qualification.
polyhedron such that UC n X =
PROOF:
= sup
= yTx for some yECO}
a closed interval,
qualification and
= yTx for some yCO.
finite, then f(x) We
However,
projection
that separates them.
qualification, all
Furthermore, X is polyhedron.
Because
projections
f(.)
of uC are
Consequently, according to
remark 5 in the appendix, there exists a hyperplane that separates from X and does that yTx must be
not meet UC.
< a for all xEUC,
Therefore, there exists
and yTx
a
>
[X]
We can now prove:
11 aa ·--
for all xEX.
positive, whereby by scaling we can presume
completing the proof.
-Ls(B
is a
__
UC
(y,a)E(Rn,R) such Since x=O EUC, a
it is equal
to
1,
11
Theorem 2. and
let
Assume that f(.)
z* and v*
satisfies
be the optimal
the projection qualification,
values of P and D,
respectively.
Then: (i)
If P and D are both
strongly feasible, then
z*v* =1;
and
the optimal values of P and D are achieved for some x*
(ii)
and
*.
P is
essentially infeasible and v*=O
v*=O;
is
(i.e.
z*=*)
if and only if
achieved for some feasible X*
if P is
infeasible. (iii)
D is essentially z*=O;
and
z*=O
infeasible
(i.e.
v*=o)
if and only if
is achieved for some feasible x*
if D is
infeasible. Let X = {xERnlMx > b}.
PROOF:
(i)
then 0
< z*
< +=
x
f(x)
=
E
X,
Lemma
and 0
/v*.
1, there
< +.
< v*
Assume
exists yRn
so does
convex set, since yTx y=MTX,
XTb
fO(MTX)
=
i.e.
(1/v*)C,
1.
x.
and
0 with
Thus P achieves
= f(x*)
Regarding
= l/v*.
its minimum at some (ii),
For the
the
"if"
"only if"
Then for any finite
, whereby there exists a vector yRn as described in
12
Now
and
a contradiction.
assume that P is essentially infeasible. UC
(-a,d],
shows that D achieves
establishing
the statement has been
or
is feasible for D,
its minimum at some feasible point x*, A parallel argument
(1/v*)C n X=¢,
therefore (yTxlxE(l/v*)C) is a closed
satisfy Mx > b,
Now X = X/XTb
for some
But since C satisfies the projection
a closed interval
(1/XTb) fO(y)
Then
that satisfies yTx < 1 for all
> 1 for all x that >
We must now show that
the contrary.
and yTx > 1 for all feasible property,
If P and D are strongly feasible,
part o f part, > 0,
Lemma
Let X and X be constucted as above.
1.
for D and f(MTX) < If
1/U,
P is infeasible,
exists X > 0 with MTX This completes The proof Theorem
= 0 and
(iii)
1/u for any U > 0,
i.e.
v*=O.
then by a theorem of the alternative, there
the proof of of
0,
qualification is
then UC
shown
stemming from an that
to guarantee the
(i) of Theorem 2 asserts
qualification is sufficient optimum z* where 0
< z*
The proof
x/2) and
_I
The
open separation.
The rather stronger condition that the dual
qualification is not sufficient
can
of Theorem
for a gauge program to attain
(ii)
to be sufficient in this case.
and
"open separation"
can be openly separated from X.
sufficient
Although part
z*=O.
need not be differentiable,
(see the appendix), which states
for some
2 indicates,
the feasible region
the feasible region of P.
essentially on arguments
and indirectly
of Theorem
a constraint qualification is not needed, because is polyhedral.
6]
_____
(2).
an Let Thus
its
III
we have
It
0
xO,
X2=0
+o=
x 1 70,
X2=0
+00
X2
=
f(xl,x2)
that both C and C°
the feasible
Let
< 0 {(Y1,Y2)ER2ly2
=
to derive C
is straightforward
verify
> O
X2
x2/(2x 2 )
satisfy the projection qualification.
region X of P be defined by X =
-2
= X1
which
= 1/(2x2),
/(2x2)
However,
z*=0.
Thus
spirit of
follows.
z*
and v*
(rel
there
to
(xl,x 2 )
for
condition for strong duality in P and D in is given below.
We
define X =
proceed as {xERnJAx > b),
so
so X'
is
raylike. We now must show that set that does not contain
if X is a nonempty closed, the origin,
then X"=X.
yTx > 1 for all yX', whereby xX", and Then there
exists an element z of X"
Because X is closed and (z)
strictly separates with the property then by and so
convex, from X,
Let xeX.
so X c X".
that
Suppose XX".
(y,a) E (Rn,
for all xX, and yTz < a.
yTz > 1=a, a contradiction.
This
Thus a < O.
Also yTz < a
Because OX and X is nonempty, X'
yTx
> 0 for every xX, (y + ey)Tx
Therefore
(y + ey)
(y + ey)Tz > yTz < a
is
for every e >
1 for all e > 0,
.
and so yTz
> 0,
Because
for every xX.
This
in
turn implies
that
> O,
which contradicts [X]
3 implies that the
dual of NGD
closed, and X is a closed,
contain the origin.
yX',
is nonempty.
< O.
Lemma f(-)
EX'
> 0,
for all xX.
yTx > 1 for every xX. > 1 for any
R),
Because X is raylike,
xX and all e > 1, and hence yTx > 0
Then,
If a
being the case,
yT(ex) > a for all
Let y be any element of X'.
Then
there exists a hyperplane that
rescaling we can assume that a=1.
< O.
raylike
is not contained in X.
and so there exists
that yTx > a
convex,
Of
course,
convex,
if X does
17
is precisely NGP whenever raylike
set that does not
contain the origin,
then
li
z*=O,
X'-$,
and v*=,
primal and dual,
where
z* and v*
are the optimal values of the
respectively.
The following result is analogous to Theorem 1: Theorem 1A (Weak Duality).
Let
NGP and NGD,
Then
(i)
respectively.
z* and v*
be optimal
values for
If x and y are strongly feasible for NGP and NGD, with objective values z and v, hence
respectively, the zv > 1, and
z*v* > 1.
(ii)
If z*=O,
then NGD is essentially infeasible,
(iii)
If v*=O,
then NGP
(iv)
If x and y are strongly feasible
is
essentially infeasible,
objective values
z and v,
then x and y are
optimal
i.e.
v*=+".
i.e.
z*=+o
for NGP and NGD with
respectively,
and zv =
1,
solutions of NGP and NGD,
respectively. PROOF:
If x and y are strongly feasible
1 < yTx < fo(y) f(x) = (ii),
(iii),
and
(iv)
zv,
by Remark 3.
follow from
To see how to obtain the from NGP and NGD, X =
let
a raylike
satisfying bTX
Y = (yERnly=MTX for some X > 0
(i),
The
(xeRnjMx > b}
> be for some Furthermore, > 1,
satisfying bTX
and
and
> 1}. X'
=
and define =
1),
and note that X'
linearly constrained gauge program P
is equivalent to the program P:
Define X =
> 1) = (ERnMx
raylike and contains Y.
result shows
[X]
set that contains X.
(yERnly=MTX for some X > 0
is
This
then
linearly constrained problems P and D
P be as given.
(xeRnlxEeX for some
Then X is
(i).
for NGP and NGD,
minimize f(x) subject xEX
18
because even though the feasible region of P contains the feasible region of P, every point xEX has at least as large an objective function value as a corresponding point xX.
The nonlinear gauge dual
of P is D:
minimize fO(MT\) subject to bTX > 1 >0
However, because f(.) is
homogeneous, we can restrict our .attention
to those X > 0 for which bTX = 1, obtaining the equivalent program: minimize fo(MTX)
D:
to bTX = 1
subject
>
0
linear gauge dual.
which is the
We have the following strong duality result for the nonlinear Let E = {xERnlf(x)
0,
then we can presume a = 1 by rescaling if necessary.
Then yX'
19
III
and v*
f(y) =
0, which Similarly,
0 for all XEX.
xz*C, and hence for all xC.
properly separate X from z*C.
that a > 0,
for NGD.
for every xX, there exists
turn means yTx =
that yTx =
l/z*.
is optimal
E z*C,
Thus yT(6x
X.
is attained
is attained in the primal,
If z* so
(z*/f(xO))xO
(1-6)x E
This in
< 0.
Thus H does not
xO
Because
6 > 1 such that implies yTx
C,
rel int
1 and y
=
Then ytxO > 0 because xO
= 0, because OEz*C.
because x
Also,
sup yTx < a/z* XEZ*C
(/z*)
since vz* > 1, we have z*v* =
However, If a
sup yTx = xEC
in
This contradiction the dual and
then the above proof
z*v* =
1.
is still valid,
long as z*C and X can be properly separated by a hyperplane H.
z*C and X cannot be properly separated, appendix,
there
exists XE
(rel
int
X)n(rel
int z*C and OEz*C, there exists
xE rel
whereby x
(z*/6)C, and so f(x)
then by Theorem 6 of
can be properly separated, and so v*
6 > 1 such that
is attained in
the
Because
int z*C).
a contradiction.
< z*,
If
6x + (1-6)OEz*C, Thus z*C and X
the dual,
and
z*v*=1.
A parallel primal.
argument
establishes that z*
is attained in
the
[X]
The nonlinear gauge duality theory parallels the gauge duality theory for linear constraints. there
It
is only natural
then to examine if
is a parallel duality construction that extends the Lagrange-
type dual LD to handle nonlinear constraints. and does not
contain the origin,
is closed,
NGP can be written as
f(x)
minimize subject
If X
to
yTx
> g(y)
20
for all y E cone X',
convex,
where cone X' g(y)
= max
=
{yeRnly = aw for some a > 0 and wX') and
(a > Oy = aw for
The above program
some wX').
suitable format so that Gwinner's dual
[7]
is in a
can be constructed, which
is: g(y)
maximize
GwD:
subject
yTx < f(x) for all x
to
y E cone X' Because be
if f(y)
yTx < f(x) for all x if and only
0 and
is feasible for NLD.
dual nonlinear gauge programs NGP and NGD make up a neat
theory in terms of
polar
explicit dual variables constraints
functions and antipolar sets.
However,
the
y do not directly correspond to primal
(though they do indirectly),
mention of constraints per se
and there is no formal
in either the primal
special
case of the general nonlinear theory is
theory,
instances
of which will be seen
or the dual.
One
of course the linear
in the next
section.
An open
issue regarding nonlinear gauge duality is under what circumstances can the
nonlinear gauge dual program be written
a finite number of
constraints whose
for every primal constraint?
21
variables
explicitly in terms of include a multiplier
li
4.
Applications In this section we explore a number of mathematical
models that correspond to a gauge program, programming,
problems
involving the
Many of the applications will minimize x subject where f(-) that of P.
However,
including convex quadratic
lp norm,
involve programs
and
linear programming.
of the form:
f(Nx+d)
to
is a gauge.
programming
Ax > b
Note that this format does not conform to
it
is equivalent to:
g(x,s)
minimize x,s subject to
where g(x,s)
= f(s).
Ax -Nx + Is
Furthermore,
> b = d
f(-)
qualification if and only if g(.,.)
satisfies
does.
the projection
The gauge dual
of this
program is:
minimize
g(ATX-NTU,
)
X,U
subject to
+ dT
bTX
X> But gO(y,t)
= f(t) when y=O,
U
=
and gO(y,t)
last program becomes minimize subject to
fO(u)
subject
ATX-NTU bTX+dTU
to
X>
22
1
0
0
= 0 = 1
= +
for y #
O;
thus this
A.
Convex Quadratic Programming The standard convex quadratic program QP:
minimize subject
(1/2)xTQx
is given by
+ qTx
Ax > b
to
where Q is a symmetric positive semi-definite matrix.
into the form Q = MTM for some square
the matrix Q can be factored matrix M. lies can
If M
is nonsingular
in the row space of
M,
(i.e.,
subject which
is positive definite),
then q = MTs for some vector sERn,
or
if q
and QP
(1/2)xTMTMx + sTMx to
Ax > b
is equivalent to the gauge GP:
minimize subject
where f(.)
= #'U 2 ,
program
lMx+sU 2 to
Ax > b
and so f(.)
satisfies
Note that QP and GP are equivalent identical
and their objective
transformation.
the projection qualification.
in that their
constraints are
functions differ by a strictly monotone
In examining the duality properties of QP and GP,
first study the case where Q is
the case when Q is positive Q is
Q
be written as minimize
will
Furthermore,
positive definite,
followed by
semi-definite.
Positive Definite When Q is positive definite the Lagrange dual bTX -
maximize X
of QP
is
(1/2)(XTA-qT)Q-1(ATX-q) (LQP),
subject
to
see Dorn [1].
X
> 0
The guage dual
11(MT)-IATX1{
minimize
of GP,
on the other hand,
is
2
(GD), subject
to
(bT+qTQ-lAT)X X > 0
= 1
23
we
II'
to
is equivalent
which
the quadratic program
minimize
XTAQ-1ATX
subject to
(bT+qTQ-1AT)X = 1 X >O
(D).
Note that both LQP and GD are strictly convex quadratic programs. constraints of LQP consist of the nonnegativity conditions
X > 0, whereas
the single equality constraint (bT+qTQ-1AT)X = 1.
GD also includes The
The
following theorem shows
the relationship between the gauge dual
programs GD or GD and the Lagrange dual LQP: Theorem 3. (i)
is a solution to
If
*
a)
t*
0
dual
LQP,
b) (ii)
is positive definite,
If Q
t*
GD and
t* = X*TAQ-1ATX*,
if and only if X = X*/t* solves
t
0 if and
only if
\*
then
the Lagrange
infeasible.
= 0 if and only if QP is
is a solution to LQP and t = bTX
If a)
then
+ qTQ-1ATX, then
= X/t solves
the guage
dual
GD, b)
t = 0 if and only if QP has a solution x to Ax > b satisfying Mx + s = O,
i.e.,
if and only if GD is
infeasible. PROOF:
The proof follows
Tucker conditions
The
for GD and LQP.
direct substitution. Q
from an examination of the Karush-Kuhntransformations follow from
[X]
is Positive Semi-Definite We now turn our attention to the broader case,
semi-definite and q lies some vector sRn.
in the row space
In this case,
of M,
whence q = MTs for
the Lagrange dual
24
when Q is positive
of QP
is
bTX -
maximize
(1/2)xTMTMx
',x
(LQP'),
subject
to
as in Dorn [1].
ATX -
MTMx = MTs X> O
The gauge dual of GP is
minimize X,u
null 2
subject to
-ATX + MTu = bTX
+ sTU
=
0
(GD'),
1
X > 0
which is equivalent to the quadratic program minimize
uTu
X, u subject
-ATX + MTU = 0
to
bTX
+
sTU
=
(GD')
1
X > 0
Analogous to Theorem 3, Theorem 4 demonstrates the relationship between the two different dual quadratic programs LQP' and GD'. Theorem 4.
If Q is positive semi-definite, Q = MTM and q = MTs for
some s ERn,
then
(i)
(X*,u*)
constitute an optimal solution to the gauge dual
GD' if and only if there exists x*, t* such that
(ii)
(a)
-ATX*+MTU*=O
(b)
bTX*+sTu*=l
(c)
X*
(d)
Ax* > bt*
(e)
U*=Mx*+st*
(f)
X*TAx*=X*Tbt*
> 0
t*=O if and only if QP is infeasible. only if
t$*O if and
=X*/t*, x=x*/t* constitute a solution to the
Lagrange dual LQP'.
25
III
(iii)
(X,x) constitute an optimal solution to the Lagrange dual LQP'
if and only if there exists z such that
(a)
X > 0
(b)
AT\-MTM=MTs
(c)
Az > b
(d)
MTMx=MTMz
(e)
XTAz=XTb
XTb+sTs+sTMz=o if and only if QP has a solution Ax > b,
(iv)
Mx+s=O, i.e.
if and only if GD' is infeasible.
XTb+sTs+sTMz~o if and only if (X*,U*) solves the gauge dual t*
D'
with multipliers x*,
t*
given by:
= 1/(XTb+sTs+sTMz)
X* = Xt* U*
=
x*
= zt*.
The conditions
(Mz+s)t* [X]
(i) and (iii)
of this theorem are simply the Karush-
Kuhn-Tucker conditions, and the transformations in
(ii)
and (iv)
follow from direct substitution. B.
Programs with the l If f(x)
=
xllp,
Norm
1 < p
d
(GPp)
26
f(y) =
if
The gauge dual
of GPp, derived using P and D, nBTXq
minimize
subject
is
to
CTX
=
0-
dTX
=
1
(GDq)
X > 0 Because f(x) results
of Theorem
Note
=
Hxlp satisfies the projection qualification, 2 are valid
solution s),
which is a derived equivalent of
i-I
instance of GPp, by setting
C
M
is a more general
When pl, p,
, d =
s
program than QP.
(l/p)
p
w
X,Z
The Lagrange
and p = 2.
the program GPp is equivalent to
minimize
subject
the
(when Q is positive semi-definite and q=MTs has a
can be cast as an
B
Thus GPp
for GPp and GDq.
that the program GP,
quadratic program QP
the
(LPp) to
dual of
Bw+Cz > this
d
program
is
maximize XTd-(1/q)UBTn q q subject
to
= 0
CT
(LDq)
X> 0 The gauge dual GDq and the Lagrange dual LDq bear a relationship that generalizes the case of quadratic 4.
This
programming in Theorems 3 and
relationship is demonstrated below.
notation xP,
where xERn,
In the theorem,
denotes the vector whose
(sign xj)(Ixjlp).
27
the
jth component
is
If pl and p,
Theorem 5.
If X*
(i)
t*
then
is an optimal
solution
to the gauge
dual GDq and
= \*T(B)(BTX*)q-1, then
(a)
t'*O if
and only if
=X*/t*
solves the Lagrange
dual LDq. (b)
t*=O if
and only if GPp
If X is an optimal
(ii)
(a)
XTd~O
is infeasible.
solution to
the Lagrange dual LDq, then
if and only if X*=X/XTd solves the gauge
dual GDq. (b)
XTd=o
if and only if GPp has an optimal
with value The proof of
this
0,
Cz > d has a solution.
i.e.
theorem follows
solution [X]
from examining the Karush-Kuhn-
Tucker conditions and substituting in the transformations as given. Although
the programs GPp and LPp are equivalent
objective functions differ by a monotone dual GDq of GPp will yield a better the
optimal
To see this, LPp, and
let
let
(w,z)
and
in GPp and LPp,
X = X/(dTX)
h =
dTX -
values of
X and X in
the gauge
lower bound on
the Lagrange dual
LDq for LPp. to GPp and
be the corresponding objective values of =
wllp,
' =
(l/p)
P Uwlp
Let X >
.
(w,z)
0 be any
LDq with a positive objective value, and hence
is a feasible solution to GDq. (l/q)JJBTx\
larger)
be any strongly feasible solution
namely
feasible solution to
transformation),
(i.e.,
solution to GPp than will
(their
p
p
Let g =
be the corresponding objective
BTXUq, and function
the programs GDq and LDq, respectively.
Then
1/g and h each represent a lower bound on the optimal primal objective values for GPp and LPp, and the values numbers
greater than or equal
the respective dual pairs
g and
/h are
to one that measure the duality gap in
of programs,
28
as a ratio
of the primal
objective
function value
to the dual objective
function value.
We
have: Lemma This
3.
Under
the above assumptions,
lemma states
ratio)