Convex Optimization
Contents
10.1. Convex Optimization#
Main references for this section are [6, 9, 17].
10.1.1. Convex Optimization Problems#
Definition 10.1 (Convex optimization problem general form)
Let
A mathematical optimization problem of the form
is known as a convex optimization problem.
In other words, a convex optimization problem minimizes a convex function over a convex set. We can write the optimal value of the minimization problem as
The set of optimal points for the minimization problem is
Note
is the objective function (or cost function) being minimized. is the optimization variable. is the feasible set. Any is a feasible point.The problem is feasible if
.If
, then the problem is infeasible.If the problem is infeasible then
.If the problem is feasible then
.If
, then the problem is unbounded below.We say that
is an optimal point for the minimization problem if .If
is nonempty, then we say that the optimal value is attained at some point.It is possible that
is finite and yet the optimal set is empty.
Remark 10.1 (Closedness of the feasible set)
Some authors prefer to define convex optimization as the minimization of a convex function over a closed and convex set.
We have not insisted that
be a closed set in the general definition of convex optimization problem.The closedness of
is an additional property which can provide additional guarantees on the existence of an optimal point. We shall introduce it whenever needed in the discussion below.Recall from Theorem 9.170 that a closed and convex set
is an intersection of all the halfspaces that contain it. Let be the set of halfspaces that contains .Then, each half space can be written as
Thus,
is equivalent to for every .This gives us the motivation to consider the feasible set in the form of intersection of sublevel sets of convex functions.
Majority of convex optimization problems can be transformed into a functional form where the constraints are expressed in the form of sublevel sets of convex functions and the level sets of affine functions.
10.1.1.1. Convex Optimization Standard Form#
Definition 10.2 (Convex optimization problem standard form)
Let
with optimization variable
The objective function
is a convex function.The inequality constraint functions
are convex functions for .The equality constraint functions
are affine functions for .
Observation 10.1 (Discussion on the convex optimization standard form)
The domain of the problem is given by
for every since are affine functions.Thus,
By definition
are convex for . Hence is convex.The feasible set
is given byThen,
are sublevel sets of convex functions hence they are also convex sets.By Theorem 4.209, the level sets of affine functions are affine sets.
Thus,
is an affine set. Hence, it is a convex set.Thus,
is an intersection of convex sets.Thus,
is a convex set.We note that we can rewrite the standard form as
to match with Definition 10.1.
Remark 10.2 (Adding closedness of objective and constraint functions)
Suppose we add an additional requirement that
the function
Recall from Definition 3.64 that a function is closed if all its sublevel sets are closed.
In particular, this means that the domain of a closed function is also closed.
Thus,
is a closed set.Then,
are closed sets since are closed functions.Since
is finite dimensional, hence affine sets are closed. Hence is closed for every .Thus,
is an intersection of closed and convex sets.Thus,
is a closed and convex set.Recall from Theorem 3.92 that a function is closed if and only if it has a closed epigraph.
Also, recall from Theorem 3.119 that a function is closed if and only if it is l.s.c. (lower semicontinuous).
Further, recall from Theorem 9.172 that every convex function has a closure which is l.s.c. (hence closed) given by its lower semicontinuous hull and the closure of a convex function is also convex.
Thus, if any of the
for is convex but not a closed function, we can replace it by its lower semicontinuous hull to convert it to a closed convex function. This way, be can bring such problems to the standard form for convex optimization.
Remark 10.3 (Comparison of two forms)
We have seen two forms of describing convex optimization problems.
Definition 10.1 describes it as minimizing a convex function over a convex set.
Definition 10.2 describes it in the form of minimizing a convex objective function with convex inequality constraints and affine equality constraints.
As seen in comments above, the second form can be converted into the first form.
The first form is more general. It is very useful in establishing the theoretical properties of convex optimization problems.
The second form is more useful from practical perspective. Almost all real life convex optimization problems can be reduced to this form. It is easier to describe algorithms to solve convex optimization problems in terms of this form.
In the sequel, we will liberally use either form for proving theoretical results and developing algorithms.
Definition 10.3 (Concave function maximization problem)
Let
with optimization variable
The objective function
is a concave function.The inequality constraint functions
are convex functions for .The equality constraint functions
are affine functions for .
Remark 10.4 (Concave maximization as a convex optimization problem)
We note that maximizing
which is a convex optimization problem in standard form. With an abuse of notation, we shall call a concave function maximization program also a convex optimization program.
10.1.1.2. Proper Convex Functions#
It is useful to extend these definitions to proper convex functions as objective functions and/or inequality constraint functions.
Definition 10.4 (Optimization problem general form for proper convex functions)
Let
A mathematical optimization problem of the form
is known as a convex optimization problem for proper convex functions.
In other words, this problem minimizes a proper convex function over a convex set. We can write the optimal value of the minimization problem as
The set of feasible points is given by
The set of optimal points for the minimization problem is
Definition 10.5 (Convex optimization problem standard form for proper convex functions)
Let
with optimization variable
The objective function
is a proper convex function.The inequality constraint functions
are proper convex functions for .The equality constraint functions
are affine functions for .
The reader can check that the definitions of
problem domain, feasible set, optimal value,
optimal solutions/points, etc. can be easily
extended by using the effective domains of
10.1.1.3. Local and Global Optima#
Theorem 10.1 (Local minimum is a global minimum in convex optimization)
Let
In other words,
Proof. Since
In other words,
Let
be any point such that .We need to show that
.Let
be such that .Since
is convex, hence as and is their convex combination.Thus,
.By the local optimality condition
We used the fact that
is convex.Cancelling and rearranging the terms, we get
.Since
, hence it reduces to .Thus,
is indeed globally optimal.
Corollary 10.1 (Local minimum is a global minimum in for proper convex functions)
Let
In other words,
Proof. We note that the feasible set is
If
, then there are no feasible points. Hence, no local/global optima.We now consider the case where
.Following the argument in Theorem 10.1, if
is locally optimal for over , then it is globally optimal for over .Consequently, it is globally optimal for
over as for every .
The argument can be modified to show that if
Theorem 10.2 (Local minimum is strict global minimum for strictly convex functions)
Let
In other words,
Proof.
There exists
Let
be any point such that .Let
be such that .Since
is convex, hence as and is their convex combination.Thus,
.Note that
. Thus, is distinct from and and lies in the line segment between them.By the local optimality condition
We used the fact that
is strictly convex.Cancelling and rearranging the terms, we get
.Since
, hence it reduces to .Thus,
is indeed strictly globally optimal.
Corollary 10.2 (Local minimum is strict global minimum for proper strictly convex functions)
Let
In other words,
10.1.1.4. Optimal Sets#
The optimal sets of a convex optimization problem are also convex.
Theorem 10.3 (Optimal set is convex for a convex optimization problem)
Let
Let the optimal set (the set of optimal points) be given by
Then,
Proof. If
Let
.Then,
.Let
.Let
.Since
is convex, hence . Thus, .Since
is convex, henceBut
.Thus,
.Combining, the two inequalities, we get
.Thus,
.Thus, for every
and every , .Thus,
is indeed convex.
Theorem 10.4 (Optimal points for minimization of strictly convex functions)
Let
has at most one optimal point.
Proof. Let the optimal value for the minimization of
Let the optimal set (the set of optimal points) be given by
By Theorem 10.1,
is a convex set.If
is empty or a singleton, there is nothing more to prove.For contradiction, assume that there are two distinct points
.We have
.Let
.Thus, it is a convex combination of
and .By convexity of
, . Thus, .By strict convexity of
This contradicts the fact that
is the optimal value for the minimization problem.Thus,
must be either empty or a singleton.Thus, the minimization problem has at most one optimal point.
10.1.2. Concave Objective Functions#
Theorem 10.5 (Minimization of concave function and relative interior)
Let
Let the set of optimal points for the minimization problem be given by
If
Proof. Assume that
Let
be any vector distinct from .Since
, hence due to Theorem 9.144, there exists such that . is a point behind on the line from to which belongs to .Letting
we can rewrite it asThus,
is a convex combination of and .By concavity of
, we haveSince
is an optimal point over and , hence and .Thus,
This can be true only if
which in turn implies that .Thus,
must be constant over .
Corollary 10.3 (Linear functions attain minimum only at boundary points)
Let
In other words,
Proof. We note that every linear functional is also concave.
Let
.Then, there exists
such that .Since
is linear, hence cannot be constant over .Thus, by Theorem 10.5, it cannot attain a minimum at
.