9.10. Function Operations#

In this section we discuss various operations which generate new functions from a given function, a pair of functions or a family of functions. We discuss if the properties like convexity and closedness are preserved under these function operations. Such operations include, scaling, sum, composition, pointwise supremum, partial minimization, etc..

Main references for this section are [9, 17].

9.10.1. Scaling and Addition of Convex Functions#

Here we show some basic results about operations that preserve convexity

Theorem 9.101 (Nonnegative multiplication)

If f is convex, then so is αf for all α0.

Proof. Assume f is convex. Note that domf=domαf. Thus, domαf is convex since domf is convex.

Now, let x,ydomf and t[0,1]. Then

f(tx+(1t)y)tf(x)+(1t)f(y)αf(tx+(1t)y)α(tf(x)+(1t)f(y))α0(αf)(tx+(1t)y)t(αf)(x)+(1t)(αf)(y)α0.

Thus, αf is convex for every α0.

Theorem 9.102 (Convex function sum)

If f and g are convex, then so is f+g with dom(f+g)=domfdomg.

Proof. We discussed earlier that dom(f+g)=domfdomg as f+g is defined only for the points where both f and g are defined.

Recall from Theorem 9.25 that intersection of convex sets is convex. Thus, dom(f+g) is convex since domf and domg are both convex.

Now let x,ydom(f+g) and t[0,1].

  1. Since f is convex, hence

    f(tx+(1t)y)tf(x)+(1t)f(y).
  2. Since g is convex, hence

    g(tx+(1t)y)tg(x)+(1t)g(y).
  3. Adding the two inequalities, we get:

    f(tx+(1t)y)+g(tx+(1t)y)tf(x)+(1t)f(y)+tg(x)+(1t)g(y)(f+g)(tx+(1t)y)t(f+g)(x)+(1t)(f+g)(y).
  4. Thus, f+g is convex.

Theorem 9.103 (Conic combinations of convex functions)

If f1,,fn are convex, then for any t1,,tn0, the conic combination of functions given by:

f=t1f1++tnfn

is also convex.

The conic combinations are also called nonnegative weighted sums.

Proof. Due to Theorem 9.101, tifi are convex since fi are convex and ti0.

Due to Theorem 9.102, sum of two convex functions is convex.

It can be easily shown by mathematical induction that sum of n convex functions is convex too.

Thus, f is convex.

Theorem 9.104

The set of convex functions in the vector space of real valued functions form a convex cone.

Note that the set of real valued functions forms a vector space over R with the standard definitions of function scalar multiplication and function addition. We are examining the convexity of the set of functions under this vector space.

Proof. Since every conic combination of convex functions is convex, hence the set of convex functions is a convex cone.

Theorem 9.105 (Concave function sum)

If f and g are concave, then so is f+g with dom(f+g)=domfdomg.

Proof. We proceed as follows:

  1. Let f and g be concave.

  2. f and g are convex.

  3. By Theorem 9.102, (f)+(g)=(f+g) is convex.

  4. Thus, f+g is concave.

Theorem 9.106 (Conic combinations of cave functions)

If f1,,fn are concave, then for any t1,,tn0, the conic combination of functions given by:

f=t1f1++tnfn

is also concave.

Proof. We proceed as follows:

  1. If f1,,fn are concave then f1,,fn are convex.

  2. By Theorem 9.103, (t1f1)++(tnfn) is convex.

  3. Thus, (t1f1++tnfn) is convex.

  4. Thus, t1f1++tnfn is concave.

9.10.2. Composition with Nondecreasing Functions#

Theorem 9.107 (Composition with a convex nondecreasing function preserves convexity)

Let V be a real vector space. Let f:V(,] be a proper convex function. Let g:R(,] be a convex function which is nondecreasing.

Then, their composition h=gf given by:

h(x)=g(f(x))xV

with g()=, is convex on V.

Proof. We note that domh=domfdomg. Since both f and g are convex, hence domf and domg are convex and hence domh is convex.

Choose any x,yV and t(0,1).

  1. We need to show that

    h((1t)x+ty)(1t)h(x)+th(y).
  2. If xdomf, then h(x)=g(f(x))=. Similarly, if ydomf, then h(y)=g(f(y))=.

  3. In either case, the convexity inequality will be satisfied trivially.

  4. Let us now consider the case where x,ydomf.

  5. By convexity of f, we have:

    f((1t)x+ty)(1t)f(x)+tf(y).
  6. Note that (1t)f(x)+tf(y) is a convex combination of f(x) and f(y).

  7. Since g is nondecreasing, hence if rs then g(r)g(s).

  8. By applying g on both sides of the previous inequality, we obtain

    h((1t)x+ty)=g(f((1t)x+ty))g((1t)f(x)+tf(y))(1t)g(f(x))+tg(f(y))=(1t)h(x)+th(y).
  9. Thus, h is convex.

Example 9.50 (Exponential of a convex function)

Recall from Example 9.42 that the exponential function ex is convex.

Then, for any convex f, h(x)=ef(x) is convex due to Theorem 9.107.

Example 9.51 (Power of a nonnegative convex function)

Let f:VR be a proper convex and nonnegative function.

Then, f(x)=|f(x)|.

Let p1 and consider the function g(x)=|x|p. By Example 9.31, g is convex.

Then, h=gf given by

h(x)=g(f(x))=|f(x)|p=f(x)p

is also convex.

Example 9.52 (Power of a norm)

Let :VR be a norm on the real vector space V. Let p1.

Then, the p-th power of the norm given by

h(x)=xpxV

is convex.

  1. The norm is a convex and nonnegative function.

  2. g(x)=|x|p is convex.

  3. By Theorem 9.107, p is convex.

Example 9.53 (Reciprocal of a concave function)

Let g be a concave function. Then, h(x)=1g(x) is convex on the set C={x|g(x)>0}.

  1. Since g is concave, hence f=g is convex.

  2. For all xC, f(x)<0.

  3. Consider the function

    ϕ(x)={1x if x<0 if x0.
  4. It is easy to see that ϕ is convex with the domain R++ (Example 9.44).

  5. We can see that, h=ϕf.

  6. Since f is convex and negative, h is also convex with domh=CR++.

Example 9.54 (Scaling and translation of a convex function)

Let f:VR be a proper convex function. Let m0 and cR.
Let g:RR be given by

g(x)=mx+c.

Then, g is convex (in fact affine) and nondecreasing.

Thus, h=gf=mf+c is also a proper convex function due to Theorem 9.107.

9.10.3. Composition with Affine Mapping#

Theorem 9.108 (Affine transformations preserve convexity)

Let V and W be real vector spaces. Let T:VW be a linear transformation. Let bW. Let f:WR be a function. Define g:VR as:

g(x)=f(T(x)+b)

with domg={xV|T(x)+bdomf}.

If f is convex, then so is g. If f is concave, then so is g.

Proof. Assume f is convex.

  1. If we define A:VW as A(x)=T(x)+b then domg=A1(domf).

  2. A, so defined, is an affine transformation.

  3. By Theorem 9.32, domg is convex since domf is convex.

  4. Let x,ydomg.

  5. g(x)=f(T(x)+b). Define u=T(x)+b.

  6. g(y)=f(T(y)+b). Define v=T(y)+b.

  7. By definition, u,vdomf.

  8. Let t[0,1].

  9. Since f is convex, hence

    f(tu+(1t)v)tf(u)+(1t)f(v).
  10. Now

    g(tx+(1t)y)=f(T(tx+(1t)y)+b)=f(tT(x)+(1t)T(y)+(t+(1t))b)=f(t(T(x)+b)+(1t)(T(y)+b))=f(tu+(1t)v)tf(u)+(1t)f(v)=tg(x)+(1t)g(y).
  11. Thus, g satisfies the convexity defining inequality (9.1).

  12. Thus, g is convex.

A similar argument shows that if f is concave then so is g.

9.10.4. Sum of Functions#

Theorem 9.109 (Sum of convex functions)

Let V be a real vector space. Let f1:V(,] and f2:V(,] be proper convex functions. Then, the function f=f1+f2 is a proper convex function.

We note that since both f1 and f2 are proper, hence neither attains a value of , thus undesired sums like are avoided.

Proof. Convexity of the domain

  1. Since f1 is convex, hence domf1 is a convex set.

  2. Since f2 is convex, hence domf2 is a convex set.

  3. By definition of function sum domf=domf1domf2.

  4. Intersection of convex sets is convex.

  5. Hence, domf is convex.

We note that f(x)< if and only if f1(x)< and f1(x)<.

If domf=, then by Observation 9.4, f is convex. We are left with the case where domf is not empty.

Convexity inequality

  1. Let x,ydomf. Let t(0,1).

  2. Then, x,ydomf1 as well as x,ydomf2.

  3. By convexity of f1

    f1(tx+(1t)y)tf1(x)+(1t)f1(y).
  4. By convexity of f2

    f2(tx+(1t)y)tf2(x)+(1t)f2(y).
  5. Summing these inequalities, we get:

    f(tx+(1t)y)tf(x)+(1t)f(y).

Corollary 9.7 (Sum of multiple convex functions)

Let V be a real vector space. Let f1,,fk:V(,] be proper convex functions. Then, the function f=f1++fk is a proper convex function.

Proof. We note that domf=i=1kdomfk. domf is convex since it is an intersection of convex sets. The proof of convexity is a simple application of mathematical induction.

  1. Let gr=f1++fr for r2,,k.

  2. By Theorem 9.109, g2=f1+f2 is a proper convex function.

  3. Assume that gr is a proper convex function for some r.

  4. Then, gr+1=gr+fr+1.

  5. Since both gr and fr+1 are proper convex, hence gr+1 is also a proper convex function.

Theorem 9.110 (Conic combination of convex functions)

Let V be a real vector space. Let f1,,fk:V(,] be proper convex functions. Let r1,,rk0.

Then, the function f=r1f1++rkfk is a proper convex function. f is a conic combination of f1,,fk.

Proof. By Example 9.54, rifi are proper convex functions.

By Corollary 9.7, their sum is a proper convex function.

Example 9.55 (Restricting the domain of a convex function)

Let f:VR be a convex function with domf=V.

Let CV be a convex set. Let IC be the indicator function for the set C given by

IC(x)={0 if xC if xC.

Consider the proper function h=f+IC given by

h(x)=f(x)+IC(x)={f(x) if xC if xC.

By Theorem 9.109, h is a proper convex function. domh=domfdomIC=VC=C.

Thus, h restricts the effective domain of f to C.

9.10.5. Construction from VR#

One way to construct convex functions on V is from convex sets in VR.

Theorem 9.111 (Construction from convex sets of VR)

Let V be a real vector space. Let FVR be a convex subset of VR. Let f:V(,] be defined as

f(x)inf{tR|(x,t)F}.

Then, f is a proper convex function on V.

We note that f(x)= if there is no tR such that (x,t)F. This is consistent since inf=.

Proof. Convexity of domain of f

  1. We note that for some vV, if there is no tR such that (v,t)F, then f(v)= and vdomf.

  2. Thus, if vdomf, then there exists tR such that (v,t)F.

  3. Let x,ydomf and t(0,1).

  4. Then, there exists (x,p)F and (y,q)F for some p,qR.

  5. Since F is convex, hence

    t(x,p)+(1t)(y,q)=(tx+(1t)y,tp+(1t)q)F.
  6. But then, f(tx+(1t)y)tp+(1t)qR.

  7. Thus, tx+(1t)ydomf.

  8. Thus, domf is convex.

Convexity inequality in the domain

  1. Let x,ydomf and t(0,1).

  2. Let X={vR|(x,v)F}.

  3. Let Y={vR|(y,v)F}.

  4. By definition of f, f(x)=infX and f(y)=infY.

  5. Let z=tx+(1t)y.

  6. Let Z={vR|(z,v)F}.

  7. We have f(z)=infZ.

  8. Since F is convex, hence for every pX and every qY, the point (tx+(1t)y,tp+(1t)q)=(z,tp+(1t)q)F.

  9. Thus, for every pX and every qY, tp+(1t)qZ.

  10. But then, infZtp+(1t)q for every pX and every qY.

  11. Thus, taking infimum on the R.H.S. over X and Y,

    infZinfpXinfqY(tp+(1t)q)=tinfX+(1t)infY.
  12. In other words

    f(tx+(1t)y)=f(z)tf(x)+(1t)f(y).
  13. Thus, f is convex.

We note that Fepif. Although, Fepif and can be easily extended to become epif.

9.10.6. Pointwise Supremum#

9.10.6.1. Two Functions#

Theorem 9.112 (Pointwise maximum of two convex functions)

Let f1 and f2 be convex functions. Define their pointwise maximum as

f(x)=max{f1(x),f2(x)}

with domf=domf1domf2. Then, f is convex.

Proof. domf is an intersection of convex sets. Hence, it is convex.

Let x,ydomf and t[0,1].

f(tx+(1t)y)=max{f1(tx+(1t)y),f2(tx+(1t)y)}max{tf1(x)+(1t)f1(y),tf2(x)+(1t)f2(y)}tmax{f1(x),f2(x)}+(1t)max{f1(y),f2(y)}=tf(x)+(1t)f(y).

Thus, f is convex.

9.10.6.2. Multiple Functions#

Theorem 9.113 (Pointwise maximum of multiple convex functions)

Let f1,f2,,fn be convex functions. Define their pointwise maximum as

f(x)=max{f1(x),,fn(x)}

with domf=domf1domfn. Then, f is convex.

Proof. The result has been proved for the base case of 2 functions in Theorem 9.112.

Assume that it is true for n functions. We can easily show it true for n+1 functions since

max{f1(x),,fn+1(x)}=max{max{f1(x),,fn(x)},fn+1(x)}.

Thus, by principle of mathematical induction, the result is true for all n.

9.10.6.3. Family of Functions#

Theorem 9.114 (Pointwise supremum of a family of convex functions)

Let I be an index set. Let {fi:VR}iI be a family of convex functions. Define their pointwise supremum as

f(x)=sup{fi(x)}iI

with

domf=iIdomfi.

Then, f is convex. Moreover,

epif=iIepifi.

Consequently, if fi are closed and convex for every iI, then f is closed and convex.

Proof. We shall first verify the epigraph equality.

  1. Let (x,t)epif.

  2. Then, f(x)t.

  3. Thus, fi(x)t for all iI since f(x)=sup{fi(x)}iI.

  4. Thus, (x,t)epifi for all iI.

  5. Thus, epifiIepifi.

Now, for the converse:

  1. Let (x,t)iIepifi.

  2. Thus, (x,t)epifi for all iI.

  3. Thus, fi(x)t for all iI.

  4. Taking the supremum over iI on the L.H.S., we obtain:

    sup{fi(x)}iI=f(x)t.
  5. Thus, (x,t)epif.

  6. Thus, iIepifiepif.

Combining the two, we get:

epif=iIepifi.
  1. Since fi are convex functions, hence epifi are convex sets due to Theorem 9.71.

  2. Thus, epif is a convex set due to Theorem 9.26.

  3. But then, f is convex again due to Theorem 9.71.

Closedness

  1. If fi are closed for every iI, then epifi is closed for every iI.

  2. Consequently, epif is closed, since its an intersection of closed sets.

  3. Hence f is a closed function.

9.10.6.4. Largest Entry#

Example 9.56 (Convexity of the largest entry in a vector)

Let h:RnR be defined as

h(x)=max{x1,,xn}.

If we introduce coordinate functions fi:RnR as

fi(x)=xi

then, we can write h as

h(x)=max{f1(x),,fn(x)}.

Each of the coordinate functions is linear hence convex. Thus, h is a maximum of convex functions. Hence h is convex.

9.10.6.5. Sum of k Largest Entries#

Example 9.57 (Sum of k largest entries in a vector)

Consider a vector xRn with x=(x1,,xn).

  1. Let x[i] denote the i-th largest entry of x.

  2. In particular, x[1]=max{x1,,xn} is the largest entry of x.

  3. Then,

    x[2]=max({x1,,xn}{x[1]})

    is the second largest entry of x.

  4. Similarly,

    x[k]=max({x1,,xn}{x[1],,x[k1]})

    is the k-th largest entry of x.

  5. In other words, if we sort the entries of x in descending order, we can pick the largest entries one by one.

We can introduce functions gi:RnR as

gi(x)=x[i].

As shown in Example 9.56, g1 is convex. However, g2,,gn are not convex.

We now introduce a function which computes the sum of k largest entries. Let hk:RnR be given by

hk(x)=i=1kx[i].

Then, h is convex. To see this, we proceed as follows:

  1. There are N=(nk) ways of choose k indices from the set 1,,n.

  2. Let Il={i1l,,ikl} be one of the N ways to choose k indices for l=1,,N.

  3. Let Fl:RnR be given by

    Fl(x)=xi1l++xikl

    for l=1,,N.

  4. Then, Fl is a linear (hence convex) function for every l=1,,N.

  5. We can now see that

    hk(x)=max{F1(x),,FN(x)}.
  6. Thus, hk is a maximum of N convex functions.

  7. Hence h is convex.

9.10.6.6. Piecewise Linear Function#

Definition 9.54 (Piecewise linear function)

Let a1,,anV. Let b1,,bnR.

A function f:VR given by:

f(x)=max{x,ai+bi}i1,,n

is called a piecewise linear or piecewise affine function.

Theorem 9.115

Piecewise linear functions are convex.

Proof. Each of the functions fi(x)=x,ai+bi is affine functionals. Thus, fi are convex (Theorem 9.68). f is a pointwise maximum of n convex functions. Hence, f is convex (Theorem 9.113).

9.10.7. Projection#

Theorem 9.116 (Projection for extended valued functions)

Let V and W be finite dimensional real normed linear spaces. Let f:VWR be a convex function. Let gx:WR be defined by

gx(y)=f(x,y)
  1. If f is convex, then gx is convex.

  2. If f is proper and f(x,y)< for some y, then gx is proper.

  3. If f is closed, then gx is closed.

Proof. Convexity

  1. Let y1,y2W and t[0,1].

  2. Then

    gx(ty1+(1t)y2)=f(x,ty1+(1t)y2)=f(tx+(1t)x,ty1+(1t)y2)=f(t(x,y1)+(1t)(x,y2))tf(x,y1)+(1t)f(x,y2)=tgx(y1)+(1t)g(y2).
  3. Hence g is convex.

Properness

  1. Since f is proper, hence gx> for every y.

  2. Since f(x,y)< for some y, hence gx(y)< for the same y.

  3. Hence gx is proper.

Closedness

  1. Let Gt=sublevel(gx,t) for any tR.

  2. Let Ft=sublevel(f,t) for any tR.

  3. By hypothesis, Ft is closed for every t.

  4. Consider a converging sequence {yk} of Gt with limyk=y.

  5. Then gx(yk)t for every k.

  6. Thus f(x,yk)t for every k.

  7. Hence {(x,yk)} is a converging sequence of Ft.

  8. Since Ft is closed, hence {(x,yk)} converges in Ft.

  9. Hence (x,y)Ft.

  10. Hence f(x,y)t.

  11. Hence gx(y)t.

  12. Hence yGt.

  13. Hence Gt is closed.

  14. Hence all sublevel sets of gx are closed.

  15. Hence gx is a closed function.

9.10.8. Partial Minimization#

9.10.8.1. Real Valued Convex Functions#

Theorem 9.117 (Partial minimization)

Let V and W be real vector spaces. Let f:VWR be a convex function with CD=domf. Define a function g:VR as

g(x)infyDf(x,y),xC.

Assume that the infimum is finite for every xC. Then, g is convex with C=domg.

Note that it is possible that g(x) is finite for some xC and yet there is no yD such that (x,y)CD. In other words, we only assume that the infimum is finite at every xC. We don’t require that the infimum is attained at some point (x,y)CD.

Proof. Since f is convex, hence domf is convex. Thus, CD is convex. By Theorem 9.41, C and D are convex.

  1. Since the infimum is finite for every xC, hence domg=C.

  2. Thus, domg is a convex subset of V.

  3. Let x1,x2C and t(0,1).

  4. Take any r>0.

  5. By definition of the infimum, there exist some y1,y2D such that

    f(x1,y1)g(x1)+r,f(x2,y2)g(x2)+r.
  6. Let (x,y)=t(x1,y1)+(1t)(x2,y2).

  7. Since f is convex, hence

    f(x,y)tf(x1,y1)+(1t)f(x2,y2)t(g(x1)+r)+(1t)(g(x2)+r)=tg(x1)+(1t)g(x2)+r.
  8. By definition of g

    g(x)f(x,y)tg(x1)+(1t)g(x2)+r.
  9. Since this inequality is valid for every r>0, hence

    g(x)tg(x1)+(1t)g(x2)

    holds true for every x1,x2C and t(0,1).

  10. Thus, g is convex.

Example 9.58 (Convexity of the distance from a convex set function)

Let CV be a convex set. The set distance function dC:VR is defined by

dC(x)=inf{xy|yC}.

We define a function f:VVR as

f(x,y)=xy.

Since the norm is convex, hence f is convex over VV. In particular f is convex over the subset VC.

Then, by Theorem 9.117, dC is convex.

9.10.8.2. Proper Convex Functions#

Theorem 9.118 (Partial minimization for proper convex functions)

Let V and W be real vector spaces. Let f:VW(,] be a proper convex function satisfying the following property:

For every xV, there exists yW for which f(x,y)<.

Let g:V[,) be defined by

g(x)infyWf(x,y).

Then g is convex.

Proof. We first mention that since for every xV, there exists at least one y such that f(x,y)<, hence g(x)< for every x. This is why, the range of g is [,). Although it still leaves open the possibility that g(x)= for some xV.

We now show that g satisfies the convexity inequality.

  1. Let x1,x2V and t(0,1).

  2. We consider the two cases

    1. Both g(x1),g(x2)>.

    2. At least one of them equals . Without loss of generality, assume that g(x1)=.

Case 1: g(x1),g(x2)>

  1. Choose some ϵ>0.

  2. By the infimum property, there exist y1,y2W such that

    f(x1,y1)g(x1)+ϵ;f(x2,y2)g(x2)+ϵ.
  3. Since f is convex, hence

    f(tx1+(1t)x2,ty1+(1t)y2)tf(x1,y1)+(1t)f(x2,y2)t(g(x1)+ϵ)+(1t)(g(x2)+ϵ)=tg(x1)+(1t)g(x2)+ϵ.
  4. By the definition of g, we have

    g(tx1+(1t)x2)f(tx1+(1t)x2,ty1+(1t)y2).
  5. Thus we have

    g(tx1+(1t)x2)tg(x1)+(1t)g(x2)+ϵ.
  6. Since this inequality is valid for every ϵ>0, hence

    g(tx1+(1t)x2)tg(x1)+(1t)g(x2).
  7. Thus g is convex for this case.

Case 2: g(x1)=.

  1. To show that g is convex, we need to show that g(tx1+(1t)x2)=.

  2. Choose any MR.

  3. Since g(x1)=, there exists y1W such that

    f(x1,y1)M.
  4. By hypothesis in the theorem, there exists y2W such that

    f(x2,y2)<.

    In other words, f(x2,y2) is finite.

  5. Using the convexity of f, we have

    f(tx1+(1t)x2,ty1+(1t)y2)tf(x1,y1)+(1t)f(x2,y2)tM+(1t)f(x2,y2).
  6. By definition of g, we have

    g(tx1+(1t)x2)tM+(1t)f(x2,y2).
  7. Since the inequality holds for any MR and the quantity f(x2,y2) is finite, hence taking the limit M on the R.H.S., we get

    g(tx1+(1t)x2)=.
  8. Thus g is convex for this case too.

Combining the two cases, g is convex.

9.10.8.3. Extended Valued Convex Functions#

Theorem 9.119 (Partial minimization for extended valued convex functions)

Let V and W be real vector spaces. Let f:VWR be a convex function. Let g:VR be defined by

g(x)infyWf(x,y).

Then g is convex.

Proof. We proceed as follows.

  1. If g(x) is everywhere then epig is empty, and g is convex.

  2. Hence assume that epig.

  3. If domg is singleton, then also g is convex and we are done.

  4. Hence assume the case where domg is not a singleton.

  5. Let (xa,ra) and (xb,rb) belong to epig.

  6. Hence g(xa)ra and g(xb)rb.

  7. By definition of g, we must have sequences {uk} and {vk} of W so that

    f(xa,uk)g(xa) and f(xb,vk)g(xb).
  8. Pick some t[0,1].

  9. By definition of g

    g(txa+(1t)xb)f(txa+(1t)xb,tuk+(1t)vk)k.
  10. By convexity of f

    f(txa+(1t)xb,tuk+(1t)vk)tf(xa,uk)+(1t)f(xb,vk)k.
  11. Hence we have

    g(txa+(1t)xb)tf(xa,uk)+(1t)f(xb,vk)k.
  12. Taking limit k on the R.H.S., we obtain

    g(txa+(1t)xb)tg(xa)+(1t)g(xb)tra+(1t)rb.
  13. Hence the point (txa+(1t)xb,tra+(1t)rb)epig.

  14. Hence epig is convex.

  15. Hence g is convex.

9.10.9. Partial Minimization and Closedness#

The closedness of a function doesn’t imply the closedness of its partial minimization. The problem is that the projection operation of a closed set to a subspace doesn’t always preserve the closedness. In this subsection, we review specific conditions under which the closedness of a function is guaranteed after partial minimization.

The results in this section draw heavily on Recession Cones. The reader is encouraged to familiarize themselves with the concepts of recession cones and lineality spaces of convex sets.

We first establish the relationship between the sublevel sets of the original function and the sublevel sets of its partial minimization.

9.10.9.1. Sublevel Sets#

Lemma 9.3 (Partial minimization and sublevel-sets)

Let V and W be Euclidean spaces. Let f:VWR be a convex function. Let g:VR be defined by

g(x)infzWf(x,z).

Let Gt denote the set sublevel(g,t)={xV|g(x)t}. Then

Gt=k=1{xV| there exists (x,z)VW with f(x,z)tk}

where {tk} is any nonincreasing sequence with tkt.

We further note that the set on the R.H.S. is the projection on V of the set sublevel(f,tk) where the projection is given by (x,z)x.

Proof. tkt means that

  1. tk>t for every k.

  2. tk+1tk for every k.

  3. For every ϵ>0, there exists a k such that tkt+ϵ.

Let Xt denote the set

Xt={xV| there exists (x,z)VW with f(x,z)t}.

We first show that Gtk=1Xtk.

  1. Let xGt.

  2. Then g(x)t.

  3. Thus infzWf(x,z)t.

  4. Hence for every ϵ>0, there exists z such that f(x,z)t+ϵ.

  5. Since tk>t for every k, hence for every k, there exists z such that f(x,z)tk.

  6. Hence xXtk for every k.

  7. Hence xk=1Xtk.

  8. Hence Gtk=1Xtk.

Now for the converse,

  1. Let xk=1Xtk.

  2. Then for every k, xXtk.

  3. Hence for every k, there exists zk such that f(x,zk)tk.

  4. Also, g(x)f(x,zk)tk for every k.

  5. Taking the infimum on the R.H.S., we get g(x)t.

  6. Hence xGt.

  7. Hence k=1XtkGt.

Combining, we get

Gt=k=1Xtk.

We see that the sublevel set of g is an infinite intersection of projections of a nested sequence of sublevel sets of f. If the projections are closed, then the sublevel set of g is also closed. Thus, to show that g is closed, it is sufficient to show that the projections of the sublevel sets of f on V are closed.

9.10.9.2. Closedness Conditions I#

Theorem 9.120 (Partial minimization and closedness I)

Let V and W be Euclidean spaces. Let f:VW(,] be a proper, closed and convex function.

Let g:VR be defined by

g(x)infzWf(x,z).

Assume that there exists a vector x~V and a scalar γ such that the set

{z|f(x~,z)γ}

is nonempty and compact. Then g is proper, closed and convex. Furthermore, for each xdomg, the set of points that attain the infimum of f(x,) over W is nonempty and compact.

Proof. Due to Theorem 9.119, g is convex. However, this result alone doesn’t guarantee that g is proper or closed.

We shall denote the sublevel sets of f as

Vt=sublevel(f,t)={(x,z)|f(x,z)t}.

We first establish that for any nonempty sublevel set Vt, there is no nonzero direction of recession of the form (0,y).

  1. By hypothesis Vγ is nonempty since there exists a z such that f(x~,z)γ.

  2. By Definition 9.68, all nonempty sublevel sets of f have the same recession cone given by Rf.

  3. Let (0,y)Rf be a direction of recession of f.

  4. Then (0,y) is also a direction of recession of Vγ.

  5. For any (x~,z)Vγ, such a direction of recession will satisfy

    f(x~,z+αy)γα0.
  6. Since, by hypothesis, the set {z|f(x~,z)γ} is compact, hence the previous statement can be true only if y=0.

  7. Thus there is no nonzero direction of recession of Vγ of the form (0,y).

  8. Since all sublevel sets share the same recession cone, hence for any nonempty sublevel set Vt, there is no nonzero direction of recession of the form (0,y).

We now show that g is a closed function. For this, we shall show that the projection of every sublevel set of f to V is closed.

  1. If Vt is an empty sublevel set of f then its projection to V is also an empty set which is a closed set.

  2. Now let Vt be any nonempty sublevel set of f.

  3. Let p:VWV denote the projection operator given by

    p(x,y)=x.
  4. Note that p is a linear operator.

  5. The nullspace of this projection operator is the set of vectors of the form (0,y).

  6. Hence the nullspace of p doesn’t contain any nonzero direction of recession of Vt.

  7. Also, RVt(nullp)={(0,0)}LVt since (0,0) always belongs to the lineality space of Vt.

  8. Hence, due to Theorem 9.193, the projection of Vt to V under p is closed.

  9. By Lemma 9.3, a sublevel set of g is an infinite intersection of projections of a nested sequence of sublevel sets of f.

  10. Since projection of every nonempty sublevel set of f is closed, hence an intersection of any sequence of such projections is closed.

  11. Thus every sublevel set of g is closed.

  12. Hence g is closed.

We introduce a function hx:W(,] as

hx(z)=f(x,z)zW.
  1. For every xdomg, there exists zW such that f(x,z)<.

  2. Hence for every xdomg, the function hx is proper, closed and convex due to Theorem 9.116.

  3. We can see that

    g(x)=infzWhx(z).
  4. By our previous argument, the function hx has no nonzero direction of recession.

  5. Hence the recession cone of every nonempty sublevel set of hx is {0}.

  6. Then due to Theorem 9.181, every nonempty sublevel set is closed and bounded, hence compact.

  7. Then due to Weierstrass theorem (Theorem 8.9), the set of minimizers of hx is nonempty and compact (since one of the sublevel sets is nonempty and bounded).

We now show that g is proper.

  1. g(x~)=infzWf(x~,z).

  2. By hypothesis, there exist zW such that f(x~,z)γ.

  3. Hence g(x~)γ<.

  4. Let xdomg.

  5. We have shown that the set of minimizers of hx is nonempty and compact.

  6. Hence g(x)> for every xdomg.

  7. Hence g is proper.

9.10.9.3. Closedness Conditions II#

Theorem 9.121 (Partial minimization and closedness II)

Let V and W be Euclidean spaces. Let f:VW(,] be a proper, closed and convex function.

Let g:VR be defined by

g(x)infzWf(x,z).

Assume that there exists a vector x~V and a scalar γ such that the set

K={z|f(x~,z)γ}

is nonempty and its recession cone is equal to its lineality space. Then g is proper, closed and convex. Furthermore, for each xdomg, the set of points that attain the infimum of f(x,) over W is nonempty.

Proof. The proof is along similar lines. Due to Theorem 9.119, g is convex.

We shall denote the sublevel sets of f as

Vt=sublevel(f,t)={(x,z)|f(x,z)t}.
  1. By hypothesis Vγ is nonempty since there exists a z such that f(x~,z)γ.

  2. Let (0,y)Rf be a direction of recession of f.

  3. Then (0,y) is also a direction of recession of Vγ.

  4. For any (x~,z)Vγ, such a direction of recession will satisfy

    f(x~,z+αy)γα0.
  5. Thus for every zK and α0, we have z+αyK.

  6. Then y is a direction of recession of K.

  7. By hypothesis RK=LK.

  8. Hence y is also a direction of recession of K.

  9. Thus For any (x~,z)Vγ,

    f(x~,zαy)γα0.
  10. Thus, (0,y) is also a direction of recession for Vγ due to Theorem 9.179.

  11. Hence (0,y)Rf.

  12. Since both (0,y)Rf and (0,y)Rf, hence (0,y)Lf.

  13. Hence (0,y) is a direction along with f(x~,) is constant.

  14. Hence for any nonempty sublevel set Vt of f, a direction of recession of the form (0,y) is also in the lineality space of Vt.

We now show that g is closed.

  1. Let Vt be any nonempty sublevel set of f.

  2. Let p be the projection operator as defined in the proof of Theorem 9.119.

  3. Then nullp={(0,y)|yW}.

  4. If (0,y)RVt then (0,y)LVt by the previous argument.

  5. Hence RVtnullpLVt.

  6. Hence, due to Theorem 9.193, the projection of Vt to V under p is closed.

  7. Hence g is closed.

As before, we define hx:W(,] as a projection of f by fixing x.

  1. For every xdomg, hx is proper, closed and convex.

  2. By the previous argument Rhx=Lhx.

  3. Following the arguments of Theorem 10.26, the set of minimizers of hx is nonempty.

  4. Hence g(x) is finite for every xdomg.

  5. In particular g(x~)< since K is nonempty.

  6. Hence g is proper.