9.15. Directional Derivatives#

This section deals with the subject of directional derivatives for convex functions.

9.15.1. Convex Real Functions#

For real functions $f : R \to R$ , several useful results are available.

Remark 9.13 (Domain of a convex real function)

The domain of a convex real function is an interval.

Let $f : R \to R$ be convex. Then $dom f$ is convex. But every convex subset of real line is an interval.

Now let $I = dom f$ be an interval.
We say $a = inf I$ as the left end point of $I$ .
We say $b = sup I$ as the right end point of $I$ .
$a$ and $b$ may or may not belong to $I$ .
If both $a$ and $b$ belong to $I$ , then $I$ is a closed interval.
If neither $a$ nor $b$ belongs to $I$ , then $I$ is an open interval.

9.15.1.1. Characterization#

Theorem 9.199 (Characterization of real convex functions)

Let $f : R \to R$ be a real function with $dom f = I$ which is an interval (closed or open or semi-open). Let $a$ and $b$ be the left and right endpoints of the interval $I$ .

The following are equivalent.

$f$ is convex over $I$ .
For every $x_{1}, x_{2}, x_{3} \in I$ with $x_{1} < x_{2} < x_{3}$ ,

$\frac{f (x_{2}) - f (x_{1})}{x_{2} - x_{1}} \leq \frac{f (x_{3}) - f (x_{1})}{x_{3} - x_{1}} .$
For every $x_{1}, x_{2}, x_{3} \in I$ with $x_{1} < x_{2} < x_{3}$ ,

$\frac{f (x_{2}) - f (x_{1})}{x_{2} - x_{1}} \leq \frac{f (x_{3}) - f (x_{2})}{x_{3} - x_{2}} .$
For every $x_{1}, x_{2}, x_{3} \in I$ with $x_{1} < x_{2} < x_{3}$ ,

$\frac{f (x_{3}) - f (x_{1})}{x_{3} - x_{1}} \leq \frac{f (x_{3}) - f (x_{2})}{x_{3} - x_{2}} .$

Proof. (1) $⟹$ (2) Assume that $f$ is convex.

Let

$α = \frac{x_{3} - x_{2}}{x_{3} - x_{1}}, β = \frac{x_{2} - x_{1}}{x_{3} - x_{1}} .$
Then, $α + β = 1$ and $α, β \in (0, 1)$ .
Also, verify that

$x_{2} = α x_{1} + β x_{3} .$
Thus,

$\begin{aligned} f (x_{2}) \leq α f (x_{1}) + β f (x_{3}) \\ ⟺ f (x_{2}) \leq \frac{x_{3} - x_{2}}{x_{3} - x_{1}} f (x_{1}) + \frac{x_{2} - x_{1}}{x_{3} - x_{1}} f (x_{3}) \\ ⟺ (x_{3} - x_{1}) f (x_{2}) \leq (x_{3} - x_{2}) f (x_{1}) + (x_{2} - x_{1}) f (x_{3}) \\ ⟺ (x_{3} - x_{1}) f (x_{2}) \leq ((x_{3} - x_{1}) - (x_{2} - x_{1})) f (x_{1}) + (x_{2} - x_{1}) f (x_{3}) \\ ⟺ (x_{3} - x_{1}) (f (x_{2}) - f (x_{1})) \leq (x_{2} - x_{1}) (f (x_{3}) - f (x_{1})) \\ ⟺ \frac{f (x_{2}) - f (x_{1})}{x_{2} - x_{1}} \leq \frac{f (x_{3}) - f (x_{1})}{x_{3} - x_{1}} . \end{aligned}$

(2) $⟹$ (1)

Let $x_{1}, x_{3} \in I$ and $t \in (0, 1)$ .
WLOG, assume that $x_{1} < x_{3}$ .
Let $α = t$ and $β = (1 - t)$ .
Let $x_{2} = α x_{1} + β x_{3}$ .
Then, $x_{1} < x_{2} < x_{3}$ .
From the hypothesis, we have

$\frac{f (x_{2}) - f (x_{1})}{x_{2} - x_{1}} \leq \frac{f (x_{3}) - f (x_{1})}{x_{3} - x_{1}} .$
Using the previous argument backwards, this implies

$f (x_{2}) \leq α f (x_{1}) + β f (x_{3}) = t f (x_{1}) + (1 - t) f (x_{3}) .$
Thus, $f$ is convex.

(2) $⟺$ (3)

Pick any $x_{1}, x_{2}, x_{3} \in I$ with $x_{1} < x_{2} < x_{3}$ .
By hypothesis (2)

$\begin{aligned} \frac{f (x_{2}) - f (x_{1})}{x_{2} - x_{1}} \leq \frac{f (x_{3}) - f (x_{1})}{x_{3} - x_{1}} \\ ⟺ & (x_{3} - x_{1}) (f (x_{2}) - f (x_{1})) \leq (x_{2} - x_{1}) (f (x_{3}) - f (x_{1})) \\ ⟺ & (x_{3} - x_{1}) f (x_{2}) \leq ((x_{3} - x_{1}) - (x_{2} - x_{1})) f (x_{1}) + (x_{2} - x_{1}) f (x_{3}) \\ ⟺ & ((x_{3} - x_{2}) + (x_{2} - x_{1})) f (x_{2}) \leq ((x_{3} - x_{2}) f (x_{1}) + (x_{2} - x_{1}) f (x_{3}) \\ ⟺ & (x_{3} - x_{2}) (f (x_{2}) - f (x_{1})) \leq (x_{2} - x_{1}) (f (x_{3}) - f (x_{2})) \\ ⟺ & \frac{f (x_{2}) - f (x_{1})}{x_{2} - x_{1}} \leq \frac{f (x_{3}) - f (x_{2})}{x_{3} - x_{2}} . \end{aligned}$

(2) $⟺$ (4)

Pick any $x_{1}, x_{2}, x_{3} \in I$ with $x_{1} < x_{2} < x_{3}$ .
By hypothesis (2)

$\begin{aligned} \frac{f (x_{2}) - f (x_{1})}{x_{2} - x_{1}} \leq \frac{f (x_{3}) - f (x_{1})}{x_{3} - x_{1}} \\ ⟺ & (x_{3} - x_{1}) (f (x_{2}) - f (x_{1})) \leq (x_{2} - x_{1}) (f (x_{3}) - f (x_{1})) \\ ⟺ & (x_{3} - x_{1}) f (x_{2}) \leq (x_{3} - x_{2}) f (x_{1}) + ((x_{2} - x_{3}) + (x_{3} - x_{1})) f (x_{3}) \\ ⟺ & (x_{3} - x_{1}) (f (x_{2}) - f (x_{3})) \leq (x_{3} - x_{2}) (f (x_{1}) - f (x_{3})) \\ ⟺ & (x_{3} - x_{1}) (f (x_{3}) - f (x_{2})) \geq (x_{3} - x_{2}) (f (x_{3}) - f (x_{1})) \\ ⟺ & (x_{3} - x_{2}) (f (x_{3}) - f (x_{1})) \leq (x_{3} - x_{1}) (f (x_{3}) - f (x_{2})) \\ ⟺ & \frac{f (x_{3}) - f (x_{1})}{x_{3} - x_{1}} \leq \frac{f (x_{3}) - f (x_{2})}{x_{3} - x_{2}} . \end{aligned}$

9.15.1.2. One Sided Derivatives#

Recall from Definition 2.83 that if $f$ is defined over $[a, b)$ , then the right hand derivative is defined as:

f_{+}^{'} (a) = lim_{x \to a^{+}} \frac{f (x) - f (a)}{x - a} = lim_{h ↓ 0} \frac{f (a + h) - f (a)}{h}

if the limit exists. Similarly, if $f$ is defined over $(c, a]$ , then the left hand derivative is defined as:

f_{-}^{'} (a) = lim_{x \to a^{-}} \frac{f (x) - f (a)}{x - a} = lim_{h ↑ 0} \frac{f (a + h) - f (a)}{h} = lim_{r ↓ 0} \frac{f (a) - f (a - r)}{r}

if the limit exists.

We introduce two helper functions

s_{+} (x, h) = \frac{f (x + h) - f (x)}{h}

and

s_{-} (x, h) = \frac{f (x) - f (x - h)}{h}

where $h > 0$ .

Then

f_{+}^{'} (x) = lim_{h ↓ 0} s_{+} (x, h)

and

f_{-}^{'} (x) = lim_{h ↓ 0} s_{-} (x, h) .

An interesting property of convex functions is that the one sided derivatives always exist. On the real line, there are only two directions to move; left and right. The one sided derivatives play the role of directional derivatives on the real line.

Lemma 9.4 (Monotonicity of $s_{+}$ and $s_{-}$ for convex functions)

Let $f : R \to R$ with $dom f = I$ be convex. Let $a$ and $b$ be the left and right endpoints of the interval $I$ . Then $s_{+}$ and $s_{-}$ as functions of $h$ are monotonic.

$s_{+} (x, h)$ is a nondecreasing function of $h$ .
$s_{-} (x, h)$ is a nonincreasing function of $h$ .

Proof. Monotonicity of $s_{+}$ .

Let $x \in I$ such that $x \neq b$ .
Let $h_{1}, h_{2} > 0$ such that $h_{1} < h_{2}$ and $x + h_{2} \in I$ .
Consider the three points $x < x + h_{1} < x + h_{2}$ .
Then

$\frac{f (x + h_{1}) - f (x)}{h_{1}} \leq \frac{f (x + h_{2}) - f (x)}{h_{2}} .$
Hence $s_{+} (x, h_{1}) \leq s_{+} (x, h_{2})$ .
Hence $s_{+}$ is a nondecreasing function of $h$ .

The argument for monotonicity of $s_{-}$ is similar.

Observation 9.6 (One sided derivatives as infimum/supremum)

Due to the monotonicity of $s_{+}$ over $h$ , we have

f_{+}^{'} (x) = lim_{h ↓ 0} s_{+} (x, h) = inf_{h > 0} s_{+} (x, h) .

Similarly, due to the monotonicity of $s_{+}$ over $h$ , we have

f_{-}^{'} (x) = lim_{h ↓ 0} s_{-} (x, h) = sup_{h > 0} s_{-} (x, h) .

Theorem 9.200 (Real convex functions and one sided derivatives)

Let $f : R \to R$ with $dom f = I$ be convex. Let $a$ and $b$ be the left and right endpoints of the interval $I$ . Then, for every $x \in int I = (a, b)$ , the left hand derivative $f_{-}^{'} (x)$ and the right hand derivative $f_{+}^{'} (x)$ exist.

If $a \in I$ , then the right hand derivative $f_{+}^{'} (a)$ exists. If $b \in I$ , then the left hand derivative $f_{-}^{'} (b)$ exists.

If $a \in I$ , we define $f_{-}^{'} (a) = - \infty$ . If $b \in I$ , we define $f_{+}^{'} (b) = \infty$ .

Proof. We are given that $f$ is convex over $(a, b)$ .

Let $x \in int I$ .
Then there exists $r > 0$ such that $(x - r, x + r) \subseteq I$ .
For $h > 0$ , define

$F (h) = \frac{f (x + h) - f (x)}{h} .$
Let $0 < h_{1} < h_{2}$ such that $h_{2} < r$ .
Let $x_{1} = x$ , $x_{2} = x + h_{1}$ , $x_{3} = x + h_{2}$ .
Since $f$ is convex, hence by Theorem 9.199

$\frac{f (x_{2}) - f (x_{1})}{x_{2} - x_{1}} \leq \frac{f (x_{3}) - f (x_{1})}{x_{3} - x_{1}} .$
But that means

\frac{f (x + h_{1}) - f (x)}{h_{1}} \leq \frac{f (x + h_{2}) - f (x)}{h_{2}} .

Thus, whenever $h_{1} < h_{2}$ (up to $h_{2} < r$ ), $F (h_{1}) \leq F (h_{2})$ .
Thus, $F$ is a nondecreasing (monotone) function of $h$ in some interval $(0, δ)$ where $δ < r$ .
Then, $f_{+}^{'} (x) = lim_{h ↓ 0} F (h)$ exists.

A similar argument shows that $f_{-}^{'} (x)$ also exists. Similar arguments apply for the one sided derivatives at the end points.

9.15.1.3. Continuity#

Theorem 9.201 (Convex real function is continuous)

Let $f : R \to R$ be a real convex function with $dom f = I$ . Let $a$ and $b$ be the left and right endpoints of the interval $I$ . Then,

$f$ is continuous at every $x \in (a, b)$ .
If $a \in I$ , then $f$ is continuous from the right at $a$ .
If $b \in I$ , then $f$ is continuous from the left at $b$ .

In other words, $f$ is continuous on $I$ .

Proof. We proceed as follows.

Let $x \in (a, b)$ .
By Theorem 9.200, the one sided derivatives $f_{+}^{'} (x)$ and $f_{-}^{'} (x)$ exist.
Then, by limit arithmetic

$lim_{h \to 0^{-}} (f (x + h) - f (x)) = (lim_{h \to 0^{-}} \frac{f (x + h) - f (x)}{h}) (lim_{h \to 0^{-}} h) = 0.$
Similarly,

$lim_{h ↓ 0} (f (x + h) - f (x)) = (lim_{h ↓ 0} \frac{f (x + h) - f (x)}{h}) (lim_{h ↓ 0} h) = 0.$
Thus,

$lim_{h \to 0^{-}} (f (x + h) - f (x)) = lim_{h ↓ 0} (f (x + h) - f (x)) = 0.$
Thus, $f$ is continuous at $x$ .
Since $x$ was arbitrary, hence $f$ is continuous on $(a, b)$ .

Now consider the case where $a \in I$ .

By Theorem 9.200, the one sided derivative $f_{+}^{'} (a)$ exists.
Then, by limit arithmetic

$lim_{h \to 0^{-}} (f (a + h) - f (a)) = (lim_{h \to 0^{-}} \frac{f (a + h) - f (a)}{h}) (lim_{h \to 0^{-}} h) = 0.$
Hence $f$ is continuous from the right at $a$ .

A similar argument holds for continuity from the left at $b$ .

9.15.1.4. Properties of One Sides Derivatives#

Theorem 9.202 (Properties of one-sided derivatives)

Let $f : R \to R$ be a real convex function with $dom f = I$ . Let $a$ and $b$ be the left and right endpoints of the interval $I$ .

We have $f_{-}^{'} (x) \leq f_{+}^{'} (x)$ for every $x \in I$ .
If $x \in int I$ then both $f_{-}^{'} (x)$ and $f_{+}^{'} (x)$ are finite.
If $x, z \in I$ and $x < z$ , then $f_{+}^{'} (x) \leq f_{-}^{'} (z)$ .
The functions $f_{-}^{'}, f_{+}^{'} : R \to \overset{―}{R}$ are nondecreasing over $I$ .
The function $f_{+}^{'}$ is right-continuous at every interior point of $I$ . If $a \in I$ then $f_{+}^{'}$ is right-continuous at $a$ .
The function $f_{-}^{'}$ is left-continuous at every interior point of $I$ . If $b \in I$ then $f_{-}^{'}$ is left-continuous at $b$ .
The function $f_{+}^{'}$ is upper-semicontinuous at every $x \in I$ .
The function $f_{-}^{'}$ is lower-semicontinuous at every $x \in I$ .

Proof. (1)

If $a \in I$ , then by convention $f_{-}^{'} (a) = - \infty$ . Hence $f_{-}^{'} (a) \leq f_{+}^{'} (a)$ .
If $b \in I$ , then by convention $f_{+}^{'} (b) = \infty$ . Hence $f_{-}^{'} (b) \leq f_{+}^{'} (b)$ .
Now let $x \in int I$ .
Then there is $r > 0$ such that $(x - r, x + r) \in I$ .
Pick any $h > 0$ such that $h < r$ .
Then, using the three points $x - h, x, x + h$ , we have

$\frac{f (x) - f (x - h)}{h} \leq \frac{f (x + h) - f (x)}{h}$

due to Theorem 9.199.
Taking the limit $h ↓ 0$ , we see that

$f_{-}^{'} (x) \leq f_{+}^{'} (x)$

holds true for every $x \in int I$ .

(2)

Let $x \in int I$ .
Let $h > 0$ such that $(x - h, x + h) \subseteq I$ .
Then we have

$f_{+}^{'} (x) \leq \frac{f (x + h) - f (x)}{h} < \infty .$
Similarly, we have

$- \infty < \frac{f (x) - f (x - h)}{h} \leq f_{-}^{'} (x) .$
By (1), we have

$- \infty < f_{-}^{'} (x) \leq f_{+}^{'} (x) < \infty .$
Hence both are finite at interior points of $I$ .

(3)

Let $y = \frac{x + z}{2}$ .
Due to Theorem 9.199, we have

$\frac{f (y) - f (x)}{y - x} \leq \frac{f (z) - f (y)}{z - y} .$
We also have

$f_{+}^{'} (x) \leq \frac{f (y) - f (x)}{y - x} and \frac{f (z) - f (y)}{z - y} \leq f_{-}^{'} (z) .$
Combining, we get $f_{+}^{'} (x) \leq f_{-}^{'} (z)$ .

(4)

Let $x, z \in I$ such that $x < z$ .
From (3), we have $f_{+}^{'} (x) \leq f_{-}^{'} (z)$ .
From (1), we have $f_{-}^{'} (z) \leq f_{+}^{'} (z)$ .
Combining, we have $f_{+}^{'} (x) \leq f_{+}^{'} (z)$ .
Hence $f_{+}^{'}$ is nondecreasing.
Similarly, $f_{-}^{'} (x) \leq f_{+}^{'} (x) \leq f_{-}^{'} (z)$ .
Hence $f_{-}^{'}$ is nondecreasing.

(5)

Pick any $x \in I$ such that $x \neq b$ (if $b \in I$ ).
Then $x < b$ .
We can pick $h > 0$ and $r > 0$ such that $x + h + r < b$ .
Then $f_{+}^{'} (x + h) \leq \frac{f (x + h + r) - f (x + h)}{r}$ .
We established in Theorem 9.201 that $f$ is continuous.
Taking the limit $h ↓ 0$ , we obtain

$lim_{h ↓ 0} f_{+}^{'} (x + h) \leq \frac{f (x + r) - f (x)}{r} .$

This is valid since $f$ is continuous.
Now taking the limit $r ↓ 0$ on the R.H.S., we obtain

$lim_{h ↓ 0} f_{+}^{'} (x + h) \leq f_{+}^{'} (x) .$
Since $f_{+}^{'}$ is nondecreasing by claim (4), hence

$f_{+}^{'} (x) \leq lim_{h ↓ 0} f_{+}^{'} (x + h) .$
Together, we must have

$f_{+}^{'} (x) = lim_{h ↓ 0} f_{+}^{'} (x + h) .$
Hence $f_{+}^{'}$ is right continuous at $x$ .

(6)

An argument similar to (5) shows that $f_{-}^{'}$ is left continuous at every $x \in I$ except for $x = a$ (if $a \in I$ ).

(7) Upper semicontinuity of $f_{+}^{'}$

We need to show that for every $ϵ > 0$ there exists $r > 0$ such that

$f_{+}^{'} (y) < f_{+}^{'} (x) + ϵ for every y \in (x - r, x + r) \cap I .$
Pick some $ϵ > 0$ .
Consider any $x \in int I$ .
By (6) $f_{+}^{'}$ is right continuous at $x$ .
Hence there exists $r_{1} > 0$ such that for every $y \in [x, x + r_{1})$ , we have

$| f_{+}^{'} (y) - f_{+}^{'} (x) | < ϵ .$
By (4), $f_{+}^{'}$ is nondecreasing. Hence for every $y \in [x, x + r_{1})$

$| f_{+}^{'} (y) - f_{+}^{'} (x) | = f_{+}^{'} (y) - f_{+}^{'} (x) .$
Hence for every $y \in [x, x + r_{1})$ , we have

$f_{+}^{'} (y) - f_{+}^{'} (x) < ϵ ⟺ f_{+}^{'} (y) < f_{+}^{'} (x) + ϵ .$
Now, let $r = min (x - a, r_{1})$ .
By monotonicity of $f_{+}^{'}$ , for every $y \in (x - r, x)$

$f_{+}^{'} (y) \leq f_{+}^{'} (x) .$
Hence for every $y \in (x, - r, x)$

$f_{+}^{'} (y) < f_{+}^{'} (x) + ϵ .$
Combining the two, for every $y \in (x - r, x + r)$ , we have

$f_{+}^{'} (y) < f_{+}^{'} (x) + ϵ .$
Hence $f_{+}^{'}$ is u.s.c. at $x$ .
Now, if $a \in I$ then let $x = a$ .
By right continuity of $f_{+}^{'}$ at $a$ , there exists $r > 0$ such that for every $y \in [a, a + r)$ , we have

$f_{+}^{'} (y) < f_{+}^{'} (a) + ϵ .$
Also $(a - r, a + r) \cap I = [a, a + r)$ .
Hence $f_{+}^{'}$ is u.s.c. at $a$ .
If $b \in I$ , then by convention $f_{+}^{'} (b) = \infty$ .
Hence $f_{+}^{'}$ is u.s.c. at $b$ .

(8) Lower semicontinuity of $f_{-}^{'}$

The argument is similar to (7).

9.15.2. Directional Derivatives of Proper Functions#

Definition 9.70 (Directional derivative)

Let $f : V \to (- \infty, \infty]$ be a proper function with $S = dom f$ . Let $x \in int S$ . The directional derivative at $x$ in the direction $d \in V$ is defined by

f^{'} (x; d) ≜ lim_{α ↓ 0} \frac{f (x + α d) - f (x)}{α}

provided the limit exists.

We say that $f$ is directionally differentiable at $x$ if it is directionally differentiable in every direction at $x$ .

The directional derivative is a scalar quantity ( $\in R$ ) if it is defined (i.e., the limit exists). When we say that

f^{'} (x; d) = lim_{t ↓ 0} \frac{f (x + t d) - f (x)}{t},

we mean that $f$ is defined over a set ${v | v = x + t d, 0 < t < t_{max}}$ and for every $ϵ > 0$ , there exists $δ > 0$ such that

| \frac{f (x + t d) - f (x)}{t} - f^{'} (x; d) | < ϵ whenever 0 < t < δ .

Since $x \in int S$ , hence there exists $r > 0$ such that $B (x, r) \subseteq S$ .

With $v \in B (x, r)$ , we need $‖ t d ‖ < r$ . Thus, a $t_{max} = \frac{r}{‖ d ‖}$ is a suitable range of allowed values for $t$ . Accordingly, $0 < δ < t_{max}$ can be chosen.

Remark 9.14 (Directional derivative for zero vector)

If $d = 0$ then, $f^{'} (x; d) = 0$ .

We can see this from the fact that

f^{'} (x; 0) = lim_{α ↓ 0} \frac{f (x + α 0) - f (x)}{α} = 0.

A useful result is for computing the directional derivative of a function which is the pointwise maximum of a finite number of proper functions.

We recall from Theorem 3.22 that the interior of a finite intersection of sets is the intersection of their interiors. This is useful in identifying the interior of the domain for a pointwise maximum of a finite set of functions.

9.15.3. Differentiability#

9.15.3.1. Differentiability of Proper Functions#

Definition 9.71 (Differentiability of proper functions)

Let $f : V \to (- \infty, \infty]$ be a proper function with $S = dom f$ . Let $x \in int S$ . $f$ is said to be differentiable at $x$ if there exists $g \in V^{*}$ such that:

(9.5)#

lim_{h \to 0} \frac{f (x + h) - f (x) - ⟨ h, g ⟩}{‖ h ‖} = 0.

The unique vector $g$ satisfying this condition is called the gradient of $f$ at $x$ and is denoted by $\nabla f (x)$ .

If $f$ is differentiable at some $x \in int S$ , then there is a simple formula to connect the gradient and the directional derivatives.

9.15.3.2. Gradient and Directional Derivatives#

Theorem 9.203 (Gradient and directional derivatives)

Let $f : V \to (- \infty, \infty]$ be a proper function with $S = dom f$ . Let $x \in int S$ . Assume that $f$ is differentiable at $x$ . Then, for any $d \in V$ ,

f^{'} (x; d) = ⟨ d, \nabla f (x) ⟩ .

In other words, the directional derivative is the projection of the gradient in the specified direction.

Proof. For $d = 0$ , the equality is obvious. We shall consider the case where $d \neq 0$ .

Since $f$ is differentiable at $x$ , hence

lim_{h \to 0} \frac{f (x + h) - f (x) - ⟨ h, \nabla f (x) ⟩}{‖ h ‖} = 0.

In particular, if we take the limit of $h$ along the direction of $d$ as $t d$ where $t > 0$ and $t \to 0^{+}$ , then

$lim_{t \to 0^{+}} \frac{f (x + t d) - f (x) - ⟨ t d, \nabla f (x) ⟩}{‖ t d ‖} = 0.$
Splitting the terms, we get

$lim_{t \to 0^{+}} \frac{f (x + t d) - f (x)}{‖ t d ‖} - lim_{t \to 0^{+}} \frac{⟨ d, \nabla f (x) ⟩}{‖ d ‖} = 0.$
Multiplying with $‖ d ‖$ and simplifying, we get:

$lim_{t \to 0^{+}} \frac{f (x + t d) - f (x)}{t} - lim_{t \to 0^{+}} ⟨ d, \nabla f (x) ⟩ = 0.$
Thus,

$f^{'} (x; d) = lim_{t \to 0^{+}} \frac{f (x + t d) - f (x)}{t} = ⟨ d, \nabla f (x) ⟩ .$

9.15.3.3. Gradient in $R^{n}$ #

Remark 9.15 (Gradient in $R^{n}$ )

It is imperative to compare the definition of gradients in this section with Definition 5.1 (differentiability of functions from $R^{n}$ to $R^{m}$ ) and the notion of the gradient as defined in Definition 5.4.

To better develop our understanding of gradients, let us examine the gradient in the Euclidean space $R^{n}$ . The standard basis is given by $B = {e_{1}, \dots, e_{n}}$ which are the coordinate unit vectors. The standard inner product is given by the dot product

⟨ x, y ⟩ = y^{T} x = x^{T} y .

A vector $x \in R^{n}$ is written as

x = \sum_{i = 1}^{n} x_{i} e_{i} .

The individual coordinates are obtained via

x_{i} = ⟨ e_{i}, x ⟩ = ⟨ x, e_{i} ⟩ = x^{T} e_{i} \forall i \in [1, \dots, n] .

Let $f : R^{n} \to (- \infty, \infty]$ be a proper function. Let $S = dom f$ . Let $x \in int S$ . Assume that $f$ is differentiable at $x$ . Let $g = \nabla f (x)$ . Let

g = \sum_{i}^{n} g_{i} e_{i} .

Following the notation in Observation 5.1, the derivative of $f$ at $x$ , denoted by $D f (x)$ is given by

lim_{h \to 0} \frac{‖ f (x + h) - f (x) - D f (x) h ‖_{2}}{‖ h ‖_{2}} = 0.

We don’t have to check for $x + h \in dom f$ as $f$ is a proper function with a value of $\infty$ at points outside its effective domain.

Compare this with (9.5). For $f : R^{n} \to R$ , $D f (x)$ is a row vector. If we let $\tilde{g} = D f (x)^{T}$ , then

D f (x) h = {\tilde{g}}^{T} h = ⟨ h, \tilde{g} ⟩ .

Then, the definition of $\tilde{g}$ in the limit above is exactly the same as $g$ in (9.5). Thus, $\tilde{g} = g$ . We can see that our definition of gradient coincides with the definition in Definition 5.4 for $R^{n}$ with the dot product as standard inner product.

Now consider the components of $g$ .

g_{i} = ⟨ e_{i}, g ⟩ = ⟨ e_{i}, \nabla f (x) ⟩ .

By Theorem 9.203, the directional derivative in the direction $e_{i}$ is given by

f^{'} (x; e_{i}) = ⟨ e_{i}, \nabla f (x) ⟩ = ⟨ e_{i}, g ⟩ = g_{i} .

Thus,

\frac{\partial f (x)}{\partial x_{i}} = ⟨ e_{i}, \nabla f (x) ⟩ .

The partial derivatives of $f$ at $x$ along the standard basis vectors are identical to the directional derivatives of $f$ .

\begin{array}{r} \nabla f (x) = [\begin{array}{c} \frac{\partial f (x)}{\partial x_{1}} \\ ⋮ \\ \frac{\partial f (x)}{\partial x_{n}} \end{array}] = [\begin{array}{c} ⟨ e_{1}, \nabla f (x) ⟩ \\ ⋮ \\ ⟨ e_{n}, \nabla f (x) ⟩ \end{array}] . \end{array}

Then, for an arbitrary direction $d = \sum_{i = 1}^{n} e_{i}$ , the directional derivative becomes

f^{'} (x; d) = ⟨ d, \nabla f (x) ⟩ = \nabla f (x)^{T} d = \sum_{i = 1}^{n} \frac{\partial f (x)}{\partial x_{i}} d_{i} .

Recall from Definition 9.70, that the directional derivative is independent on the choice of the inner product. This is also clear from the expression $\sum_{i = 1}^{n} \frac{\partial f (x)}{\partial x_{i}} d_{i}$ as the partial derivatives are independent of the choice of the inner product.

However, this means that the gradient itself must depend on the choice of inner product. If $⟨ \cdot, \cdot ⟩_{a}$ and $⟨ \cdot, \cdot ⟩_{b}$ are two different inner products defined on $R^{n}$ , then the gradients of $f$ at $x$ w.r.t. the two inner products, denoted by $\nabla_{a} f (x)$ and $\nabla_{b} f (x)$ must satisfy the relationship

f^{'} (x; d) = ⟨ d, \nabla_{a} f (x) ⟩_{a} = ⟨ d, \nabla_{b} f (x) ⟩_{b} \forall d \in V .

In the following, we shall assume that $\nabla f (x)$ denotes the gradient w.r.t. the dot product.

Consider the inner product given by

⟨ x, y ⟩_{H} = x^{T} H y

where $H \in R^{n \times n}$ is a symmetric positive definite matrix.

Then,

\begin{aligned} (\nabla_{H} f (x))_{i} & = \nabla_{H} f (x)^{T} e_{i} & coordinate in standard basis \\ = \nabla_{H} f (x)^{T} (H H^{- 1}) e_{i} & H is invertible \\ = \nabla_{H} f (x)^{T} H (H^{- 1} e_{i}) \\ = ⟨ H^{- 1} e_{i}, \nabla_{H} f (x) ⟩_{H} & by definition of this inner product \\ = f^{'} (x; H^{- 1} e_{i}) & directional derivative w.r.t. this inner product \\ = \nabla f (x)^{T} H^{- 1} e_{i} & directional derivative w.r.t. dot product \\ = (H^{- 1} \nabla f (x))^{T} e_{i} & H is symmetric . \end{aligned}

Thus,

\nabla_{H} f (x) = H^{- 1} \nabla f (x) .

Thus, the gradient w.r.t. the inner product $⟨ \cdot, \cdot ⟩_{H}$ is the scaled version of the standard gradient where the scaling factor is $H^{- 1}$ .

9.15.3.4. Gradient in $R^{m \times n}$ #

Remark 9.16 (Gradient in $R^{m \times n}$ )

We next look at the vector space of real matrices. The standard basis is a family of unit matrices ${E_{i j}}_{1 \leq i \leq m, 1 \leq j \leq n}$ where $E_{i j}$ has the $(i, j)$ -th entry as 1 and other entries as 0.

The standard inner product is given by

⟨ X, Y ⟩ = tr (Y^{T} X) \forall X, Y \in R^{m \times n} .

Let $f : R^{m \times n} \to R$ be a proper function. Let $S = dom f$ . Let $X \in int S$ . Assume that $f$ is differentiable at $X$ .

The gradient is given by

\partial f (X) = {(\frac{\partial f (X)}{\partial x_{i j}})}_{i j} .

The directional derivative for some direction $D \in R^{m \times n}$ is given by

f (X; D) = ⟨ D, \partial f (X) ⟩ = tr (\partial f (X)^{T} D) .

Consider the inner product given by

⟨ X, Y ⟩_{H} = tr (X^{T} H Y)

where $H \in R^{m \times m}$ is a symmetric positive definite matrix.

Then,

\begin{aligned} (\nabla_{H} f (X))_{i j} & = tr (\nabla_{H} f (X)^{T} E_{i j}) & coordinate in standard basis \\ = tr (\nabla_{H} f (X)^{T} (H H^{- 1}) E_{i j}) & H is invertible \\ = tr (\nabla_{H} f (X)^{T} H (H^{- 1} E_{i j})) \\ = ⟨ H^{- 1} E_{i j}, \nabla_{H} f (X) ⟩_{H} & by definition of this inner product \\ = f^{'} (X; H^{- 1} E_{i j}) & directional derivative w.r.t. this inner product \\ = tr (\nabla f (X)^{T} H^{- 1} E_{i j}) & directional derivative w.r.t. standard inner product \\ = (H^{- 1} \nabla f (X))^{T} E_{i j} & H is symmetric . \end{aligned}

Thus,

\nabla_{H} f (X) = H^{- 1} \nabla f (X) .

9.15.4. Proper Convex Functions#

9.15.4.1. Existence of Directional Derivatives#

An important property of directional derivatives is that if $f$ is a proper convex function then $f$ is directionally differentiable at every $x \in int dom f$ .

Theorem 9.204 (Existence of directional derivatives for convex functions.)

Let $f : V \to (- \infty, \infty]$ be a proper convex function with $S = dom f$ . Let $x \in int S$ . Then, for any $d \in V$ , the directional derivative $f^{'} (x; d)$ exists.

Proof. This is a consequence of the directional differentiability of the scalar convex functions.

Define the convex function $F : R \to R$ as

$F (t) = f (x + t d) .$
Let $I = dom F$ .
Then $I$ is an interval of values for which $x + t y \in S$ .
Since $x \in int S$ , hence $t = 0 \in int I$ .
We now note that

$f^{'} (x; d) = lim_{t ↓ 0} \frac{f (x + t d) - f (x)}{t} = lim_{t ↓ 0} \frac{F (t) - F (0)}{t} = F_{+}^{'} (0) .$

It is the right hand derivative of $F$ at $t = 0$ .
By Theorem 9.200, $F_{+}^{'} (0)$ exists.
Hence $f^{'} (x; d)$ exists for every $x \in int S$ and every $d \in V$ .

Observation 9.7 (Relation between the directional derivatives in opposite directions)

We can see that

f^{'} (x; - d) = lim_{t ↓ 0} \frac{f (x - t d) - f (x)}{t} = lim_{t ↓ 0} \frac{F (- t) - F (0)}{t} = - lim_{r ↑ 0} \frac{F (r) - F (0)}{r} = - F_{-}^{'} (0) .

Hence

F_{-}^{'} (0) = - f^{'} (x; - d) .

By Theorem 9.202

F_{-}^{'} (0) \leq F_{+}^{'} (0) .

Hence

- f^{'} (x; - d) \leq f^{'} (x; d) \forall d \in V .

Observation 9.8 (Directional derivative as infimum)

Let $f : V \to (- \infty, \infty]$ be a proper convex function with $S = dom f$ . Let $x \in int S$ . Then, for any $d \in V$ ,

f^{'} (x; d) = inf_{t > 0} \frac{f (x + t d) - f (x)}{t} .

This follows from the fact that $F (t) = f (x + t d)$ is convex and due to Observation 9.6,

f^{'} (x; d) = F_{+}^{'} (0) = inf_{t > 0} \frac{F (t) - F (0)}{t} = inf_{t > 0} \frac{f (x + t d) - f (x)}{t} .

9.15.4.2. Upper Semicontinuity#

The next result generalizes the upper semicontinuity property of the right hand derivatives of real convex functions.

Theorem 9.205

Let $f : V \to (- \infty, \infty]$ be a proper convex function with $dom f = S$ . Assume that $S$ is an open subset of $V$ . Let ${f_{k}}$ be a sequence of proper convex functions $f_{k} : V \to (- \infty, \infty]$ with $dom f_{k} = S$ with the property that

lim_{k \to \infty} f_{k} (x_{k}) = f (x)

holds true for every $x \in S$ and every sequence ${x_{k}}$ of $S$ that converges to $x$ . Then for any $x \in S$ and any direction $d \in V$ and any sequences ${x_{k}}$ of $S$ and ${d_{k}}$ of $V$ converging to $x$ and $d$ respectively, we have

\underset{k \to \infty}{lim sup} f_{k}^{'} (x_{k}; d_{k}) \leq f^{'} (x; d) .

Furthermore if $f$ is differentiable over $S$ , then $f$ is also continuously differentiable over $S$ .

Proof. Limit superior

Choose any $ϵ > 0$ .
By definition of the directional derivative, there exists a $t > 0$ such that

$\frac{f (x + t d) - f (x)}{t} < f^{'} (x; d) + ϵ .$
Due to Observation 9.8, for every $k$ and every $t > 0$ , we have

$f_{k}^{'} (x_{k}; d_{k}) \leq \frac{f_{k} (x_{k} + t d_{k}) - f (x_{k})}{t} .$
Now

$lim_{k \to \infty} \frac{f_{k} (x_{k} + t d_{k}) - f (x_{k})}{t} = \frac{f (x + t d) - f (x)}{t} .$
Hence for sufficiently large $k$ , we have

$f_{k}^{'} (x_{k}; d_{k}) \leq \frac{f_{k} (x_{k} + t d_{k}) - f (x_{k})}{t} < f^{'} (x; d) + ϵ .$
By taking the limit superior on the L.H.S. as $k \to \infty$ , we have

$\underset{k \to \infty}{lim sup} f_{k}^{'} (x_{k}; d_{k}) \leq f^{'} (x; d) + ϵ .$
Since this is valid for every $ϵ > 0$ , hence we must have

$\underset{k \to \infty}{lim sup} f_{k}^{'} (x_{k}; d_{k}) \leq f^{'} (x; d)$

as desired.

Continuous differentiability

We are given that $f$ is differentiable over $S$ .
Then $f$ is also continuous over $S$ .
Let $x \in S$ .
Let ${x_{k}}$ be a sequence of $S$ converging to $x$ .
Let $d \in V$ be any nonzero direction.
Due to Theorem 9.203, for every $k$ ,

$f^{'} (x_{k}; d) = ⟨ d, \nabla f (x_{k}) ⟩ .$
Hence

$\begin{aligned} \underset{k \to \infty}{lim sup} ⟨ d, \nabla f (x_{k}) ⟩ & = \underset{k \to \infty}{lim sup} f^{'} (x_{k}; d) \\ \leq f^{'} (x; d) \\ = ⟨ d, \nabla f (x) ⟩ . \end{aligned}$
By replacing $d$ with $- d$ in the previous argument, we have

$\begin{aligned} - \underset{k \to \infty}{lim inf} ⟨ d, \nabla f (x_{k}) ⟩ & = \underset{k \to \infty}{lim sup} ⟨ - d, \nabla f (x_{k}) ⟩ \\ = \underset{k \to \infty}{lim sup} f^{'} (x_{k}; - d) \\ \leq f^{'} (x; - d) \\ = - ⟨ d, \nabla f (x) ⟩ . \end{aligned}$
Hence

$\underset{k \to \infty}{lim inf} ⟨ d, \nabla f (x_{k}) ⟩ \geq ⟨ d, \nabla f (x) ⟩ .$
Thus we have

$\underset{k \to \infty}{lim sup} ⟨ d, \nabla f (x_{k}) ⟩ \leq \underset{k \to \infty}{lim inf} ⟨ d, \nabla f (x_{k}) ⟩ .$
But then this must be an equality. Hence

$lim_{k \to \infty} ⟨ d, \nabla f (x_{k}) ⟩ = ⟨ d, \nabla f (x) ⟩ .$
Since this is valid for every nonzero direction $d \in V$ , hence we must have

$lim_{k \to \infty} \nabla f (x_{k}) = \nabla f (x) .$
Hence $\nabla f$ is continuous at every $x \in S$ .
Hence $f$ is continuously differentiable at every $x \in S$ .

9.15.4.3. Directional Derivatives Map#

The existence of directional derivatives in all directions allows us to consider a mapping from a direction $d \in V$ to the directional derivative of $f$ in this direction at $x$ . We can define a directional derivative map parameterized by $x \in S$ as:

g_{x} (d) ≜ f^{'} (x; d) = lim_{α ↓ 0} \frac{f (x + α d) - f (x)}{α} .

We shall refer to such maps by $d \mapsto f^{'} (x; d)$ .

Theorem 9.206 (Convexity and homogeneity of $d \mapsto f^{'} (x; d)$ )

Let $f : V \to (- \infty, \infty]$ be a proper convex function with $S = dom f$ . Let $x \in int S$ . Then, the function $d \mapsto f^{'} (x; d)$ is convex and nonnegative homogeneous.

Nonnegative homogeneity: For any $t \geq 0$ and $d \in V$ ,

f^{'} (x; t d) = t f^{'} (x; d) .

Proof. Convexity

Let $d_{1}, d_{2} \in V$ and $t \in (0, 1)$ .
Let $d = t d_{1} + (1 - t) d_{2}$ .
Then,

$\begin{aligned} f^{'} (x; d) & = f^{'} (x; t d_{1} + (1 - t) d_{2}) \\ = lim_{α ↓ 0} \frac{f (x + α [t d_{1} + (1 - t) d_{2}]) - f (x)}{α} \\ = lim_{α ↓ 0} \frac{f (t x + α t d_{1} + (1 - t) x + α (1 - t) d_{2}) - f (x)}{α} \\ = lim_{α ↓ 0} \frac{f (t (x + α d_{1}) + (1 - t) (x + α d_{2})) - f (x)}{α} \\ \leq lim_{α ↓ 0} \frac{t f (x + α d_{1}) + (1 - t) f (x + α d_{2}) - t f (x) - (1 - t) f (x)}{α} \\ = t lim_{α ↓ 0} \frac{f (x + α d_{1}) - f (x)}{α} + (1 - t) lim_{α ↓ 0} \frac{f (x + α d_{2}) - f (x)}{α} \\ = t f^{'} (x; d_{1}) + (1 - t) f^{'} (x; d_{2}) . \end{aligned}$

We used the convexity property of $f$ in this derivation.
Thus, $f^{'} (x; d)$ is convex.

Nonnegative homogeneity

For $t = 0$ ,

$f^{'} (x, 0 d) = f^{'} (x, 0) = 0 = 0 f^{'} (x; d) .$

Thus, the homogeneity property is trivial for $t = 0$ .
Now consider $t > 0$ .
Then,

$\begin{aligned} f^{'} (x; t d) & = lim_{α ↓ 0} \frac{f (x + α t d) - f (x)}{α} \\ = t lim_{α ↓ 0} \frac{f (x + α t d) - f (x)}{α t} \\ = t f^{'} (x; d) . \end{aligned}$
Thus, $f^{'} (x; d)$ is nonnegative homogeneous.

9.15.4.4. As Linear Underestimator#

Directional derivatives are a linear underestimator for convex functions.

Theorem 9.207 (Directional derivative as linear underestimator)

Let $f : V \to (- \infty, \infty]$ be a proper convex function with $S = dom f$ . Let $x \in int S$ . Then, for every $y \in S$

f (y) \geq f (x) + f^{'} (x; y - x) .

Proof. Note that

\begin{aligned} f^{'} (x; y - x) & = lim_{α ↓ 0} \frac{f (x + α (y - x)) - f (x)}{α} \\ = lim_{α ↓ 0} \frac{f ((1 - α) x + α y) - f (x)}{α} \\ \leq lim_{α ↓ 0} \frac{(1 - α) f (x) + α f (y) - f (x)}{α} \\ = lim_{α ↓ 0} \frac{α (f (y) - f (x))}{α} \\ = lim_{α ↓ 0} (f (y) - f (x)) = f (y) - f (x) . \end{aligned}

Thus,

f (y) \geq f (x) + f^{'} (x; y - x) .

9.15.5. Pointwise Maximum of Finite Set of Functions#

9.15.5.1. Directional Derivative#

Theorem 9.208 (Directional derivative of a maximum of functions)

Let $f_{1}, f_{2}, \dots, f_{m} : V \to (- \infty, \infty]$ be proper functions. Let $f : V \to (- \infty, \infty]$ be defined as

f (x) = max {f_{1} (x), \dots, f_{m} (x)}

with $dom f = ⋂_{i = 1}^{m} dom f_{i}$ .

Let $x \in int dom f = ⋂_{i = 1}^{m} int dom f_{i}$ and $d \in V$ . Assume that $f^{'} (x; d)$ exists for every $i \in 1, \dots, m$ .

Let $I (x) = {i \in 1, \dots, m | f_{i} (x) = f (x)}$ be the set of indices of functions whose value at $x$ equals $f (x)$ . Then,

f^{'} (x; d) = max_{i \in I (x)} f^{'} (x; d) .

In other words, the directional derivative of a pointwise maximum of functions equals the maximum of directional directives of functions which attain the pointwise maximum at a specific point.

Proof. The key idea here is that for computing the directional derivative $f^{'} (x; d)$ , only those functions are relevant for which $f_{i} (x) = f (x)$ . We need to show this first.

Since $x \in int dom f$ , there exists $B (x, r)$ such that $f$ and $f_{i}$ are all defined over this open ball.
Let $s = \frac{r}{‖ d ‖}$ .
For every $i \in 1, \dots, m$ , let $g_{i} : R \to R$ be defined as

$g_{i} (t) = f_{i} (x + t d)$

with $dom g_{i} = [0, s)$ . $‖ s d ‖ = r$ . Thus, $x + t d \in B (x, r)$ . Hence, $g_{i}$ are well defined.
Then,

$\begin{aligned} lim_{t \to 0 +} g_{i} (t) & = lim_{t \to 0 +} f_{i} (x + t d) \\ = lim_{t \to 0 +} [(f_{i} (x + t d) - f_{i} (x)) + f_{i} (x)] \\ = lim_{t \to 0 +} [t \frac{f_{i} (x + t d) - f_{i} (x)}{t} + f_{i} (x)] \\ = 0 \cdot f_{i}^{'} (x; d) + f_{i} (x) \\ = f_{i} (x) = g_{i} (0) . \end{aligned}$

We used the fact that $f_{i}^{'} (x; d)$ exists for every $f_{i}$ .
Thus, $g_{i}$ is continuous from the right at $t = 0$ for every $i \in 1, \dots, m$ .
Let $i \in I (x)$ and $j \notin I (x)$ .
Then, $f_{i} (x) > f_{j} (x)$ . Alternatively $g_{i} (0) > g_{j} (0)$ .
Since $g_{i}, g_{j}$ are continuous from the right, hence there exists $ϵ_{i j} > 0$ such that $g_{i} (t) > g_{j} (t)$ for every $t \in [0, ϵ_{i j}]$ .
Minimizing $ϵ_{i j}$ over all pairs of $i \in I (x)$ and $j \notin I (x)$ , there exists $ϵ > 0$ such that for any $i \in I (x)$ and $j \notin I (x)$ ,

$f_{i} (x + t d) = g_{i} (t) > g_{j} (t) = f_{j} (x + t d) \forall t \in [0, ϵ] .$

We can now compute the directional derivative.

For every $t \in [0, ϵ]$ ,

$f (x + t d) = max_{i = 1, \dots, m} f_{i} (x + t d) = max_{i \in I (x)} f_{i} (x + t d) .$
Consequently, for any $t \in (0, ϵ]$

$\begin{aligned} \frac{f (x + t d) - f (x)}{t} & = \frac{max_{i \in I (x)} f_{i} (x + t d) - f (x)}{t} \\ = \frac{max_{i \in I (x)} (f_{i} (x + t d) - f_{i} (x))}{t} \\ = max_{i \in I (x)} \frac{f_{i} (x + t d) - f_{i} (x)}{t} . \end{aligned}$

We used the fact that $f_{i} (x) = f (x)$ for every $i \in I (x)$ .
Taking the limit $t ↓ 0$ ,

$\begin{aligned} f^{'} (x; d) & = lim_{t ↓ 0} \frac{f (x + t d) - f (x)}{t} \\ = lim_{t ↓ 0} max_{i \in I (x)} \frac{f_{i} (x + t d) - f_{i} (x)}{t} \\ = max_{i \in I (x)} lim_{t ↓ 0} \frac{f_{i} (x + t d) - f_{i} (x)}{t} \\ = max_{i \in I (x)} f_{i}^{'} (x; d) . \end{aligned}$

9.15.5.2. Finite Set of Convex Functions Case#

Theorem 9.209 (Directional derivative of pointwise maximum of convex functions)

Let $f_{1}, f_{2}, \dots, f_{m} : V \to (- \infty, \infty]$ be proper convex functions. Let $f : V \to (- \infty, \infty]$ be defined as

f (x) = max {f_{1} (x), \dots, f_{m} (x)}

with $dom f = ⋂_{i = 1}^{m} dom f_{i}$ .

Let $x \in int dom f$ and $d \in V$ . Then,

f^{'} (x; d) = max_{i \in I (x)} f^{'} (x; d) .

where $I (x) = {i \in 1, \dots, m | f_{i} (x) = f (x)}$ .

Proof. Since $f_{i}$ are proper convex, hence their pointwise maximum $f$ is proper convex.

By Theorem 9.204, the directional derivatives $f^{'} (x; d)$ and $f_{i}^{'} (x; d)$ for $i = 1, \dots, m$ exist.

By Theorem 9.208,

f^{'} (x; d) = max_{i \in I (x)} f^{'} (x; d) .

where $I (x) = {i \in 1, \dots, m | f_{i} (x) = f (x)}$ .

Topics in Signal Processing

Directional Derivatives

Contents

9.15. Directional Derivatives#

9.15.1. Convex Real Functions#

9.15.1.1. Characterization#

9.15.1.2. One Sided Derivatives#

9.15.1.3. Continuity#

9.15.1.4. Properties of One Sides Derivatives#

9.15.2. Directional Derivatives of Proper Functions#

9.15.3. Differentiability#

9.15.3.1. Differentiability of Proper Functions#

9.15.3.2. Gradient and Directional Derivatives#

9.15.3.3. Gradient in $R^{n}$ #

9.15.3.4. Gradient in $R^{m \times n}$ #

9.15.4. Proper Convex Functions#

9.15.4.1. Existence of Directional Derivatives#

9.15.4.2. Upper Semicontinuity#

9.15.4.3. Directional Derivatives Map#

9.15.4.4. As Linear Underestimator#

9.15.5. Pointwise Maximum of Finite Set of Functions#

9.15.5.1. Directional Derivative#

9.15.5.2. Finite Set of Convex Functions Case#

Topics in Signal Processing

Directional Derivatives

Contents

9.15. Directional Derivatives#

9.15.1. Convex Real Functions#

9.15.1.1. Characterization#

9.15.1.2. One Sided Derivatives#

9.15.1.3. Continuity#

9.15.1.4. Properties of One Sides Derivatives#

9.15.2. Directional Derivatives of Proper Functions#

9.15.3. Differentiability#

9.15.3.1. Differentiability of Proper Functions#

9.15.3.2. Gradient and Directional Derivatives#

9.15.3.3. Gradient in Rn#

9.15.3.4. Gradient in Rm×n#

9.15.4. Proper Convex Functions#

9.15.4.1. Existence of Directional Derivatives#

9.15.4.2. Upper Semicontinuity#

9.15.4.3. Directional Derivatives Map#

9.15.4.4. As Linear Underestimator#

9.15.5. Pointwise Maximum of Finite Set of Functions#

9.15.5.1. Directional Derivative#

9.15.5.2. Finite Set of Convex Functions Case#

9.15.3.3. Gradient in $R^{n}$ #

9.15.3.4. Gradient in $R^{m \times n}$ #