Separation Theorems
Contents
9.12. Separation Theorems#
Separation theorems provide us ways to find a separating hyperplane between two disjoint convex sets. Hyperplanes require the notion of an inner product.
Throughout this section, we assume that
9.12.1. Types of Separating Hyperplanes#
Definition 9.59 (Separating hyperplane)
Let
Let
where
If
or
holds true.
We can also say that the two sets
Any choice of
This definition allows for some degenerate possibilities where
both
Definition 9.60 (Proper separation)
Let
Proper separation still allows for the possibility that
parts of
If
or
holds true.
Also, there exists at least one
We can also say that the two sets
and
Any choice of
If a convex set has a nonempty interior, then it cannot be contained in a hyperplane. Two disjoint sets one of which has a nonempty interior can indeed be properly separated.
Definition 9.61 (Strong separation)
Let
Under these assumptions, one set lies in the interior of
We can also say that the two sets
Any choice of
Strong separation is called as strict separation in [9].
Remark 9.9
Separation is translation invariant.
For any
9.12.2. Disjoint Affine and Convex Sets#
Separation between sets can be discussed at several levels. We start with a simple case where an affine set and a relatively open convex set are disjoint.
Theorem 9.160 (Disjoint affine and relatively open convex sets)
Let
Proof. Consider the case where
If for contradiction,
is not contained in one of the open halfspaces of .Then,
since is the boundary of the halfspaces.It will contradict our hypothesis that
.Thus,
must be in one of the open halfspaces of .
Now, assume that
Without loss of generality, assume that
The proof is constructive. We iteratively construct
subspaces of increasing dimension which contain
Let
where .Let
be the initial subspace.Let
be the number of iterations (if is a hyperplane, no iterations are needed).For
:Replace
by a subspace such that and .Let
We have arrived with a hyperplane
such that .
Towards this, let us look at the procedure to
construct
Let
be the orthogonal complement of .Pick any 2D subspace
of .Pick a line
in which passes through origin but doesn’t intersect with ; i.e. .Construct the subspace
. The direct sum is valid since .Note that
and since as well as .
We now elaborate the procedure for finding the line
We have
as a linear subspace with and . .Since
, it contains a two dimensional subspace .Consider the set
.The set
has some properties. is convex since it is the sum of two convex sets.Since
, hence .Since
, hence .Note that due to Theorem 9.157:
since
is relatively open and is affine, hence relatively open.Thus,
is relatively open.
Consequently, the set
also has some properties. is convex since it is the intersection of two convex sets.Since
, hence, . is relatively open.If
is empty, then is relatively open vacuously.Otherwise, by Theorem 9.155:
Thus,
implies that is relatively open.
We now seek a line
passing through the origin that doesn’t intersect .We note that
.Suppose for contradiction that
but .Let
.Then,
.Also,
.Thus,
.This means that
.In other words,
which contradicts our assumption.
How to choose
? There are three possibilities: is empty. is contained in a line; i.e., . and .
cannot be larger than since and is affine.If
is empty, then any line in passing through origin will do.If
is a line.If
passes through the origin, we can take a line perpendicular to passing through origin. In this case, . Since , hence .If
is a line that doesn’t pass through , we can simply take a line that is parallel to and passes through . In this case . Consequently, .
If
is the entire 2D subspace .Consider the set
. (specifically, for ). is a convex cone containing but not containing . is relatively open since is relatively open.A boundary ray of
doesn’t intersect with since is relatively open.Take
to be any of the two boundary rays of and extend it to a line passing through .
9.12.3. Proper Separation Characterization#
Theorem 9.161 (Characterization of proper separation)
Let
and
Proof. Assume that there is some vector
Since
satisfies the second strict inequality, hence . Otherwise, the second inequality cannot hold.Choose any
such thatThen, the set
is a hyperplane.Let
and .Clearly,
and .The second strict inequality ensures that both
and cannot be contained in .For contradiction, assume
andThen
and .Thus, the second inequality will not hold.
Thus,
and are properly separated.
Now, assume that
Let
be the specification of the said hyperplane.Let
and .Without loss of generality, assume that
and .Then,
for every .Thus,
.Similarly,
for every .Thus,
.Thus, the first inequality is satisfied.
Since
properly separates and , hence either or .If
, then there exists such that .Consequently,
.If
, then there exists such that .Consequently,
.Combining,
must be true.
9.12.3.1. Proper Separation : Set and Point#
Remark 9.10 (Proper separation between a set and a point)
Let
Then
Hence,
We now consider a hyperplane
And let
We can clearly see that
. . is contained entirely in the closed half-space . . is not contained entirely in .
Thus, if a set and a point are properly separated, then there exists a hyperplane that passes through the point, contains the set in one of its half-spaces but does not fully contain the set itself.
9.12.4. Separating Hyperplane Theorems#
Theorem 9.162 (Separating hyperplane theorem I)
Let
In other words, two sets are properly separated if and only if their relative interiors are disjoint.
Proof. Let
By Theorem 9.142,
and are nonempty since and are nonempty convex sets.By Theorem 9.157,
and it is nonempty.Note that,
if and only if .If
then there exists such that .
Now assume that
Thus,
.Consider the affine set
.Clearly,
and is relatively open.By Theorem 9.160, there exists a hyperplane containing
such that is a subset of one of its associated open halfspaces.Let
be this hyperplane given by . Since the hyperplane contains , hence it is a subspace.By Theorem 9.149,
.Thus,
is contained in the corresponding closed halfspace.Without loss of generality, assume that
is contained in the nonnegative halfspace .Thus,
.This means that
Thus,
Since
, there exists , such that .Thus,
.But then,
Thus,
Then, by Theorem 9.161,
and are properly separated.
Now, for the converse, assume that
and
From the first inequality, we get that
.From the second inequality, we get that
.Thus, there exists a hyperplane
given by such that the corresponding nonnegative closed halfspace contains .Note that
Since
, hence . but .Thus, by Theorem 9.156,
.Thus,
.Thus,
.
We are done.
Corollary 9.10 (Separating hyperplane theorem II)
Let
Then, there exists a hyperplane that properly separates them.
Proof. Since
This result is stronger than proposition 2.4.2 in [9].
Disjointness of convex sets is not enough for strong separation as their closures might meet.
Theorem 9.163 (Separating hyperplane theorem III)
Let
In other words, a point can be properly separated from a convex set if and only if it doesn’t belong to its relative interior.
Proof. .
We form a set
.Then
(Theorem 9.133).By Theorem 9.162,
and are properly separated if and only ifIn other words,
In other words,
.
9.12.5. Strong Separation Characterization#
Theorem 9.164 (Characterization of strong separation)
Let
In other words,
Proof. Let
By Theorem 4.51,
Thus,
Assume that
Then, there exists an
such that and .Since
, hence .Then, for any
and and every ,as
would mean thatNote that,
.Then,
must be true for every and .For contradiction, assume that
for some and .Let
.Let
.Note that
and .Thus,
.Then,
.Then,
.This contradicts the condition that
for every , and .
Thus,
.
Conversely, assume that
Let
where .Then,
.Then,
.For contradiction, assume that
.Let
.Then, there exists
and such that .And, there exists
and such that .Then,
.Thus,
.Thus,
This contradictions our hypothesis that
.
Note that
and are convex and disjoint.Then, due to Corollary 9.10, there exists a hyperplane
which separates and properly.We can choose
so that and .But then,
and lie in the opposite open halfspaces.Thus,
and are strongly separated.
9.12.6. Strong Separation Conditions#
We describe several conditions which are sufficient to achieve strong separation between two sets. See also strict separation theorem (proposition 2.4.3) of [9].
Theorem 9.165 (Strong separation: closed subtraction)
Let
Proof. We proceed as follows.
Since
and are disjoint hence .Since
is closed, hence .By Theorem 9.164,
and are strongly separated.
Theorem 9.166 (Strong separation: closed and compact sets)
Let
Proof. We proceed as follows.
Since
and are disjoint hence .Since
is closed, hence .By Theorem 9.164,
and are strongly separated.
9.12.6.1. Recession Cones#
The conditions below are based on the theory of recession cones and lineality spaces of convex sets. See Recession Cones for a background.
Theorem 9.167 (Strong separation: recession cones)
Let
where
Proof. Due to Corollary 9.18,
9.12.6.2. Strongly Separating Hyperplane#
Recall that the (orthogonal) projection of a vector
Theorem 9.168 (Strongly separating hyperplane)
Let
Consider the vector of minimum norm (projection of the origin)
in
Let
.Let
.Let
.
Then the hyperplane
strongly separates
Proof. .
Since
and are disjoint hence .Hence
. is the nearest point in from . is the nearest point in from .The line segment
connects these nearest points. lies on this line segment.Accordingly,
is projection of in .Similarly
is projection of in .Then, due to Theorem 10.7 (orthogonal projection characterization), for every
But
Hence
.Hence
.Further
since
.Hence for every
, we have .A similar argument shows that for every
, we have .
9.12.7. Closed Convex Sets#
Theorem 9.169 (Strict separation theorem)
Let
In other words, there exists a separating hyperplane
such that
By choosing any
Proof. This is an application of strong separation.
Define a set
.Since
is closed and , hence is not a closure point of .Thus, there exists
such that .Thus,
.Then, by Theorem 9.164,
and are strongly separated.Let
be a hyperplane which strongly separates them.Then one of the closed halfspaces of
contains but not .Let
be described byWe can negate
if necessary so thatAccordingly,
.
Theorem 9.170 (Closed Convex = Intersection of Halfspaces)
Let
Proof. The set
The empty set is closed and convex and every halfspace
contains it. The intersection of all halfspaces
is the empty set. Thus, the statement is vacuously true.
Let us assume that
Let
such that .Since
is closed, hence is not a closure point of .Thus, there exists
such that .Thus,
. Specifically, .Thus, by Theorem 9.164,
and are strongly separated by some hyperplane .Thus, one of the corresponding closed halfspaces contains
but not .In other words, for every
, there exists a closed halfspace that doesn’t contain but contains .Thus, the intersection of all halfspaces containing
cannot contain any element from .If the intersection contained an element
, then there would be a closed halfspace containing but not which would mean that the intersection of all halfspaces containing cannot contain .
Hence, the intersection of all closed halfspaces containing
is exactly equal to .
We now have two different characterizations of closed convex sets.
From Theorem 9.122, a closed and convex set is the closure of the union of all the line segments connecting the points of the set.
From Theorem 9.170, a closed and convex set is the intersection of all the closed half-spaces containing it.
9.12.8. Supporting Hyperplanes#
Definition 9.62 (Supporting hyperplane and halfspaces)
Let
If there exists a nonzero vector
is called a supporting hyperplane of
Convex sets have this beautiful property that there exists a supporting hyperplane at every point in the boundary of the set.
Theorem 9.171 (Supporting hyperplane theorem)
Let
Proof. Consider the case where
Then, due to Theorem 9.125,
.Thus, there exists a hyperplane
such that .Since
is finite dimensional, hence is closed (Theorem 4.200).Thus,
.Thus,
.This
trivially separates and as both are contained in .
Now, assume that
. .Since
, hence the sets and are disjoint.We have
.By Theorem 9.162, there exists a hyperplane
that separates and properly.Consequently
lies entirely in one of the closed halfspaces of .
Corollary 9.11
Let