Dictionaries II
Contents
18.9. Dictionaries II#
This section continues the development of dictionaries for sparse and redundant representations.
18.9.1. Spark#
We present some more results on spark of a dictionary.
18.9.1.1. Upper Bounds for Spark#
Whenever a set of atoms in a dictionary are linearly dependent,
the dependence corresponds to some vector in its null space. Thus,
identifying the spark of a dictionary essentially amounts of
sifting through the vectors in its null space and finding
one with smallest
Note that the solution
The
-th problem constrains the solution to choose atom from the dictionary.Since the minimal set of linearly dependent atoms in
will contain at least two vectors, hence would correspond to the optimal value of one (or more) of the problems (18.58).Formally, if we denote
as an optimal vector for the problem (18.58), thenThus, solving (18.57) is equivalent to solving all
problems specified by (18.58) and then finding the minimum -“norm” among them.The problems (18.58) are still computationally intractable.
We now change each of the
We formally state the upper bound on
Theorem 18.70
Let
where
18.9.2. Coherence#
In this subsection we develop some more bounds using coherence of a dictionary.
As usual, we will be considering an overcomplete dictionary
Theorem 18.71
Suppose that
Equivalently, the rows of
Proof. We recall that the operator norm
The following definition is due to [22, 28].
Definition 18.33
Let
This quantity was introduced in [28] for developing
more accurate bounds compared to bounds based on coherence. At that
time the idea of Babel function was not available. A careful examination reveals
that
Theorem 18.72
Proof. Since
Thus
Since
The following result is due to [28].
Theorem 18.73
Proof. We proceed as follows.
Let
.Then
Subtracting both sides with
we getLet
.By taking columns indexed by
from and corresponding entries in , we can write:Taking
norm on both sides we getWe know that
It is easy to see that:
Thus
This gives us
But
is nothing but the maximum sum of magnitudes of off diagonal entries in along a row in .Consider any row in
.One of the entries in the row (on the main diagonal of
) is 0.Thus, there are a maximum of
nonzero entries in the row. is smallest when .For such a
, there exists a row in such that the sum of off diagonal entries in the row exceeds 1.Let
denote the minimum number of off diagonal elements on a row or a column of such that the sum of their magnitudes exceeds one.Clearly
It is easy to see that
i.e. minimum number of off diagonal elements summing up to 1 or more is at least twice the minimum number of off diagonal elements summing up to
or more on any row (or column due to Hermitian property).Thus
Rewriting, we get
18.9.3. Babel function#
In this subsection, we provide a more general development of Babel function for a pair of dictionaries.
When we consider a single dictionary, we will use
as the dictionary.When considering a pair of dictionaries of equal size, we would typically label them as
and with both .We will assume that the dictionaries are full rank as they span the signal space
.Why a pair of dictionaries?
We consider
as a modeling dictionary from which the sparse signalsare built.
on the other hand is the sensing dictionary which will be used to compute correlations with the signal and try to estimate the approximation .Ideally,
and should be same.But in real life, we may not know
correctly.Hence,
would be a dictionary slightly different from .
18.9.4. p-Babel Function#
See [43] for reference.
Definition 18.34 (
Consider an index set
What is going on here?
Consider the row vector
This vector contains inner products of modeling atoms in
indexed by with the sensing atom .Now
This is the term in (18.60).
Thus
is a measure of the correlation of the sensing atom with a group of modeling atoms in indexed by using the -norm. attempts to find out a sensing atom from outside the index set which is most correlated to the group of modeling atoms in indexed by and returns the maximum correlation value.Different choices of
-norm lead to different correlation values.
We can also measure a correlation of sensing and modeling
atoms inside the index set
Definition 18.35 (Complementary
A complement to the
Finally
So far, we have focused our attention to a specific index set
Definition 18.36 (
The Babel function for a pair of dictionaries
Correspondingly, the complement of Babel function is defined as
It is straightforward to see that
Now consider the special case where
We obtain
Further by choosing
Finally compare this definition of
where the vector
We also know that
Careful observation shows that the definitions of
18.9.5. Exact Recovery Coefficient#
we introduce a measure of similarity between a subdictionary and the remaining atoms from the dictionary known as the exact recovery coefficient.
Definition 18.37 (Exact recovery coefficient)
The Exact Recovery Coefficient
[77, 78, 79] for a subdictionary
We will also use the notation
The quantity is called exact recovery coefficient
since for a number of algorithms the criteria
18.9.5.1. ERC and Babel Function#
We present a lower bound on
Theorem 18.74 (ERC lower bound: babel function)
Suppose that
It follows that
Proof. 1. Let us expand the pseudo-inverse
For the Gram matrix
we recall from Theorem 18.37 that:For the other term we have
Thus, we get
Putting back in the definition of Exact Recovery Coefficient:
This completes the bound on ERC.
Now, we verify the condition for
.Thus, if
, then the lower bound on is positive leading to .
18.9.5.2. ERC and Coherence#
On the same lines we develop a coherence bound for ERC.
Theorem 18.75 (ERC lower bound: coherence)
Suppose that
It follows that
Proof. .
Following the proof of Theorem 18.74 for the Gram matrix
have:For the other term we have
Thus, we get
Putting back in the definition of Exact Recovery Coefficient:
This completes the bound on ERC.
Now, we verify the condition for
.And
Thus
ensures that both numerator and denominator for the coherence lower bound on are positive leading to .
A more accurate bound on
Theorem 18.76
where
Proof. .
Assuming
, we haveFrom Theorem 18.75, we have
Thus under the given conditions, we have
We also need to show that under these conditions
We can see that:
18.9.5.3. Geometrical Interpretation of ERC#
Definition 18.38 (Antipodal convex hull of a subdictionary)
The antipodal convex hull [78] of a subdictionary
It is the smallest convex set that contains every unit multiple of every atom.
We recall that
is the orthogonal projector on to the column space of .Therefore
is a coefficient vector which can be used to synthesize this projection.In other words:
Thus, the quantity
measures how far the projected atom lies from the boundary of .If every projected atom lies well within the antipodal convex hull, then it is possible to recover superpositions of atoms from
.This happens because coefficient associated with an atom outside
must be quite large to represent anything in the span of the subdictionary whenever .
18.9.6. Dirac DCT Dictionary#
Theorem 18.77
The
In particular, the standard Babel function is given by
Proof. TODO prove it.