Block transform
Wavelet packet bases are designed by dividing the frequency axis in intervals of varying sizes. These bases are particularly well adapted to decomposing signals that have different behavior in different frequency intervals. If f has properties that vary in time, it is then more appropriate to decompose f in a block basis that segments the time axis in intervals with sizes that are adapted to the signal structures.
Block Bases
Block orthonormal bases are obtained by dividing the time axis in consecutive intervals [ap,ap + 1] with
limp → − ∞ap = − ∞ and limp → ∞ap = ∞.
The size lp = ap + 1 − ap of each interval is arbitrary. Let g = 1[0,1]. An interval is covered by the dilated rectangular window
$g_p(t)=1_{[a_p,a_{p+1}]}(t)=g({t-a_p \over l_p}).$
Theorem 1. constructs a block orthogonal basis of L2(ℝ) from a single orthonormal basis of L2[0,1].
Theorem 1.
if {ek}k ∈ ℤ is an orthonormal basis of L2[0,1], then
$\{g_{p,k}(t)=g_p(t)\frac{1}{\sqrt{l_p}}e_k(\frac{t-a_p}{l_p})\}_{(p,k)\in \mathbb{Z}}$
is a block orthonormal basis of L2(ℝ)
Proof
One can verify that the dilated and translated family
$\{\frac{1}{\sqrt{l_p}}e_k(\frac{t-a_p}{l_p})\}_{(p,k)\in \mathbb{Z}}$
is an orthonormal basis of L2[ap,ap + 1]. If p ≠ q, then ⟨gp, k, gq, k⟩ = 0 since their supports do not overlap. Thus, the family $\{g_{p,k}(t)=g_p(t)\frac{1}{\sqrt{l_p}}e_k(\frac{t-a_p}{l_p})\}_{(p,k)\in \mathbb{Z}}$ is orthonormal. To expand a signal f in this family, it is decomposed as a sum of separate blocks
$f(t)=\sum_{p=-\infty}^{+\infty}f(t)g_p(t),$
and each block f(t)gp(t) is decomposed in the basis $\{\frac{1}{\sqrt{l_p}}e_k(\frac{t-a_p}{l_p})\}_{(p,k)\in \mathbb{Z}}$
Block Fourier Basis
A block basis is constructed with the Fourier basis of L2[0,1]:
{ek(t) = exp(i2kπt)}k ∈ ℤ
The time support of each block Fourier vector gp, k is [ap,ap + 1] of size lp. The Fourier transform of g = 1[0,1] is
$\hat{g}(w)=\frac{\sin(w/2)}{w/2}exp(\frac{iw}{2})$
and
$\hat{g}_{p,k}(w)=\sqrt{l_p}\hat{g}(l_pw-2k\pi)exp(\frac{-i2\pi ka_p}{l_P}).$
It is centered at 2kπlp−1 and has a slow asymptotic decay proportional to lp−1|w|−1. Because of this poor frequency localization, even though a signal f is smooth, its decomposition in a block Fourier basis may include large high-frequency coefficients. This can also be interpreted as an effect of periodization.
Discrete Block Bases
For all p ∈ ℤ, suppose that ap ∈ ℤ. Discrete block bases are built with discrete rectangular windows having supports on intervals [ap,ap − 1]:
gp[n] = 1[ap,ap + 1−1](n).
Since dilations are not defined in a discrete framework, bases of intervals of varying sizes from a single basis cannot generally be derived. Thus, Theorem 2 supposes an orthonormal basis of ℂl for any l > 0 can be constructed. The proof is:
Theorem 2.
Suppose that {ek, l}0 ≤ k < l is an orthogonal basis of ℂl for any l > 0. The family
{gp, k[n] = gp[n]ek, lp[n−ap]}0 ≤ k < lp, p ∈ ℤ
is a block orthonormal basis of l2(ℤ).
A discrete block basis is constructed with discrete Fourier bases
$\{e_{k,l[n]}=\frac{1}{\sqrt{l}}exp(\frac{i2\pi kn}{l})\}_{0\leqslant k<l}$
The resulting block Fourier vectors gp, k have sharp transitions at the window border, and thus are not well localized in frequency. As in the continuous case, the decomposition of smooth signals f may produce large-amplitude, high-frequency coefficients because of border effects.
Block Bases of Images
General block bases of images are constructed by partitioning the plane ℝ2 into rectangles {[ap,bp] × [cp,dp]}p ∈ ℤ of arbitrary length lp = bp − ap and width wp = dp − cp. Let {ek}k ∈ ℤ be an orthonormal basis of L2[0,1] and g = 1[0,1]. The following can be denoted:
$g_{p,k,j}(x,y)=g(\frac{x-a_p}{l_p})g(\frac{y-c_p}{w_p})\frac{1}{\sqrt{l_pw_p}}e_k(\frac{x-a_p}{l_p})e_j(\frac{y-c_p}{w_p})$.
The family {gp, k, j}(p,k,j) ∈ ℤ3 is an orthonormal basis of L2(ℝ2).
For discrete images, discrete windows that cover each rectangle can be defined
gp = 1[ap,bp−1] × [cp,dp−1].
If {ek, l}0 ≤ k < l is an orthogonal basis of ℂl for any l > 0, then
{gp, k, j[n1,n2] = gp[n1,n2]ek, lp[n1−ap]ej, wp[n2−cp]}(k,j,p) ∈ (Z)3
is a block basis of l2(ℝ2)
References
- St´ephane Mallat, A Wavelet Tour of Signal Processing, 3rd