|
QR-RLS Blind Linear MMSE Detector
Dec 19,2008 00:00
by
admin
QR-RLS Blind Linear MMSE DetectorAssume that Cr[i] is positive definite. Let
be the Cholesky decomposition (i.e., C[i] is the unique upper triangular Cholesky factor with positive diagonal elements). Define the following quantities:
At time i, the a posteriori least-squares (LS) estimate is given by
The a priori LS estimate at time i is given by
It can be shown that x[i] and z[i] are related by [389]
Suppose that C[i – 1] and u[i – 1] are available from the previous recursion. At time i the new observation r[i] becomes available. We construct a block matrix consisting of C[i – 1], u[i – 1], and r[i] and apply an orthogonal transformation as follows:
In (2.62) the matrix Q[i] which zeros the first N elements on the last row of the partitioned matrix appearing on the left-hand side of (2.62), is an orthogonal matrix consisting of N Givens rotations,
where Qn[i] zeros the nth element in the last row by rotating it with the (n + 1)th row. An individual rotation is specified by two scalars, cn and sn (which can be regarded as the cosine and sine, respectively, of a rotation angle fn), and affects only the last row and the (n + 1)th row. The effects on these two rows are
where the rotation factors are defined by
The correctness of (2.62) is shown in the Appendix (Section 2.8.1). It is seen from (2.62) that the computed quantities appearing on the right-hand side are C[i], u[i], v[i], at time n. It is also shown in the Appendix (Section 2.8.1) that the quantities a[i], z[i], and x[i] can be updated according to the equations
Note that g[i] in (2.62) is the last diagonal element of Q[i]. A direct calculation shows that The initialization of the QR-RLS blind adaptive algorithm is
given by Algorithm 2.4: [QR-RLS blind linear MMSE detector—synchronous CDMA]
The orthogonal transformation (2.62) on the block matrix can be mapped onto a triangular systolic array for highly efficient parallel implementation, which is discussed next. Parallel Implementation on Systolic ArraysThe QR-RLS blind adaptive algorithm derived above has good
numerical properties and is well suited for parallel implementation. Figure 2.1 shows systematically a systolic
array implementation of this algorithm, using a triangular array first proposed
in [316]. It
consists of three sections: the basic upper triangular array, which stores and
updates C[i]; the right-hand column of cells, which stores and
updates u[i]; and the final processing cell, which computes the
demodulated data bit. The system is initialized as Figure 2.1. Systematic of the systolic array implementation of the QR-RLS blind adaptive multiuser detection algorithm (N = 4, K = 2) and the operations at each cell.
The systolic array in Fig. 2.1 operates in a highly pipelined manner. The computational wavefront propagates at the received data symbol rate. The demodulated data bits are also output at the received data symbol rate. Note that the demodulated data bit produced on a given clock corresponds to the received vector entered 2N clock cycles earlier. If multiple synchronous user data streams need to be demodulated, we can simply add more column arrays on the right-hand side and initialize each of them by the corresponding signature vector of each user. It is clear that by using the same triangular array, multiple users' data can be demodulated simultaneously. This is also illustrated in Fig. 2.1 for the case of two users. (Also, multiple paths of the same signal can be handled by adding appropriate linear arrays to Fig. 2.1.) |