Formulation of the DA based block ADFE - (1)surya prakash matcha H I G H P E R F O R M A N C E

5.4 formulation of the da based block adfe 66

5.4 formulation of the da based block adfe 67

1: loop

fork=1 :(N/L)do%Block index

fori=1 :number_o f_all_symbol_combinationsdo

decisions(((k−¹)∗^L+2):((k−¹)∗^L+L))) =all_comb_decisions(i); f b f_output= f ilter((decisions(((k−¹)∗^L+2):((k−¹)∗^L+L))); adder_output= f f f_output+ f b f_output;

for j=1 : Ldo

slicer_output(j) =bpsk_decision_device(adder_output(j)); end for

decisions((k−¹)∗^L+1 :k∗^L) =slicer_output;

e_vec= desired_signal_train((k−¹)∗^L+1 :k∗^L)−adder_output;

%For MAD case count=0;

foriii =1 :Ldo

if(abs(e_vec(iii)) ==0) count=count+1;

end end for

if(count== L) break;

end if

%For MSD case mse=0;

foriii =1 :Ldo

mse= mse+ (e_vec(iii))²; end for

if(count== L); break;

end if end for end for

2: end loop

Figure5.3: Algorithm for the computation of unknown decisions.

FFF

FBF Decision

Decisions Computing

Sample Delay Received

Signal

Output Decisions

Block Device

(MAD/MSD)

Figure5.4: Block ADFE with the decisions computing block in the feedback loop.

5.4 formulation of the da based block adfe 68

B-Input NOR Gate

errorin clk reset

undecB BB BBankof registers Btofbf

storing allsymbol values

Figure5.5:ProcessingElementincaseofMAD.

5.4 formulation of the da based block adfe 69

B-Input NOR Gate

errorin clk reset

undecB BB BmultiplierB-bit adderB-bit Register dataindataout CLR

B BBankof

registers storing allsymbol values

Btofbf Figure5.6:ProcessingElementincaseofMSD.

5.4 formulation of the da based block adfe 70 where,Q[.]is the quantization operation of theL-dimensional decision device and ˆd(k)is the vector containing the corresponding samples from the original transmitted sequence in case of training mode and the decisions vectord(k)in case of decision-directed mode.

Further, ˆx(k), ˆv(k)are the output vectors of FFF and FBF respectively. The matrixR_0,L is used for the selection of the last L valid samples of the filter output vector (as obtained from the overlap-save method) and is given as

R_0,L=^h _{0 I}_L ⁱ (5.25)

where 0 is the L×(N−¹)-dimensional (N = N_f for FFF and N = N_b for FBF) all-zero matrix andI_Lis the L-dimensional identity matrix.

The matrices F ^andF⁻¹ are respectively the M×^M ^(M = N_f +L−1 in case of FFF andM =N_b+L−1 in case of FBF)-dimensional Fast Fourier Transform (FFT) and Inverse Fast Fourier Transform (IFFT) matrices which may be given as,

F =







1 1 . . . 1

1 e⁻^j2π/M . . . e⁻^j2π⁽^M⁻¹⁾^/M 1 e⁻^j4π/M . . . e⁻^j4π⁽^M⁻¹⁾^/M

... ... ... ...

1 e⁻^j2π⁽^M⁻¹⁾^/M . . . e⁻^j2π⁽^M⁻¹⁾²^/M







(5.26)

F⁻¹ = ¹ M







1 1 . . . 1

1 e^j2π/^M . . . e^j2π⁽^M⁻¹⁾^/M 1 e^j4π/^M . . . e^j4π⁽^M⁻¹⁾^/M

... ... ... ...

1 e^j2π⁽^M⁻¹⁾^/M . . . e^j2π⁽^M⁻¹⁾²^/M







(5.27)

The matrices X_F(k),V_F(k)are respectively given as

X_F(k) =F^Xc(k)F⁻¹ ^(5.28) V_F(k) =F^Vc(k)F⁻¹ ^(5.29) where,Xc(k)andVc(k)are the N_f +L−¹× ^Nf +L−¹^and(N_b+L−¹)×(N_b+L−¹)- dimensional circular matrices respectively, which are given as,

Xc(k) =







x kL−^Nf +1

. . . x kL+N_f −² x kL−^Nf +2

. . . x kL−^Nf +3

... ... ...

x(kL+L−¹) . . . x kL−^Nf +1







(5.30)

Vc(k) =







d(kL−^Nb+1) . . . d(kL+N_b−²) d(kL−^Nb+2) . . . d(kL−^Nb+3)

... ... ...

d(kL+L−¹) . . . d(kL−^Nb+1)







(5.31)

5.4 formulation of the da based block adfe 71 In (15) and (16), w_F^f (k) and w^b_F(k) are the vectors containing the frequency domain samples of the zero-padded tap-weight vectors of FFF and FBF respectively and are given as

w_F^f (k) =F^w^˜ ^f ^(5.32)

w^b_F(k) =F^w^˜^b ^(5.33)

and

w^f (k) =

w^f (k) 0

(5.34)

w^b(k) =

w^b(k) 0

(5.35) where,w^f (k)andw^b(k)are the tap-weight vectors of FFF and FBF respectively.

From the properties of circular matrices, the matrices X_F(k) and V_F(k) will be the diagonal matrices and the diagonal elements correspond to the FFT of the first column of Xc(k)andVc(k)respectively. In matrix notation, they may be written as

X_F(k) =diag[x_F(k)] (5.36) V_F(k) =diag[v_F(k)] (5.37) and

x_F(k) =F {^x(k)} ^(5.38)

v_F(k) =F {^v(k)} ^(5.39)

where,

x(k) =x kL−^Nf +1

,x kL−^Nf +2

, . . . ,x(kL+L−¹)^T (5.40) v(k) =v kL−^Nf +1

,v kL−^Nf +2

, . . . ,v(kL+L−¹)^T (5.41) are the first columns ofX_c(k)andV_c(k)respectively.

Further, the weight-update recursion for FFF and FBF are respectively given by the equations,

w_F^f (k+1) =w_F^f (k) +µPN_f,0X^∗_F(k)e_F (k) (5.42) w^b_F(k+1) =w^b_F(k) +µP_N_b_,0V^∗_F(k)e_F(k) (5.43) where,

5.4 formulation of the da based block adfe 72

Serial to Parallel Converter

Input Buffer

N_f +L−¹

point FFT using DA

Making the last

‘L−^1’

elements as zeros

IFFT using DA (Last

terms)‘L’

Delay

+ Decision

Device

+ −

Decision Outputs

Delay Buffer

−

Adding Nf−^{1 zeros}

at the beginning

Adding Nb−^{1 zeros}

at the beginning Making

the last

‘L−^1’

elements as zeros

Delay µ

x(n)

L-dimensional

Decisions computing

block (MAD/MSD)

N_f +L−¹

point FFT using DA

N_f +L−¹

point FFT using DA

N_f +L−¹

point FFT using DA

N_b+L−¹

point FFT using DA

N_b+L−¹

point FFT using DA

N_b+L−¹

point FFT using DA

N_b+L−¹

point FFT using DA

IFFT using DA (Last

terms)‘L’

Figure5.7: The block diagram of block ADFE implemented in the frequency domain.

5.4 formulation of the da based block adfe 73

e_F(k) =F^e^˜(k) (5.44)

Here X^∗_F(k) and V^∗_F(k) represent the complex conjugates of X_F(k) and V_F(k) respectively. Further, ˜e(k) = ^h _{0 e}(k)

and the matrices P_N_f_,0, P_N_b_,0 are required to ensure that the lastL−1 samples of the IFFT of w_F^f (k)andw^b_F(k)are constrained to zeros.

Although, the derivations of frequency-domain block LMS based adaptive filters in- volve extending the vectors to a length of L+N−^{1 (N} being the length of filter under consideration), in practice, the vectors are chosen to be of length L+N. Further, L = N may be chosen for maximum efficiency where N is typically in the powers of 2. Hence, assuming N_f, N_b and L are all in powers of 2, the FFT/IFFT operations in (5.19), (5.20), (5.28), (5.29), (5.32), (5.33), (5.38) and (5.39) may be given as

a_F =F^an = √¹ M

M−¹ n

∑

ane⁻^j^2π^M^kn (5.45) where an is the nth element of vector an and M = L+N and N = N_f and N = N_b in case of FFF and FBF respectively. Using the procedure described above, each of the FFT and IFFT operations may be realized using the distributed arithmetic technique for the efficient realization of block ADFE and this can be obtained as follows.

If each ofa_n is represented in signed2’s-complement representation, as given by a_n= −^bn,B−¹+

B−¹

∑

j=1

b_n,B₋₁₋_j2⁻^j (5.46) whereb_n,B₋₁₋_j is the(B−¹−^j)th-bit in theB-bit binary representation ofan, then

a_ne⁻^j^2π^M^kn =−^h^bn,B−¹e⁻^j^2π^M^kni +

B−¹ j

∑

hb_n,B₋₁₋_je⁻^j^2π^M^kni

2⁻^j (5.47)

Now, since b_n,B₋₁₋_j ∈ [0, 1], the expressions inside the square braces of above equations may take one out of 2 possible combinations (partial-products of twiddle factors) which may be stored in a memory as the twiddle factors are known constants prior to the implementation. Hence, (5.47) may be computed by right-shift (due to the term 2⁻^j) and accumulate (due to the summation) operations. This is known as the distributed arithmetic (DA) based realization and requires no hardware multiplier for its implementation.

Hence, all the multipliers present in the FFT/IFFT units can be realized using DA and the IFFTs can also be realized using the same structure of FFT. When the filter lengths are not in the powers of2, other FFT algorithms (such as the Prime-Factor FFT algorithm, Rader’s FFT algorithm etc) may be chosen and the hardware complexity depends on the type of algorithm chosen. Such an implementation for block LMS based adaptive filter can be found in [5,6].

The detailed block diagram of the block ADFE implemented in the frequency domain is shown in Fig. 5.7. The operation of FFF is as follows: The received samples arrive serially which are stored for parallel processing using a serial-to-parallel converter. These samples are buffered taking the newest set ofLsamples along withN_f−1 old samples for conversion into frequency domain using an FFT block as described by (5.38). The set of

5.5 performance analysis 74

Dalam dokumen (1)surya prakash matcha H I G H P E R F O R M A N C E A R C H I T E C T U R E S F O R A D A P T I V E E Q U A L I Z E R S U S I N G D I S T R I B U T E D A R I T H M E T I C (2)H I G H P E R F O R M A N C E A R C H I T E C T U R E S F O R A D A P T I V E E Q U A L I Z E R S U S I N G D I S T R I B U T E D A R I T H M E T I C A Thesis submitted for the award of the degree of Doctor of Philosophy by surya prakash matcha Under the supervision of Dr (Halaman 80-88)