Purohit 2016
Purohit 2016
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LES.2015.2499207, IEEE
Embedded Systems Letters
Abstract— This letter presents a new algorithm to construct an WCDMA and the other wireless standards. The Algorithm
XOR-Free architecture of a power efficient Convolutional description, its HW implementation and outcome of the
Encoder. Optimization of XOR operators is the main concern
results are presented in the following subsections.
while implementing polynomials over GF (2), which consumes a
significant amount of dynamic power. The proposed approach
completely removes the XOR-processing operation of a chosen II. Algorithm Design and Implementation
non-systematic, feed-forward generator polynomial and reduces
the logical operators, thereby the decoding cost. Hardware Convolution operation is realized using a deterministic finite
(HW) implementation of the proposed design uses Read-only state machine (DFSM). Its hardware implementation requires
memory (ROM) with a pre-processed addressing operations to a combinational circuit and memory elements. The discrete
reduce ROM size by nearly 50%. The results of the new
convolution for the encoded sequence (CJ) can be expressed
architecture reduce the dynamic power up to 21.4% and HW
cost up to 15% with lesser design complexity as compared to in terms of information sequence (II) with the generator
conventional method. The Hardware co-simulation of the sequences (GI) by the following equation,
architecture is first validated and then implemented with Xilinx
Virtex-V FPGA. (1)
Index Terms—Convolutional Codes, Common Subexpression Further Shift register (SR) based realization of (1) for
Elimination, Finite State Machine, Forward Error Correction, encoded sequence (Cjβ) depends upon the length (L) of SR,
FPGA, HDL, Modulo Adder. the present input Ij and M previous input blocks [I j−1, ··, Ij−M]
to yield (2).
I. INTRODUCTION
Data transmission reliability and stringent QoS are two main (2)
requirements of modern 3GPP and 3GPP2 standards over
unreliable wireless channels. These standards require to The shift register, α Є for each bits of the input and
develop and innovate efficient, cost effective forward error only memories for which are connected to adder β.
correction (FEC) codes for the mobile equipment [1-2]. The (2) is used to realize a CDMA2000 Convolutional
Among the conventional FECs, the Convolutional codes are Encoder with g0(x) = (753)8 and g1(x) = (561)8 as generator
usually preferred over Block codes due to its economical soft polynomials, K=9 and code rate k=1/2 as shown in Fig. 1.
decoding capability and inherent higher coding gain [3]. The
error correction capabilities of such codes rely upon its
Constraint length (K) and maximum free distance (dfree) [4].
The key operation of convolution is multiplication, which is
implemented using shifts and adds. Usually multiple addition
operations, increase complexity and consumes a significant
amount of dynamic power [5-7]. Hence the mitigation of
such complexities by reducing the number of logical
operators is the prime focus in the encoder design. Common
Subexpression Elimination (CSE) methods have been used to Fig. 1 Conventional Convolutional Encoder for CDMA-2000. [4]
eliminate such redundant computation using the most
common bit pattern to optimize the XOR count [8-9]. A deterministic finite automaton of convolution transducer
consists of a finite set of input symbols alphabet (Σ), output
This letter proposes a novel algorithmic approach that finds alphabet Γ, a finite set of states (S), Start state (S0 ∈ S),
an XOR-Free processing architecture for non-systematic, Transition function (δ: S × Σ → S) and the output function ω.
feed-forward Convolutional encoder polynomial represented These six-tuple symbols along with (1) & (2) are used to
over GF (2). However, some concatenated codes such as describe the algorithm 1.
turbo code demands systematic and feedback generator
polynomial over GF (2). In this paper effective The proposed algorithm can be summarized as follows,
implementation of non-recursive convolutional codes is
Step I: Assignment: Assign the information bit as logic ‘0’,
focused. The approach completely replaces modulo-2 based
for all possible encoder states (S), whose next state follows
realization of a standard GF (2) based polynomial into a previous state incremented by one. Compute the output
ROM with parity bits as its elements. The ROM based response using basic convolution process. The encoded
architecture provides ease in FPGA implementation with an output with its encoding states is then arranged in a tabular
additional feature of dynamic reconfiguration. The algorithm form.
is successfully applied with different code rate and constraint
length for CDMA2000, Step II: Grouping: Group the encoder states based on their
1
1943-0663 (c) 2015 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LES.2015.2499207, IEEE
Embedded Systems Letters
10 4,5,10,11,18,16,28,29,38,39,40,41,48,49,62,63
11 6, 7, 8, 9,16,17,30,31,36,37,42,43,50,51,60,61
CT 00 01 10 11
New RT
00 00 11 10 01 Fig. 3 Reduced Rom XOR-Free Architecture for CDMA-2000 Standard.
01 10 01 00 11
10 01 10 11 00 The algorithm is further tested for CDMA2000 with
11 10 01 00 11 generator polynomials g0(x) = (557)8, g1(x) = (663)8 and g2(x)
2
1943-0663 (c) 2015 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LES.2015.2499207, IEEE
Embedded Systems Letters
= (711)8 having k=1/3, K=9. The table 5 assigns RT to the Figs. 4 and 5 respectively. The Power estimation relies
isomorphic states. The Isomorphs are extracted by the states equally upon signal and logic power, IO activity and the
formed by combining the RT and MSB of CT i.e. first 6 bits clock toggling rate. It is interesting to note that a significant
of the shift register. Further the RT is modified using the reduction of Clock Domain Power (CDP) up to 38% for
logics of table 6 and results in the array of table 7. The table k=1/2 is achieved. Thereby improving the overall DPC up to
7 is implemented as a ROM. 21.4% as is evident from Fig. 4. However, as code rate
reduces to k=1/3, the CDP reduces to 1.06% leaving the
Table 5: Assignments of RT for the Isomorphic RT-CT groups. overall DPC to only 5% as depicted in the Fig. 5. The
resource utilization for new architecture has been computed
RT Isomorph states (in Dec.) for different rates as reported in the Tables 8 & 9. It is
000 0,13,18,31,39,42,53,56 inferred from the tables that the proposed architecture claims
001 1,12,19,30,38,43,52,57 15% improvement in the HW area for k=1/3 whereas it is
10% for k=1/2. It is observed from the above results that for
010 4,9,22,27,35,46,49,60
a given constraint length, the decrease in code rate improves
011 5,8,23,26,34,47,48,61
the HW cost significantly as compare to the power and delay,
100 3,14,17,28,36,41,54,59 whereas an increase in code rate improves the dynamic
101 2,15,16,29,37,40,55,58 power and the delay as compare to the area.
110 7,10,21,24,32,45,50,63
111 6,11,20,25,33,44,51,62 Table 8: Comparison of Resource Utilization for the Architectures.
Table 6: Modified RT-CT for Reduced ROM Architecture. Conventional Reduced ROM
Parameter
Encoder (k=1/2) XOR-FREE
IP RT(2) Mod. RT (2 bits) Mod. CT (2 bits) FFs 12 11
0 0 LUTs 03 03
1 LUT-FF Pairs 13 11
0
BUFG/BUFGCTRLs 01 01
1 0
Fmax (MHz) 444.6 585.5
1 1
CT 00 01 10 11
New RT
00 000 111 110 001
01 100 011 010 101
10 010 101 100 011
11 101 001 000 111
Fig. 5 Comparison of Dynamic power consumption for two different
III. Results architectures for k=1/3 with 1000 ns run at 500 MHz clock frequency.
IV. Conclusion
1943-0663 (c) 2015 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LES.2015.2499207, IEEE
Embedded Systems Letters
References
1. Dielissen, J., Eindhoven, Engin, N. , Sawitzki, S., van Berkel, K.:
‘Multistandard FEC Decoders for Wireless Devices’, IEEE Trans.
Circuits Syst. II Express Briefs, 2008, 55(3), pp 284- 288, doi:
10.1109/TCSII.2008.918964.
2. Hagenauer, J.: ‘Forward error correcting for CDMA Systems’ Int.
Symp. Spread Spectr. Tech. Appl. Proc., Mainz, September 1996,
pp. 566-569, doi: 10.1109/ISSSTA.1996.563190.
3. Viterbi, A.J.: ‘Convolutional codes and their performance in
communication systems’, IEEE Trans. Comm. Techn., 1971,
19(5), pp. 751-772, doi: 10.1109/TCOM.1971.1090700.
4. Lin S., Costello, D. J., Jr.: ‘Error Control Coding’ (Prentice-Hall
Englewood Cliffs, NJ, US, 2004, second edition).
5. Yibin, Y., Roy, K., Drechsler, R.: ‘Power Consumption in XOR-
Based Circuits’, Proceedings of the ASP-DAC, January 1999,
pp.299-302, DOI:10.1109/ASPDAC.1999.760018.
6. Igor, M.: ‘Power consumption analysis of XOR based circuits’,
Informatics, 2006, 1(9). pp. 97-103.
7. Rabaey, J.: ‘Low Power Design Essentials’ (Springer Science &
Business Media, US, 2009, First edition).
8. Pasko, R., Schaumont, P., Derudder, V., Vernalde, S., Durackova
D.:‘A New Algorithm for Elimination of Common
Subexpressions’, IEEE Trans. Comput. Des. Integr. Circuits and
Syst., 1999, 18(1), pp. 58-68, DOI:10.1109/43.739059.
9. Huang, C., Li, J., Chen, M.: ‘Optimizing XOR-based codes’, US
Patent 8209577 B2, June 26, 2012.
1943-0663 (c) 2015 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.