# International Journal of Innovative Research in Computer and Communication Engineering 

(A High Impact Factor, Monthly, Peer Reviewed J ournal)
Website: www.ijircce.com
Vol. 6, I ssue 11, November 2018

# A Partial Residue to Decimal Converter for the Moduli Set $\left\{\mathbf{2}^{n}-1,2^{n}, 2^{n}-1\right\}$ 

Alhassan Abdul-Barik<br>Senior Lecturer, Department of Computer Science, Faculty of Mathematical Sciences, University for Development<br>Studies, Tamale, Ghana


#### Abstract

Residue Number System (RNS) is a non-conventional number system that uses remainders to represent numbers. These remainders, called residues are converted back to decimal representations using two types of converters. A forward converter converts from decimal representation to residue whiles a reverse converter converts from residues to decimal representations respectively. In this paper, a fast residue-to-decimal converter for the moduli set $\left(\mathbf{2}^{\boldsymbol{n}}-\mathbf{1}, \mathbf{2}^{\boldsymbol{n}}, \mathbf{2}^{\boldsymbol{n}}+\mathbf{1}\right)$ is proposed. The algorithm used to convert the residues of the moduli to decimal number is based on the Chinese Remainder Theorem (CRT). The approach does not allow full reverse conversion. The resulting implementation is based on two carry-save adders, two carry-propagate adders and a multiplexer. After comparing the proposed scheme to the state-of-the-art reverse converters in terms of area and delays, it is considered to be faster.


KEYWORDS: Residue Number System; Converter; Adder; Chinese Remainder Theorem

## I. Introduction

Numbers play an important role in computer systems. Numbers are the basis and object of computer operations. The main task of computers is computing, which deals with numbers all the time [7]. For that matter, the study of number systems is very necessary as computers generally work on numbers. Residue Number Systems (RNS) is a nonweighted number system that utilizes remainders to represent numbers. In RNS, a set of moduli are chosen which are independent of each other. An integer is represented by the residue of each modulus and the arithmetic operations are based on the residues individually. Let $\left\{m_{1}, m_{2}, \ldots, m_{n}\right\}$ be a set of positive integers all greater than $1 . m_{i}$ is called a modulus, and the n -tuple set $\left\{m_{1}, m_{2}, \ldots, m_{n}\right\}$ is called moduli set. Consider an integer number $X$. For each modulus in $\left\{m_{1}, m_{2}, \ldots, m_{n}\right\}$, we have $x_{i}=\mathrm{X} \bmod \mathrm{m}_{\mathrm{i}}\left(\right.$ denoted as $\left.|X|_{m_{i}}\right)$. Thus, a number $X$ in RNS can be represented as $X=$ $\left\{x_{1}, x_{2}, \ldots, x_{n}\right\}$ [7].

Now, to calculate the number $X$ from its residues, the Chinese Remainder Theorem (CRT) is formulated and applied as follows:
$X=\left.\left|\sum_{\mathrm{i}=1}^{\mathrm{n}} \mathrm{M}_{\mathrm{i}}\right| \mathrm{M}_{\mathrm{i}}^{-1}\right|_{\mathrm{m}_{\mathrm{i}}} \mathrm{X}_{\mathrm{i}} \mid \mathrm{M}$
where; $M=\prod_{i=1}^{N} m_{i} ; M_{i}=\frac{M}{m_{i}} ;\left|M_{i} \times M_{i}^{-1}\right|_{m_{i}}=1$
The CRT provides an algorithmic solution of decoding the residue encoded number back into its conventional representation. This theorem is considered the cornerstone in realizing the utilization of RNS [2]. Residue Number System (RNS) belongs to the second school of thought, to enhance system performance, it was proposed for computation-intensive application design because of its ability to support high-speed concurrent arithmetic [8]. These RNS features has been put to good use in various Digital Signal Processing (DSP) applications [1]. Today RNS is also regarded as one of the most popular techniques for reducing the power dissipation and the computation load in Very Large Scale Integrated Circuits (VLSI) system design [9].

RNS is a non-weighed number system with special carry characteristics and a potential that results in high computations. In RNS, addition, subtraction and multiplication are inherently carry-free, for instance, each digit of the result is a function of only one digit from each operand, hence independent of all other digits. As a result of the carryfree property, it is feasible to mechanize operations such as addition, subtraction and multiplication. These inherent

ISSN(Online): 2320-9801
ISSN (Print) : 2320-9798

# International Journal of Innovative Research in Computer and Communication Engineering 

(A High I mpact Factor, Monthly, Peer Reviewed Journal)

## Website: www.ijircce.com

Vol. 6, Issue 11, November 2018
features enable RNS utilization in the fields of Digital Signal Processing (DSP), computational intensive areas such as; digital filtering, convolutions, correlations, Discrete Fourier Transform (DFT) computations, Fast Fourier Transform (FFT) computations and direct digital frequency synthesis [6], [5].

However, irrespective of the fact that, the residue number system supports high-speed parallel arithmetic, it is not very popular in processor construction due to the following difficult RNS arithmetic operations: reverse conversion between residue and binary numbers, overflow detection, magnitude comparison, sign detection, moduli selection, etc. However, solutions to some of these problems are currently being developed as a result of ongoing research. Out of these numerous RNS challenges, moduli selection and reverse conversion are the two most critical issues [4]. Many interesting reverse converters have been proposed for many moduli sets such as ( $2^{n}-1,2^{n}, 2^{n}+1$ ) [10], ( $2^{n}-$ $1,2^{n}, 2^{2 n+1}-1$ ) [3], to mention just a few. In [10] a new and uniform Adder Based algorithm using the New Chinese Remainder Theorem (CRT) for the RNS to binary conversion is presented whilst [3] presents a new residue to binary converter based on Mixed-Radix Conversion (MRC).

In this paper, the proposed scheme uses partial reverse conversion based on the CRT technique.

## II. Related work

In a study by Molahosseini and Navi (2007), two efficient residue to binary converters for the new three-moduli set $\left\{2^{\mathrm{n}}\right.$, $\left.2^{\mathrm{n}+1}+1,2^{\mathrm{n}+1}-1\right\}$ is presented. The proposed moduli set consists of pairwise relatively prime and balanced moduli, which can offer fast internal RNS processing and efficient implementation of the residue to binary converter. The new Chinese Remainder Theorem (CRT-I) is applied to derive an efficient residue to binary conversion algorithm for the new three-moduli set. Hardware implementation of the proposed residue to binary converters for the moduli set consist of one $(2 n+2)$-bit carry save adder (CSA) with End Around Carry (EAC) and a modulo $\left(2^{2 n+2}-1\right)$ adder. The proposed residue to binary converters are memoryless. In comparison with other residue to binary converter for a three-moduli set, the proposed converters have better area-time complexity whiles Hiasat and Abdel-Aty-Zohdy (1995), introduced a new algorithm for implementing residue to binary conversion for the moduli set ( $2^{\mathrm{k}}, 2^{\mathrm{k}}-1,2^{\mathrm{k}-1}-1$ ). The algorithm incorporates new compact forms for the multiplicative inverses. Binary adders are used in this proposed algorithm for the hardware implementation. If the system is pipelined, then the throughput rate of the converter increase to the equivalent of that of a single ( $3 \mathrm{k}-1$ ) bits binary adder. Thus, hardware requirements and execution time are less, with corresponding larger dynamic range. Therefore, this moduli set could have an increasing role in designing residuebased arithmetic units for different computing applications. In 2011, Stamenkovic and Jovanovic published an alternative architecture derived by Mixed-Radix Conversion (MRC) for a four-moduli set. Due to the use of simple multiplicative inverses of the proposed moduli set, there is considerably reduction in the complexity of the RNS to binary converter based on the MRC. The hardware architecture for the proposed converter is based on the adders and subtracters, without the needed Read Only Memory (ROM) or multipliers. The implementation consists of two levels. The first level, is the algorithm to convert RNS number to mixed-radix digits. The algorithm is improved by using optimal choice of form of moduli set. The second level is a simplified hardware architecture. Carry-Save-Adder (CSA) with End-Around-Carry (EAC) is replaced with Borrow-Save-Subtracter that avoids two complement operations, and EAC adder. Further, the binary subtraction is optimized by using Borrow-Propagate-Subtracter (BSS) with End-Around-Borrow (EAB) which avoids one complement operation and the use of multiplexer(s). The proposed converter architecture is memoryless enabling efficient implementation. However, Hariri, Rastegar, and Navi (2006) studied the moduli set $\left\{2^{\mathrm{n}}, 2^{2 \mathrm{n}}-1,2^{2 \mathrm{n}}+1\right\}$ and proposed a reverse converter. This moduli set provides the dynamic range (DR) of $2^{\mathrm{n}} \times\left(2^{4 \mathrm{n}}-1\right)$ and the implementation results have shown that its reverse converter has better area and time complexities in comparison with the moduli sets with the same dynamic range categories. The authors showed that for majority of the similar dynamic ranges, the proposed reverse converter is faster than the reverse converter of $\left\{2^{n}-1,2^{n}, 2^{n}+1\right\}$ but the reverse converter of $\left\{2^{\mathrm{n}}-1,2^{\mathrm{n}}, 2^{\mathrm{n}}+1\right\}$ has less area.Pettenghi et al. (2013), also proposed methods to design memoryless reverse converters for the proposed moduli sets with large dynamic ranges, up to ( $8 \mathrm{n}+1$ )-bit. Due to the complexity of the reverse conversion, both the Chinese Remainder Theorem (CRT) and the Mixed Radix Conversion (MRC) are applied in the proposed methods to derive efficient reverse converters. Experimental results suggest that the proposed vertical extensions allows the reduction of the area-delay-product up to 1.34 times in comparison with the related state-of-the-art. The horizontal extensions allow larger and more balanced moduli sets, resulting in an improvement of the RNS arithmetic computation, at the cost of lower reverse conversion performance.

# International Journal of Innovative Research in Computer and Communication Engineering 

(A High I mpact Factor, Monthly, Peer Reviewed Journal)

## Website: www.ijircce.com

Vol. 6, Issue 11, November 2018

## III. PROPOSED ALGORITHM

Given the RNS number $X=\left(x_{1}, x_{2}, x_{3}\right)$ with respective moduli set $\left\{2^{n}-1,2^{n}, 2^{n}+1\right\}$, where; $m_{1}=2^{n}-$ $1, m_{2}=2^{n}$, and $m_{3}=2^{n}+1$, then the following holds true;
$M_{1}=2^{n}\left(2^{n}+1\right) ; M_{2}=\left(2^{2 n}-1\right) ;$ and $M_{3}=2^{n}\left(2^{n}-1\right)$
Theorem 1: For the given moduli set, then;
$\left|M_{1}^{-1}\right|_{m_{1}}=\left|2^{n-1}\right|_{m_{1}}$
$\left|M_{2}^{-1}\right|_{m_{2}}=|-1|_{m_{2}}$
$\left|M_{3}^{-1}\right|_{m_{3}}=\left|-2^{n-1}\right|_{m_{3}}$
The proof of (3)-(5) is demontrated in [5].
Theorem 2: For the given moduli set, any RNS number $X$ can be represented as;

$$
\begin{equation*}
X=2^{n} \xi+x_{2} \tag{6}
\end{equation*}
$$

where;
$\xi=\left|\left|\begin{array}{c}\left|\left(2^{n} x_{1}+x_{1}\right) 2^{n-1}\right|_{2^{2 n}-1}+\left|-2^{n} x_{2}\right|_{2^{2 n}-1} \\ +\left|-x_{3}\right|_{2^{2 n-1}}+\left|+2^{n-1} x_{3}\right|_{2^{2 n}-1}\end{array}\right|_{2^{2 n-1}} \|\right.$
Proof: Substituting equations (2) through to (5) into (1) and factorizing out $2^{n}$ we obtain (6).

## HARDWARE IMPLEMENTATION

Here, the mathematical formulations and simplifications for obtaining an effective and robust design is presented. The design and usage of simplified architecture that is cost effective and less complex is also presented.
A. Design Considerations and Mathematical Simplifications

Hardware implementation considers significantly, the minimisation of production cost and cost of usage. It is therefore important to further simplify equation (7) to equation (8) to reduce the implementation cost, which is achieved by reducing equation (7) to utilise Adders and Multiplexers.
Equation (7) can further be simplified as follows;
$\xi=\mid \varphi_{1}+\varphi_{2}+\varphi_{3}+\varphi_{4} I_{2^{2 n}-1}$
where;
$\varphi_{1}=\left.I\left(2^{n} x_{1}+x_{1}\right) 2^{n-1}\right|_{2^{2 n}-1}$
$\varphi_{2}=\left|-2^{n} x_{2}\right|_{2^{2 n}-1}$
$\varphi_{3}=\left|-x_{3}\right|_{2^{2 n}-1}$
$\varphi_{4}=\left|2^{n-1} x_{3}\right|_{2^{2 n}-1}$
Now, considering equations (9)-(12) and simplify them for implementation in a VLSI system. It is necessary to note that $x_{i, j}$ means the $j$-th bit of $x_{i}$.
Evaluation of $\boldsymbol{\varphi}_{\mathbf{1}}$
The residue $x_{1}$ can be represented as follows;
$x_{1}=x_{1, n-1} \ldots x_{1,1} x_{1,0}$
Thus,
$\left|\left(2^{n} x_{1}+x_{1}\right) 2^{n-1}\right|_{2^{2 n}-1}=$
$|2^{n-1}(\underbrace{x_{1, n-1} \ldots x_{1,0} \overbrace{0 \ldots 0}^{n \text { Bits }}}_{2 n-\text { bits }}+\underbrace{\overbrace{0 \ldots 0}^{0 \text { Bits }} x_{1, n-1} \ldots x_{1,0}}_{2 n})|_{2^{2 n-1}}$
$=|2^{n-1}(\underbrace{x_{1, n-1} \ldots x_{1,1} x_{1,0} x_{1, n-1} \ldots x_{1,1} x_{1,0}}_{2 n-\text { bits }})|_{2^{2 n-1}}$

# International Journal of Innovative Research in Computer and Communication Engineering 

(A High I mpact Factor, Monthly, Peer Reviewed Journal)
Website: www.ijircce.com
Vol. 6, Issue 11, November 2018
$=\frac{n+1}{\overbrace{x_{1,0} x_{1, n-1} \ldots x_{1,1} x_{1,0}}^{\text {Bits }}} \underbrace{x_{1, n-1} \ldots x_{1, n+2} x_{1,1}}_{n-1}$

## Evaluation of $\boldsymbol{\varphi}_{2}$ :

The residue $x_{2}$ can be represented as follows;
$x_{2}=x_{2, n-1} \ldots x_{2,1} x_{2,0}$
Therefore,
$\left|-2^{n} x_{2}\right|_{2^{2 n}-1}=\bar{x}_{2, n-1} \ldots \bar{x}_{2,1} \bar{x}_{2,0} \overbrace{11 \ldots .11}^{n \text { Bits }}$

## Evaluation of $\varphi_{3}$ and $\varphi_{4}$ :

The residue $x_{3}$ can be represented as follows;
$x_{3}=x_{3, n} \ldots x_{3,1} x_{3,0}$
Therefore,
$\varphi_{3}=\left|-x_{3}\right|_{2^{2 n}-1}=\overbrace{11 \ldots 11}^{n \text { Bits }} \underbrace{\bar{x}_{3, n-1} \ldots \bar{x}_{3,1} \bar{x}_{3,0}}_{n \text { Bits }}$
Again,
$\varphi_{4}=\left|2^{n-1} x_{3}\right|_{2^{2 n}-1}=\underbrace{0 x_{3, n} \ldots x_{3,1} x_{3,0}}_{n+1 \text { Bits }} \overbrace{00 \ldots .00}^{n-1}$
B. The Proposed Architecture
$\xi$ is computed according to equation (8) where all the parameters are defined in equations (9) - (12). Carry Save Adders (CSAs) and Carry Propagate Adders (CPAs) are used to reduce the hardware complexity. As shown in Figure $1, \xi$ is computed using CSAs 1and 2 and two regular $2 n$-bit CPAs 1 and 2 . The results of these CPAs are passed on to a multiplexer (MUX 1) which would then pass either of them down. MUX 1 will pass on the result of CPA 1 if the carry out of CSA 1 is a ' 0 ', otherwise the result of CPA 2 is passed on.
CSAs 1 and 2 require an area of $2 n \Delta_{F A}$ each as well as CPAs 1 and 2 . Therefore, in order to obtain $\xi$, a total area of $8 n \Delta_{F A}$ will be required.
Regarding the delay, each CSA (i.e. CSAs 1 and 2) impose a delay of $D_{F A}$ while the CPA pair 1 and 2 impose a delay of $2 n D_{F A}$ since they are in parallel, thus delay imposed on computing $\xi$ is $(2 n+2) D_{F A}$. The final computation for (6) is a concatenation which does require any hardware.

ISSN(Online): 2320-9801
ISSN (Print) : 2320-9798

## International Journal of Innovative Research in Computer and Communication Engineering

(A High I mpact Factor, Monthly, Peer Reviewed Journal)
Website: www.ijircce.com
Vol. 6, Issue 11, November 2018

The schematic diagram for the proposed scheme is shown below;


Figure 1: Block Diagram of the Reverse Converter

## IV.SIMULATION RESULTS

In this section, the performance of the proposed system is evaluated by comparing it with [10] and [3]. The evaluation is done in terms of area (A), delay (D) and $A D^{2}$. From the $A D^{2}$ comparison, it can be concluded that, the proposed scheme is better.

Table 1: Area, Delay, AD $^{2}$ Comparison

| Converters | Area | Delay | $\mathbf{A D}^{2}$ |
| :--- | :--- | :--- | :--- |
| $[\mathbf{1 0}]$ | 4 n | $4 \mathrm{n}+2$ | $64 \mathrm{n}^{3}+64 \mathrm{n}^{2}+16 \mathrm{n}$ |
| $[\mathbf{3}]$ | $9 \mathrm{n}+2$ | $10 \mathrm{n}+5$ | $900 \mathrm{n}^{3}+1100 \mathrm{n}^{2}+425 \mathrm{n}+50$ |
| Proposed Scheme | 8 n | $2 \mathrm{n}+2$ | $32 \mathrm{n}^{3}+64 \mathrm{n}^{2}+32 \mathrm{n}$ |

The graph below in Figure 2 shows the performance of the various schemes. In the graph, it is clear that the proposed scheme is more efficient in terms of area and delay comparison.

ISSN(Online): 2320-9801
ISSN (Print) : 2320-9798

# International Journal of Innovative Research in Computer and Communication Engineering 

(A High I mpact Factor, Monthly, Peer Reviewed Journal)

Website: www.ijircce.com
Vol. 6, Issue 11, November 2018


Figure 2: Graph of Area and Delay Comparison

## V. CONCLUSION AND FUTURE WORK

In this paper, a fast residue-to-binary converter for the moduli set ( $2^{n}-1,2^{n}, 2^{n}+1$ ) has been proposed. The approach does not require full RNS-binary conversion. Theoretical analysis from Table 1 shows that, the proposed scheme is more efficient as compared to the state-of-art schemes in [3] and [10] in the Area-Delay comparison

## REFERENCES

[^0]
[^0]:    1. Nannarelli, A., Re, M., and Cardarilli, G. C., "Tradeoffs between Residue Number System and Traditional FIR Filters", IEEE Int. Symp. Circuits Syst. pp. 305-308, 2001.
    2. Omondi, A., and Premkumar, A., "Residue Number System Theory and Implementation (Vol. 2). Imperial College Press, 2007.
    3. Molahosseini, A. S., Navi, K., and Rafsanjani, M. K. "A New Residue to Binary Converter based on Mixed-Radix Conversion," $3^{\text {rd }}$ International Conference on Information and Communication Technologies: From Theory to Applications (ICTTA 2008), pp. 1-6, 2008. Cao, B., Chang, C. H, and Srikanthan, T., "An Efficient Reverse Converter for the 4 -Moduli Set ( $2^{\mathrm{n}}-1,2^{\mathrm{n}}, 2^{\mathrm{n}}+1,2^{2 \mathrm{n}}-1$ )," IEEE Trans. on Circuits and Systems, vol 50, pp. 1296-1303, 2003.
    4. Bankas, E. K., and Gbolagade. K. A. "A New Efficient FPGA Design of Residue-to-Binary Converter". International Journal of VLSI design \& Communication Systems (VLSICS) Vol.4, No.6, 2013.
    5. Gbolagade, K. A., "An Efficient MRC based RNS-to Binary Converter for the $\left\{2^{2 \mathrm{n}}-1,2^{\mathrm{n}}, 2^{2 \mathrm{n}+1}-1\right\}$ Moduli Set". International Journal of Advanced Research in Computer Engineering \& Technology (IJARCET) Volume 2, Issue 4, 2013. 7. Luvanced Research in Computer Engineering \& Technology (IJARCET) Volume 2, Issue 4, 2013.
    6. Sheu, M., Lin, S. H., Chen, C., and Yang, S. W. "An efficient VLSI Design for a Residue to Binary Converter for General Balance Moduli Set $\left\{2^{\mathrm{n}}-3,2^{\mathrm{n}}-1,2^{\mathrm{n}}+1,2^{\mathrm{n}}+3\right\}$ ", IEEE Transactions on Circuits and Systems II, 51(3), pp. 152-155, 2004.
    7. Stouratitis, T. and Paliouras, V. "Considering the Alternatives in Lowpower Design," IEEE Circuits and Devices, pp. 23-29, 2001.
    8. Wang, Y., Song, Y., Aboulhamid, M., and Shen, H., "Adder Based Residue to Binary Number Converters for ( $2^{\mathrm{n}}-1,2^{\mathrm{n}}, 2^{\mathrm{n}}+1$ )", IEEE Trans. on Signal Processing, vol. 50, pp. 1772-1779, 2002.
    9. Hariri, A., Rastegar, R., \& Navi, K, "High Dynamic Range 3-Moduli Set with Efficient Reverse Converter", 2006.
    10. Hiasat, A. A., and Abdel-Aty-Zohdy, H. S., "An Algorithm for the Design of A Residue-To-Binary Converter", Midwest Symposium on Circuits and Systems, 1280-1283, 1995.
    11. Pettenghi, H., Chaves, R., and Sousa, L., "Residue Number System Reverse Converters for Moduli Sets with Dynamic Ranges up to ( $8 \mathrm{n}+1$ )-bit", IEEE, 1487-1500, 2013.
    12. Stamenkovic, N., \& Jovanovic, B., "Reverse Convertor Design for the 4-Moduli Set $\left\{2^{\wedge} \mathrm{n}-1,2^{\wedge} \mathrm{n}+1,2^{\wedge}(2 \mathrm{n}+1)-1\right\}$ Based on the MixedRadix Conversion", SER.: ELEC.ENERG, 89-103, 2011.
