Professional Documents
Culture Documents
net/publication/351421185
CITATION READS
1 233
2 authors, including:
Mohsen Raji
Shiraz University
73 PUBLICATIONS 274 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Mohsen Raji on 13 August 2021.
Abstract—Linear Feedback Shift Registers (LFSR) are exten- called as Single-Point-of-Failure (SPoF). SPoF is defined as
sively used in variety of applications such as Built-In-Self-Test the design point that will result in the entire system failure if it
circuits or Pseudo Random Number Generators. Hence, fault fails [6]. The majority-voting system in TMR is accounted as
tolerant design of LFSR is essential for the applications with
high reliability demands. Traditional fault tolerant LFSRs include its SPoF. So, the number of SPoF is required to be considered
large number of Single-Point-of-Failures (SPoFs) in which any in TMR-based fault tolerant techniques. Some previous works
fault results in the whole system failure. In this paper, a new have proposed fault tolerant LFSR designs [7], [8] which are
fault tolerant architecture for LFSR (named as FT-LFSR) is based on TMR approach. Although, they have provided fault
proposed in which the number of SPoFs are significantly reduced tolerance against single fault, the number of SPoFs is increased
compared to the previous ones. To this end, a modified version of
Triple Modular Redundancy (TMR) empowered with some extra significantly by triplicating the majority-voting system, leading
controlling units for identifying the operational module is used. to a considerable decrease in the reliability of the design.
In addition, a novel metric called Reliability-Area-Factor (RAF) In this paper, a new fault tolerant LFSR (named as FT-
is introduced to evaluate the efficacy of the redundancy-based LFSR) is proposed in which the number of SPoFs is sig-
fault tolerant techniques (such as FT-LFSR) in terms of number nificantly reduced. To this end, a modified version of TMR
of SPoFs and the area overhead. Experimental results show that,
the FT-LFSR is resilient to all single transient and permanent empowered with some extra controlling units for identifying
faults except in its limited SPoFs and many patterns of multiple the operational module is used. We also introduce a novel
faults. comparison metric called Relaibilty-Area-Factor (RAF) to
Index Terms—LFSR, Fault tolerance, FPGA, TMR, Test, Built compare the TMR-based fault tolerance techniques in which
in self test. the number of SPoFs and area overhead of TMR approach is
taken into consideration. The proposed design is simulated
I. I NTRODUCTION and implemented using Verilog on Xilinx Spartan 6. The
Linear Feedback Shift Register (LFSR) is the main com- fault injection-based experimental results show that, FT-LFSR
ponent of the embedded systems that are based on pseudo- tolerates all single transient and permanent faults (except the
random bit sequences such as pseudo-random number and ones which occur in its limited SPoFs) and many patterns of
pseudo-noise sequence generators, fast digital counters, and multiple faults.
Built-In-Self Tests [1], [2], [3], [4], [5]. In real-time and The rest of the paper is organized as follows: Section
safety-critical embedded applications, incorrect functionality II is about LFSR. Section III explains previous works in
may cause catastrophic effects and thus, the fault-tolerance is which a fault tolerant LFSR is presented. Section IV explains
known as a traditional design requirement. In recent years, the architecture of FT-LFSR. Section V introduces RAF. Section
continuous reduction of the transistor dimensions and supply VI is about RAF analysis of FT-LFSR. Section VII talks about
voltage levels have led to an increase in the vulnerability fault injection in different parts of FT-LFSR. Section VIII
of Integrated Circuits (ICs) against various hardware faults. shows simulation and report analysis with 16, 32 and 64 bit
Therefore, fault tolerant design of LFSR considering hardware LFSR and finally, section IX concludes the paper.
faults is very essential in safety-critical embedded applications.
The key concept of fault tolerance is redundancy; i.e. the II. BACKGROUND
extra capabilities are added to the original design which are LFSR is a state machine composed of a shift register whose
not necessary in fault-free operation. The redundancy can be input bit is a linear function of its previous state. There are two
spatial and/or temporal. Triple Modular Redundancy (TMR) main forms of LFSR; i.e. Galois and Fibonacci configurations.
is a well-known and common fault tolerant technique based In the Fibonacci implementation, the outputs from some of the
on spatial redundancy [9]. In TMR, the design is triplicated to registers are XORed with each other and fed back to the input
perform a process in parallel and their result is processed by of the shift register. In the Galois implementation, the gates
a majority-voting system to produce a single output. Despite are placed between the registers. In Figure 1 the conventional
its efficacy, there is a major vulnerability in TMR technique Fibonacci and Galois LFSR implementations are shown. n
is obtained based on majority voting (which is not shown in
Fig 2) on the outputs of the three voters (voter 1, voter 2, and
voter 3). So, the last majority voter (which includes N one-
bit voters) is the SPoF of this architecture and any fault in
the last voter leads to the system failure. Therefore, although
this architecture tolerates the single faults in the intermediate
points of the circuit, its reliability is threatened by N SPoFs.
Each PE is organized as shown in Fig 4. It has two main V. R ELIABILITY-A REA -FACTOR (RAF) M ETRIC
parts: In order to evaluate the efficacy of hardware redundancy-
1. LFSR: LFSR produces N bit pattern called based techniques, we propose a novel metric called Reliabilty-
PE LFSR out. Area-Factor (RAF) which integrates the number of SPoFs and
2. Comparetor: Comparetor determines whether the area overhead of the techniques.
FT LFSR out is correct or not based on its point RAF is computed as:
of view. This operation is done by comparing its LFSR's RAF = S ∗ A (2)
output with other LFSR's outputs in other PEs if the
current PE is selected PE. If the current PE is not where S and A respectively show the ratio of number of SPoFs
selected PE, the LFSR's output is compared just with and the area in the original and fault tolerant architectures.
the LFSR's output of selected PE. Parameter S is calculated as:
∑
PE outputs and inputs are: N SOi
S = ∑i (3)
1. PE LFSR out: PE LFSR out is the N bit pattern which j N SFj
is produced by LFSR.
where N SOi shows the number of logic blocks in ith SPoF
5 PE LFSR out is the N bit output of each PE which is the same as a in the original architecture and N SFj is the number of logic
pettern that a conventional LFSR generates. blocks in j th SPoF in the fault tolerant architecture. Higher
parameter S is proportional to the higher reliability achieved Table I
by the fault tolerant architecture. Parameter A is calculated as: RAF value for different LFSRs
Nb LFSR RAF
A= (4) width FT-LFSR Conventional LFSR P-LFSR
Nf 16 5.33 1 0.33
where Nb and Nf respectively represent the total number of 32 10.66 1 0.33
64 21.33 1 0.33
logic blocks in the original and fault tolerant architectures.
The overhead imposed by the fault tolerant approach leads to
an increase in parameter A.
Therefor, RAF integrates the reliability improvement and the VII. Q UANTITATIVE FAULT A NALYSIS
imposed area overhead achieved by a fault tolerant technique. In this section, it is quantitatively shown that FT-LFSR is
Higher RAF means that the proposed fault tolerant architecture resilient to transient and permanent faults; i.e. the fault will
provides higher reliability in terms of less area overhead. More be either masked (when it occurs in an unselected PE and
precisely, a fault tolerant architecture is better than the original thus, will not destroy the FT LFSR out) or detected (by ready
one in terms of RAF if 1 < RAF < ∞ and it is worse if signal which causes the controller of BIST to change the
0 < RAF < 1. The fault tolerant architecture is as the same sel out). There are some main modules in FT-LFSR which
as original architecture if RAF = 1. should be evaluated in presence of faults as follows:
VI. E VALUATION OF FT-LFSR USING RAF 1. PE
If one fault occurs in one of PEs then the two others
In this section, we evaluate FT-LFSR and P-LFSR using the
will understand the occurence of fault by comparing their
RAF metric. The RAF is computed for FT-LFSR as follows:
∑ PE LFSR out (which is correct value) with the faulty
1. i N SBi = N , because a conventional LFSR has N flip PE LFSR out. If fault occurs in an selected PE it will
∑ that all of them are SPoFs.
flops be detected and if fault occurs in an unselected PE it will
2. j N SFj = V = 1, where V is the number of logic be masked.
blocks in Voter (because we have just one SPoF in 2. Controller
our architecture). As the occupied volume by Voter is If one fault occurs such that sel out changes, fault will be
approximately equivalent to occupied volume by one masked because all PEs are working correctly and if each
register, therefor V = 1. of them connects to sel out the result will be correct. It
3. Nb = N , because the base LFSR has just N FFs. is important to say that PE A is connected to sel out
4. Nf = ((3 ∗ N ) + M ) ≈ 3N , where M is the number of with two values of sel out because we want to ensure
extra logic blocks that is related to Voter, Comparetor and that FT LFSR out connects to one of PE outputs. If one
Controller that is negligible in front of occupied volume fault occurs in SelectedA, SelectedB and SelectedC the
by FFs. fault will be masked because they just determines each
So, RAF for FT-LFSR will be: PE must compare its output with one PE or two PEs
N N 1 N which is not important in this case because all PEs are
RAF = ( ) ∗ ( )=N∗ = (5)
1 3N 3 3 working correctly.
Therefor, FT-LFSR is better than the conventional LFSR if 3. Voter
N > 3. If one fault occurs in Voter such that the output of voter
The RAF analysis of the P-LFSR is as follows: is 1, it will be masked otherwise, it will not be detected.
∑ So Voter is a SPoF.
1. ∑i N SBi = N .
Therefor, FT-LFSR is resilient to all single (transient or
2. j N SFj = N , because this LFSR has N one-bit voters
that each of them are SPoFs. permanent) faults that occurs in any logic except the Voter. It
3. Nb = N . is notable that, FT-LFSR is also resilient to many patterns of
4. Nf = ((3 ∗ N ) + M ) ≈ 3N , where M is the number of multiple faults.
extra logic blocks that is related to voters and comparators VIII. E XPERIMENTAL R ESULTS
that are negligible in comparison to occupied volume by
FFs. This section investigates the efficacy of the proposed fault
tolerant LFSR in various aspects based on the extensive fault
Finally, RAF for the P-LFSR will be:
injection analysis. The proposed method is implemented in
N N 1 1 Verliog language, simulated by ISE tool, and synthesized on
RAF = ( )∗( )=1∗ = (6)
N 3N 3 3 Spartan 6 FPGA. Logic value upset is considered as the fault
In order to compare the RAF value for different widths and model for fault injection. In order to model transient faults,
based on the Equations of 5 and 6, the RAF value of different the logic upset is injected only for one clock cycle and then,
LFSRs are shown in Table I. As it is shown for all widths, becomes corrected in the next cycles while for permanent
FT-LFSR has higher RAF value and that is because of one faults, the logic upset remains in the circuit for more than
SPoF. one clock cycle. Faults have been injected in outputs of main
modules (PE, Controller and voter) and main inputs (sel out)
in two forms wrong zero and wrong one. The used polynomial
generators 6 have been brought in table II.
Table II
Polynomial generator for different LFSR widths
32 x32 + x22 + x2 + x1 + 1
64 x64 + x63 + 1