SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5041
Analysis of Micro Inversion to Improve Fault Tolerance in High Speed
VLSI Circuits
Somashekhar1, Vikas Maheshwari2, R. P. Singh3
1Research Scholar, Dept of ECE, SSSUTMS, Sehore (M.P), India-
2Associate Professor, Dept of ECE, Bharat Institute of Engineering and Technology, Hyderabad. India-
3Vice-Chancelor & Professor, Dept of ECE, SSSUTMS, Sehore (M.P), India-
1somashekhar49@gmail.com, 2maheshwarivikas1982@gmail.com, 3provc@sssutms.co.in
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - With technology scaling,thereliabilityofcircuits
is becoming a rising concern. The emergence of logic errors
in the field cause by faults escaping manufacturing testing,
aging, single event upsets,or processvariationsisincreasing.
Conventional techniques for online testing and circuit
protection repeatedly requirea highdesign effortorresultin
high area overhead and power consumption and are
unsuitable for low cost systems. The primary motive for
introducing fault tolerance in VLSI circuits is yield
enhancement, increasing the percentage of fault free chips
obtained. The active area of monolithic VLSI chips has
always been limited by random fabrication defects, which
appear impossible to eliminate in even the best
manufacturing processes. The larger the circuit, the more
likely it will contain such a defect and fail to operate
correctly. Thus, the defect density in any fabrication line
limits the size of the largest defect free chip producible with
commercially viable yields. Larger circuits demand a fault
tolerance capability to overcome fabrication defects while
avoiding unreasonable costs. In nm technologies, circuits be
more and more sensitive to a variety of perturbations.
Transient faults can take place in a processor as a result of
electrical noise, like crosstalk, or high energy particles, like
neutrons and alpha particles. These faults be able to cause a
program running on the processor to behave erratically, if
they propagate and change the architectural state of the
processor. These faults can occur in memory arrays,
sequential elements or in the combinational logic in the
processor. Protection against transient faults in
combinational logic has not received much attention
traditionally because combinational logic has a natural
barrier stopping the propagation of the faults. System
performance is increased whenthenodesareabletorecover
locally from most errors caused by transient faults. The
circuitry added for concurrent error detection generally
reduces performance. By means of a technique called micro
rollback, it is achievable to eliminate the performance
penalty of concurrent error detection.
Keywords: Micro inversion, Fault Tolerance, VLSI,
Processor, IC, Register file.
1. INTRODUCTION
As with every piece of machinery, ICs are prone to failure.
Through technology scaling, transistor sizes are reduced to
open the way for increased functionality with reduced in
general power dissipation, device dimensions and
manufacturing costs but despite those advantages, the
reliability of ICs hasbeenaffected.Theincreasingprobability
of circuit failure caused by increasing device complexityand
the errors caused by increaseddelayduetotemperature rise
in CMOS circuits. Faults in a distributed embedded system
can be permanent, intermittent or transient (also known as
soft errors). Permanent faults cause long-term
malfunctioning of components. These faults emerge for a
short time. Causes of intermittent faults are within system
boundaries, while causes of transient faults are external to
the system. They might damage data or lead to logic
miscalculations, which can outcome in a fatal failure. Due to
their higher rate, these faults cannot be addressed in a cost-
effective way by applying traditional hardware-based fault
tolerance techniques suitable for tolerating permanent
faults. Embedded systems with fault tolerance have to be
carefully designed and optimized, in order to satisfy strict
timing requirements without exceeding a certain limited
amount of resources. Moreover, not only performance and
cost related requirements have to be considered but also
other issues such as debug ability and testability have to be
taken into account.
2. LITERATURE REVIEW
A fault-tolerant system may be able to tolerate one or more
fault-types including -- i) transient, intermittent or
permanent hardware faults, ii) software and hardware
design errors, iii) operator errors. Gayathri and Prabakaran
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5042
[1] discussed some important factors of failures. One
important factor is arbitrary node or link failure which
results in denial of service. In cloud computing, load
balancing is required to distribute the dynamic local
workload evenly across all the nodes. It helps to attain a
more user fulfillment and resource utilization ratio by
ensuring an efficient and fair allocation of every computing
resource. Identified some of the load balancing algorithms
which distribute workload across multiple computers or a
computer cluster, network links, central processing units,
disk drives, or other resources, to achieve best possible
resource utilization, maximize throughput, minimize
response time, and avoid overload. When all theseissuesare
addressed naturally the system becomesa faulttolerant one.
Win Naing [2] proposed a fault-tolerance
management framework for private clouds development.
Previous researchers developed Eucalyptus in organize to
facilitate the creation of private clouds. But Eucalyptus is no
fault tolerant system and no VM monitoring is performed
thus limiting the support for advanced VM placement
policies (e.g., consolidation). Eucalyptus does not also
include any self-healing features and strictly distinguishes
between cloud and cluster controllers. Therefore, they
proposed the fault-tolerance management framework over
Eucalyptus by adding new component Cluster Controller
Manager (CCM).
Sheheryar and Fabrice [3] proposed a scheme of
fault tolerance mechanism for real time computing on cloud
infrastructure. It has advantages of forward recovery
mechanism. It performs the reverse recovery if the node
with best reliability could not achieve the SRL. There is
another big advantage of this scheme. It does notsuffer from
domino effect as check pointing is made in the end when all
the nodes have produced the result.
Y Tamir [4] proposed a Fault-tolerantsystemfrequentlyrely
on self-checking compute nodes. It detect errors
immediately they occur, hence prevent the spread of invalid
information throughout the system.
3. MICRO INVERSION
A key to achieving a high degree of fault tolerance is the
ability to detect errors as soon as they occur and prevent
erroneous information from spreading throughout the
system. In highly reliable systems, this is usually
accomplished by checkers and isolation circuits in the
communication paths from each module to the rest of the
system. This additional circuitry reduces performance by
requiring either longer clock cycles or additional pipeline
stages. This presents a technique, called micro rollback.
Operations performed on this erroneous information are
“undone” by means of a hardware mechanism for fast
rollback of a few cycles. Straightforward realizationof micro
rollback will need of significant performance and chip area
overheads for replicating all the storage elements in each
module.
This paper discusses techniques for efficient analysis of
micro rollback in VLSI systems. It focuses on the micro
architecture and VLSI realization of a VLSI RISC processor
that is able of micro rollback. A micro rollback of a
subsystem consists of bring the subsystem back a onlysome
cycles to a state reached in the past. It is so necessary tosave
the state of the subsystem at each cycle boundary [10]. Ifthe
‘‘subsystem’’ is a processor, the state is the contents of all
storage elementswhichcarryuseful informationacrosscycle
boundaries. It is composed of the program counter, the
program status word, the instruction register, and the
register file, it also includes the contents of some pipeline
latches and some registers in the statemachinewhichcanbe
changed during the execution of a multicycle instruction.
Since instructions also modify external memory, the state of
the cache must also be preserved. A rollback restores the
contents of the cache to its state a few cycles earlier.
Figure 1. A register file with support for micro Inversion
4.MULTI MODULE SYSTEM
Periodic check pointing of process states and roll back to a
previous state when an error is detected is a common
technique for error recovery in distributed systems[11]. If
each process is check pointed independently, rolling back
one process may require rolling back a second process
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5043
further in time which, in turn, may cause a third process to
roll back, etc. leading to an uncontrolled domino effect [11].
In the worst case this can result in all processes in the
system rolling back to their state when the system is
initialized. In the context of micro rollback, which is done at
the level of hardware modules, the domino effect cannot
occur in such system. However, if themodulesareconnected
in an arbitrary topology, where there are several
independent communication paths between pairs of
modules, the domino effect could, potentially, occur. Since
the range of rollback is severely limited (a few cycles), this
can make recovery impossible.
Figure 2. Multi-Module System
In a system if modules are interconnectedvia a commonbus,
this problem can be solved by using bus transactions as a
common logical clock.
5.CONCLUSION
One of the keys to achieving a high degree of fault tolerance
is the ability to detect errors instantly after they occur &
prevent invalid information from distribution all over the
system. This primary problem in achieving fault tolerancein
VLSI systems is able to overcome by performing checks in
parallel with intermodule communication. This paper
analyses the parallel error checks in concurrencewithmicro
rollback can be used to support fault tolerance in complex
multi module high performance VLSI systems.
REFERENCES
[1] Ms. G. Gayathri and Dr. N. Prabakaran, “Achieving Fault
Tolerance in Cloud Environment by Efficient Load
Balancing”, International Journal of Emerging
Trends & Technology in Computer Science (IJETTCS)
Volume 2, Issue 3, May – June , 2013 ISSN 2278-6856.
[2] WinNaing “Fault-tolerant Management for PrivateCloud
System”, International Journal of Emerging Trends &
Technology in Computer Science (IJETTCS), Volume 1,
Issue 1, May-June 2012 ISSN 2278-6856.
[3] Sheheryar Malik, FabriceHuet,“AdaptiveFaultTolerance
in Real Time Cloud Computing”, IEEE World Congress
on Services, Jul 2011, Washington DC, United States.
IEEE, pp.280-287.
[4] Y Tamir, “Self-checking self-repairing computer nodes
using the Mirror Processor”, IEEE Journal ofsolid-state
circuits, vol. 21. No. I. January 1992.
[5] Han C.C., Shin K. G. and Wu J., “A Fault-Tolerant
Scheduling Algorithm for Real-Time Periodic Tasks
with Possible Software Faults”, IEEE Computers 2003.
[6] Prasenjit Kumar Patra,HarshpreetSingh,GurpreetSingh,
“Fault Tolerance Techniques and Comparative
Implementation in Cloud Computing”, International
Journal of Computer Applications (0975 – 8887)
Volume 64– No.14, February 2013.
[7] Jasbir Kaur, SupriyaKinger, “Efficient AlgorithmforFault
Tolerance in Cloud Computing”, International Journal
of Computer Science and Information
Technologies(IJCSIT), Vol.5 (5) , 2014, 6278-6281.
[8] S. Sudha Lakshmi, Sri Padmavati, “Fault Tolerance in
Cloud Computing”,International Journal ofEngineering
Sciences Research-IJESR, Vol 04, Special Issue01,2013,
issn:2230-8504, e-ISSN-2230-8512.
[9] Pandeeswari.R, Mohamadi Begum “Rsfts: Rule-Based
Semantic Fault Tolerant Scheduling For Cloud
Environment”, Council for Innovative Research
International Journal of Computers & Technology.
Volume 4 No. 2, March-April, 2013, ISSN 2277-3061.
[10] W. W. Hwu and Y. N. Patt, ‘‘Checkpoint Repair for Out-
of-orderExecutionMachines,’’14thAnnual Symposium
on Computer Architecture, Pittsburgh, PA, pp. 18-26
(June 1987).
[11] B. Randell, P. A. Lee, and P. C. Treleaven, ‘‘Reliability
Issues in Computing System Design,’’ Computing
Surveys 10(2), pp. 123165 (June 1978).
[12] L. D. Babu and P. Krishna, “Honey bee behaviorinspired
load balancing of tasks in cloud computing
environments”, in Applied Soft Computing, Vol. 13(5),
pp. 2292-2303, (2013).
[13] R. Kaur and P. Luthra (2012), “Load Balancing in Cloud
Computing”, In Proceedings of International
Conference on Recent Trends in Information,
Telecommunication and Computing, ITC.
[14] M. Dorigo, G. D. Caro and L. M. Gambardella (1999),“Ant
algorithms for discrete optimization”, Artif. Life, Vol.
5(2), pp.137-172.
[15] Virendra Singh Kushwah, Sandip Kumar Goyal and
Priusha Narwariya, “ A survey on various fault tolerant
approaches for cloud Environment during load
balancing”, IJCNWMC, Vol. 4, Issue 6, Dec 2014, 25-34
ISSN(P): 2250-1568; ISSN(E): 2278-9448.
[16] I. Koren, “A Reconfigurable and Fault-Toler- ant VLSI
Multiprocessor Array,” Proc. 8th Ann. Symp. Computer
Architecture, May 1981, pp. 425-441.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5044
[17] N.R. Rejinpaul, L. Maria Michael Visuwasam,
“Checkpoint-based Intelligent Fault tolerance For
Cloud Service Providers”, International Journal of
Computers and Distributed Systems Vol. No.2, Issue 1,
December 2012 ISSN: 2278-5183.
[18] FetahiWuhib, Mike Speritzer and Rolf Stadler (2012),
“Gossip Protocol for DynamicResourceManagementin
Large Cloud Environments” IEEE Transactions on
Network and Service Management, vol. 9, no. 2, pp.
213- 225.
[19] Ramtilak Vemu et al, “A low-cost concurrent error
detection techniqueforprocessorcontrol logic”,978-3-
9810801-3-1/DATE08 © 2008 EDAA.
[20] Peng Hu, Wei Dai, “Enhancing Fault Tolerance based on
Hadoop Cluster”, International Journal of Database
Theory and Application Vol.7, No.1 (2014), pp.37-48.
[21] Samudrala, P. K., Ramos, J., and Katkoori, S. (2004)
“Selective triple modular redundancy (STMR) based
single-event upset (SEU) tolerantsynthesisforFPGAs”,
Nuclear Science, IEEE Transactions on, Vol.51,No.5,pp
2957-2969.

More Related Content

PDF
An Investigation of Fault Tolerance Techniques in Cloud Computing
PDF
2012A8PS309P_AbhishekKumar_FinalReport
PDF
IRJET- Application Backup and Restore across Multiple Devices
PDF
Design approach for fault
PPTX
Ch11-Software Engineering 9
PDF
Ig3514391443
PDF
AN EFFICIENT HYBRID SCHEDULER USING DYNAMIC SLACK FOR REAL-TIME CRITICAL TASK...
PDF
An Investigation of Fault Tolerance Techniques in Cloud Computing
2012A8PS309P_AbhishekKumar_FinalReport
IRJET- Application Backup and Restore across Multiple Devices
Design approach for fault
Ch11-Software Engineering 9
Ig3514391443
AN EFFICIENT HYBRID SCHEDULER USING DYNAMIC SLACK FOR REAL-TIME CRITICAL TASK...

What's hot (19)

PDF
Towards predictive maintenance for marine sector in malaysia
PDF
26 7956 8212-1-rv software (edit)
PPT
Adaptive fault tolerance in cloud survey
PPTX
Ch12 safety engineering
PDF
2 E231 Description Afman 2108
PDF
Proactive cloud service assurance framework for fault remediation in cloud en...
PPTX
CS 5032 L8 dependability engineering 2 2013
PDF
Instruction Set Extension of a Low-End Reconfigurable Microcontroller in Bit-...
PDF
On-line Power System Static Security Assessment in a Distributed Computing Fr...
PDF
Procesamiento multinúcleo óptimo para aplicaciones críticas de seguridad
PPTX
Ch13-Software Engineering 9
PPTX
CS 5032 L6 reliability and security specification 2013
PPTX
Ch13 security engineering
PDF
Protective relaying-philosphy-and-design-guidelines
PDF
IRJET- Secure Scheme For Cloud-Based Multimedia Content Storage
PDF
1 introduction
PDF
Ispe Article
PPTX
Dependability and security (CS 5032 2012)
PDF
Cluster Computing Environment for On - line Static Security Assessment of lar...
Towards predictive maintenance for marine sector in malaysia
26 7956 8212-1-rv software (edit)
Adaptive fault tolerance in cloud survey
Ch12 safety engineering
2 E231 Description Afman 2108
Proactive cloud service assurance framework for fault remediation in cloud en...
CS 5032 L8 dependability engineering 2 2013
Instruction Set Extension of a Low-End Reconfigurable Microcontroller in Bit-...
On-line Power System Static Security Assessment in a Distributed Computing Fr...
Procesamiento multinúcleo óptimo para aplicaciones críticas de seguridad
Ch13-Software Engineering 9
CS 5032 L6 reliability and security specification 2013
Ch13 security engineering
Protective relaying-philosphy-and-design-guidelines
IRJET- Secure Scheme For Cloud-Based Multimedia Content Storage
1 introduction
Ispe Article
Dependability and security (CS 5032 2012)
Cluster Computing Environment for On - line Static Security Assessment of lar...
Ad

Similar to IRJET- Analysis of Micro Inversion to Improve Fault Tolerance in High Speed VLSI Circuits (20)

PDF
An Efficient Approach Towards Mitigating Soft Errors Risks
PDF
A Brief Review Of Approaches For Fault Tolerance In Distributed Systems
PDF
E04702056061
PDF
IRJET - Efficient Load Balancing in a Distributed Environment
PDF
PDF
AN EFFICIENT HYBRID SCHEDULER USING DYNAMIC SLACK FOR REAL-TIME CRITICAL TASK...
PDF
AN EFFICIENT HYBRID SCHEDULER USING DYNAMIC SLACK FOR REAL-TIME CRITICAL TASK...
PDF
Title: An Efficient Hybrid Scheduler Using Dynamic Slack for Real-Time Critic...
PDF
Title: Comparison and Evaluation of Digital Signature Schemes Employed in NDN...
PDF
ON FAULT TOLERANCE OF RESOURCES IN COMPUTATIONAL GRIDS
PDF
Crypto Mark Scheme for Fast Pollution Detection and Resistance over Networking
PDF
IOT-653: ENHANCING SECURITY, RELIABILITY AND REAL-TIME PERFORMANCE IN RESOURC...
PDF
IOT-653: Enhancing Security, Reliability and Real-Time Performance in Resourc...
PDF
IRJET- PLC based Object Sorting Machine on their Height
PDF
Deployment of Debug and Trace for features in RISC-V Core
PDF
Departure Delay Prediction using Machine Learning.
PDF
Simulation-based fault-tolerant multiprocessors system
PDF
IRJET- Smart Tracking System for Healthcare Monitoring using GSM/GPS
PDF
IRJET- A Review on Aluminium (LM 25) Reinforced with Boron Carbide (B4C) & Tu...
PDF
IRJET-A Review on Trends in Multicore Processor Based on Cache and Power Diss...
An Efficient Approach Towards Mitigating Soft Errors Risks
A Brief Review Of Approaches For Fault Tolerance In Distributed Systems
E04702056061
IRJET - Efficient Load Balancing in a Distributed Environment
AN EFFICIENT HYBRID SCHEDULER USING DYNAMIC SLACK FOR REAL-TIME CRITICAL TASK...
AN EFFICIENT HYBRID SCHEDULER USING DYNAMIC SLACK FOR REAL-TIME CRITICAL TASK...
Title: An Efficient Hybrid Scheduler Using Dynamic Slack for Real-Time Critic...
Title: Comparison and Evaluation of Digital Signature Schemes Employed in NDN...
ON FAULT TOLERANCE OF RESOURCES IN COMPUTATIONAL GRIDS
Crypto Mark Scheme for Fast Pollution Detection and Resistance over Networking
IOT-653: ENHANCING SECURITY, RELIABILITY AND REAL-TIME PERFORMANCE IN RESOURC...
IOT-653: Enhancing Security, Reliability and Real-Time Performance in Resourc...
IRJET- PLC based Object Sorting Machine on their Height
Deployment of Debug and Trace for features in RISC-V Core
Departure Delay Prediction using Machine Learning.
Simulation-based fault-tolerant multiprocessors system
IRJET- Smart Tracking System for Healthcare Monitoring using GSM/GPS
IRJET- A Review on Aluminium (LM 25) Reinforced with Boron Carbide (B4C) & Tu...
IRJET-A Review on Trends in Multicore Processor Based on Cache and Power Diss...
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
PDF
Kiona – A Smart Society Automation Project
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
PDF
Breast Cancer Detection using Computer Vision
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
Kiona – A Smart Society Automation Project
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
BRAIN TUMOUR DETECTION AND CLASSIFICATION
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
Breast Cancer Detection using Computer Vision
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...

Recently uploaded (20)

PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPT
Project quality management in manufacturing
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
composite construction of structures.pdf
PPTX
bas. eng. economics group 4 presentation 1.pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
Lecture Notes Electrical Wiring System Components
PDF
Well-logging-methods_new................
PPTX
Geodesy 1.pptx...............................................
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Project quality management in manufacturing
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Automation-in-Manufacturing-Chapter-Introduction.pdf
Embodied AI: Ushering in the Next Era of Intelligent Systems
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
composite construction of structures.pdf
bas. eng. economics group 4 presentation 1.pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
R24 SURVEYING LAB MANUAL for civil enggi
Lecture Notes Electrical Wiring System Components
Well-logging-methods_new................
Geodesy 1.pptx...............................................
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS

IRJET- Analysis of Micro Inversion to Improve Fault Tolerance in High Speed VLSI Circuits

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5041 Analysis of Micro Inversion to Improve Fault Tolerance in High Speed VLSI Circuits Somashekhar1, Vikas Maheshwari2, R. P. Singh3 1Research Scholar, Dept of ECE, SSSUTMS, Sehore (M.P), India- 2Associate Professor, Dept of ECE, Bharat Institute of Engineering and Technology, Hyderabad. India- 3Vice-Chancelor & Professor, Dept of ECE, SSSUTMS, Sehore (M.P), India- 1somashekhar49@gmail.com, 2maheshwarivikas1982@gmail.com, 3provc@sssutms.co.in ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - With technology scaling,thereliabilityofcircuits is becoming a rising concern. The emergence of logic errors in the field cause by faults escaping manufacturing testing, aging, single event upsets,or processvariationsisincreasing. Conventional techniques for online testing and circuit protection repeatedly requirea highdesign effortorresultin high area overhead and power consumption and are unsuitable for low cost systems. The primary motive for introducing fault tolerance in VLSI circuits is yield enhancement, increasing the percentage of fault free chips obtained. The active area of monolithic VLSI chips has always been limited by random fabrication defects, which appear impossible to eliminate in even the best manufacturing processes. The larger the circuit, the more likely it will contain such a defect and fail to operate correctly. Thus, the defect density in any fabrication line limits the size of the largest defect free chip producible with commercially viable yields. Larger circuits demand a fault tolerance capability to overcome fabrication defects while avoiding unreasonable costs. In nm technologies, circuits be more and more sensitive to a variety of perturbations. Transient faults can take place in a processor as a result of electrical noise, like crosstalk, or high energy particles, like neutrons and alpha particles. These faults be able to cause a program running on the processor to behave erratically, if they propagate and change the architectural state of the processor. These faults can occur in memory arrays, sequential elements or in the combinational logic in the processor. Protection against transient faults in combinational logic has not received much attention traditionally because combinational logic has a natural barrier stopping the propagation of the faults. System performance is increased whenthenodesareabletorecover locally from most errors caused by transient faults. The circuitry added for concurrent error detection generally reduces performance. By means of a technique called micro rollback, it is achievable to eliminate the performance penalty of concurrent error detection. Keywords: Micro inversion, Fault Tolerance, VLSI, Processor, IC, Register file. 1. INTRODUCTION As with every piece of machinery, ICs are prone to failure. Through technology scaling, transistor sizes are reduced to open the way for increased functionality with reduced in general power dissipation, device dimensions and manufacturing costs but despite those advantages, the reliability of ICs hasbeenaffected.Theincreasingprobability of circuit failure caused by increasing device complexityand the errors caused by increaseddelayduetotemperature rise in CMOS circuits. Faults in a distributed embedded system can be permanent, intermittent or transient (also known as soft errors). Permanent faults cause long-term malfunctioning of components. These faults emerge for a short time. Causes of intermittent faults are within system boundaries, while causes of transient faults are external to the system. They might damage data or lead to logic miscalculations, which can outcome in a fatal failure. Due to their higher rate, these faults cannot be addressed in a cost- effective way by applying traditional hardware-based fault tolerance techniques suitable for tolerating permanent faults. Embedded systems with fault tolerance have to be carefully designed and optimized, in order to satisfy strict timing requirements without exceeding a certain limited amount of resources. Moreover, not only performance and cost related requirements have to be considered but also other issues such as debug ability and testability have to be taken into account. 2. LITERATURE REVIEW A fault-tolerant system may be able to tolerate one or more fault-types including -- i) transient, intermittent or permanent hardware faults, ii) software and hardware design errors, iii) operator errors. Gayathri and Prabakaran
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5042 [1] discussed some important factors of failures. One important factor is arbitrary node or link failure which results in denial of service. In cloud computing, load balancing is required to distribute the dynamic local workload evenly across all the nodes. It helps to attain a more user fulfillment and resource utilization ratio by ensuring an efficient and fair allocation of every computing resource. Identified some of the load balancing algorithms which distribute workload across multiple computers or a computer cluster, network links, central processing units, disk drives, or other resources, to achieve best possible resource utilization, maximize throughput, minimize response time, and avoid overload. When all theseissuesare addressed naturally the system becomesa faulttolerant one. Win Naing [2] proposed a fault-tolerance management framework for private clouds development. Previous researchers developed Eucalyptus in organize to facilitate the creation of private clouds. But Eucalyptus is no fault tolerant system and no VM monitoring is performed thus limiting the support for advanced VM placement policies (e.g., consolidation). Eucalyptus does not also include any self-healing features and strictly distinguishes between cloud and cluster controllers. Therefore, they proposed the fault-tolerance management framework over Eucalyptus by adding new component Cluster Controller Manager (CCM). Sheheryar and Fabrice [3] proposed a scheme of fault tolerance mechanism for real time computing on cloud infrastructure. It has advantages of forward recovery mechanism. It performs the reverse recovery if the node with best reliability could not achieve the SRL. There is another big advantage of this scheme. It does notsuffer from domino effect as check pointing is made in the end when all the nodes have produced the result. Y Tamir [4] proposed a Fault-tolerantsystemfrequentlyrely on self-checking compute nodes. It detect errors immediately they occur, hence prevent the spread of invalid information throughout the system. 3. MICRO INVERSION A key to achieving a high degree of fault tolerance is the ability to detect errors as soon as they occur and prevent erroneous information from spreading throughout the system. In highly reliable systems, this is usually accomplished by checkers and isolation circuits in the communication paths from each module to the rest of the system. This additional circuitry reduces performance by requiring either longer clock cycles or additional pipeline stages. This presents a technique, called micro rollback. Operations performed on this erroneous information are “undone” by means of a hardware mechanism for fast rollback of a few cycles. Straightforward realizationof micro rollback will need of significant performance and chip area overheads for replicating all the storage elements in each module. This paper discusses techniques for efficient analysis of micro rollback in VLSI systems. It focuses on the micro architecture and VLSI realization of a VLSI RISC processor that is able of micro rollback. A micro rollback of a subsystem consists of bring the subsystem back a onlysome cycles to a state reached in the past. It is so necessary tosave the state of the subsystem at each cycle boundary [10]. Ifthe ‘‘subsystem’’ is a processor, the state is the contents of all storage elementswhichcarryuseful informationacrosscycle boundaries. It is composed of the program counter, the program status word, the instruction register, and the register file, it also includes the contents of some pipeline latches and some registers in the statemachinewhichcanbe changed during the execution of a multicycle instruction. Since instructions also modify external memory, the state of the cache must also be preserved. A rollback restores the contents of the cache to its state a few cycles earlier. Figure 1. A register file with support for micro Inversion 4.MULTI MODULE SYSTEM Periodic check pointing of process states and roll back to a previous state when an error is detected is a common technique for error recovery in distributed systems[11]. If each process is check pointed independently, rolling back one process may require rolling back a second process
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5043 further in time which, in turn, may cause a third process to roll back, etc. leading to an uncontrolled domino effect [11]. In the worst case this can result in all processes in the system rolling back to their state when the system is initialized. In the context of micro rollback, which is done at the level of hardware modules, the domino effect cannot occur in such system. However, if themodulesareconnected in an arbitrary topology, where there are several independent communication paths between pairs of modules, the domino effect could, potentially, occur. Since the range of rollback is severely limited (a few cycles), this can make recovery impossible. Figure 2. Multi-Module System In a system if modules are interconnectedvia a commonbus, this problem can be solved by using bus transactions as a common logical clock. 5.CONCLUSION One of the keys to achieving a high degree of fault tolerance is the ability to detect errors instantly after they occur & prevent invalid information from distribution all over the system. This primary problem in achieving fault tolerancein VLSI systems is able to overcome by performing checks in parallel with intermodule communication. This paper analyses the parallel error checks in concurrencewithmicro rollback can be used to support fault tolerance in complex multi module high performance VLSI systems. REFERENCES [1] Ms. G. Gayathri and Dr. N. Prabakaran, “Achieving Fault Tolerance in Cloud Environment by Efficient Load Balancing”, International Journal of Emerging Trends & Technology in Computer Science (IJETTCS) Volume 2, Issue 3, May – June , 2013 ISSN 2278-6856. [2] WinNaing “Fault-tolerant Management for PrivateCloud System”, International Journal of Emerging Trends & Technology in Computer Science (IJETTCS), Volume 1, Issue 1, May-June 2012 ISSN 2278-6856. [3] Sheheryar Malik, FabriceHuet,“AdaptiveFaultTolerance in Real Time Cloud Computing”, IEEE World Congress on Services, Jul 2011, Washington DC, United States. IEEE, pp.280-287. [4] Y Tamir, “Self-checking self-repairing computer nodes using the Mirror Processor”, IEEE Journal ofsolid-state circuits, vol. 21. No. I. January 1992. [5] Han C.C., Shin K. G. and Wu J., “A Fault-Tolerant Scheduling Algorithm for Real-Time Periodic Tasks with Possible Software Faults”, IEEE Computers 2003. [6] Prasenjit Kumar Patra,HarshpreetSingh,GurpreetSingh, “Fault Tolerance Techniques and Comparative Implementation in Cloud Computing”, International Journal of Computer Applications (0975 – 8887) Volume 64– No.14, February 2013. [7] Jasbir Kaur, SupriyaKinger, “Efficient AlgorithmforFault Tolerance in Cloud Computing”, International Journal of Computer Science and Information Technologies(IJCSIT), Vol.5 (5) , 2014, 6278-6281. [8] S. Sudha Lakshmi, Sri Padmavati, “Fault Tolerance in Cloud Computing”,International Journal ofEngineering Sciences Research-IJESR, Vol 04, Special Issue01,2013, issn:2230-8504, e-ISSN-2230-8512. [9] Pandeeswari.R, Mohamadi Begum “Rsfts: Rule-Based Semantic Fault Tolerant Scheduling For Cloud Environment”, Council for Innovative Research International Journal of Computers & Technology. Volume 4 No. 2, March-April, 2013, ISSN 2277-3061. [10] W. W. Hwu and Y. N. Patt, ‘‘Checkpoint Repair for Out- of-orderExecutionMachines,’’14thAnnual Symposium on Computer Architecture, Pittsburgh, PA, pp. 18-26 (June 1987). [11] B. Randell, P. A. Lee, and P. C. Treleaven, ‘‘Reliability Issues in Computing System Design,’’ Computing Surveys 10(2), pp. 123165 (June 1978). [12] L. D. Babu and P. Krishna, “Honey bee behaviorinspired load balancing of tasks in cloud computing environments”, in Applied Soft Computing, Vol. 13(5), pp. 2292-2303, (2013). [13] R. Kaur and P. Luthra (2012), “Load Balancing in Cloud Computing”, In Proceedings of International Conference on Recent Trends in Information, Telecommunication and Computing, ITC. [14] M. Dorigo, G. D. Caro and L. M. Gambardella (1999),“Ant algorithms for discrete optimization”, Artif. Life, Vol. 5(2), pp.137-172. [15] Virendra Singh Kushwah, Sandip Kumar Goyal and Priusha Narwariya, “ A survey on various fault tolerant approaches for cloud Environment during load balancing”, IJCNWMC, Vol. 4, Issue 6, Dec 2014, 25-34 ISSN(P): 2250-1568; ISSN(E): 2278-9448. [16] I. Koren, “A Reconfigurable and Fault-Toler- ant VLSI Multiprocessor Array,” Proc. 8th Ann. Symp. Computer Architecture, May 1981, pp. 425-441.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 03 | Mar 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 5044 [17] N.R. Rejinpaul, L. Maria Michael Visuwasam, “Checkpoint-based Intelligent Fault tolerance For Cloud Service Providers”, International Journal of Computers and Distributed Systems Vol. No.2, Issue 1, December 2012 ISSN: 2278-5183. [18] FetahiWuhib, Mike Speritzer and Rolf Stadler (2012), “Gossip Protocol for DynamicResourceManagementin Large Cloud Environments” IEEE Transactions on Network and Service Management, vol. 9, no. 2, pp. 213- 225. [19] Ramtilak Vemu et al, “A low-cost concurrent error detection techniqueforprocessorcontrol logic”,978-3- 9810801-3-1/DATE08 © 2008 EDAA. [20] Peng Hu, Wei Dai, “Enhancing Fault Tolerance based on Hadoop Cluster”, International Journal of Database Theory and Application Vol.7, No.1 (2014), pp.37-48. [21] Samudrala, P. K., Ramos, J., and Katkoori, S. (2004) “Selective triple modular redundancy (STMR) based single-event upset (SEU) tolerantsynthesisforFPGAs”, Nuclear Science, IEEE Transactions on, Vol.51,No.5,pp 2957-2969.