Lockstep-Based Fault-Tolerant Architectures for Dependable Safety-Critical and Cyber-Physical Computing Systems

Authors

  • Dr. Alexander J. Hoffman Department of Electrical and Computer Engineering, Rheinland Institute of Technology, Germany

Keywords:

Fault tolerance, lockstep architecture, safety-critical systems, multicore processors

Abstract

The rapid proliferation of safety-critical and cyber-physical systems in domains such as automotive electronics, unmanned aerial vehicles, industrial automation, and intelligent energy infrastructures has intensified the demand for computing platforms that combine high performance with stringent fault tolerance and functional safety guarantees. Contemporary systems increasingly rely on multicore and heterogeneous processor architectures, which, while offering superior computational capabilities, introduce complex reliability challenges stemming from transient faults, permanent hardware failures, electromagnetic interference, and design-induced vulnerabilities. Lockstep-based execution paradigms have emerged as a foundational architectural strategy to address these challenges by enabling timely error detection, fault isolation, and deterministic recovery. This article presents a comprehensive and theoretically grounded investigation of lockstep and lockstep-inspired fault-tolerant architectures, drawing exclusively on the provided body of scholarly and industrial references. The discussion spans classical redundancy principles, including dual-core and triple modular redundancy, as well as modern evolutions such as light lockstep, heterogeneous Arm–RISC-V lockstep systems, dynamically coupled cores, and flexible vector lockstep execution in ultra-low-power clusters. Beyond architectural mechanisms, the article situates lockstep computing within broader system-level considerations, including power supply safety design, compliance with functional safety standards, fault injection methodologies, and emerging application contexts such as autonomous vehicles, UAVs, edge intelligence, and secure energy trading. Through extensive theoretical elaboration, the article identifies critical trade-offs between performance, power efficiency, detection latency, and design complexity, while highlighting persistent gaps in scalability, adaptability, and cross-layer integration. The findings underscore that lockstep architectures, when combined with system-aware design principles and rigorous validation strategies, constitute a central pillar for dependable computing in next-generation safety-critical systems.

Downloads

Download data is not yet available.

References

1. Avizienis, A. (1976). Fault-tolerant systems. IEEE Transactions on Computers, 25(12), 1304–1312.

2. Baumann, R. C. (2005). Radiation-induced soft errors in advanced semiconductor technologies. IEEE Transactions on Device and Materials Reliability, 5(3).

3. Gomaa, M., Scarbrough, C., Vijaykumar, T. N., and Pomeranz, I. (2003). Transient-fault recovery for chip multiprocessors. Proceedings of the International Symposium on Computer Architecture.

4. Hernandez, C., and Abella, J. (2014). LiVe: Timely error detection in light lockstep safety-critical systems. Design Automation Conference.

5. Hernandez, C., and Abella, J. (2015). Timely error detection for effective recovery in light-lockstep automotive systems. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

6. Infineon. (2012). AURIX multicore 32-bit microcontroller family to meet safety and powertrain requirements of upcoming vehicle generations.

7. International Standards Organization. (2009). ISO 26262: Road vehicles – Functional safety.

8. Iturbe, X., Venu, B., Ozer, E., and Das, S. (2018). Addressing functional safety challenges in autonomous vehicles with the Arm triple core lock-step architecture. IEEE Design and Test.

9. Kilian, P., Köhler, A., Van Bergen, P., Gebauer, C., Pfeufer, B., Koller, O., and Bertsche, B. (2021). Principle guidelines for safe power supply systems development. IEEE Access, 9, 107751–107766.

10. LaFrieda, C., et al. (2007). Utilizing dynamically coupled cores to form a resilient chip multiprocessor. Dependable Systems and Networks.

11. Lyons, R. E., and Vanderkulk, W. (1962). The use of triple modular redundancy to improve computer reliability. IBM Journal of Research and Development, 6(2), 200–209.

12. Marques, I. D. C. (2020). A loosely-coupled Arm and RISC-V lockstepping technology. Doctoral dissertation.

13. Marques, I., Rodrigues, C., Tavares, A., Pinto, S., and Gomes, T. (2021). Lock-V: A heterogeneous fault tolerance architecture based on Arm and RISC-V. Microelectronics Reliability, 120.

14. Mohsan, S. A. H., Othman, N. Q. H., Li, Y., Alsharif, M. H., and Khan, M. A. (2023). Unmanned aerial vehicles: Practical aspects, applications, open challenges, security issues, and future trends. Intelligent Service Robotics, 16(1), 109–137.

15. Nikiema, P. R., Kritikakou, A., Traiola, M., and Sentieys, O. (2023). Design with low complexity fine-grained dual core lock-step RISC-V processors. Dependable Systems and Networks Supplemental Volume.

16. Nishiyama, H., Fujimoto, D., Sone, H., and Hayashi, Y. (2023). Efficient noninvasive fault injection method utilizing intentional electromagnetic interference. IEEE Transactions on Electromagnetic Compatibility, 65(4), 1211–1219.

17. Ottavi, G., Garofalo, A., Tagliavini, G., Conti, F., Di Mauro, A., Benini, L., and Rossi, D. (2023). Dustin: A 16-cores parallel ultra-low-power cluster with fully flexible bit-precision and vector lockstep execution mode. IEEE Transactions on Circuits and Systems I, 70(6), 2450–2463.

18. Peña-Fernández, M., Serrano-Cases, A., Lindoso, A., Cuenca-Asensi, S., Entrena, L., Morilla, Y., Martín-Holgado, P., and Martínez-Álvarez, A. (2022). Hybrid lockstep technique for soft error mitigation. IEEE Transactions on Nuclear Science, 69(7), 1574–1581.

19. Saha, S. S., Sandha, S. S., and Srivastava, M. (2022). Machine learning for microcontroller-class hardware: A review. IEEE Sensors Journal, 22(22), 21362–21390.

20. Sbai, I., and Krichen, S. (2020). A real-time decision support system for big data analytic: A case of dynamic vehicle routing problems. Procedia Computer Science, 176, 938–947.

21. Sharma, G., Joshi, A. M., and Mohanty, S. P. (2023). sTrade: Blockchain based secure energy trading using vehicle-to-grid mutual authentication in smart transportation. Sustainable Energy Technologies and Assessments, 57.

22. Tam, P., Math, S., Nam, C., and Kim, S. (2021). Adaptive resource optimized edge federated learning in real-time image sensing classifications. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 14, 10929–10940.

Downloads

Published

2025-05-31

How to Cite

Lockstep-Based Fault-Tolerant Architectures for Dependable Safety-Critical and Cyber-Physical Computing Systems. (2025). International Journal of Advance Scientific Research, 5(05), 89-95. https://sciencebring.com/index.php/ijasr/article/view/1056

Similar Articles

31-40 of 183

You may also start an advanced similarity search for this article.