doi: 10.17586/2226-1494-2023-23-3-608-617


Assessment of the readiness of a computer system for timely servicing of requests when combined with information recovery of memory after failures

V. A. Bogatyrev, S. V. Bogatyrev, A. V. Bogatyrev


Read the full article  ';
Article in Russian

For citation:
Bogatyrev V.A., Bogatyrev S.V., Bogatyrev A.V. Assessment of the readiness of a computer system for timely servicing of requests when combined with information recovery of memory after failures. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2023, vol. 23, no. 3, pp. 608–617 (in Russian). doi: 10.17586/2226-1494-2023-23-3-608-617


Abstract
The possibilities of increasing the readiness of a redundant computer system for the timely execution of requests critical to service delays are being investigated. A fault-tolerant computer cluster is considered in which nodes are duplicated computing systems that combine computer nodes and memory nodes. Two-stage recovery of memory nodes is assumed: first physical, and then informational, carried out using the resources of computing nodes. The novelty of the approach lies in the fact that for systems with a limitation of the allowable service time of functional requests, the impact of recovery disciplines on the readiness of the system with various options for dividing computing resources to restore information after memory failures and to perform the required functions is evaluated. At the same time, the reliability of the computer systems under study is assessed not only by the probability of their readiness to perform functional tasks (by the readiness coefficient), but also by the probability of the system readiness to perform tasks in a timely manner. Justification of the choice of disciplines for the restoration and maintenance of the flow of functional requests is carried out on the basis of Markov models. At the same time, models are proposed that allow taking into account the impact of the division of computing resources on the joint performance of the required functions and on the information recovery of memory, implemented after its physical recovery. The choice of computer system maintenance disciplines based on the proposed Markov model is aimed at achieving a compromise between the desire to increase the availability factor and the probability of timely execution of the incoming flow of functional requests. The justification of the choice of options for the distribution (separation) of computing resources stored after failures to solve functional queries (required functions) and information recovery of memory, implemented after its physical recovery, is carried out. Based on the proposed Markov models, the dependence of the system readiness for timely execution of requests on the distribution options of computing resources stored in the system for restoring information in memory and for performing functional tasks is investigated. The study was conducted depending on the allowable waiting time for functional requests and the intensity of their traffic. The influence on the system readiness for timely execution of traffic balancing requests of functional tasks between functional computing nodes is analyzed, taking into account the options for their possible joint use for information recovery of memory nodes after their physical recovery. The existence of an optimal share of traffic distribution between computing nodes is shown, taking into account the options for dividing their resources to service functional requests and to restore information in memory nodes after their physical recovery. The results obtained can be used to justify the choice of disciplines for servicing functional requests and recovery after failures of fault-tolerant cluster systems critical to delays in the execution of functional requests.

Keywords: cluster, availability factor, recovery, information recovery of memory, Markov model, recovery discipline, criticality to service delays, probability of timely execution of requests, duplicated system, fault tolerance

References
  1. Shubinsky I.B., Rozenberg I.N., Papic L. Adaptive fault tolerance in real-time information systems. Reliability: Theory & Applications, 2017, vol. 12, no. 1(44), pp. 18–25.
  2. Krasnobaev V., Kuznetsov A., Kiian A., Kuznetsova K. Fault tolerance computer system structures functioning in residue classes. Proc. of the 11th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS). V. 11, 2021, pp. 471–474. https://doi.org/10.1109/idaacs53288.2021.9660919
  3. Tatarnikova T.M., Sikarev I.A., Bogdanov P.Y., Timochkina T.V. Botnet attack detection approach in IoT networks. Automatic Control and Computer Sciences, 2022, vol. 56, no. 8, pp. 838–846. https://doi.org/10.3103/S0146411622080259
  4. Bukhsh M., Abdullah S., Rahman A., Asghar M.N., Arshad H., Alabdulatif A. An energy-aware, highly available, and fault-tolerant method for reliable IoT systems. IEEE Access, 2021, vol. 9, pp. 145363–14538. https://doi.org/10.1109/access.2021.3121033
  5. Wu B.-H., Alexander R.N., Liu S., Zhang Z. Quantum computing with multidimensional continuous-variable cluster states in a scalable photonic platform. Physical Review Research, 2020, vol. 2, no. 2, pp. 023138. https://doi.org/10.1103/physrevresearch.2.023138
  6. Bogatyrev V.A. An interval signal method of dynamic interrupt handling with load balancing. Automatic Control and Computer Sciences, 2000, vol. 34, no. 6, pp. 51–57.
  7. Bogatyrev V.A., Bogatyrev A.V., Bogatyrev S.V. Redundant multi-path service of a flow heterogeneous in delay criticality with defined node passage paths. Journal of Physics: Conference Series, 2021, vol. 1864, no. 1, pp. 012094. https://doi.org/10.1088/1742-6596/1864/1/012094
  8. Dinh T.D., Vishnevsky V., Pham V.D., Le D.T., Kirichek R., Koucheryavy A. Determination of subscribers coordinates using flying network for emergencies. Proc. of the 24th International Conference on Advanced Communication Technology (ICACT), 2022, pp. 1309–1318. https://doi.org/10.23919/icact53585.2022.9728777
  9. Alsweity M., Muthanna A., Elgendy I.A., Koucheryavy A. Traffic management algorithm for V2X-based flying fog system. Lecture Notes in Computer Science, 2021, vol. 13144, pp. 32–41. https://doi.org/10.1007/978-3-030-92507-9_4
  10. Stepanov N., Turlikov A., Begishev V., Koucheryavy Y., Moltchanov D.A. Accuracy assessment of user micromobility models for thz cellular systems. mmNets 2021: Proc. of the 5th ACM Workshop on Millimeter-Wave and Terahertz Networks and Sensing Systems, Part of ACM MobiCom 2021. V. 5, 2021, pp. 37–42. https://doi.org/10.1145/3477081.3481676
  11. Tatarnikova A.A., Turlikov A.M., Pupynin D.A. The spectral analysis of data obtained from accelerometers for musculoskeletal diseases diagnosis. Proc. of the Wave Electronics and its Application in Information and Telecommunication Systems (WECONF), 2018, pp. 8604404. https://doi.org/10.1109/weconf.2018.8604404
  12. Markoval E., Moltchanov D., Pirmagomedov R., Ivanova D., Koucheryavy Y., Samouylov K. Priority-based coexistence of eMBB and URLLC traffic in industrial 5G NR deployments. Proc. of the 12th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), 2020, pp. 1–6. https://doi.org/10.1109/icumt51630.2020.9222433
  13. Gurjanov A.V., Korobeynikov A.G., Zharinov I.O., Zharinov O.O. Edge, fog and cloud computing in the cyber-physical systems networks. Proc. of the III International Workshop on Modeling, Information Processing and Computing (MIP: Computing-2021), 2021, pp. 103–108. https://doi.org/10.47813/dnit-mip3/2021-2899-103-108
  14. Korobeynikov A.G., Shukalov A.V., Zharinov I.O., Zharinov O.O. Cyber-physical and human control of technological process and equipment sensitive to failures. IOP Conference Series: Earth and Environmental Science, 2021, vol. 839, pp. 42081. https://doi.org/10.1088/1755-1315/839/4/042081
  15. Astakhova T., Verzun N., Kolbanev M., Shamin A. A model for estimating energy consumption seen when nodes of ubiquitous sensor networks communicate information to each other. CEUR Workshop Proceedings, 2019, vol. 2344.
  16. Tatarnikova T.M., Poymanova E.D. Load management in computer networks as a tool to ensure the quality of service delivery. Proc. of the Wave Electronics and its Application in Information and Telecommunication Systems (WECONF), 2021, pp. 9470535. https://doi.org/10.1109/weconf51603.2021.9470535
  17. Biryaltsev E.V., Galimov M.R., Demidov D.E., Elizarov A.M. The platform approach to research and development using high-performance computing. Program Systems: Theory and Applications, 2019, vol. 10, no. 2(41), pp. 93–119. https://doi.org/10.25209/2079-3316-2019-10-2-93-119
  18. Kazimirov A.N. Model-oriented design of a wireless sensor network. Proc. of the 2020 Global Smart Industry Conference (GloSIC), 2020, pp. 391–398. https://doi.org/10.1109/glosic50886.2020.9267874
  19. Rahman P.A. Advanced reliability model of the fault-tolerant disk arrays with data striping and single disk redundancy. Advances in Fault-Tolerance and Reliability Models of the Redundant Disk Arrays: Proc. of the International Scientific and Practical Conference, 2017, pp. 20–25.
  20. Rahman P.A. Advanced reliability model of the fault-tolerant disk arrays with the Dual disk redundancy and data striping. Advances in Fault-Tolerance and Reliability Models of the Redundant Disk Arrays: Proc. of the International Scientific and Practical Conference, 2017, pp. 40–49.
  21. Uspenskaya N.N. Estimation of availability factor for the data storage systems based on redundant disk arrays with the backup. High-Performance and Fault-Tolerant Computing technologies and systems: Proc. of the International Scientific and Practical Conference, 2016, pp. 20–23.
  22. Sharipov M.I. Simplified reliability model of the fault-tolerant disk arrays with data striping and single disk redundancy on basis of the Markov birth-death chain. Advances in Fault-Tolerance and Reliability Models of the Redundant Disk Arrays: Proceedings of the International Scientific and Practical Conference, 2017, pp. 15–19.
  23. Bogatyrev V.A., Bogatyrev S.V., Bogatyrev A.V. Reliability and timeliness of servicing requests in infocommunication systems, taking into account the physical and information recovery of redundant storage devices. Proc. of the 2022 International Conference on Information, Control, and Communication Technologies (ICCT), 2022, pp. 1–4. https://doi.org/10.1109/icct56057.2022.9976800
  24. Bogatyrev V.A., Bogatyrev S.V., Bogatyrev A.V. Reliability of computer systems during physical and informational recovery of duplicated memory. CEUR Workshop Proceedings, 2021, vol. 3057, pp. 274–279.
  25. Bogatyrev V.A., Derkach A.N. Evaluation of a cyber-physical computing system with migration of virtual machines during continuous computing. Computers, 2020, vol. 9, no. 2, pp. 42. https://doi.org/10.3390/computers9020042
  26. Polovko A.M., Gurov S.V. Fundamental Theory of Reliability. St.Petersburg, BHV Publ., 2006, 702 p. (in Russian)


Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Copyright 2001-2024 ©
Scientific and Technical Journal
of Information Technologies, Mechanics and Optics.
All rights reserved.

Яндекс.Метрика