doi: 10.17586/2226-1494-2024-24-1-62-69


Solving the problem of preliminary partitioning of heterogeneous data into classes in conditions of limited volume

A. V. Sharamet


Read the full article  ';
Article in Russian

For citation:
Sharamet A.V. Solving the problem of preliminary partitioning of heterogeneous data into classes in conditions of limited volume. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2024, vol. 24, no. 1, pp. 62–69 (in Russian). doi: 10.17586/2226-1494-2024-24-1-62-69


Abstract
In the context of the formation of heterogeneous data that differ significantly in nature, even of a small volume, it becomes necessary to analyze them for decision-making. This is typical for many high-tech industrial fields of human activity. The problem can be solved by bringing heterogeneous data to a single view and then dividing it into clusters. Instead of searching for a solution for each data element, it is proposed to use the division of the entire set of normalized data into clusters, and thereby simplify the process of isolating the cluster and making a decision on it. The essence of the proposed solution is the automatic grouping of objects with similar data into clusters. This allows you to reduce the amount of analyzed information by combining a lot of data and perform mathematical operations already for the cluster. When splitting, it is proposed to use the theory of fuzzy logic. The possibility of such an approach is due to the fact that different objects always have several characteristics by which they can be combined. These signs are often not obvious and are poorly formalized. A hierarchical modification of the AFC fuzzy clustering method based on the operation (max- min) of the fuzzy similarity ratio is proposed. The basic concepts and definitions of the proposed method of automatic partitioning of a set of input data, a step-by-step scheme of the corresponding cluster procedure are considered. The efficiency of the proposed method is demonstrated by the example of solving the problem of forming a traffic flow. A numerical experiment has shown that the developed algorithm allows you to automatically analyze heterogeneous data and stably divide them into classes. The application of the proposed modification allows for the preliminary partitioning of data into clusters and allows reducing the volume of analyzed data in the future. There is no need to consider the objects in each case separately.

Keywords: reduction of the amount of calculations, automatic division into classes, limited amount of data, hierarchical method, algorithm stability, similarity threshold, traffic flow

Acknowledgements. This article is prepared in memory of Viatchenin D.A. who devoted his whole life to the theory of fuzzy cluster analysis.

References
  1. Svetashov A.K. Using artificial neural networks for their application in the existing and future radio systems: a case study. Young Scientist, 2023, no. 22(469), pp. 52–58. (in Russian)
  2. Alimov Kh.T., Dzamikhova F.Kh., Parovik R.I. Fractional mathematical model McSherry. Bulletin KRASEC. Physical and Mathematical Sciences, 2023, vol. 42, no. 1, pp. 164–179. (in Russian). https://doi.org/10.26117/2079-6641-2023-42-1-164-179
  3. Salimian F., Damiri M., Ramezankhani M., Fariman S.K. Developing a new interval type-2 hesitant fuzzy TOPSIS-based fuzzy best-worst multicriteria decision-making method for competitive pricing in supply chain. Journal of Mathematics, 2022, vol. 2022, pp. 7879028. https://doi.org/10.1155/2022/7879028
  4. Sinitsyn A.V., Lisay N.Yu., Selivanov S.A., Sinitsyn A.A. Development the information systems with fuzzy logic algorithms and network optimization. Information and Innovations, 2023, vol. 18, no. 2, pp. 33–47. https://doi.org/10.31432/1994-2443-2023-18-2-33-47
  5. Gogoi S., Gohain B., Chutia R. Distance measures on intuitionistic fuzzy sets based on cross-information dissimilarity and their diverse applications. Artificial Intelligence Review, 2023, vol. 56, suppl. 3. pp. 3471–3514. https://doi.org/10.1007/s10462-023-10608-y
  6. Viatchenin D.A. Fuzzy Methods for Automatic Classification. Minsk, Tehnoprint Publ., 2004, 219 p. (in Russian)
  7. Belov M.A., Grishko S.I., Zhivetyev A.V., Podgorny S.A., Tokareva N.A. Use of fuzzy logic to create an adaptive individual learning path based on dynamic course complexity management. Modeling, Optimization and Information Technology, 2022, vol. 10, no. 4, pp. 7–8. (in Russian). https://doi.org/10.26102/2310-6018/2022.39.4.018
  8. Edgulova E.K., Tkhabisimova M.M., Bozieva A.M. Features of building knowledge bases in intelligent systems. Information Technologies in Ecology, Education and Business: Conference Proceedings. Nalchik, KBSU, 2021, pp. 168–174. (in Russian)
  9. Zhan Q., Jin L., Yager R.R., Mesiar R.. A novel three-way decision method for interval-valued hesitant fuzzy environment. Soft Computing, 2023, vol. 27, no. 17, pp. 12289–12307. https://doi.org/10.1007/s00500-023-08259-w
  10. Zhao S., Wang D., Changyong L., Lu W. Induced choquet integral aggregation operators with single-valued neutrosophic uncertain linguistic numbers and their application in multiple attribute group decision-making. Mathematical Problems in Engineering, 2019, vol. 2019, pp. 9143624. https://doi.org/10.1155/2019/9143624
  11. Zhang Y., Li P., Wang Y., Ma P., Su X. Multiattribute decision making based on entropy under interval-valued intuitionistic fuzzy environment. Mathematical Problems in Engineering, 2013, vol. 2013, pp. 526871. https://doi.org/10.1155/2013/526871
  12. Tamura S., Higuchi S., Tanaka K. Pattern classification based on fuzzy relations. IEEE Transactions on Systems, Man, and Cybernetics, 1971, vol. SMC-1, no. 1, pp. 61–66. https://doi.org/0.1109/TSMC.1971.5408605
  13. Sidorov D., Belov M. Design of hardware-software systems in the educational process with the use of Virtual Computer Lab. System Analysis in Science and Education, 2020, no. 2, pp. 70–82. (in Russian). https://doi.org/10.37005/2071-9612-2020-2-70-82
  14. Viattchenin D.A. A new heuristic algorithm of fuzzy clustering. Control & Cybernetics, 2004, vol. 33, no. 2, pp. 323–340.
  15. Viatchenin D.A. Parameters of the AFC fuzzy clustering method. Vestnik VA RB, 2004, no. 4, pp. 51–55. (in Russian)
  16. Han Y., Huang Y., Jia S., Liu J. An interval-parameter fuzzy linear programming with stochastic vertices model for water resources management under uncertainty. Mathematical Problems in Engineering, 2013, vol. 2013, pp. 942343. https://doi.org/10.1155/2013/942343
  17. Milovidova A.A., Cheremisina E.N., Dobrynin V.N. Definition algorithm type and parameters of membership function of fuzzy measuring. Modern Science: actual problems of theory and practice", a series "Natural and Technical Sciences, 2019, no. 9, pp. 69–74. (in Russian)


Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Copyright 2001-2024 ©
Scientific and Technical Journal
of Information Technologies, Mechanics and Optics.
All rights reserved.

Яндекс.Метрика