Skip to main content

2024 | OriginalPaper | Buchkapitel

Machine Learning and Data Mining

verfasst von : Dmitry A. Kurasov, Anton S. Kutuzov, Dmitry S. Zvonarev, Anton P. Devyatkov

Erschienen in: Information Technologies and Intelligent Decision Making Systems

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The article discusses the main tasks of machine learning. The functional structure of a computer algorithm for solving machine learning problems and a data mining model are considered. The solution of the simplest machine learning problem using the classification method of linear regression is proposed. The analysis of existing learning algorithms based on a decision tree is carried out. Based on the analysis performed, a decision tree was selected for implementation using the so-called C4.5 algorithm. The article builds a decision tree using specific training data. The application of a simple and understandable algorithm for building trees is to create all possible trees, calculate the number of erroneously classified data for each of them and select a tree with a minimum number of errors. As a result, an optimal learning algorithm is formed for the decision tree in terms of data errors in the learning process. The top-down algorithm implemented in the article for building a decision tree selects the attribute with the largest increase in information at each step. Entropy is used as a metric of the amount of information in the training data set D. In the process of implementation, this algorithm is analyzed to identify opportunities for its application to a more complex task.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Demin, A.V., Ponomaryov, D.K.: Machine learning with probabilistic law discovery: a concise introduction. Bull. Irkutsk State Univ. Ser. Math. 43, 91–109 (2023) Demin, A.V., Ponomaryov, D.K.: Machine learning with probabilistic law discovery: a concise introduction. Bull. Irkutsk State Univ. Ser. Math. 43, 91–109 (2023)
2.
Zurück zum Zitat Fedutinov, K.A.: Machine learning in decision support tasks in nature conservation management. Eng. Bull. Don 9(81), 100–113 (2021) Fedutinov, K.A.: Machine learning in decision support tasks in nature conservation management. Eng. Bull. Don 9(81), 100–113 (2021)
3.
Zurück zum Zitat Pandey, A.: Machine Learning. Int. J. Res. Appl. Sci. Eng. Technol. 11(8), 864–869 (2023)CrossRef Pandey, A.: Machine Learning. Int. J. Res. Appl. Sci. Eng. Technol. 11(8), 864–869 (2023)CrossRef
4.
Zurück zum Zitat Kuzmin, O.V., Golikov, V.A.: Application of the “decision tree” method in the diagnosis of a malfunction of an internal combustion engine of a car. Mod. Technol. Syst. Anal. Model. 2(70), 113–120 (2021) Kuzmin, O.V., Golikov, V.A.: Application of the “decision tree” method in the diagnosis of a malfunction of an internal combustion engine of a car. Mod. Technol. Syst. Anal. Model. 2(70), 113–120 (2021)
5.
Zurück zum Zitat Rich, E.: Artificial Intelligence. McGraw-Hill, New York (1983) Rich, E.: Artificial Intelligence. McGraw-Hill, New York (1983)
6.
Zurück zum Zitat Simonov, D.A., Zernov, M.I.: Application of the ID3 algorithm for robot training. Energetika, informatics, innovations-2016, pp. 343–346. Universum, Smolensk (2016) Simonov, D.A., Zernov, M.I.: Application of the ID3 algorithm for robot training. Energetika, informatics, innovations-2016, pp. 343–346. Universum, Smolensk (2016)
7.
Zurück zum Zitat Nursikuwagus, A.: Implementation ID3 algorithm to predict. J. Eng. Appl. Sci. 12(2), 204–207 (2017) Nursikuwagus, A.: Implementation ID3 algorithm to predict. J. Eng. Appl. Sci. 12(2), 204–207 (2017)
8.
Zurück zum Zitat Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993) Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
9.
Zurück zum Zitat Kustiyahningsih, Y., Khotimah B.K., Anamisa D.R., Yusuf, M., Rahayu, T., Purnama, J.: Decision Tree C 4.5 algorithm for classification of poor family scholarship recipients. In: IOP Conference Series: Materials Science and Engineering, vol. 1125, no. 1, p. 012048 (2021) Kustiyahningsih, Y., Khotimah B.K., Anamisa D.R., Yusuf, M., Rahayu, T., Purnama, J.: Decision Tree C 4.5 algorithm for classification of poor family scholarship recipients. In: IOP Conference Series: Materials Science and Engineering, vol. 1125, no. 1, p. 012048 (2021)
10.
Zurück zum Zitat Tuyakbasarova, N.A.: Entropy of the system and information. In: Modern Science: Issues of Theory and Practice: a Collection of Materials of the III Correspondence International Scientific and Practical Conference, pp. 168–176 (2018) Tuyakbasarova, N.A.: Entropy of the system and information. In: Modern Science: Issues of Theory and Practice: a Collection of Materials of the III Correspondence International Scientific and Practical Conference, pp. 168–176 (2018)
11.
Zurück zum Zitat Kurasov, D. A: Digital technologies Industry 4.0. In: CEUR Workshop Proceedings, vol. 2843 (2021) Kurasov, D. A: Digital technologies Industry 4.0. In: CEUR Workshop Proceedings, vol. 2843 (2021)
Metadaten
Titel
Machine Learning and Data Mining
verfasst von
Dmitry A. Kurasov
Anton S. Kutuzov
Dmitry S. Zvonarev
Anton P. Devyatkov
Copyright-Jahr
2024
DOI
https://doi.org/10.1007/978-3-031-60318-1_18

Premium Partner