nach oben

Erschienen in:

2024 | OriginalPaper | Buchkapitel

Deep Learning: How to Apply Machine Learning and Deep Learning Methods to Audio Analysis

verfasst von : Manan Dabral, Tejinder Kaur, Abhay Khanna, Ashish Yadav, Ojas Sharma, Nakul

Erschienen in: Mobile Radio Communications and 5G Networks

Verlag: Springer Nature Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

So before understanding about deep learning, we should also look at Artificial Intelligence (AI) and Machine Learning (ML). The purpose of AI is to train machines in such a way that they can function like the human mind. The field of AI includes machine learning, the purpose of which is that the machine can learn by itself according to its experience and can develop such skills in which human involvement is not equal. Let us now understand what Deep Learning is. You can also say that very complex neural networks have been named deep learning, and you can also see it as an advancement in machine learning. Basic machine learning had limited data processing capabilities and generally required structured data. While the data processing capacity of deep learning algorithm is very high, and compared to traditional machine learning, it does not require structured data, rather it can handle both structured and unstructured data. In one sentence, deep learning enables computers to think, understand, and experience like humans.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Deep Learning Assisted Diagnosis of Parkinson’s Disease

Nächstes Kapitel Naive Bayes Classifier-Based Smishing Detection Framework to Reduce Cyber Attack

Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. NIPS’12 Proc 25th Int Conf Neural Inf Process Syst 1:1097–1105

O’ Mahony N, Murphy T, Panduru K et al (2016) Adaptive process control and sensor fusion for process analytical technology. In: 2016 27th Irish signals and systems conference (ISSC). IEEE, pp 1–6

Schöning J, Faion P, Heidemann G (2016) Pixel-wise ground truth annotation in videos—an semi-automatic approach for pixel-wise and semantic object annotation. In: Proceedings of the 5th international conference on pattern recognition applications and methods. SCITEPRESS—Science and and Technology Publications, pp 690–697

Zhang X, Lee J-Y, Sunkavalli K, Wang Z (2017) Photometric stabilization for Fastforward videos

Karami E, Shehata M, Smith A (2017) Image identification using SIFT algorithm: performance analysis against different image deformations

Horiguchi S, Ikami D, Aizawa K (2017) Significance of Softmax-based features in comparison to distance metric learning-based features

Alhaija HA, Mustikovela SK, Mescheder L, et al (2017) Augmented reality meets computer vision: efficient data generation for urban driving scenes

AlDahoul N, Md Sabri AQ, Mansoor AM (2018) Real-time human detection for aerial captured video sequences via deep models. Comput Intell Neurosci 2018:1–14. https://doi.org/10.1155/2018/1639561CrossRef

Li F, Wang C, Liu X et al (2018) A composite model of wound segmentation based on traditional methods and deep neural networks. Comput Intell Neurosci 2018:1–12. https://doi.org/10.1155/2018/4149103CrossRef

10.

Zeng G, Zhou J, Jia X, et al (2018) Hand-crafted feature guided deep learning for facial expression recognition. In: 2018 13th IEEE international conference on automatic face & gesture recognition (FG 2018). IEEE, pp 423–430

11.

Ahmed E, Saint A, Shabayek AER et al (2018) Deep learning advances on different 3D data representations: a survey. arXiv Prepr arXiv 180801462

12.

Braeger S, Foroosh H (2018) Curvature augmented deep learning for 3D object recognition. In: 2018 25th IEEE International conference on image processing (ICIP). IEEE, pp 3648–3652

13.

Niall O’ Mahony (Institute of Technology Tralee), Sean Campbell (Institute of Technology Tralee), Lenka Krpalkova (Institute of Technology Tralee), et al (2018) Deep learning for visual navigation of unmanned ground vehicles; a review

14.

Clément M, Kurtz C, Wendling L (2018) Learning spatial relations and shapes for structural object description and scene recognition. Pattern Recognit 84:197–210. https://doi.org/10.1016/J.PATCOG.2018.06.017CrossRef

15.

Hayou S, Doucet A, Rousseau J (2018) On The selection of initialization and activation function for deep neural networks. arXiv Prepr arXiv 180508266v2

16.

Voulodimos A, Doulamis N, Doulamis A, Protopapadakis E (2018) Deep learning for computer vision: a brief review. Comput Intell Neurosci 2018:1–13. https://doi.org/10.1155/2018/7068349CrossRef

17.

Miikkulainen, R, Liang J, Meyerson E, Rawal A, Fink D, Francon O, Raju B et al (2019) Evolving deep neural networks. In: Artificial intelligence in the age of neural networks and brain computing, pp 293–312. Academic Press

18.

Manohar V, Chen S-J, Wang Z, Fujita Y, Watanabe S, Khudanpur S (2019) Acoustic modeling for overlapping speech recognition: Jhu Chime-5 challenge system. In: ICASSP 2019–2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 6665–6669. IEEE

19.

Bischke B, Helber P, Folz J, Borth D, Dengel A (2019) Multi-task learning for segmentation of building footprints with deep neural networks. In: 2019 IEEE International conference on image processing (ICIP), pp 1480–1484. IEEE

20.

Chen J, Wu L, Zhang J, Zhang L, Gong D, Zhao Y, Hu S, Wang Y, Hu X, Zheng B et al (2020) Deep learning-based model for detecting 2019 novel coronavirus pneumonia on high-resolution computed tomography: a prospective study, medRxiv

21.

Maghdid HS, Ghafoor KZ, Sadiq AS, Curran K, Rabie K (2020) A novel ai-enabled framework to diagnose coronavirus covid 19 using smartphone embedded sensors: Design study, arXiv preprint arXiv:2003.07434

22.

Kadra A, Lindauer M, Hutter F, Grabocka J (2021) Regularization is all you need: Simple neural nets can excel on tabular data. arXiv preprint arXiv:2106.11189

23.

Ghantasala GSP, Rao DN, Patan R (2022) Recognition of dubious tissue by using supervised machine learning strategy. Applications of computational methods in manufacturing and product design, Springer, Singapore, pp 395–404

24.

Sachdeva RK, Bathla P (2022) A machine learning-based framework for diagnosis of breast cancer. Int J Software Innov 10(1):1–11CrossRef

25.

Sachdeva RK, Bathla P, Rani P, Kukreja V, Ahuja R (2022) A systematic method for breast cancer classification using RFE feature selection. 2022 2nd International conference on advance computing and innovative technologies in engineering (ICACITE), pp 1673–1676. https://doi.org/10.1109/ICACITE53722.2022.9823464

26.

Kumar Sachdeva R, Garg T, Khaira GS, Mitrav D, Ahuja R (2022) A systematic method for lung cancer classification. 2022 10th International conference on reliability, Infocom technologies and optimization (Trends and Future Directions) (ICRITO), Noida, India, pp 1–5. https://doi.org/10.1109/ICRITO56286.2022.9964778

Titel: Deep Learning: How to Apply Machine Learning and Deep Learning Methods to Audio Analysis
verfasst von: Manan Dabral
Tejinder Kaur
Abhay Khanna
Ashish Yadav
Ojas Sharma
Nakul
Verlag: Springer Nature Singapore
Buch: Mobile Radio Communications and 5G Networks
Print ISBN: 978-981-9706-99-0

Electronic ISBN: 978-981-9707-00-3

Copyright-Jahr: 2024
DOI: https://doi.org/10.1007/978-981-97-0700-3_2

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"