nach oben

Erschienen in:

2023 | OriginalPaper | Buchkapitel

4. KI-basiertes akustisches Monitoring: Herausforderungen und Lösungsansätze für datengetriebene Innovationen auf Basis audiovisueller Analyse

verfasst von : Patrick Aichroth, Judith Liebetrau

Erschienen in: Entrepreneurship der Zukunft

Verlag: Springer Fachmedien Wiesbaden

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Zusammenfassung

KI-basierte audiovisuelle Analyse kann datengetriebene Produkt-, Prozess- und auch Geschäftsmodellinnovationen in verschiedenen Anwendungsbereichen befördern. Allerdings müssen dafür wichtige Herausforderungen bezüglich Datenerhebung, Datenschutz, Datensicherheit, sowie von Erklärbarkeit und iterativer Entwicklung von KI-Modellen adressiert werden. In diesem Kapitel werden die Innovationspotenziale, relevante Probleme und Lösungsansätze am Beispiel von akustischem Monitoring erläutert. Dabei wird deutlich, dass der frühzeitige Einsatz von Verfahren und Technologien für vertrauenswürdige KI, adäquate Entwicklungsmethoden und systematische Evaluationsprozesse entscheidend für einen erfolgreichen Einsatz und die Realisierung der Innovationspotenziale sind.

Verfahren und Komponenten für audiovisuelle Analyse sind Algorithmen, die Informationen aus Bild, Video- und Audiomaterial extrahieren. Sie können in vielen Anwendungsbereichen wichtige Bausteine für datengetriebene Innovationen sein. Einige dieser Innovationen, und in diesem Zusammenhang relevante Herausforderungen und Lösungsansätze, werden in diesem Kapitel exemplarisch anhand des KI-basierten akustischen Monitorings zur Überwachung von Prozessen, Maschinen und Produkten beschrieben.

Das Kapitel gliedert sich in drei Teile:

Definition relevanter Begrifflichkeiten und Beschreibung der Anwendungsbereiche und Potenziale von akustischem Monitoring für datengetriebene Geschäftsmodelle,

Erläuterung zentraler Herausforderungen im Kontext von Datenerhebung, Datenschutz, Datensicherheit, Erklärbarkeit, iterativer Entwicklung und Evaluation für die Erschließung der o. g. Potenziale, sowie

Zusammenfassung der Ergebnisse und kurzer Ausblick auf relevante Trends.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Einordnung und Rechnungslegung von KI-basierten Geschäftsmodellen bei Start-ups

Nächstes Kapitel Schutz von KI-Trainingsdaten

Frei verfügbare und häufig genutzte Datensets sind zum Beispiel über die Plattformen zenodo (https://zenodo.org/), datahub.io (https://datahub.io/machine-learning), Kaggle (https://www.kaggle.com/datasets) oder VisualData (https://visualdata.io/discovery) verfügbar.

Abayomi-Alli, O., Damaševičius, R., Qazi, A., Adedoyin-Olowe, M., & Misra, S. (2022). Data Augmentation and Deep Learning Methods in Sound Classification: A Systematic Review. Electronics, 11(22), Article 22. https://doi.org/10.3390/electronics11223795.

Abeßer, J., Loos, A., & Prachi, S. (2022). Construction-sAIt: Multi-modal AI-driven technologies for construction site monitoring. In P. Leistner (Hrsg.), Fortschritte der Akustik – DAGA 2022: 48. Jahrestagung für Akustik (S. 90–93). Berlin.

Abeßer, J., Gourishetti, S., Kátai, A., Clauß, T., Sharma, P., & Liebetrau, J. (2021). IDMT-Traffic: An Open Benchmark Dataset for Acoustic Traffic Monitoring Research. In Proceedings of the 29th European Signal Processing Conference (EUSIPCO), (S. 551–555).

Abeßer, J., Götze, M., Clauß, T., Zapf, D., Kühn, C., Lukashevich, H., Kühnlenz, S., & Mimilakis, S. (2019). Urban Noise Monitoring in the Stadtlärm Project – A Field Report. In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019). New York: New York University, 25–26 October 2019.

Aichroth, P., Battis, V., Dewes, A., Dibak, C., Doroshenko, V., Geiger, B., Graner, L., Holly, S., Huth, M., Kämpgen, B., Kaulartz, M., Mundt, M., Rapp, H., Steinebach, M., Sushko, Y., Swarat, D., Winter, C., & Weiß, R. (2020). Anonymisierung und Pseudonymisierung von Daten für Projekte des maschinellen Lernens: Eine Handreichung für Unternehmen. BITKOM.

Arrieta, A. B., Díaz-Rodríguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., Garc’ia, S., Gil-L’opez, S., Molina, D., Benjamins, R., Chatila, R., & Herrera, F. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58, 82–115.

Balke, S., Driedger, J., Abeßer, J., Dittmar, C., & Müller, M. (2016). Towards evaluating multiple predominant melody annotations in Jazz Recordings. In Proceedings of the 17th International Society for Music Information Retrieval Conference, ISMIR 2016, New York (S. 246–252).

Becker, C., & Mohr, M. (2020). Federated Machine Learning: über Unternehmensgrenzen hinaus aus Produktionsdaten lernen. atp magazin. 62(5), 18–20.

Berthelot, D., Carlini, N., Cubuk, E. D., Kurakin, A., Sohn, K., Zhang, H., & Raffel, C. (2020). ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring. http://arxiv.org/abs/1911.09785

Biedermann, H. (2008). Anlagenmanagement–Managementinstrumente zur Wertsteigerung. TÜV Media.

Biedermann, H. (2014). Anlagenmanagement im Zeitalter von Industrie 4.0 – Handlungsfelder für die industrielle Instandhaltung. In Instandhaltung im Wandel (S. 23–32).

Bittner, F., Gonzalez Rodriguez, M., Richter, M., Lukashevich, H., & Abeßer, J. (2022). Multi-pitch Estimation meets Microphone Mismatch: Applicability of Domain Adaptation. In Proceedings of 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). Bengaluru, 4–8 December 2022.

Bouee, C. E., & Schaible, S. (2015). Die digitale Transformation der Industrie. Roland Berger/BDI.

Cances, L., Labbé, E., & Pellegrini, T. (2022). Comparison of semi-supervised deep learning algorithms for audio classification. EURASIP Journal on Audio, Speech, and Music Processing, 23(1). https://doi.org/10.1186/s13636-022-00255-6.

Cano, E., Plumbley, M., & Dittmar, C. (2014a). Phase-based harmonic/percussive separation. In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH), Singapore (S. 1628–1632).

Cano, E., Schuller, G., & Dittmar, C. (2014b). Pitch-informed solo and accompaniment separation towards its use in music education applications. EURASIP Journal on Advances in Signal Processing, 1, 1–19.

Carletti, M., Masiero, C., Beghi, A., & Susto, G. A. (2019). Explainable machine learning in industry 4.0: Evaluating feature importance in anomaly detection to enable root cause analysis. In Proceedings of the 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC), Bari (S. 21–26).

Cartwright, M., Seals, A., Salamon, J., Williams, A., Mikloska, S., MacConnell, D., Law, E., Bello, J., & Nov., O. (2017). Seeing sound: Investigating the effects of visualizations and complexity on crowdsourced audio annotations. In Proceedings of the ACM on Human-computer Interaction. Issue CSCW, 1–21.

Chandola, V., Banerjee, A., & Kumar, V. (2009). Anomaly detection. ACM Computing Surveys (CSUR), 41(3), 1–58.

Chen, H., Liu, Z., Liu, Z., Zhang, P., & Yan, Y. (2019). Integrating the data augmentation scheme with various classifiers for acoustic scene modeling. In Proceedings of the Detection and Classification of Acoustic Scenes and Events Workshop (DCASE), New York (S. 25–26).

Chen, H., Tao, R., Fan, Y., Wang, Y., Wang, J., Schiele, B., Xie, X., Raj, B., & Savvides, M. (2022). SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning. The Eleventh International Conference on Learning Representations. https://openreview.net/forum?id=ymt1zQXBDiF.

Clauß, T., Abeßer, J., Lukashevich, H., Gräfe, R., Häuser, F., Kühn, C., & Sporer, Thomas. (2018). Stadtlärm – A distributed system for noise level measurement and noise source identification in a smart city environment. In B. Seeber (Hrsg.), Fortschritte der Akustik – DAGA 2018: 44. Jahrestagung für Akustik (S. 285–288). München.

CORDIS. (2019). REVerse engineering of audio-VIsual coNtent Data (REWIND). https://cordis.europa.eu/project/id/268478. Zugegriffen: 05. Aug. 2021.

Cramer, A. L., Wu, H.-H., Salamon, J., & Bello, J. P. (2019). Look, Listen, and Learn More: Design Choices for Deep Audio Embeddings. ICASSP 2019–2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 3852–3856. https://doi.org/10.1109/ICASSP.2019.8682475.

Cuccovillo, L., & Aichroth, P. (2016). Open-set microphone classification via blind channel analysis. In Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai (S. 2074–2078).

Cuccovillo, L., Mann, S., Aichroth, P., Tagliasacchi, M., & Dittmar, C. (2013a). Blind microphone analysis and stable tone phase analysis for audio tampering detection. In Proceedings of the 135th Audio Engineering Society convention, New York (S. 271–280).

Cuccovillo, L., Mann, S, Tagliasacchi, M., & Aichroth, P. (2013b). Audio tampering detection via microphone classification. In Proceedings of the 2013b IEEE International Workshop on Multimedia Signal Processing (MMSP), Pula (S. 177–182).

DA3KMU. (o. J.). Datenschutz durch statistische Analyse und Adaptive Anonymisierung von personenbezogenen Daten für KMU https://www.da3kmu.de. Zugegriffen: 28. Nov. 2021.

DCASE (o. J.). Detection and classification of acoustic scenes and events. http://dcase.community/. Zugegriffen: 30. Juli 2021.

Denkena, B., Dittrich, M., Noske, H., Kramer, K., & Schmidt, M. (2021). Anwendungen des maschinellen Lernens in der Produktion aus Auftrags- und Produktsicht: Ein Überblick. Zeitschrift für wirtschaftlichen Fabrikbetrieb, 116(5), 358–362.

Deloitte. (o. J.). KI-Studie 2020: Wie nutzen Unternehmen Künstliche Intelligenz? https://www2.deloitte.com/de/de/pages/technology-media-and-telecommunications/articles/ki-studie-2020.html. Zugegriffen: 30. Juli 2021.

Donahue, C., McAuley, J., & Puckette, M. (2019). Adversarial Audio Synthesis. http://arxiv.org/abs/1802.04208.

Došilović, F. K., Brčić, M., & Hlupić, N. (2018). Explainable artificial intelligence: A survey. In Proceedings of the 2018 41st International convention on information and communication technology, electronics and microelectronics (MIPRO), Opatija (S. 0210–0215).

Dwork, C. (2008). Differential privacy: A survey of rresults. In M. Agrawal, D. Du, Z. Duan, & A. Li (Hrsg.), Theory and applications of models of computation (S. 1–19). Springer. https://doi.org/10.1007/978-3-540-79228-4_1.

Engel, J., Agrawal, K. K., Chen, S., Gulrajani, I., Donahue, C., & Roberts, A. (2018). GANSynth: Adversarial Neural Audio Synthesis. International Conference on Learning Representations. https://openreview.net/forum?id=H1xQVn09FX.

European Commission, Directorate-General for Communications Networks, Content and Technology. (2019). Ethics guidelines for trustworthy AI, Publications Office. https://data.europa.eu/doi/doi.org/10.2759/177365. Zugegriffen: 05. Aug. 2021

Fraunhofer IDMT. (o. J.a). Audioforensik Toolbox. https://www.idmt.fraunhofer.de/de/institute/projects-products/audioforensics.html. Zugegriffen: 05. Aug. 2021.

Fraunhofer IDMT. (o. J.b). Sensor Edge Cloud für verteiltes Lernen (SEC-Learn). https://www.idmt.fraunhofer.de/de/institute/projects-products/projects/sec-learn.html. Zugegriffen: 05. Aug. 2021.

Fraunhofer IDMT. (o. J.c). Trusted Resource Aware ICT (TRAICT). https://www.idmt.fraunhofer.de/de/institute/projects-products/projects/tra-ict.html. Zugegriffen: 05. Aug. 2021

Gassmann, O., Frankenberger, K., & Choudury, M. (2020). Geschäftsmodelle entwickeln: 55+ innovative Konzepte mit dem St. Galler Business Model Navigator (3., überarbeitete und erweiterte Edition). Carl Hanser.

Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B.g, Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative adversarial nets. In Proceedings of the 27th International Conference on Neural Information Processing Systems – Volume 2 (NIPS'14) (S. 2672–2680). MIT Press.

Grollmisch, S., Abeßer, J., Liebetrau, J., & Lukashevich, H. (2019). Sounding industry: Challenges and datasets for industrial sound analysis. In Proceedings of the European Signal Processing Conference. IEEE. doi: https://doi.org/10.23919/EUSIPCO.2019.8902941.

Grollmisch, S., & Cano, E. (2021). Improving Semi-Supervised Learning for Audio Classification with FixMatch. Electronics, 10(15), Article 15. https://doi.org/10.3390/electronics10151807.

Grollmisch, S., Cano, E., Kehling, C., & Taenzer, M. (2021). Analyzing the Potential of Pre-Trained Embeddings for Audio Classification Tasks. 2020 28th European Signal Processing Conference (EUSIPCO), 790–794. https://doi.org/10.23919/Eusipco47968.2020.9287743.

Gudivada, V., Apon, A., & Ding, J. (2017). Data quality considerations for big data and machine learning: Going beyond data cleaning and transformations. International Journal on Advances in Software, 10(1), 1–20.

Hannover Messe. (2019). „You have to beat the Germans“ – Wie lange noch? https://www.hannovermesse.de/de/news/news-fachartikel/-you-have-to-beat-the-germans-wie-lange-noch. Zugegriffen: 05. Aug. 2021

Hartz, C. (2021). Künstliche Intelligenz in der Produktentwicklung im Spannungsfeld zwischen juristischer Perfektion und technischer Machbarkeit. LRZ 2021, Rn. 260–294.

Hennequin, R., Khlif, A., Voituret, F., & Moussallam, M. (2020). Spleeter: A fast and efficient music source separation tool with pre-trained models. Journal of Open Source Software, 5(50), 2154.

Hoffmann, K. (2008). Projektmanagement heute. HMD Praxis der Wirtschaftsinformatik, 45(2), 5–16.

Jo, T. (2021). Machine learning foundations. Springer.

Johnson, D., & Grollmisch, S. (2021). Techniques improving the robustness of deep learning models for industrial sound analysis. In Proceedings of the 28th European Signal Processing Conference (EUSIPCO), Amsterdam (S. 81–85). https://doi.org/10.23919/Eusipco47968.2020.9287327.

Johnson, D., Kirner, J., Grollmisch, S., & Liebetrau, J. (2020). Compressed Air Leakage Detection Using Acoustic Emissions with Neural Networks. INTER-NOISE and NOISE-CON Congress and Conference Proceedings, 261(1), 5662–5673.

Jones, M. C., Downie, J. S., & Ehmann, A. F. (2007). In Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR 2007), Vienna (S. 539-542).

Kodrasi, I., Goetze, S., & Doclo, S. (2013). Regularization for partial multichannel equalization for speech dereverberation. IEEE Transactions on Audio, Speech and Language Processing, 21(9), 1879–1890.

Koizumi, Y., Saito, S., Uematsu, H., Harada, N., & Imoto, K. (2019). ToyADMOS: A dataset of miniature-machine operating sounds for anomalous sound detection. http://arxiv.org/abs/1908.03299. Zugegriffen: 04. Aug. 2021.

Kong, Q., Xu, Y., Iqbal, T., Cao, Y., Wang, W., & Plumbley, M.D. (2019). Acoustic Scene Generation with Conditional SampleRNN. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton (S. 925–929).

Kong, Q., Cao, Y., Iqbal, T., Wang, Y., Wang, W., & Plumbley, M. D. (2020). PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 2880–2894. https://doi.org/10.1109/TASLP.2020.3030497.

Kouw, W. M., & Loog, M. (2021). A review of domain adaptation without target labels. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(3), 766–785.

Kraus, T., Ganschow, L., Eisenträger, M., & Wischmann, S. (2021). Erklärbare Künstliche Intelligenz – Anforderungen, Anwendungen, Lösungen. Technologieprogramm KI-Innovationswettbewerb des Bundesministeriums für Wirtschaft und Energie/Begleitforschung: Iit-Institut für Innovation und Technik in der VDI/VDE Innovation + Technik GmbH.

Landis, J. R., & Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159–174.

Lei, Y., Yang, B., Jiang, X., Jia, F., Li, N., & Nandi, A. K. (2020). Applications of machine learning to machine fault diagnosis: A review and roadmap. Mechanical Systems and Signal Processing, 138, 106587.

Leistungszentrum InSignA. (o. J.). Intelligente Signalanalyse- und Assistenzsysteme. https://www.leistungszentrum-insigna.de/. Zugegriffen: 28. Nov. 2021.

Liang, D., Song, W., & Thomaz, E. (2020). Characterizing the effect of audio degradation on privacy perception and inference performance in audio-based human activity recognition. In Proceedings of the 22nd International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI). Oldenburg, 5–9 October 2020.

Mishra, S., Sturm, B. L., & Dixon, S. (2017). Local interpretable model-agnostic explanations for music content analysis. In Proceedings of the 18th International Society for Music Information Retrieval Conference ISMIR 2017, Suzhou (S. 537–543).

Mitchell, T. (1997). Machine learning. McGraw-Hill.

Montavon, G., Binder, A., Lapuschkin, S., Samek, W., & Müller, K.-R. (2019). Layer-Wise relevance propagation: An overview. In W. Samek, G. Montavon, A. Vedaldi, L. K. Hansen, & K.-R. Müller (Hrsg.), Explainable AI: Interpreting, explaining and visualizing deep learning (S. 193–209). Springer International Publishing. https://doi.org/10.1007/978-3-030-28954-6_10.

Morik, K. (2018). Schlüsseltechnologie Maschinelles Lernen. Digitale Welt, 4, 22–27.

Müller, M. (2015). Fundamentals of music processing: Audio, analysis, algorithms, applications. Springer.

Mun, S., Park, S., Han, D. K., & Ko, H. (2017). Generative adversarial networks based acoustic scene training set augmentation and selection using SVM hyperplane. In Proceedings of the Detection and Classification of Acoustic Scenes and Events Workshop (DCASE). Munich, 16–17 November 2017.

Nikolenko, S. I. (2021). Synthetic data for deep learning. Springer.

NSCAI. (2021). Final report. https://www.nscai.gov/wp-content/uploads/2021/03/Full-Report-Digital-1.pdf. Zugegriffen: 05. Aug. 2021.

Papenfuss, C. (2017). Ist prädiktive Instandhaltung die Killer-App für das Industrial Internet of Things? Industrie 4.0 Management, 33, 57–60.

Patki, N., Wedge, R, & Veeramachaneni, K. (2016). The synthetic data vault. In Proceedings – 3rd IEEE International Conference on Data Science and Advanced Analytics (DSAA), Montreal (S. 399–410).

People + AI Research. (o. J.). What-If Tool: Visually probe the behavior of trained machine learning models, with minimal coding. https://pair-code.github.io/what-if-tool/. Zugegriffen: 05. Aug. 2021.

Piller, F. T. (2000). Mass customization. Deutscher Universitätsverlag.

PricewaterhouseCoopers. (2018). Studie: Künstliche Intelligenz in Unternehmen. https://www.pwc.de/de/digitale-transformation/kuenstliche-intelligenz/kuenstliche-intelligenz-in-unternehmen.html. Zugegriffen: 10. Jan. 2022.

Purohit, H., Tanabe, R., Ichige, K., Endo, T., Nikaido, Y., Suefusa, K., & Kawaguchi, Y. (2019). MIMII Dataset: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection (public 1.0) [dataset]. Zenodo. https://doi.org/10.5281/zenodo.3384388.

Rammer, C. (2021). Herausforderungen beim Einsatz von Künstlicher Intelligenz, Ergebnisse einer Befragung von jungen und mittelständischen Unternehmen in Deutschland. Mannheim: Bundesministerium für Wirtschaft und Energie (BMWi).

Reichel, J., & Müller, G. (2018). Betriebliche Instandhaltung. J. Haeffs (Hrsg.). Springer.

Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-Resolution Image Synthesis with Latent Diffusion Models. http://arxiv.org/abs/2112.10752.

Saeed, A., Grangier, D., & Zeghidour, N. (2021). Contrastive Learning of General-Purpose Audio Representations. ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 3875–3879. https://doi.org/10.1109/ICASSP39728.2021.9413528.

Samek, W., Montavon, G., Lapuschkin, S., Anders, C. J., & Müller, K. R. (2021). Explaining deep neural networks and beyond: A review of methods and applications. In Proceedings of the IEEE, 109(3), 247–278.

Seifert, I., Bürger, M., Wangler, L., Christmann-Budian, S., Rohde, M., Gabriel, P., & Zinke, G. (2018). Potenziale der Künstlichen Intelligenz im produzierenden Gewerbe in Deutschland: Studie im Auftrag des Bundesministeriums für Wirtschaft und Energie (BMWi). Berlin.

Simonite, T. (2018). Google’s AI guru wants computers to think more like brains. Wired. https://www.wired.com/story/googles-ai-guru-computers-think-more-like-brains/. Zugegriffen: 10. Jan. 2022.

Sohn, K., Berthelot, D., Li, C.-L., Zhang, Z., Carlini, N., Cubuk, E. D., Kurakin, A., Zhang, H., & Raffel, C. (2020). FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence. http://arxiv.org/abs/2001.07685.

Stacke, K., Eilertsen, G., Unger, J., & J. & Lundstrom, C. (2021). Measuring domain shift for deep learning in histopathology. IEEE Journal of Biomedical and Health Informatics, 25(2), 325–336.

Stoesser, K. R. (2017). Prozessoptimierung für produzierende Unternehmen. Springer Fachmedien.

Stowell, D. (2022). Computational bioacoustics with deep learning: A review and roadmap. PeerJ, 10, e13152. https://doi.org/10.7717/peerj.13152.

TensorFlow. (o. J.). TensorBoard: Das Visualisierungs-Toolkit von TensorFlow. https://www.tensorflow.org/tensorboard. Zugegriffen: 05. Aug. 2021

Tercan, H., Guajardo, A., & Meisen, T. (2019). Industrial transfer learning: Boosting machine learning in production. In 2019 IEEE 17th International Conference on Industrial Informatics (INDIN), 274–279.

Theissler, A., Pérez-Velázquez, J., Kettelgerdes, M., & Elger, G. (2021). Predictive maintenance enabled by machine learning: Use cases and challenges in the automotive industry. Reliability engineering & system safety, 215, 107864.

Töpfer, A., & Günther S. (2007). Steigerung des Unternehmenswertes durch Null-Fehler-Qualität als strategisches Ziel: Überblick und Einordnung der Beiträge. In Töpfer A. (Hrsg.), Six Sigma. Springer.

Tricentis. (o. J.). AI approaches compared: Rule-based testing vs. learning. https://www.tricentis.com/artificial-intelligence-software-testing/ai-approaches-rule-based-testing-vs-learning/. Zugegriffen: 05. Aug. 2021.

Turian, J., Shier, J., Khan, H., Raj, B., Schuller, B., Steinmetz, C., Malloy, C., Tzanetakis, G., Velarde, G., McNally, K., Henry, M., Pinto, N., Noufi, C., Clough, C., Herremans, D., Fonseca, E., Engel, J., Salamon, J., Esling, P., Manocha, P., Watanabe, S., Jin, Z., & Bisk, Y. (2022). HEAR: Holistic Evaluation of Audio Representations. In Proceedings of Machine Learning Research. Virtual, 13–14 December 2021.

Virtanen, T., Plumbley, M. D., & Ellis, D. (Hrsg.). (2018). Computational Analysis of Sound Scenes and Events. Springer International Publishing. https://doi.org/10.1007/978-3-319-63450-0.

Wang, M., & Deng, W. (2018). Deep visual domain adaptation: A survey. Neurocomputing, 312, 135–153.

Wang, Y. S., Liu, N. N., Guo, H., & Wang, X. L. (2020). An engine-fault-diagnosis system based on sound intensity analysis and wavelet packet pre-processing neural network. Engineering Applications of Artificial Intelligence, 94.

Werner, E. (2009). Wandler für Luftschallmessungen. In M. Möser (Hrsg.), Messtechnik der Akustik (S. 1–53). Springer.

Wong, P. K., Zhong, J., Yang, Z., & Vong, C. M. (2016). Sparse Bayesian extreme learning committee machine for engine simultaneous fault diagnosis. Neurocomputing, 174, 331–343.

Yan, J., Meng, Y., Lu, L., & Li, L. (2017). Industrial big data in an industry 4.0 environment: Challenges, schemes, and applications for predictive maintenance. IEEE Access, 5, 23484–23491. https://doi.org/10.1109/ACCESS.2017.2765544

Yang, Q., Liu, Y., Chen, T., & Tong, Y. (2019). Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST). 10(2), Article No. 12.

Zapp, T., Jussen, P., & Kurz, M. (2018). Informations- und Kommunikationstechnologien für die Instandhaltungsplanung und -steuerung. In J. Reichel, G. Müller, & J. Haeffs (Hrsg.), Betriebliche Instandhaltung (S. 205–222). Springer.

Titel: KI-basiertes akustisches Monitoring: Herausforderungen und Lösungsansätze für datengetriebene Innovationen auf Basis audiovisueller Analyse
verfasst von: Patrick Aichroth
Judith Liebetrau
Verlag: Springer Fachmedien Wiesbaden
Buch: Entrepreneurship der Zukunft
Print ISBN: 978-3-658-42059-8

Electronic ISBN: 978-3-658-42060-4

Copyright-Jahr: 2023
DOI: https://doi.org/10.1007/978-3-658-42060-4_4

Springer Professional

Zusammenfassung

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner