nach oben

Erschienen in:

2024 | OriginalPaper | Buchkapitel

Analyzing the Impact of Carbon Emission in Training Neural Machine Translation Models: A Case Study

verfasst von : Goutam Datta, Nisheeth Joshi, Kusum Gupta

Erschienen in: ICT: Innovation and Computing

Verlag: Springer Nature Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

As the field of machine learning grows rapidly, a lot of attention has been paid to how training complex models affects the environment. Carbon emissions caused by the computing needs of machine learning algorithms are becoming a big concern. This is because these models need a lot of computing power and energy. The goal of this paper is to find out how training Neural Machine Translation models affects the environment in terms of carbon footprint and to look into ways to reduce that effect. Machine translation, which automatically translates from source to target, is an area of natural language processing where researchers have been actively working for a long time. The performance of Neural Machine Translation (NMT) is enhanced by exploiting Artificial Neural Networks (ANN) in its model implementation. However, NMT is highly data-hungry and it requires longer training time. In this paper, an attempt has been made to estimate carbon emission when the different NMT models are trained in low-resource language pairs such as English to Hindi and English to Bengali language pairs on different hardware configurations. Finally, different alternatives have also been suggested to reduce this carbon emission and thereby its adverse impact on the environment can be minimized.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Path Planning of Autonomous Vehicle for Real World Scenario Using CARLA

Nächstes Kapitel Women’s Safety Wearables Design Using An IoT-Based Framework Technology

Koehn P, Zens R, Dyer C et al (2007) Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions—ACL’07, June 2007, p 177. https://doi.org/10.3115/1557769.1557821

Saini S, Sahula V (2020) Setting up a neural machine translation system for English to Indian languages, vol 1. INC. https://doi.org/10.1016/b978-0-12-819443-0.00011-8

Wang H, Wu H, He Z, Huang L, Church KW (2022) Progress in machine translation. Engineering 18:143–153. https://doi.org/10.1016/j.eng.2021.03.023CrossRef

Lacoste A, Luccioni A, Schmidt V, Dandres T (2019) Quantifying the carbon emissions of machine learning. Published online 2019. http://arxiv.org/abs/1910.09700

Guzmán F, Joty S, Màrquez L, Nakov P (2017) Machine translation evaluation with neural networks. Comput Speech Lang 45:180–200. https://doi.org/10.1016/j.csl.2016.12.005CrossRef

Raffel C, Shazeer N, Roberts A et al (2020) Exploring the limits of transfer learning with a unified text-to-text transformer. J Mach Learn Res 21:1–67MathSciNet

Luccioni AS, Hernandez-Garcia A (2023) Counting carbon: a survey of factors influencing the emissions of machine learning. (Ml). http://arxiv.org/abs/2302.08476

Henderson P, Romoff J, Brunskill E (2020) Footprints of machine learning 21:1–44

Strubell E, Ganesh A, McCallum A (2020) Energy and policy considerations for deep learning in NLP. In: ACL 2019—Proceedings of the conference on 57th annual meeting of the Association for Computational Linguistics, vol 1, pp 3645–3650

10.

Srivastava RK, Greff K, Schmidhuber J (2015) Training very deep networks. In: Advances in neural information processing system, pp 2377–2385

11.

Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. Adv Neural Inf Process Syst 4(January):3104–3112

12.

Bahdanau D, Cho KH, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: 3rd International conference on learning representations, ICLR 2015—conference track proceedings, pp 1–15. Published online

13.

Luong MT, Pham H, Manning CD (2015) Effective approaches to attention-based neural machine translation. In: Conference proceedings—EMNLP 2015 conference on empirical methods in natural language processing, pp 1412–1421. Published online. https://doi.org/10.18653/v1/d15-1166

14.

Wu Y, Schuster M, Chen Z et al (2016) Google’s neural machine translation system: bridging the gap between human and machine translation, pp 1–23. Published online. http://arxiv.org/abs/1609.08144

15.

Shterionov D, Vanmassenhove E (2023) The ecological footprint of neural machine translation systems, pp 185–213. Published online. https://doi.org/10.1007/978-3-031-14689-3_10

16.

Dhar P (2020) The carbon impact of artificial intelligence. Nat Mach Intell 2(8):423–425. https://doi.org/10.1038/s42256-020-0219-9CrossRef

17.

Smith LN (2017) Cyclical learning rates for training neural networks. In: Proceedings—2017 IEEE winter conference on applications of computer vision, WACV 2017, Apr, pp 464–472. https://doi.org/10.1109/WACV.2017.58

18.

Vathsala MK, Holi G (2020) RNN based machine translation and transliteration for Twitter data. Int J Speech Technol 23(3):499–504. https://doi.org/10.1007/s10772-020-09724-9CrossRef

19.

Salesky E, Runge A, Coda A, Niehues J, Neubig G (2020) Optimizing segmentation granularity for neural machine translation. Mach Transl 34(1):41–59. https://doi.org/10.1007/s10590-019-09243-8CrossRef

20.

Lim R, Heafield K, Hoang H, Briers M, Malony A (2018). Exploring hyper-parameter optimization for neural machine translation on GPU architectures, pp 1–8. Published online. http://arxiv.org/abs/1805.02094

21.

Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn Res 13:281–305MathSciNet

22.

Papineni K, Roukos S, Ward T, Zhu W-J (2002) BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp 311–318. https://doi.org/10.3115/1073083.1073135

Titel: Analyzing the Impact of Carbon Emission in Training Neural Machine Translation Models: A Case Study
verfasst von: Goutam Datta
Nisheeth Joshi
Kusum Gupta
Verlag: Springer Nature Singapore
Buch: ICT: Innovation and Computing
Print ISBN: 978-981-9994-85-4

Electronic ISBN: 978-981-9994-86-1

Copyright-Jahr: 2024
DOI: https://doi.org/10.1007/978-981-99-9486-1_7

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Die Gewinner und Laudatoren des Sustainability Award in Automotive 2024/© Uli Regenscheit | ATZlive, Search Icon, Banner Hanser, Suresh Vittal/© Alteryx, Additiv gefertigte Teile/© Marina_Skoropadskaya | Getty Images | iStock, Warnschild "Land unter"/© Bluedesign / Fotolia, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH, adäsion-Webinar-Matinee/© krystiannawrocki_ Getty Images

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.