Skip to main content

2024 | OriginalPaper | Buchkapitel

A Deep CNN-Based Approach for Revolutionizing Bengali Handwritten Numeral Recognition

verfasst von : Sudipta Progga Islam, Farjana Parvin

Erschienen in: Proceedings of the 2nd International Conference on Big Data, IoT and Machine Learning

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recognition of Bengali handwritten digits is a fascinating and demanding research problem that has garnered significant interest from researchers in the fields of pattern recognition. In this paper, a task-oriented deep convolutional architecture for recognizing handwritten Bengali digits is proposed. The main goal is to get a high level of accuracy while using a small number of parameters. The proposed architecture is designed to address the challenges posed by the complex and diverse nature of handwritten numerals in Bengali script, while also being computationally efficient with only 1.08 million trainable parameters. The performance of the model was evaluated by conducting experiments on two commonly used benchmark datasets of handwritten numerals in Bengali script, CMATERdb-3.1.1 and BanglaLekha-isolated-numerals. Different augmentation techniques were utilized to enhance the diversity and size of the training set, which led to improved robustness and generalization of the model. On the CMATERdb-3.1.1 dataset, the proposed model achieved an accuracy of 99.28%, and on the BanglaLekha-isolated-numerals dataset, it achieved an accuracy of 99.12%, outperforming several state-of-the-art models with comparable or larger numbers of parameters. The results suggest that this task-oriented model can be an efficient and effective solution for the recognition of Bengali handwritten numerals, with potential applications in document analysis, digitization, and text recognition.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Mori S, Nishida H, Yamada H (1999) Optical character recognition. Wiley, Hoboken Mori S, Nishida H, Yamada H (1999) Optical character recognition. Wiley, Hoboken
2.
Zurück zum Zitat Das N, Sarkar R, Basu S, Saha PK, Kundu M, Nasipuri M (2015) Handwritten Bangla character recognition using a soft computing paradigm embedded in two pass approach. Pattern Recogn 48(6):2054–2071CrossRef Das N, Sarkar R, Basu S, Saha PK, Kundu M, Nasipuri M (2015) Handwritten Bangla character recognition using a soft computing paradigm embedded in two pass approach. Pattern Recogn 48(6):2054–2071CrossRef
3.
Zurück zum Zitat Biswas M, Islam R, Shom GK, Shopon M, Mohammed N, Momen S, Abedin A (2017) Banglalekha-isolated: a multi-purpose comprehensive dataset of handwritten Bangla isolated characters. Data Brief 12:103–107CrossRef Biswas M, Islam R, Shom GK, Shopon M, Mohammed N, Momen S, Abedin A (2017) Banglalekha-isolated: a multi-purpose comprehensive dataset of handwritten Bangla isolated characters. Data Brief 12:103–107CrossRef
4.
Zurück zum Zitat Dutta A, Chaudhury S (1993) Bengali alpha-numeric character recognition using curvature features. Pattern Recogn 26(12):1757–1770CrossRef Dutta A, Chaudhury S (1993) Bengali alpha-numeric character recognition using curvature features. Pattern Recogn 26(12):1757–1770CrossRef
5.
Zurück zum Zitat Pal U, Chaudhuri BB (2001) Automatic recognition of unconstrained off-line Bangla handwritten numerals. In: Advances in multimodal interfaces-ICMI 2000: third international conference Beijing, China, October 14–16, 2000 Proceedings. Springer, pp 371–378 Pal U, Chaudhuri BB (2001) Automatic recognition of unconstrained off-line Bangla handwritten numerals. In: Advances in multimodal interfaces-ICMI 2000: third international conference Beijing, China, October 14–16, 2000 Proceedings. Springer, pp 371–378
6.
Zurück zum Zitat Bhattacharya U, Shridhar M, Parui SK (2006) On recognition of handwritten Bangla characters. In: Proceedings of the computer vision, graphics and image processing: 5th Indian conference, ICVGIP 2006, Madurai, India, December 13–16, 2006. Springer, pp 817–828 Bhattacharya U, Shridhar M, Parui SK (2006) On recognition of handwritten Bangla characters. In: Proceedings of the computer vision, graphics and image processing: 5th Indian conference, ICVGIP 2006, Madurai, India, December 13–16, 2006. Springer, pp 817–828
7.
Zurück zum Zitat Azim R, Rahman W, Fazlul Karim M (2016) Bangla hand-written character recognition using support vector machine. Int J Eng Works 3(6):36–46 Azim R, Rahman W, Fazlul Karim M (2016) Bangla hand-written character recognition using support vector machine. Int J Eng Works 3(6):36–46
8.
Zurück zum Zitat Aziz TI, Rubel AS, Salekin MS, Kushol R (2017) Bangla handwritten numeral character recognition using directional pattern. In: Proceedings of the 2017 20th international conference of computer and information technology (ICCIT). IEEE, pp 1–5 Aziz TI, Rubel AS, Salekin MS, Kushol R (2017) Bangla handwritten numeral character recognition using directional pattern. In: Proceedings of the 2017 20th international conference of computer and information technology (ICCIT). IEEE, pp 1–5
9.
Zurück zum Zitat Bhattacharya U, Chaudhuri BB (2008) Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans Pattern Anal Mach Intell 31(3):444–457CrossRef Bhattacharya U, Chaudhuri BB (2008) Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans Pattern Anal Mach Intell 31(3):444–457CrossRef
10.
Zurück zum Zitat Pramanik R, Dansena P, Bag S (2019) A study on the effect of CNN-based transfer learning on handwritten Indic and mixed numeral recognition. In: Document analysis and recognition: 4th workshop, DAR 2018, held in conjunction with ICVGIP 2018, Hyderabad, India, December 18, 2018, Revised Selected Papers 4. Springer, pp 41–51 Pramanik R, Dansena P, Bag S (2019) A study on the effect of CNN-based transfer learning on handwritten Indic and mixed numeral recognition. In: Document analysis and recognition: 4th workshop, DAR 2018, held in conjunction with ICVGIP 2018, Hyderabad, India, December 18, 2018, Revised Selected Papers 4. Springer, pp 41–51
11.
Zurück zum Zitat Sayeed A, Shin J, Hasan MAM, Srizon AY, Hasan MM (2021) Bengalinet: a low-cost novel convolutional neural network for Bengali handwritten characters recognition. Appl Sci 11(15):6845CrossRef Sayeed A, Shin J, Hasan MAM, Srizon AY, Hasan MM (2021) Bengalinet: a low-cost novel convolutional neural network for Bengali handwritten characters recognition. Appl Sci 11(15):6845CrossRef
12.
Zurück zum Zitat Bloice MD, Stocker C, Holzinger A (2017) Augmentor: an image augmentation library for machine learning. arXiv preprint arXiv:1708.04680 Bloice MD, Stocker C, Holzinger A (2017) Augmentor: an image augmentation library for machine learning. arXiv preprint arXiv:​1708.​04680
13.
Zurück zum Zitat Valueva MV, Nagornov NN, Lyakhov PA, Valuev GV, Chervyakov NI (2020) Application of the residue number system to reduce hardware costs of the convolutional neural network implementation. Math Comput Simul 177:232–243MathSciNetCrossRef Valueva MV, Nagornov NN, Lyakhov PA, Valuev GV, Chervyakov NI (2020) Application of the residue number system to reduce hardware costs of the convolutional neural network implementation. Math Comput Simul 177:232–243MathSciNetCrossRef
14.
Zurück zum Zitat Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE conference on computer vision and pattern recognition. IEEE, pp 248–255 Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE conference on computer vision and pattern recognition. IEEE, pp 248–255
15.
Zurück zum Zitat Sarkhel R, Das N, Das A, Kundu M, Nasipuri M (2017) A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts. Pattern Recogn 71:78–93CrossRef Sarkhel R, Das N, Das A, Kundu M, Nasipuri M (2017) A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts. Pattern Recogn 71:78–93CrossRef
17.
Zurück zum Zitat Basu S, Das N, Sarkar R, Kundu M, Nasipuri M, Basu DK (2012) An MLP based approach for recognition of hand written Bangla numerals. arXiv preprint arXiv:1203.0876 Basu S, Das N, Sarkar R, Kundu M, Nasipuri M, Basu DK (2012) An MLP based approach for recognition of hand written Bangla numerals. arXiv preprint arXiv:​1203.​0876
18.
Zurück zum Zitat Ghosh S, Chatterjee A, Singh PK, Bhowmik S, Sarkar R (2021) Language-invariant novel feature descriptors for handwritten numeral recognition. Vis Comput 37(7):1781–1803CrossRef Ghosh S, Chatterjee A, Singh PK, Bhowmik S, Sarkar R (2021) Language-invariant novel feature descriptors for handwritten numeral recognition. Vis Comput 37(7):1781–1803CrossRef
19.
Zurück zum Zitat Alom MZ, Sidike P, Taha TM, Asari VK (2017) Handwritten Bangla digit recognition using deep learning. arXiv preprint arXiv:1705.02680 Alom MZ, Sidike P, Taha TM, Asari VK (2017) Handwritten Bangla digit recognition using deep learning. arXiv preprint arXiv:​1705.​02680
20.
Zurück zum Zitat Keserwani P, Ali T, Roy PP (2019) Handwritten Bangla character and numeral recognition using convolutional neural network for low-memory GPU. Int J Mach Learn Cybern 10:3485–3497CrossRef Keserwani P, Ali T, Roy PP (2019) Handwritten Bangla character and numeral recognition using convolutional neural network for low-memory GPU. Int J Mach Learn Cybern 10:3485–3497CrossRef
21.
Zurück zum Zitat Shawon A, Jamil-Ur Rahman M, Mahmud F, Arefin Zaman MM (2018) Bangla handwritten digit recognition using deep CNN for large and unbiased dataset. In: Proceedings of the 2018 international conference on Bangla speech and language processing (ICBSLP), pp 1–6 Shawon A, Jamil-Ur Rahman M, Mahmud F, Arefin Zaman MM (2018) Bangla handwritten digit recognition using deep CNN for large and unbiased dataset. In: Proceedings of the 2018 international conference on Bangla speech and language processing (ICBSLP), pp 1–6
Metadaten
Titel
A Deep CNN-Based Approach for Revolutionizing Bengali Handwritten Numeral Recognition
verfasst von
Sudipta Progga Islam
Farjana Parvin
Copyright-Jahr
2024
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-99-8937-9_14

Premium Partner