A real-time air-writing model to recognize Bengali characters

Mohammed Abdul Kader; Muhammad Ahsan Ullah; Md Saiful Islam; Fermín Ferriol Sánchez; Md Abdus Samad; Imran Ashraf; Mohammed Abdul Kader; Muhammad Ahsan Ullah; Md Saiful Islam; Fermín Ferriol Sánchez; Md Abdus Samad; Imran Ashraf

doi:10.3934/math.2024325

AIMS Mathematics

2024, Volume 9, Issue 3: 6668-6698. doi: 10.3934/math.2024325

Previous Article Next Article

Research article Special Issues

A real-time air-writing model to recognize Bengali characters

1.
Department of Electrical and Electronic Engineering, International Islamic University Chittagong, Kumira-4318, Bangladesh
2.
Department of Electrical and Electronic Engineering, Chittagong University of Engineering & Technology, Chittagong-4349, Bangladesh
3.
Department of Electronics and Telecommunication Engineering, Chittagong University of Engineering & Technology, Chittagong-4349, Bangladesh
4.
Universidad Europea del Atlántico. Isabel Torres 21, 39011 Santander, Spain
5.
Universidad Internacional Iberoamericana Campeche 24560, México
6.
Universidad Internacional Iberoamericana, Arecibo, PR 00613, USA
7.
Department of Information and Communication Engineering, Yeungnam University, Gyeongsan, Republic of Korea

Received: 30 November 2023 Revised: 24 January 2024 Accepted: 25 January 2024 Published: 18 February 2024
MSC : 68T01, 68T20

Air-writing is a widely used technique for writing arbitrary characters or numbers in the air. In this study, a data collection technique was developed to collect hand motion data for Bengali air-writing, and a motion sensor-based data set was prepared. The feature set as then utilized to determine the most effective machine learning (ML) model among the existing well-known supervised machine learning models to classify Bengali characters from air-written data. Our results showed that medium Gaussian SVM had the highest accuracy (96.5%) in the classification of Bengali character from air writing data. In addition, the proposed system achieved over 81% accuracy in real-time classification. The comparison with other studies showed that the existing supervised ML models predicted the created data set more accurately than many other models that have been suggested for other languages.

Keywords:

Citation: Mohammed Abdul Kader, Muhammad Ahsan Ullah, Md Saiful Islam, Fermín Ferriol Sánchez, Md Abdus Samad, Imran Ashraf. A real-time air-writing model to recognize Bengali characters[J]. AIMS Mathematics, 2024, 9(3): 6668-6698. doi: 10.3934/math.2024325

Related Papers:

[1]	Mashael Maashi, Mohammed Abdullah Al-Hagery, Mohammed Rizwanullah, Azza Elneil Osman . Deep convolutional neural network-based Leveraging Lion Swarm Optimizer for gesture recognition and classification. AIMS Mathematics, 2024, 9(4): 9380-9393. doi: 10.3934/math.2024457
[2]	Mashael M Asiri, Abdelwahed Motwakel, Suhanda Drar . Robust sign language detection for hearing disabled persons by Improved Coyote Optimization Algorithm with deep learning. AIMS Mathematics, 2024, 9(6): 15911-15927. doi: 10.3934/math.2024769
[3]	Adeel Farooq, Musawwar Hussain, Muhammad Yousaf, Ahmad N. Al-Kenani . A new algorithm to compute fuzzy subgroups of a finite group. AIMS Mathematics, 2023, 8(9): 20802-20814. doi: 10.3934/math.20231060
[4]	Jia-Bao Liu, Xi-Yu Yuan . Prediction of the air quality index of Hefei based on an improved ARIMA model. AIMS Mathematics, 2023, 8(8): 18717-18733. doi: 10.3934/math.2023953
[5]	Wenjia Guo, Xiaoge Liu, Tianping Zhang . Dirichlet characters of the rational polynomials. AIMS Mathematics, 2022, 7(3): 3494-3508. doi: 10.3934/math.2022194
[6]	Jianghua Li, Xi Zhang . On the character sums analogous to high dimensional Kloosterman sums. AIMS Mathematics, 2022, 7(1): 294-305. doi: 10.3934/math.2022020
[7]	Fawaz Aseeri, Julian Kaspczyk . The conjugacy diameters of non-abelian finite $ p $-groups with cyclic maximal subgroups. AIMS Mathematics, 2024, 9(5): 10734-10755. doi: 10.3934/math.2024524
[8]	Muhammad Saqlain, Muhammad Riaz, Raiha Imran, Fahd Jarad . Distance and similarity measures of intuitionistic fuzzy hypersoft sets with application: Evaluation of air pollution in cities based on air quality index. AIMS Mathematics, 2023, 8(3): 6880-6899. doi: 10.3934/math.2023348
[9]	Xuan Wang, Li Wang, Guohui Chen . The fourth power mean of the generalized quadratic Gauss sums associated with some Dirichlet characters. AIMS Mathematics, 2024, 9(7): 17774-17783. doi: 10.3934/math.2024864
[10]	Jiankang Wang, Zhefeng Xu, Minmin Jia . On the generalized Cochrane sum with Dirichlet characters. AIMS Mathematics, 2023, 8(12): 30182-30193. doi: 10.3934/math.20231542

Abstract

1. Introduction

Nowadays, people are used to interacting with the digital world through touch screens, which provide both input and output. People think that as technology improves in the future, it will be possible to make digital connections without using physical products such as laptops and phones. According to ^[1], it is anticipated that this technology will serve as a further enhancer of our cognitive abilities and provide a seamless connection between individuals and the digital realm. There is an expectation that future developments in virtual and augmented reality will involve the substitution of existing display components with specialized spectacles via which visual output will be directly projected onto the user's eyes. However, a comprehensive and effective method that can successfully integrate input schemes for future generations of technology has yet to be developed. Two well-researched methodologies encompass voice and gesture recognition; however, both exhibit significant constraints. In locations characterized by high levels of noise, the utilization of voice as a means of interacting with gadgets is deemed inefficient. Furthermore, the use of speech in public spaces is seen as unacceptable owing to issues regarding privacy.

There is a finite number of predefined gestures, so gesture recognition may not cover all possible interaction scenarios. In order to address these constraints, the notion of air-writing has been proposed. Air-writing allows users to write linguistic characters in the air without the need to memorize unique movements, making it a natural and user-friendly data feeding procedure for next-generation devices. Additionally, air-writing could be a smart text entry approach for small touchscreen devices, such as smartwatches, where tap-on-screen text input is prone to errors, and voice input is subject to privacy concerns and contamination by ambient noise ^[2]. Researchers from all over the world have proposed extensive research on air writing; the majority of studies have focused on English, and there have been only a few studies on air writing for Bengali. Bengali is the fifth language spoken worldwide by native speakers and the seventh most spoken language in terms of total speakers, with approximately 267 million people speaking it, including 230 million native speakers ^[3]. To incorporate this large Bengali-speaking population into modern technology, it is essential to increase the use of Bengali in technology. Despite being the world's seventh-most populous language, Bengali is not among the top 40 languages used on the internet. To achieve progress in business, education, and information technology using the internet, it is necessary to promote one's mother tongue online. However, this cannot be accomplished without recognizing the importance of developing Bengali writing tools that consider future challenges. Since air-writing is a potential text entry approach for future technology, many researchers are working to develop air-writing tools for various languages. Therefore, we must develop air writing tools for Bengali to establish its presence in technology in the future. In this study, an air-writing model is developed for all Bengali characters and the recognition rate of Bengali characters is 96.5% (97.2% for Bengali numerals). The rate of recognition is unaffected by changes in the environment. The following are some of the significant contributions made by this research:

● A portable system is being developed to record hand movements while writing Bengali characters in the air. In addition, an application is being developed that uses Bluetooth to get data from the portable device. The application includes a number of visualization options, including the ability to view received data as bar graphs, box plots, frequency domain plots, and raw and preprocessed data.

● No data set of Bengali symbols based on motion sensors was found for air-writing. In this study, air-writing data set is prepared using 3-dimensional motion sensing technique. All symbols of this language (Vowels, Consonants, and Digits) are written in the air, and the accelerations of the hand in three-dimensional space are recorded using the developed data acquisition system. The motion data for the fifty instances of each character are included in the data set. The data set contains 3050 instances of 40 consonants, 11 vowels, and 10 digits in total.

● The different statistical, time domain, and frequency domain features of the Bengali air-writing data set are examined, and the most useful feature set is determined. The feature set is then used to find the most effective machine learning model for classifying Bengali characters from the data collected by air-writing.

● A real-time application of air-writing of Bengali characters is developed.

2. Related work

Numerous researchers have already conducted a substantial amount of study on air-writing. The research projects that are being suggested fall into various categories: Computer vision-based writing recognition; radar sensor-based air-writing; WiFi signal-based air-writing; and air-writing based on motion sensor. A set of pictures is captured when a person uses their finger or a fixed-colored item to mimic in writing a symbol in the air for computer vision-based air-writing recognition. After that, computer vision algorithms are used in these pictures to find, separate, and eventually identify the gesture. Some of the research works on air-writing based on computer vision are listed in the reference section ^[4,5,6]. Md. Shahinur et al. proposed a trajectory-based air-writing system that enables a user to write a linguistic character or word in open space by waving a finger in front of a camera ^[4]. In ^[5], Oyndrila De et al. developed a system that can classify air-written digits from a real-time video stream. In ^[6], the classification of English digits and letters from hand gestures based on cosine similarity and fast nearest neighbor (NN) techniques is proposed. Chengzhang Qu proposed and implemented a user-friendly human-computer interaction system based on Kinect handwriting ^[7]. A slope variation detection-based air-writing recognition system for Persian numbers is proposed in ^[8]. Pradeep Kumar et al. ^[9] proposed real-time recognition of sign language gestures and air-writing using the leap motion method. They also utilized a hidden Markov model (HMM) and bidirectional long short-term memory neural networks (BLSTM-NNS) to perform 3D text recognition, where they obtained 86.88% and 81.25% accuracies in word recognition for the two methods, respectively ^[10]. In another study, Xiwen Qu et al. ^[11] used CNNs to recognize air-written Chinese characters and found that their method achieved high recognition accuracy. Ji Gan and colleagues introduced an innovative system for recognizing 3D in-air handwritten Chinese text (IAHCTR). They employed a new architecture called the temporal convolutional recurrent network (TCRN), specifically designed for online handwritten Chinese text recognition (HCTR) ^[12]. The architecture design produces superior results when compared to the latest methods in online handwritten Chinese text recognition (HCTR). The review of these research papers reveals that under fixed illumination conditions, the majority of computer vision-based air writing models are highly accurate. Unfortunately, inconsistent lighting and the background have an adverse influence on the accuracy of the models. In recent years, the millimeter wave Radar sensor has also become a viable gesture-based air-writing solution because of its low power use, noncontact type sensing, and independence from the variation of light intensity. In ^[13] the authors proposed a millimeter wave radar-based air-writing application that includes of a signal processing technique and system design for gesture-based air-writing. They tested the system for five different gestures, 10 numerical symbols, and 9 alphabetic symbols in a two-dimensional space. In ^[14] a novel 60 GHz millimeter wave radar-based air-writing device has been developed that allows users to write arbitrary characters or numbers in the air while being encircled by a network of radars. Using trilateration and an alpha-beta tracking algorithm, the system is able to locate and follow the user's hand marker. Using a millimeter-wave frequency-modulated continuous wave radar (FMCW) operating at 60 GHz the local hand trajectory was sensed, and a dataset of 3750 character instances was recorded in ^[14]. The accuracy of recognizing the drawn character was then demonstrated using a 1D temporal convolutional neural network (TCN). Without the use of a handheld device, Faheem Khan et. al. created an impulse radio ultra-wideband (IR-UWB) radar-based system that can detect alphanumeric characters in midair where four IR-UWB radar sensors were arranged in a rectangle layout as the hardware. In character classification, the method was found to perform better than the state-of-the-art ^[15]. The precision of a radar sensor-based air writing system is good; however, the system is not portable because it needs an environment with several radar sensors. Another approach to air-writing is motion sensor-based air-writing. This method allows users to write in the air by tracking hand movements using motion sensors, such as accelerometers and gyroscopes. The accuracy of this method does not depend on any surrounding parameters. The individual holds a motion sensor either in their hand or on their body and performs a gesture in the air to create a linguistic symbol. Subsequent examination of the sensor readings is conducted to determine the character that the user has drawn in the air. A variety of techniques have been suggested to identify the linguistic nature of a text by examining the unprocessed information from motion detectors. In ^[1], a novel algorithm named 2-DifViz is presented that converts hand movements in the air (captured by a myo-armband worn by a user) into text. Gesture-based robot control using an accelerometer sensor is proposed in ^[16]. A contour-based gesture model that converts human gestures into contours in 3D space and then recognizes these contours as characters is presented in ^[17]. An accelerometer and gyroscope-based air-writing character recognition system using CHMM is proposed in ^[18]. In ^[19], Jeen-Shing Wang et al. present an accelerometer-based digital pen for handwritten digit and gesture trajectory recognition.The majority of the research publications cited here are concerned with English character identification by air-writing. Only one study is found where Prasun Roy et al. ^[20] proposed a video camera-dependent air-writing framework for English, Bengali, and Devanagari numerals. For Bengali numerals, the recognition rate was found 95.4% under constant illumination conditions. Due to color-based segmentation, the performance of this system fluctuates substantially depending on lighting conditions. The recognition of the Bengali alphabet is not taken into account.

3. Materials and methods

3.1. Overview of the system

A full illustration of the system is presented in Figure 1. The system has a data acquisition unit. With this unit in hand, each Bengali character is written in the air fifty times. The data acquisition device labeled and recorded the velocity of motion in the x, y, and z directions for all Bengali characters written in the air to create a dataset. Some preprocessing is performed on the dataset and unique features are extracted from these labeled data. These features are used to train the classification model. After training the model, it is now ready for the real-time classification of Bengali air-written characters. In order to employ the categorization model in a real-time scenario, it is necessary for the user to physically trace a Bengali character in the air. The computer receives the data generated by air-writing.

Figure 1. Block diagram of the proposed system.

Name of parameters	No. of features	Feature label (Assumed)
Mean	3	$DC_x, DC_y, DC_z$
Standard deviation	3	$SD_x, SD_y, SD_z$
rms value	3	$RMS_x, RMS_y, RMS_z$
Correlation among axes	3	$Corr_{xy}, Corr_{yz}, Corr_{zx}$
Entropy	3	$EntropyX, EntropyY, EntropyZ$
Principal component coefficients	3	$PCA_x, PCA_y, PCA_z$
Zero crossing rate	3	$ZCR_x, ZCR_y, ZCR_z$
Interquartile Range	3	$IQR_x, IQR_y, IQR_z$
Energy	3	$E_x, E_y, E_z$
Mean frequency	3	$MF_x, MF_y, MF_z$
Spectral roll-off	3	$SR_x, SR_y, SR_z$
Spectral bandwidth	3	$SB_x, SB_y, SB_z$
Total features	36

Name of parameters	No. of features	Selected features
Mean	2	$DC_x, DC_z$
Standard deviation	1	$SD_z$
rms value	3	$RMS_x, RMS_y, RMS_z$
Correlation among axes	2	$Corr_{xy}, Corr_{zx}$
Entropy	1	$EntropyX$
Principal component coefficients	3	$PCA_x, PCA_y, PCA_z$
Zero crossing rate	3	$ZCR_x, ZCR_y, ZCR_z$
Interquartile range	2	$IQR_y, IQR_z$
Energy	3	$E_x, E_y, E_z$
Spectral bandwidth	2	$SB_y, SB_z$
Total features	22

Classification model	Accuracy	Classification model	Accuracy
Fine Tree	60.3%	Medium Tree	22.0%
Coarse Tree	6.1%	Linear Discriminant	95.4%
Gaussian Naïve Bayes	93.3%	Kernel Naïve Bayes	92.9%
Linear SVM	95.3%	Quadratic SVM	95.8%
Cubic SVM	94.9%	Fine Gaussian SVM	35.4%
Medium Gaussian SVM	96.5%	Coarse Gaussian SVM	91.7%
Fine KNN	94.1%	Medium KNN	91.7%
Coarse KNN	80.5%	Cosine KNN	90.8%
Weighted KNN	93.4%	Cubic KNN	90.6%

Character	No. of attempts										Accuracy
	1st	2nd	3rd	4th	5th	6th	7th	8th	9th	10th
ka ()		$\checkmark$	$\checkmark$			$\checkmark$		$\checkmark$	$\checkmark$	$\checkmark$	60.0%
kha ()	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$		$\checkmark$	90.0%
ek ()	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$		$\checkmark$	$\checkmark$	$\checkmark$	90.0%
i ()	$\checkmark$	$\checkmark$		$\checkmark$	$\checkmark$	$\checkmark$		$\checkmark$	$\checkmark$	$\checkmark$	80.0%
cha ()	$\checkmark$			$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$		$\checkmark$	70.0%
tin ()			$\checkmark$	$\checkmark$		$\checkmark$	$\checkmark$		$\checkmark$	$\checkmark$	60.0%
rri ()	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$		$\checkmark$	$\checkmark$	90.0%
ngo ()	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	100.0%
pach ()		$\checkmark$		$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	80.0%
jha ()	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$		$\checkmark$	$\checkmark$	$\checkmark$	90.0%
	Overall accuracy										81%

Research work	Language	Method	Accuracy
^[31]	English	Computer vision based	96.11%
^[32]	English	Computer vision based	86.9%
^[33]	Chinese	Computer vision based	98.11%
^[34]	Japanize	Computer vision based	92.5%
^[35]	English Digit	Kinnect sensor based	96.8%
^[36]	English Letter	Motion Sensor based	95.0%
^[17]	English Letter	Motion sensor based	94.3%
^[37]	English	Motion sensor based	88.4%
^[14]	English Letter	Radar sensor based	98.33%
^[38]	English Letter	WiFi signal based	88.74%
^[39]	Latin	Leap Motion	72.25%
^[20]	Bengali digit	Computer vision based	95.4%
Proposed	Bengali Alphabet	Motion sensor based	96.5%

[1]	A. Dash, A. Sahu, R. Shringi, J. Gamboa, M. Z. Afzal, M. I. Malik, et al., Airscript-creating documents in air, In: 2017 14th IAPR international conference on document analysis and recognition (ICDAR), 2017,908–913. https://doi.org/10.1109/ICDAR.2017.153
[2]	X. Lin, Y. Chen, X. Chang, X. Liu, X. Wang, Show: Smart handwriting on watches, In: Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies, 1 (2018), 151. https://doi.org/10.1145/3161412
[3]	The Bengali language and the history of its evolution, LingoStar, 2021. Available from: https://lingo-star.com/bengali-language/?v = 4326ce96e26c.
[4]	M. S. Alam, K. C. Kwon, M. A. Alam, M. Y. Abbass, S. M. Imtiaz, N. Kim, Trajectory-based air-writing recognition using deep neural network and depth sensor, Sensors, 20 (2020), 376. https://doi.org/10.3390/s20020376 doi: 10.3390/s20020376
[5]	O. De, P. Deb, S. Mukherjee, S. Nandy, T. Chakraborty, S. Saha, Computer vision based framework for digit recognition by hand gesture analysis, In: 2016 IEEE 7th annual information technology, electronics and mobile communication conference (IEMCON), 2016. https://doi.org/10.1109/IEMCON.2016.7746361
[6]	S. Poularakis, I. Katsavounidis, Low-complexity hand gesture recognition system for continuous streams of digits and letters, IEEE T. Cybernetics, 46 (2016), 2094–2108. https://doi.org/10.1109/TCYB.2015.2464195 doi: 10.1109/TCYB.2015.2464195
[7]	C. Qu, D. Zhang, J. Tian, Online kinect handwritten digit recognition based on dynamic time warping and support vector machine, J. Inform. Comput. Sci., 12 (2015), 413–422.
[8]	S. Mohammadi, R. Maleki, Air-writing recognition system for Persian numbers with a novel classifier, The Visual Comput., 36 (2020), 1001–1015. https://doi.org/10.1007/s00371-019-01717-3 doi: 10.1007/s00371-019-01717-3
[9]	P. Kumar, R. Saini, S. K. Behera, D. P. Dogra, P. P. Roy, Real-time recognition of sign language gestures and air-writing using leap motion, In: 2017 fifteenth IAPR international conference on machine vision applications (MVA), 2017. https://doi.org/10.23919/MVA.2017.7986825
[10]	P. Kumar, R. Saini, P. P. Roy, D. P. Dogra, Study of text segmentation and recognition using leap motion sensor. IEEE Sens. J., 17 (2017), 1293–1301. https://doi.org/10.1109/JSEN.2016.2643165 doi: 10.1109/JSEN.2016.2643165
[11]	X. Qu, W. Wang, K. Lu, J. Zhou, Data augmentation and directional feature maps extraction for in-air handwritten Chinese character recognition based on convolutional neural network, Pattern Recogn. Lett., 111 (2018), 9–15. https://doi.org/10.1016/j.patrec.2018.04.001 doi: 10.1016/j.patrec.2018.04.001
[12]	J. Gan, W. Wang, K. Lu, In-air handwritten Chinese text recognition with temporal convolutional recurrent network, Pattern Recogn., 97 (2020) 107025. https://doi.org/10.1016/j.patcog.2019.107025 doi: 10.1016/j.patcog.2019.107025
[13]	P. Wang, J. Lin, F. Wang, J. Xiu, Y. Lin, N. Yan, et al., A gesture air-writing tracking method that uses 24 GHz SIMO radar SoC, IEEE Access, 8 (2020), 152728–152741. https://doi.org/10.1109/ACCESS.2020.3017869 doi: 10.1109/ACCESS.2020.3017869
[14]	M. Arsalan, A. Santra, K. Bierzynski, V. Issakov, Air-writing with sparse network of radars using spatio-temporal learning, In: 2020 25th international conference on pattern recognition (ICPR), 2021. https://doi.org/10.1109/ICPR48806.2021.9413332
[15]	F. Khan, S. K. Leem, S. H. Cho, In-air continuous writing using UWB impulse radar sensors, IEEE Access, 8 (2020), 99302–99311. https://doi.org/10.1109/ACCESS.2020.2994281 doi: 10.1109/ACCESS.2020.2994281
[16]	M. K. Chakravarthi, R. K. Tiwari, S. Handa, Accelerometer based static gesture recognition and mobile monitoring system using neural networks, Procedia Comput. Sci., 70 (2015), 683–687. https://doi.org/10.1016/j.procs.2015.10.105 doi: 10.1016/j.procs.2015.10.105
[17]	Y. Yin, L. Xie, T. Gu, Y. Lu, S. Lu, AirContour: Building contour-based model for in-air writing gesture recognition, ACM T. Sensor. Network, 15 (2019), 44. https://doi.org/10.1145/3343855 doi: 10.1145/3343855
[18]	S. Xu, Y. Xue, Air-writing characters modelling and recognition on modified CHMM, In: 2016 IEEE international conference on systems, man, and cybernetics (SMC), 2016. https://doi.org/10.1109/SMC.2016.7844452
[19]	J. S. Wang, F. C. Chuang, An accelerometer-based digital pen with a trajectory recognition algorithm for handwritten digit and gesture recognition, IEEE T. Ind. Electron., 59 (2012), 2998–3007. https://doi.org/10.1109/TIE.2011.2167895 doi: 10.1109/TIE.2011.2167895
[20]	P. Roy, S. Ghosh, U. Pal, A CNN based framework for unistroke numeral recognition in air-writing, In: 2018 16th international conference on frontiers in handwriting recognition (ICFHR), 2018. https://doi.org/10.1109/ICFHR-2018.2018.00077
[21]	Coursera, Data processing and feature engineering with MATLAB, Available form: https://www.coursera.org/learn/feature-engineering-matlab.
[22]	Entropy calculation, information gain & decision tree learning, 2020. Available form: https://medium.com/analytics-vidhya/entropy-calculation-information-gain-decision-tree-learning-771325d16f
[23]	T. Giannakopoulos, A. Pikrakis, Introduction to audio analysis: A MATLAB® approach, 1st Eds, Cambridge, Massachusetts, US: Academic Press, 2014.
[24]	E. Scheirer, M. Slaney, Construction and evaluation of a robust multifeature speech/music discriminator, In: 1997 IEEE international conference on acoustics, speech, and signal processing, 1997. https://doi.org/10.1109/ICASSP.1997.596192
[25]	M. Müller, Fundamentals of music processing: Audio, analysis, algorithms, applications, Springer Cham, 2015. https://doi.org/10.1007/978-3-319-21945-5
[26]	M. A. Kader, M. A. Ullah, M. S. Islam, A real-time classification model for Bengali character recognition in air-writing, In: Computer vision and image analysis for industry 4.0, 1st Eds, Chapman and Hall/CRC, 2023.
[27]	Javatpoint, Regression vs. classification in machine learning, Available from https://www.javatpoint.com/regression-vs-classification-in-machine-learning.
[28]	A. Burkov, The hundred-page machine learning book, 1st Eds, Quebec City, QC, Canada: Andriy Burkov, 2019.
[29]	M. Mohammed, M. B. Khan, E. B. M. Bashier, Machine learning: Algorithms and applications, 1st Eds, Boca Raton: CRC Press, 2016. https://doi.org/10.1201/9781315371658
[30]	B. Dickson, Machine learning: What is dimensionality reduction? 2021. Available from: https://bdtechtalks.com/2021/05/13/machine-learning-dimensionality-reduction/.
[31]	S. Mukherjee, S. A. Ahmed, D. P. Dogra, S. Kar, P. P. Roy, Fingertip detection and tracking for recognition of air-writing in videos, Expert Syst. Appl., 136 (2019), 217–229. https://doi.org/10.1016/j.eswa.2019.06.034 doi: 10.1016/j.eswa.2019.06.034
[32]	V. Joseph, A. Talpade, N. Suvarna, Z. Mendonca, Visual gesture recognition for text writing in air, In: 2018 second international conference on intelligent computing and control systems (ICICCS), 2018. https://doi.org/10.1109/ICCONS.2018.8663176
[33]	J. Gan, W. Wang, K. Lu, A new perspective: Recognizing online handwritten Chinese characters via 1-dimensional CNN, Inform. Sci., 478 (2019), 375–390. https://doi.org/10.1016/j.ins.2018.11.035 doi: 10.1016/j.ins.2018.11.035
[34]	S. Hayakawa I. Goncharenko, Y. Gu, Air writing in Japanese: A CNN-based character recognition system using hand tracking, In: 2022 IEEE 4th global conference on life sciences and technologies (LifeTech), 2022. https://doi.org/10.1109/LifeTech53646.2022.9754825
[35]	C. Wang C. Y. Su, C. L. Lin, A novel recognition system for digits writing in the air using coordinated path ordering, In: HotMobile '15: Proceedings of the 16th international workshop on mobile computing systems and applications, 2015, 9–14. https://doi.org/10.1109/ICIIBMS.2015.7439500
[36]	C. Xu, P. H. Pathak, P. Mohapatra, Finger-writing with smartwatch: A case for finger and hand gesture recognition using smartwatch, In: Proceedings of the 16th International Workshop on Mobile Computing Systems and Applications, 2015, 9-14. https://doi.org/10.1145/2699343.2699350
[37]	Y. Luo, J. Liu, S. Shimamoto, Wearable air-writing recognition system employing dynamic time warping, In: 2021 IEEE 18th annual consumer communications & networking conference (CCNC), 2021. https://doi.org/10.1109/CCNC49032.2021.9369458
[38]	Z. Fu, J. Xu, Z. Zhu, A. X. Liu, X. Sun, Writing in the air with WiFi signals for virtual reality devices IEEE T. Mobile Comput., 18 (2019), 473–484. https://doi.org/10.1109/TMC.2018.2831709 doi: 10.1109/TMC.2018.2831709
[39]	P. Kumar, R. Saini, P. P. Roy, U. Pal, A lexicon-free approach for 3D handwriting recognition using classifier combination, Pattern Recogn. Lett., 103 (2018), 1–7. https://doi.org/10.1016/j.patrec.2017.12.014 doi: 10.1016/j.patrec.2017.12.014

1.	Aseel Qedear, Aldanh AlMatrafy, Athary Al-Sowat, Abrar Saigh, Asmaa Alayed, Real-Time Air-Writing Recognition for Arabic Letters Using Deep Learning, 2024, 24, 1424-8220, 6098, 10.3390/s24186098
2.	Hinase Kawano, Kazuya Murao, 2025, Chapter 17, 978-3-031-78048-6, 192, 10.1007/978-3-031-78049-3_17

AIMS Mathematics

A real-time air-writing model to recognize Bengali characters

Related Papers:

Abstract

1. Introduction

2. Related work

3. Materials and methods

3.1. Overview of the system

3.2. Development of the data acquisition system and data set

3.2.1. Data collection and transmitter unit

3.2.2. Receiver unit

3.2.3. Data collection application

3.2.4. Data set

3.3. Classification

3.3.1. Preprocessing

3.3.2. Feature extraction

3.3.3. Classification models

3.4. Feature selection and training of classification models

4. Results

5. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog