Research article

Bird sound recognition based on adaptive frequency cepstral coefficient and improved support vector machine using a hunter-prey optimizer


  • Received: 30 July 2023 Revised: 12 October 2023 Accepted: 13 October 2023 Published: 19 October 2023
  • Bird sound recognition is crucial for bird protection. As bird populations have decreased at an alarming rate, monitoring and analyzing bird species helps us observe diversity and environmental adaptation. A machine learning model was used to classify bird sound signals. To improve the accuracy of bird sound recognition in low-cost hardware systems, a recognition method based on the adaptive frequency cepstrum coefficient and an improved support vector machine model using a hunter-prey optimizer was proposed. First, in sound-specific feature extraction, an adaptive factor is introduced into the extraction of the frequency cepstrum coefficients. The adaptive factor was used to adjust the continuity, smoothness and shape of the filters. The features in the full frequency band are extracted by complementing the two groups of filters. Then, the feature was used as the input for the following support vector machine classification model. A hunter-prey optimizer algorithm was used to improve the support vector machine model. The experimental results show that the recognition accuracy of the proposed method for five types of bird sounds is 93.45%, which is better than that of state-of-the-art support vector machine models. The highest recognition accuracy is obtained by adjusting the adaptive factor. The proposed method improved the accuracy of bird sound recognition. This will be helpful for bird recognition in various applications.

    Citation: Xiao Chen, Zhaoyou Zeng. Bird sound recognition based on adaptive frequency cepstral coefficient and improved support vector machine using a hunter-prey optimizer[J]. Mathematical Biosciences and Engineering, 2023, 20(11): 19438-19453. doi: 10.3934/mbe.2023860

    Related Papers:

  • Bird sound recognition is crucial for bird protection. As bird populations have decreased at an alarming rate, monitoring and analyzing bird species helps us observe diversity and environmental adaptation. A machine learning model was used to classify bird sound signals. To improve the accuracy of bird sound recognition in low-cost hardware systems, a recognition method based on the adaptive frequency cepstrum coefficient and an improved support vector machine model using a hunter-prey optimizer was proposed. First, in sound-specific feature extraction, an adaptive factor is introduced into the extraction of the frequency cepstrum coefficients. The adaptive factor was used to adjust the continuity, smoothness and shape of the filters. The features in the full frequency band are extracted by complementing the two groups of filters. Then, the feature was used as the input for the following support vector machine classification model. A hunter-prey optimizer algorithm was used to improve the support vector machine model. The experimental results show that the recognition accuracy of the proposed method for five types of bird sounds is 93.45%, which is better than that of state-of-the-art support vector machine models. The highest recognition accuracy is obtained by adjusting the adaptive factor. The proposed method improved the accuracy of bird sound recognition. This will be helpful for bird recognition in various applications.



    加载中


    [1] L. Patrik, S. Panu, L. Petteri, L. Geres, T. Richter, S. Seibold, et al., Domain-specific neural networks improve automated bird sound recognition already with small amount of local data, Methods Ecol. Evol., 13 (2022), 2799–2810. https://doi.org/10.1111/2041-210X.14003 doi: 10.1111/2041-210X.14003
    [2] O. Küc̣üktopcu, E. Masazade, C. Ünsalan, P. K. Varshney, A real-time bird sound recognition system using a low-cost microcontroller, Appl. Acoust., 148 (2019), 194–201. https://doi.org/10.1016/j.apacoust.2018.12.028 doi: 10.1016/j.apacoust.2018.12.028
    [3] J. Xie, Y. Zhong, J. Zhang, S. Liu, C. Ding, A. Triantafyllopoulos, A review of automatic recognition technology for bird vocalizations in the deep learning era, Ecol. Inf., 73 (2023), 101927. https://doi.org/10.1016/j.ecoinf.2022.101927 doi: 10.1016/j.ecoinf.2022.101927
    [4] K. Liu, Y. Fu, L. Wu, X. Li, C. Aggarwal, H. Xiong, Automated feature selection: A reinforcement learning perspective, IEEE Trans. Knowl. Data Eng., 35 (2023), 2272–2284. http://dx.doi.org/10.1109/TKDE.2021.3115477 doi: 10.1109/TKDE.2021.3115477
    [5] Y. Dai, J. Yang, Y. Dong, H. Zou, M. Hu, B. Wang, Blind source separation-based IVA-Xception model for bird sound recognition in complex acoustic environments, Electron. Lett., 57 (2021), 454–456. http://dx.doi.org/10.1049/ell2.12160 doi: 10.1049/ell2.12160
    [6] Q. Tang, L. Xu, B. Zheng, C. He, Transound: Hyper-head attention transformer for birds sound recognition, Ecol. Inf., 75 (2023), 102001. https://doi.org/10.1016/j.ecoinf.2023.102001 doi: 10.1016/j.ecoinf.2023.102001
    [7] T. Jung, H. Jeon, C. Jeon, A. Cook, A. Weiss, M. Lee, et al., Deep learning-based bird sound recognition system with data pre-processing, in Korean Electronics Engineering Association Academic Conference, (2019), 756–759.
    [8] S. Xu, Y. Sun, L. Huang-Fu, W. Fang, Design of a comprehensive birdsong recognition classifier based on MFCC, time-frequency map and other features, Lab. Res. Explor., 37 (2018), 81–86.
    [9] A. E. Mehyadin, A. M. Abdulazeez, D. A. Hasan, J. N. Saeed, Birds sound classification based on machine learning algorithms, Asian J. Res. Comput. Sci., 9 (2021), 1–11. https://doi.org/10.9734/AJRCOS/2021/v9i430227 doi: 10.9734/AJRCOS/2021/v9i430227
    [10] X. Chen, Y. Gao, C. Wang, Fractional derivative method to reduce noise and improve SNR for Lamb wave signals, J. Vibroeng., 17 (2015), 4211–4218.
    [11] X. Chen, C. Wang, Tsallis distribution-based fractional derivative method for Lamb wave signal recovery, Res. Nondestr. Eval., 26 (2015), 174–188. https://doi.org/10.1080/09349847.2015.1023913 doi: 10.1080/09349847.2015.1023913
    [12] X. Chen, C. Wang, Noise removing for Lamb wave signals by fractional differential, J. Vibroeng., 16 (2014), 2676–2684.
    [13] X. Chen, C. Wang, Noise suppression for Lamb wave signals by Tsallis mode and fractional-order differential (in Chinese), Acta Phys. Sin., 63 (2014), 184301. http://dx.doi.org/10.7498/aps.63.184301 doi: 10.7498/aps.63.184301
    [14] X. Chen, J. Li, Noise reduction for ultrasonic Lamb wave signals by empirical mode decomposition and wavelet transform, J. Vibroeng., 15 (2013), 1157–1165.
    [15] X. Chen, D. Ma, Mode separation for multimodal ultrasonic Lamb waves using dispersion compensation and independent component analysis of forth-order cumulant, Appl. Sci., 9 (2019), 555. http://dx.doi.org/10.3390/app9030555 doi: 10.3390/app9030555
    [16] L. Ni, X. Chen, Mode separation for multimode Lamb waves based on dispersion compensation and fractional differential, Acta Phys. Sin., 67 (2018), 204301. http://dx.doi.org/10.7498/aps.67.20180561 doi: 10.7498/aps.67.20180561
    [17] X. Chen, Y. Gao, L. Bao, Lamb wave signal retrieval by wavelet ridge, J. Vibroeng., 16 (2014), 464–476.
    [18] K. Salaheddine, K. Fathallah, A. Issam, B. Mohamed, Performance evaluation and implementations of MFCC, SVM and MLP algorithms in the FPGA board, Int. J. Electr. Comput. Eng. Syst., 12 (2021), 139–153. http://dx.doi.org/10.32985/ijeces.12.3.3
    [19] G. Ruan, Y. Zhong, J. Jiang, Design of speech interaction system based on MFCC coefficient (in Chinese), Autom. Instrum., (2022), 167–171. https://doi.org/10.14016/j.cnki.1001-9227.2022.06.167
    [20] B. Liu, H. Bai, W. Chen, H. Chen, Z. Zhang, Automatic detection method of epileptic seizures based on IRCMDE and PSO-SVM, Math. Biosci. Eng., 20 (2023), 9349–9363. https://doi.org/10.3934/mbe.2023410 doi: 10.3934/mbe.2023410
    [21] X. Dai, K. Sheng, F. Shu, Ship power load forecasting based on PSO-SVM, Math. Biosci. Eng., 19 (2022), 4547–4567. https://doi.org/10.3934/mbe.2022210 doi: 10.3934/mbe.2022210
    [22] X. Chen, R. Jing, C. Sun, Attention mechanism feedback network for image super-resolution, J. Electron. Imaging, 31 (2022), 043006. https://doi.org/10.1117/1.JEI.31.4.043006 doi: 10.1117/1.JEI.31.4.043006
    [23] X. Chen, J. Zhu, Land scene classification for remote sensing images with an improved capsule network, J. Appl. Remote Sens., 16 (2022), 026510. http://dx.doi.org/10.1117/1.JRS.16.026510 doi: 10.1117/1.JRS.16.026510
    [24] X. Chen, C. Sun, Multiscale recursive feedback network for image super-resolution, IEEE Access, 10 (2022), 6393–6406. https://doi.org/10.1109/ACCESS.2022.3142510. doi: 10.1109/ACCESS.2022.3142510
    [25] X. Chen, S. Zou, Improved Wi-Fi indoor positioning based on particle swarm optimization, IEEE Sens. J., 17 (2017), 7143–7148. https://doi.org/10.1109/JSEN.2017.2749762 doi: 10.1109/JSEN.2017.2749762
    [26] R. Rajan, A. Noumida, Multi-label bird species classification using transfer learning, in International Conference on Communication, Control and Information Sciences, (2021), 1–5.
    [27] X. Chen, W. Zhan, Effect of transducer shadowing of ultrasonic anemometers on wind velocity measurement, IEEE Sens. J., 21 (2021), 4731–4738. https://doi.org/10.1109/JSEN.2020.3030634 doi: 10.1109/JSEN.2020.3030634
    [28] X. Chen, B. Zhang, 3D DV-hop localisation scheme based on particle swarm optimisation in wireless sensor networks, Int. J. Sens. Netw., 16 (2014), 100–105. https://doi.org/10.1504/IJSNET.2014.065869 doi: 10.1504/IJSNET.2014.065869
    [29] X. Chen, B. Zhang, Improved DV-Hop node localization algorithm in wireless sensor networks, Int. J. Distrib. Sens. Netw., 2012 (2012), 213980. https://doi.org/10.1155/2012/213980 doi: 10.1155/2012/213980
    [30] X. Chen, C. Hu, Adaptive medical image encryption algorithm based on multiple chaotic mapping, Saudi J. Biol. Sci., 24 (2017), 1821–1827. https://doi.org/10.1016/j.sjbs.2017.11.023 doi: 10.1016/j.sjbs.2017.11.023
  • Reader Comments
  • © 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(1031) PDF downloads(60) Cited by(1)

Article outline

Figures and Tables

Figures(11)  /  Tables(3)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog