References Abdel-Hamid, O., Mohamed, A., Jiang, H., Deng, L., Penn, G., & Yu, D. (2014). Convolutional neural networks for speech recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22(10), 1533–1545. Abdel-Hamid, O., Mohamed, A., Jiang, H., & Penn, G. (2012). APPLYING CONVOLUTIONAL NEURAL NETWORKS CONCEPTS TO HYBRID NN-HMM MODEL FOR SPEECH RECOGNITION. 4. Abdel-Hamid, O., Mohamed, A., Jiang, H., & Penn, G. (2012). Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4277–4280. Abdulazeez, A. M., Sulaiman, M. A., & Qader, D. (2020). Evaluating Data Mining Classification Methods Performance in Internet of Things Applications. 1(2), 15. Abdulazeez, A., Salim, B., Zeebaree, D., & Doghramachi, D. (2020). Comparison of VPN Protocols at Network Layer Focusing on Wire Guard Protocol. Abdulazeeza, A. M., Nahmatwllab, L. L., & Qader, D. (2020). Pipelined Parallel Processing Implementation based on Distributed Memory Systems. International Journal of Innovation, 13(7), 12. Abdulqader, D. M., Abdulazeez, A. M., & Zeebaree, D. Q. (2020). Machine Learning Supervised Algorithms of Gene Selection: A Review. 62(03), 13. Adebowale, M. A., Lwin, K. T., & Hossain, M. A. (2020). Intelligent phishing detection scheme using deep learning algorithms. Journal of Enterprise Information Management, ahead-of-print(ahead-of-print). Adeen, I. M. N., Abdulazeez, A. M., & Zeebaree, D. Q. (2020). Systematic Review of Unsupervised Genomic Clustering Algorithms Techniques for High Dimensional Datasets. Ahmed, J. A., & Brifcani, A. M. A. (2015). A new internal architecture based on feature selection for holonic manufacturing system. International Journal of Mechanical, Aerospace, Industrial, Mechatronic and Manufacturing Engineering, 2(8), 1431. Ahmed, O., & Brifcani, A. (2019). Gene Expression Classification Based on Deep Learning. 2019 4th Scientific International Conference Najaf (SICN), 145–149. Ahmed, O. M., & Abduallah, W. M. (2017). A Review on Recent Steganography Techniques in Cloud Computing. Academic Journal of Nawroz University, 6(3), 106–111. Anasuya, M. A., & Katti, S. K. (2009). Speech recognition by machine: A review. International Journal of Computer Science and Information Security, 6(3), 181–205. Anuradha, B., & Reddy, V. V. (2008). ANN for classification of cardiac arrhythmias. ARPN Journal of Engineering and Applied Sciences, 3(3), 1–6. Anvarjon, T., Mustaqeem, & Kwon, S. (2020). Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features. Sensors, 20(18), 5212. Bargarai, F., Abdulazeez, A., Tiryaki, V., & Zeebaree, D. (2020). Management of Wireless Communication Systems Using Artificial Intelligence-Based Software Defined Radio. Bingol, M. C., & Aydogmus, O. (2020). Performing predefined tasks using the human–robot interaction on speech recognition for an industrial robot. Engineering Applications of Artificial Intelligence, 95, 103903. Bird, J. J., Wanner, E., Ekárt, A., & Faria, D. R. (2020). Optimisation of phonetic aware speech recognition through multi-objective evolutionary algorithms. Expert Systems with Applications, 153, 113402. Bocklet, T., Maier, A., Bauer, J. G., Burkhardt, F., & Noth, E. (2008). Age and gender recognition for telephone applications based on gmm supervectors and support vector machines. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 1605–1608. Chaudhary, D., & Vasuja, E. R. (2019). A Review on Various Algorithms used in Machine Learning. Deepak, S., & Ameer, P. M. (2019). Brain tumor classification using deep CNN features via transfer learning. Computers in Biology and Medicine, 111, 103345. Fantaye, T. G., Yu, J., & Hailu, T. T. (2020). Advanced Convolutional Neural Network-Based Hybrid Acoustic Models for Low-Resource Speech Recognition. Computers, 9(2), 36. Furui, S. (1991). Speaker-dependent-feature extraction, recognition and processing techniques. Speech Communication, 10(5–6), 505–520. G., S., R., V., & K.P., S. (2018). Diabetes detection using deep learning algorithms. ICT Express, 4(4), 243–246. Gaikwad, S. K., Gawali, B. W., & Yannawar, P. (2010). A review on speech recognition technique. International Journal of Computer Applications, 10(3), 16–24. Ghosal, D., & Kolekar, M. H. (2018). Music Genre Recognition Using Deep Neural Networks and Transfer Learning. Interspeech 2018, 2087–2091. Graves, A., Mohamed, A., & Hinton, G. (2013). Speech recognition with deep recurrent neural networks. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 6645–6649. Hasan, D. A., & Abdulazeez, A. M. (2020). A Modified Convolutional Neural Networks Model for Medical Image Segmentation. Learning, 20, 22. Huang, K.-Y., Wu, C.-H., Hong, Q.-B., Su, M.-H., & Chen, Y.-H. (2019). Speech Emotion Recognition Using Deep Neural Network Considering Verbal and Nonverbal Speech Sounds. ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5866–5870. Jahwar, A. F., & Abdulazeez, A. M. (2020). META-HEURISTIC ALGORITHMS FOR K-MEANS CLUSTERING: A REVIEW. PalArch’s Journal of Archaeology of Egypt/Egyptology, 17(7), 12002–12020. Jansson, P. (2018). Single-word speech recognition with Convolutional Neural Networks on raw waveforms. 31. Jing, W., Jiang, T., Zhang, X., & Zhu, L. (2012). The optimisation of speech recognition based on convolutional neural network. 10. Kavi B. Obaid, Zeebaree, S. R. M., & Ahmed, O. M. (2020). Deep Learning Models Based on Image Classification: A Review. Klevans, R. L., & Rodman, R. D. (1997). Voice recognition. Artech House, Inc. Korvel, G., Treigys, P., Tamulevicus, G., Bernataviciene, J., & Kostek, B. (2018). Analysis of 2D Feature Spaces for Deep Learning-Based Speech Recognition. Journal of the Audio Engineering Society, 66(12), 1072–1081. https://doi.org/10.17743/jaes.2018.0066 Laine, U. (2017). Analytic Filter Bank for Speech Analysis, Feature Extraction and Perceptual Studies (p. 453). https://doi.org/10.21437/Interspeech.2017-1232 Lalitha, S., Tripathi, S., & Gupta, D. (2019). Enhanced speech emotion detection using deep neural networks. International Journal of Speech Technology, 22(3), 497–510. LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444. LeCun, Y., Bengio, Y., & Laboratories, T. B. (1995). Convolutional Networks for Images, Speech, and Time-Series. 15. LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278–2324. LeCun, Y., Huang, F. J., & Bottou, L. (2004a). Learning methods for generic object recognition with invariance to pose and lighting. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2, II–104. LeCun, Y., Huang, F. J., & Bottou, L. (2004b). Learning methods for generic object recognition with invariance to pose and lighting. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., 2, II–104. Liu, C.-R., Qu, D., & Yang, X.-K. (2019). Long Short Term Memory Networks Weighted Prediction Error for Far- Field Speech Recognition. 4. Maulud, D., & Abdulazeez, A. M. (2020). A Review on Linear Regression Comprehensive in Machine Learning. Journal of Applied Science and Technology Trends, 1(4), 140–147. Meng, H., Yan, T., Yuan, F., & Wei, H. (2019). Speech Emotion Recognition From 3D Log-Mel Spectrograms With Deep Learning Network. IEEE Access, 7, 125868–125881. Métais, E., Meziane, F., Vadera, S., Sugumaran, V., & Saraee, M. (Eds.). (2019). Natural Language Processing and Information Systems: 24th International Conference on Applications of Natural Language to Information Systems, NLDB 2019, Salford, UK, June 26–28, 2019, Proceedings (Vol. 11608). Springer International Publishing. Morgan, N. (2011). Deep and wide: Multiple layers in automatic speech recognition. Ieee Transactions on Audio, Speech, and Language Processing, 20(1), 7–13. Muhammad, M. A., Zeebaree, D. Q., Abdulazeez, A. M., Saeed, J. N., & Zebari, A. (2020). A Review on Region of Interest Segmentation Based on Clustering Techniques for Breast Cancer Ultrasound Images. 01(03), 14. Nagajyothi, D., & Siddaiah, P. (2018). Speech Recognition Using Convolutional Neural Networks. International Journal of Engineering & Technology, 7(4.6), 133. Nassif, A. B., Shahin, I., Attili, I., Azzeh, M., & Shaalan, K. (2019). Speech Recognition Using Deep Neural Networks: A Systematic Review. IEEE Access, 7, 19143–19165. Nweke, H. F., Teh, Y. W., Al-garadi, M. A., & Alo, U. R. (2018). Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges. Expert Systems with Applications, 105, 233–261. Omar, N., Abdulazeez, A. M., Sengur, A., & Al-Ali, S. G. S. (2020). Fused faster RCNNs for efficient detection of the license plates. Indonesian Journal of Electrical Engineering and Computer Science, 19(2), 974–982. Ordóñez, F. J., & Roggen, D. (2016). Deep convolutional and lstm recurrent neural networks for multimodal wearable activity recognition. Sensors, 16(1), 115. Palaz, D., & Collobert, R. (2015). Analysis of cnn-based speech recognition system using raw speech as input. Idiap. Palaz, D., Magimai.-Doss, M., & Collobert, R. (2015). Convolutional Neural Networks-based continuous speech recognition using raw speech signal. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4295–4299. Passricha, V., & Aggarwal, R. K. (2019). A Hybrid of Deep CNN and Bidirectional LSTM for Automatic Speech Recognition. Journal of Intelligent Systems, 29(1), 1261–1274. Qian, Y., Bi, M., Tan, T., & Yu, K. (2016). Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(12), 2263–2276. Reynolds, D. A. (2002). An overview of automatic speaker recognition technology. 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, 4, IV-4072-IV–4075. Sainath, T. N., Kingsbury, B., Saon, G., Soltau, H., Mohamed, A., Dahl, G., & Ramabhadran, B. (2015). Deep convolutional neural networks for large-scale speech tasks. Neural Networks, 64, 39–48. Sajjad, M., Khan, S., Muhammad, K., Wu, W., Ullah, A., & Baik, S. W. (2019). Multi-grade brain tumor classification using deep CNN with extensive data augmentation. Journal of Computational Science, 30, 174–182. Sethi, J., & Mittal, M. (2019). Ambient air quality estimation using supervised learning techniques. EAI Endorsed Transactions on Scalable Information Systems, 6(22). Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. ArXiv Preprint ArXiv:1409.1556. Solanki, A., & Pandey, S. (2019). Music instrument recognition using deep convolutional neural networks. International Journal of Information Technology. Song, Z. (2020). English speech recognition based on deep learning with multiple features. Computing, 102(3), 663–682. Sornam, M., Muthusubash, K., & Vanitha, V. (2017). A Survey on Image Classification and Activity Recognition using Deep Convolutional Neural Network Architecture. 2017 Ninth International Conference on Advanced Computing (ICoAC), 121–126. Swietojanski, P., Ghoshal, A., & Renals, S. (2014). Convolutional Neural Networks for Distant Speech Recognition. IEEE Signal Processing Letters, 21(9), 1120–1124. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., & Rabinovich, A. (2015). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1–9. V.Chapaneri, S. (2012). Spoken Digits Recognition using Weighted MFCC and Improved Features for Dynamic Time Warping. International Journal of Computer Applications, 40(3), 6–12. Vogt, T., & André, E. (2006). Improving Automatic Emotion Recognition from Speech via Gender Differentiaion. LREC, 1123–1126. Zebari, R., Abdulazeez, A., Zeebaree, D., Zebari, D., & Saeed, J. (2020). A Comprehensive Review of Dimensionality Reduction Techniques for Feature Selection and Feature Extraction. Journal of Applied Science and Technology Trends, 1(2), 56–70. Zeebaree, D. Q., Haron, H., & Abdulazeez, A. M. (2018). Gene Selection and Classification of Microarray Data Using Convolutional Neural Network. 2018 International Conference on Advanced Science and Engineering (ICOASE), 145–150. Zeebaree, D. Q., Haron, H., Abdulazeez, A. M., & Zebari, D. A. (2019a). Machine learning and Region Growing for Breast Cancer Segmentation. 2019 International Conference on Advanced Science and Engineering (ICOASE), 88–93. Zeebaree, D. Q., Haron, H., Abdulazeez, A. M., & Zebari, D. A. (2019b). Trainable Model Based on New Uniform LBP Feature to Identify the Risk of the Breast Cancer. 2019 International Conference on Advanced Science and Engineering (ICOASE), 106–111. Zeebaree, D. Q., Haron, H., Abdulazeez, A. M., & Zeebaree, S. R. (2017). Combination of K-means clustering with Genetic Algorithm: A review. International Journal of Applied Engineering Research, 12(24), 14238–14245. Zeebaree, S. R., Haji, L. M., Rashid, I., Zebari, R. R., Ahmed, O. M., Jacksi, K., & Shukur, H. M. (2020). Multicomputer Multicore System Influence on Maximum Multi-Processes Execution Time. Zhang, K., Zuo, W., Chen, Y., Meng, D., & Zhang, L. (2017). Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising. IEEE Transactions on Image Processing, 26(7), 3142–3155. Zhang, M., Zeng, Y., Han, Z., & Gong, Y. (2018). Automatic Modulation Recognition Using Deep Learning Architectures. 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), 1–5. Zhang, S., Zhang, S., Huang, T., & Gao, W. (2018). Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching. IEEE Transactions on Multimedia, 20(6), 1576–1590. Zhao, J., Mao, X., & Chen, L. (2019). Speech emotion recognition using deep 1D & 2D CNN LSTM networks. Biomedical Signal Processing and Control, 47, 312–323. Zoughi, T., Homayounpour, M. M., & Deypir, M. (2020). Adaptive windows multiple deep residual networks for speech recognition. Expert Systems with Applications, 139, 112840.