Gradient Boost algorithms for Modelling Malayalam Poem Syllable Duration Gradient Boost algorithms for Modelling Malayalam Poem Syllable Duration

Main Article Content

Mr. Jasir M.P
Dr. B Kannan
Ms. Jaseena K.U

Abstract

Emulating natural speech has been a top priority ever since the research activities began in the
area of Natural Language Processing (NLP). Text To Speech Synthesis (TTS) consists of several stages,
which include Text Normalization, Syllabification and Unit Selection, Duration Analysis Modelling,
and Prosody Analysis Modelling. Proper syllabification was required earlier when rule-based concatenative synthesis was used as the main method to synthesize speech. Now statistical parametric speech
synthesis is the state of the art. Supervised and unsupervised machine learning frameworks can be used
to model different aspects of speech such as duration, prosody etc. The proposed work uses classical
poem construct Vruta (meter) to identify the features determine syllable duration. Nineteen features are
extracted from the orthographic representation of poem according to the Vruta definition. Kakali, Keka,
and Manjari are the Vrutas considered. Also the contextual features of the syllables and the accoustic properties like the origin of the syllable are considered to build the feature set. The proposed work
employs Gradient Boost Algorithms for modelling the duration of Malayalam poem syllables. All the
models give superior values for the coefficient of determination (R2) compared to other major models.
Simple Gradient Boost Machine (GBM) is able to produce 90.723 for R2. Similarly, XGBoost gives
90.726, LightBoost yields 90.693 and CatBoost delivers 90.819. Also, the models exhibit lesser values
for different Statistical Error Indicators (SEI) - MAE, RMSE, and MAPE

Article Details

How to Cite
M.P, J., Mullayil, K. B., & Jaseena K.U. (2022). Gradient Boost algorithms for Modelling Malayalam Poem Syllable Duration: Gradient Boost algorithms for Modelling Malayalam Poem Syllable Duration. INFOCOMP Journal of Computer Science, 21(1). Retrieved from https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/1867
Section
Machine Learning and Computational Intelligence
Author Biographies

Mr. Jasir M.P, Department of Computer Applications, Cochin University of Science and Technology, Kochi, 682022, Department of Computer Applications, MES College, Marampally, Aluva, 683105

Jasir M P is a research fellow at the Department of Computer Application, Cochin University of Science and Technology. He presently works as an Assistant Professor in the department of Computer Application, MES College Marampally, Aluva. His areas of interest are Natural Language Processing, Linguistic characteristics of Malayalam in the context of Text to Speech Synthesis and Speech Recognition.

               

Dr. B Kannan, Department of Computer Applications, Cochin University of Science and Technology

Kannan Balakrishnan received his M.Sc and M. Phil degrees in Mathematics from University of Kerala, India, M. Tech degree in Computer and Information Science from Cochin University of Science & Technology (CUSAT), Cochin, India and Ph. D in Futures Studies from University of Kerala, India in   1982, 1983, 1988 and 2006 respectively. He is currently working with CUSAT, Cochin, India, as Professor Emeritus, in the Department of Computer Applications. He has visited Netherlands as part of a MHRD project on Computer Networks. Also he visited Slovenia as the co-investigator of Indo-Slovenian joint research project by Department of Science and Technology, Government of India. He has published several papers in International Journals and National and International conference proceedings.  His present areas of interest are Graph Algorithms, Intelligent Systems, Image Processing, CBIR and Machine Translation. He is a reviewer of American Mathematical Reviews. He is a recognized Research Guide in the Faculties of Technology and Science in CUSAT, Cochin, India. He has served in many academic bodies of various universities in Kerala, India. Also currently he is a member of the Board of Studies of Cochin, Calicut and Kannur Universities in India. He is also a member of MIR labs India.

Ms. Jaseena K.U, Department of Computer Applications, MES College Marampally, Aluva, 683105

Jaseena K.U. received her M.Tech. Degree in Information Systems Security from Indira Gandhi National Open University, New Delhi, India. She is working as Assistant Professor in the Department of Computer Applications of MES College, Marampally, Aluva, Cochin, India. She has published several papers in International Journals and also in international Conference Proceedings. Her research interests include Data Mining, Big Data Analytics, Machine Learning and Information Security. She is member of Computer Society of India and IEEE.