Automatic Punjabi Caption Generation For Sports Images

Manleen Kaur; Gurpreet Josan; Jagroop Kaur

pdf

Published: Jun 4, 2021

Manleen Kaur

Gurpreet Josan

Punjabi University Patiaa

Jagroop Kaur

Punjabi University Patiaa

Abstract

Image understanding and language generation have always been a difficult task in the field of Artificial Intelligence. Automatic Image Caption Generation is concerned with the task of understanding the image and generating a caption for it. In this paper, we represented our research work that uses the Deep Learning technique to create Punjabi captions for a given image and its associated news document. High-level features of the images are extracted using the pre-trained VGG-19 (Visual Geometry Group) model. These image features are merged with features of news text which are extracted using LSTM (Long Short Term Memory). The proposed model augments keywords from associated news text to generate suitable captions. Using both BLEU scores and human evaluations, we show that the proposed method is successful in generating intelligible and suitable captions.

How to Cite

Kaur, M., Josan, G., & Kaur, J. (2021). Automatic Punjabi Caption Generation For Sports Images. INFOCOMP Journal of Computer Science, 20(1). Retrieved from https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/1180

Issue

Vol. 20 No. 1 (2021): June 2021

Section

Computer Graphics, Image Processing, Visualization and Virtual Reality

Upon receipt of accepted manuscripts, authors will be invited to complete a copyright license to publish the paper. At least the corresponding author must send the copyright form signed for publication. It is a condition of publication that authors grant an exclusive licence to the the INFOCOMP Journal of Computer Science. This ensures that requests from third parties to reproduce articles are handled efficiently and consistently and will also allow the article to be as widely disseminated as possible. In assigning the copyright license, authors may use their own material in other publications and ensure that the INFOCOMP Journal of Computer Science is acknowledged as the original publication place.

Author Biographies

Manleen Kaur

Manleen Kaur is a research scholar at the Department of Computer Science at Punjabi University Patiala. She completed her Masters in Computer Science and Engineering in 2018. Her area of interest is Natural Language Processing, Image Processing, and Machine Learning.

Gurpreet Josan, Punjabi University Patiaa

Dr. Gurpreet Singh Josan is working as an Associate professor in the Department of Computer Science at Punjabi University Patiala. He obtained his master's degree in Computer Science and Engineering from Punjabi University Patiala in 2001 and Ph.D. in Computer Science and Engineering in 2009. He has more than 20 years of teaching experience. His area of interest is Natural Language Processing, Machine Learning, and Social Media Text Mining.

Jagroop Kaur, Punjabi University Patiaa

Dr. Jagroop Kaur is working as an Assistant professor in the department of computer engineering at Punjabi University Patiala. She obtained her master's degree in Computer Science and Engineering from Punjabi University Patiala in 2007. She has more than 18 years of teaching experience. Her area of interest is Natural Language Processing, Machine Learning, and Social Media Text Mining.

Article Sidebar

Main Article Content

Abstract

Article Details

Manleen Kaur

Gurpreet Josan, Punjabi University Patiaa

Jagroop Kaur, Punjabi University Patiaa