Main Article Content
Image understanding and language generation have always been a difficult task in the field of Artificial Intelligence. Automatic Image Caption Generation is concerned with the task of understanding the image and generating a caption for it. In this paper, we represented our research work that uses the Deep Learning technique to create Punjabi captions for a given image and its associated news document. High-level features of the images are extracted using the pre-trained VGG-19 (Visual Geometry Group) model. These image features are merged with features of news text which are extracted using LSTM (Long Short Term Memory). The proposed model augments keywords from associated news text to generate suitable captions. Using both BLEU scores and human evaluations, we show that the proposed method is successful in generating intelligible and suitable captions.
Upon receipt of accepted manuscripts, authors will be invited to complete a copyright license to publish the paper. At least the corresponding author must send the copyright form signed for publication. It is a condition of publication that authors grant an exclusive licence to the the INFOCOMP Journal of Computer Science. This ensures that requests from third parties to reproduce articles are handled efficiently and consistently and will also allow the article to be as widely disseminated as possible. In assigning the copyright license, authors may use their own material in other publications and ensure that the INFOCOMP Journal of Computer Science is acknowledged as the original publication place.