1 ACE Engineering College, Hyderabad, India.
2 CSE-AI and ML, ACE Engineering College, Hyderabad, India.
World Journal of Advanced Research and Reviews, 2025, 26(02), 3134-3143
Article DOI: 10.30574/wjarr.2025.26.2.1705
Received on 07 April 2025; revised on 19 May 2025; accepted on 21 May 2025
Image captioning is a task that Involves Natural Language Processing concepts to recognize the context of an image and describe them in a natural language like English. It requires good knowledge of Deep learning. Python, working on Jupyter notebooks, Keras library, Numpy, and Natural language processing It is a Python based project where we will use deep learning techniques of Convolutional Neural Networks and a type of Recurrent Neural Network (LSTM) together. The biggest challenge is most definitely being able to create a description that must capture not only the objects contained in an image, but also express how these objects relate to each other. Automatically describing the content of an image is a fundamental problem in artificial intelligence that connects computer vision and natural language processing here, we present a generative model based on a deep recurrent architecture that combines recent advances in computer vision and machine translation and that can be used to generate natural sentences describing an image. It could have great impact, for instance by helping visually impaired people better understand the content of images on the web.
CNN; LSTM; Image detection; Deep learning; Natural Language Processing
Preview Article PDF
Kavitha Soppari, Pakide Kavya, Kotla Pranay Teja and Bethi Pavan Sai. A survey on image captioning methods. World Journal of Advanced Research and Reviews, 2025, 26(2), 3134-3143. Article DOI: https://doi.org/10.30574/wjarr.2025.26.2.1705