Home /Research /Automatic Image and Video Captioning Production using Deep Learning
LEARNING

Automatic Image and Video Captioning Production using Deep Learning

Swapna Yenugula, M.Sai Sirisha, K. Sindhu Priya, Y.Sujeeth Reddy, M.Nikhil Raghava Rao

Year
2022
Citations
2

Abstract

Deep Learning-based methodologies have a lot of promise for applications that try to automatically produce captions or explanations for photos and video frames. In the field of imaging science, Captioning images and videos is regarded to be a difficult intellectual task. People's images and videos automatically generate captions (or descriptions). with disabilities is one of the application domains, from a range of visual impairments; automatic metadata generation for photos; picture and video search engine indexing; a variety of general-purpose robot vision systems; and so forth. Each of these application domains has its own set of requirements. has a large and beneficial impact on a variety of other task-specific applications This isn't designed to be a complete look of picture captioning; rather, it's a quick rundown of deep learning-based image and video captioning techniques. The algorithmic overlap between picture and video captioning, as well as audio, is the topic of this study.

Keywords

Closed captioningComputer scienceTask (project management)Artificial intelligenceMetadataField (mathematics)Search engine indexingSet (abstract data type)Variety (cybernetics)Deep learning

Related papers

Browse all LEARNING papers