Automatic Image and Video Captioning Production using Deep Learning
Swapna Yenugula, M.Sai Sirisha, K. Sindhu Priya, Y.Sujeeth Reddy, M.Nikhil Raghava Rao
- Year
- 2022
- Citations
- 2
Abstract
Deep Learning-based methodologies have a lot of promise for applications that try to automatically produce captions or explanations for photos and video frames. In the field of imaging science, Captioning images and videos is regarded to be a difficult intellectual task. People's images and videos automatically generate captions (or descriptions). with disabilities is one of the application domains, from a range of visual impairments; automatic metadata generation for photos; picture and video search engine indexing; a variety of general-purpose robot vision systems; and so forth. Each of these application domains has its own set of requirements. has a large and beneficial impact on a variety of other task-specific applications This isn't designed to be a complete look of picture captioning; rather, it's a quick rundown of deep learning-based image and video captioning techniques. The algorithmic overlap between picture and video captioning, as well as audio, is the topic of this study.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002