Arnav Gupta

Work place: Department of Information Technology, K J Somaiya College of Engineering, Mumbai, Maharashtra, India

E-mail: arnav.gupta@somaiya.edu

Website:

Research Interests:

Biography

Author Articles
A Novel Approach for Video Inpainting Using Autoencoders

By Irfan Siddavatam Ashwini Dalvi Dipti Pawade Akshay Bhatt Jyeshtha Vartak Arnav Gupta

DOI: https://doi.org/10.5815/ijieeb.2021.06.05, Pub. Date: 8 Dec. 2021

Inpainting is a task undertaken to fill in damaged or missing parts of an image or video frame, with believable content. The aim of this operation is to realistically complete images or frames of videos for a variety of applications such as conservation and restoration of art, editing images and videos for aesthetic purposes, but might cause malpractices such as evidence tampering. From the image and video editing perspective, inpainting is used mainly in the context of generating content to fill the gaps left after removing a particular object from the image or the video. Video Inpainting, an extension of Image Inpainting, is a much more challenging task due to the constraint added by the time dimension. Several techniques do exist that achieve the task of removing an object from a given video, but they are still in a nascent stage. The major objective of this paper is to study the available approaches of inpainting and propose a solution to the limitations of existing inpainting techniques. After studying existing inpainting techniques, we realized that most of them make use of a ground truth frame to generate plausible results. A 'ground truth' frame is an image without the target object or in other words, an image that provides maximum information about the background, which is then used to fill spaces after object removal. In this paper, we propose an approach where there is no requirement of a 'ground truth' frame, provided that the video has enough contexts available about the background that is to be recreated. We would be using frames from the video in hand, to gather context for the background. As the position of the target object to be removed will vary from one frame to the next, each subsequent frame will reveal the region that was initially behind the object, and provide more information about the background as a whole. Later, we have also discussed the potential limitations of our approach and some workarounds for the same, while showing the direction for further research.

[...] Read more.
Other Articles