Zobrazeno 1 - 10
of 17
pro vyhledávání: '"Badri N. Patro"'
Publikováno v:
Patro, B, Lunayach, M & Namboodiri, V 2021, ' Uncertainty Class Activation Map (U-CAM) using Gradient Certainty method ', IEEE Transactions on Image Processing, vol. 30, pp. 1910-1924 . https://doi.org/10.1109/TIP.2020.3046916
Understanding and explaining deep learning models is an imperative task. Towards this, we propose a method that obtains gradient-based certainty estimates that also provide visual attention maps. Particularly, we solve for visual question answering t
Publikováno v:
Neurocomputing. 420:149-161
In this paper, we propose a method for obtaining sentence-level embeddings. While the problem of securing word-level embeddings is very well studied, we propose a novel method for obtaining sentence-level embeddings. This is obtained by a simple meth
Publikováno v:
Kumar, S, Patro, B N & Namboodiri, V P 2022, Auto QA : The Question Is Not only What, but Also Where . in 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW) . vol. 2022, 9707532, Proceedings-2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2022, IEEE, U. S. A., pp. 272-281, 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2022, Waikoloa, USA United States, 4/01/22 . https://doi.org/10.1109/WACVW54805.2022.00033
Visual Question Answering can be a functionally relevant task if purposed as such. In this paper, we aim to investigate and evaluate its efficacy in terms of localization-based question answering. We do this specifically in the context of autonomous
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1367ac6fa562e833b2ee0392986304f8
https://purehost.bath.ac.uk/ws/files/239903566/AUTO_QA.pdf
https://purehost.bath.ac.uk/ws/files/239903566/AUTO_QA.pdf
Publikováno v:
Pattern Recognition. 132:108898
Autor:
Vinay P. Namboodiri, K. S. Venkatesh, Vinod K Kurmi, Preethi Jyothi, Badri N. Patro, Vipul Bajaj
Publikováno v:
ICASSP
There have been a number of techniques that have demonstrated the generation of multimedia data for one modality at a time using GANs, such as the ability to generate images, videos, and audio. However, so far, the task of multi-modal generation of d
Autor:
Vinay P. Namboodiri, Mayank Lunayach, Hunar Singh, Sarvesh, Badri N. Patro, Deepankar Srivastava
Publikováno v:
WACV
A great number of situational comedies (sitcoms) are being regularly made and the task of adding laughter tracks to these is a critical task. Providing an ability to be able to predict whether something will be humorous to the audience is also crucia
Publikováno v:
WACV
One of the major limitations of deep learning models is that they face catastrophic forgetting in an incremental learning scenario. There have been several approaches proposed to tackle the problem of incremental learning. Most of these methods are b
Publikováno v:
WACV
In recent years, the attention mechanism has become a fairly popular concept and has proven to be successful in many machine learning applications. However, deep learning models do not employ supervision for these attention mechanisms which can impro
Autor:
Vinay P. Namboodiri, Riddhiman Dasgupta, Yokesh Kumar, Francis Tom, Sudhir Kumar, Badri N. Patro, Mithun Das Gupta
Publikováno v:
ACM Multimedia
We present the problem of Visually Precise Query (VPQ) generation which enables a more intuitive match between a user's information need and an e-commerce site's product description. Given an image of a fashion item, what is the most optimum search q
Publikováno v:
WACV
Patro, B N, Kurmi, V K, Kumar, S & Namboodiri, V P 2020, Deep bayesian network for visual question generation . in Proceedings-2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020 ., 9093293, Proceedings-2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020, IEEE, pp. 1555-1565, 2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020, Snowmass Village, USA United States, 1/03/20 . https://doi.org/10.1109/WACV45572.2020.9093293
Patro, B N, Kurmi, V K, Kumar, S & Namboodiri, V P 2020, Deep bayesian network for visual question generation . in Proceedings-2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020 ., 9093293, Proceedings-2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020, IEEE, pp. 1555-1565, 2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020, Snowmass Village, USA United States, 1/03/20 . https://doi.org/10.1109/WACV45572.2020.9093293
Generating natural questions from an image is a semantic task that requires using vision and language modalities to learn multimodal representations. Images can have multiple visual and language cues such as places, captions, and tags. In this paper,