Výsledky vyhledávání - "Sarkar, Sayan Deb"

Report

HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language

Autor: Parida, Shantipriya, Abdulmumin, Idris, Muhammad, Shamsuddeen Hassan, Bose, Aneesh, Kohli, Guneet Singh, Ahmad, Ibrahim Said, Kotwal, Ketan, Sarkar, Sayan Deb, Bojar, Ondřej, Kakudi, Habeebah Adamu

This paper presents HaVQA, the first multimodal dataset for visual question-answering (VQA) tasks in the Hausa language. The dataset was created by manually translating 6,022 English question-answer pairs, which are associated with 1,555 unique image

Externí odkaz: http://arxiv.org/abs/2305.17690

Zobrazit plný text záznamu

Report

SGAligner : 3D Scene Alignment with Scene Graphs

Autor: Sarkar, Sayan Deb, Miksik, Ondrej, Pollefeys, Marc, Barath, Daniel, Armeni, Iro

Building 3D scene graphs has recently emerged as a topic in scene representation for several embodied AI applications to represent the world in a structured and rich manner. With their increased use in solving downstream tasks (eg, navigation and roo

Externí odkaz: http://arxiv.org/abs/2304.14880

Zobrazit plný text záznamu

Report

HO-3D_v3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset

Autor: Hampali, Shreyas, Sarkar, Sayan Deb, Lepetit, Vincent

HO-3D is a dataset providing image sequences of various hand-object interaction scenarios annotated with the 3D pose of the hand and the object and was originally introduced as HO-3D_v2. The annotations were obtained automatically using an optimizati

Externí odkaz: http://arxiv.org/abs/2107.00887

Zobrazit plný text záznamu

Report

Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation

Autor: Hampali, Shreyas, Sarkar, Sayan Deb, Rad, Mahdi, Lepetit, Vincent

We propose a robust and accurate method for estimating the 3D poses of two hands in close interaction from a single color image. This is a very challenging problem, as large occlusions and many confusions between the joints may happen. State-of-the-a

Externí odkaz: http://arxiv.org/abs/2104.14639

Zobrazit plný text záznamu

Report

Monte Carlo Scene Search for 3D Scene Understanding

Autor: Hampali, Shreyas, Stekovic, Sinisa, Sarkar, Sayan Deb, Kumar, Chetan Srinivasa, Fraundorfer, Friedrich, Lepetit, Vincent

We explore how a general AI algorithm can be used for 3D scene understanding to reduce the need for training data. More exactly, we propose a modification of the Monte Carlo Tree Search (MCTS) algorithm to retrieve objects and room layouts from noisy

Externí odkaz: http://arxiv.org/abs/2103.07969

Zobrazit plný text záznamu

Report

General 3D Room Layout from a Single View by Render-and-Compare

Autor: Stekovic, Sinisa, Hampali, Shreyas, Rad, Mahdi, Sarkar, Sayan Deb, Fraundorfer, Friedrich, Lepetit, Vincent

We present a novel method to reconstruct the 3D layout of a room (walls, floors, ceilings) from a single perspective view in challenging conditions, by contrast with previous single-view methods restricted to cuboid-shaped layouts. This input view ca

Externí odkaz: http://arxiv.org/abs/2001.02149

Zobrazit plný text záznamu

Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation

Autor: Hampali, Shreyas, Sarkar, Sayan Deb, Rad, Mahdi, Lepetit, Vincent

Publikováno v: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d352ee0ac5f5f5ccc76e0a7b89d45836
https://doi.org/10.1109/cvpr52688.2022.01081

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání