Zobrazeno 1 - 10
of 30 869
pro vyhledávání: '"Kartik, A."'
Face parsing refers to the semantic segmentation of human faces into key facial regions such as eyes, nose, hair, etc. It serves as a prerequisite for various advanced applications, including face editing, face swapping, and facial makeup, which ofte
Externí odkaz:
http://arxiv.org/abs/2412.08647
Autor:
Narayan, Kartik, Nair, Nithin Gopalakrishnan, Xu, Jennifer, Chellappa, Rama, Patel, Vishal M.
Pre-training on large-scale datasets and utilizing margin-based loss functions have been highly successful in training models for high-resolution face recognition. However, these models struggle with low-resolution face datasets, in which the faces l
Externí odkaz:
http://arxiv.org/abs/2412.07771
Autor:
Singhal, Kartik, Shroff, Gautam
The Abstraction and Reasoning Corpus (ARC) poses a significant challenge to artificial intelligence, demanding broad generalization and few-shot learning capabilities that remain elusive for current deep learning methods, including large language mod
Externí odkaz:
http://arxiv.org/abs/2412.07322
Autor:
Patel, Kartik, Zhang, Junbo, Kimionis, John, Kampianakis, Lefteris, Eggleston, Michael S., Du, Jinfeng
Backscatter radio is a promising technology for low-cost and low-power Internet-of-Things (IoT) networks. The conventional monostatic backscatter radio is constrained by its limited communication range, which restricts its utility in wide-area applic
Externí odkaz:
http://arxiv.org/abs/2412.06732
Autor:
Patwari, Kartik, Schneider, David, Sun, Xiaoxiao, Chuah, Chen-Nee, Lyu, Lingjuan, Sharma, Vivek
Growing privacy concerns and regulations like GDPR and CCPA necessitate pseudonymization techniques that protect identity in image datasets. However, retaining utility is also essential. Traditional methods like masking and blurring degrade quality a
Externí odkaz:
http://arxiv.org/abs/2412.06248
Autor:
Shahed, Naafis Ahnaf, Samanta, Kartik, Elekhtiar, Mohamed, Huang, Kai, Eom, Chang-Beom, Rzchowski, Mark S., Belashchenko, Kirill D., Tsymbal, Evgeny Y.
The recent surge of interest in moir\'e superlattices of twisted van der Waals compounds has spotlighted the emergence of unconventional superconductivity and novel electronic phases. However, the range of moir\'e phenomena can be dramatically expand
Externí odkaz:
http://arxiv.org/abs/2412.03798
The intricate interplay between magnetism and the topology of electronic structures provides a rich avenue for tailoring materials with unique and potent anomalous transport properties. In this paper, we present a strategy for inducing robust Berry c
Externí odkaz:
http://arxiv.org/abs/2412.02324
Autor:
Danish, Muhammad Sohail, Munir, Muhammad Akhtar, Shah, Syed Roshaan Ali, Kuckreja, Kartik, Khan, Fahad Shahbaz, Fraccaro, Paolo, Lacoste, Alexandre, Khan, Salman
While numerous recent benchmarks focus on evaluating generic Vision-Language Models (VLMs), they fall short in addressing the unique demands of geospatial applications. Generic VLM benchmarks are not designed to handle the complexities of geospatial
Externí odkaz:
http://arxiv.org/abs/2411.19325
Mechanistic interpretability aims to understand the inner workings of large neural networks by identifying circuits, or minimal subgraphs within the model that implement algorithms responsible for performing specific tasks. These circuits are typical
Externí odkaz:
http://arxiv.org/abs/2411.16105
Autor:
Vayani, Ashmal, Dissanayake, Dinura, Watawana, Hasindri, Ahsan, Noor, Sasikumar, Nevasini, Thawakar, Omkar, Ademtew, Henok Biadglign, Hmaiti, Yahya, Kumar, Amandeep, Kuckreja, Kartik, Maslych, Mykola, Ghallabi, Wafa Al, Mihaylov, Mihail, Qin, Chao, Shaker, Abdelrahman M, Zhang, Mike, Ihsani, Mahardika Krisna, Esplana, Amiel, Gokani, Monil, Mirkin, Shachar, Singh, Harsh, Srivastava, Ashay, Hamerlik, Endre, Izzati, Fathinah Asma, Maani, Fadillah Adamsyah, Cavada, Sebastian, Chim, Jenny, Gupta, Rohit, Manjunath, Sanjay, Zhumakhanova, Kamila, Rabevohitra, Feno Heriniaina, Amirudin, Azril, Ridzuan, Muhammad, Kareem, Daniya, More, Ketan, Li, Kunyang, Shakya, Pramesh, Saad, Muhammad, Ghasemaghaei, Amirpouya, Djanibekov, Amirbek, Azizov, Dilshod, Jankovic, Branislava, Bhatia, Naman, Cabrera, Alvaro, Obando-Ceron, Johan, Otieno, Olympiah, Farestam, Fabian, Rabbani, Muztoba, Baliah, Sanoojan, Sanjeev, Santosh, Shtanchaev, Abduragim, Fatima, Maheen, Nguyen, Thao, Kareem, Amrin, Aremu, Toluwani, Xavier, Nathan, Bhatkal, Amit, Toyin, Hawau, Chadha, Aman, Cholakkal, Hisham, Anwer, Rao Muhammad, Felsberg, Michael, Laaksonen, Jorma, Solorio, Thamar, Choudhury, Monojit, Laptev, Ivan, Shah, Mubarak, Khan, Salman, Khan, Fahad
Existing Large Multimodal Models (LMMs) generally focus on only a few regions and languages. As LMMs continue to improve, it is increasingly important to ensure they understand cultural contexts, respect local sensitivities, and support low-resource
Externí odkaz:
http://arxiv.org/abs/2411.16508