Zobrazeno 1 - 10
of 17 119
pro vyhledávání: '"Sanket"'
Although foundational vision-language models (VLMs) have proven to be very successful for various semantic discrimination tasks, they still struggle to perform faithfully for fine-grained categorization. Moreover, foundational models trained on one d
Externí odkaz:
http://arxiv.org/abs/2409.01835
The proliferation of scene text in both structured and unstructured environments presents significant challenges in optical character recognition (OCR), necessitating more efficient and robust text spotting solutions. This paper presents FastTextSpot
Externí odkaz:
http://arxiv.org/abs/2408.14998
Autor:
Mishra, Debasis, Patil, Sanket
We study undominated mechanisms with transfers for regulating a monopolist who privately observes the marginal cost of production. We show that in any undominated mechanism, there is a quantity floor, which depends only on the primitives, and the reg
Externí odkaz:
http://arxiv.org/abs/2408.09473
Autor:
Upadhyay, Harsh Vardhan, Tripathy, Sanket Kumar, Tan, Ting Rei, Suri, Baladitya, Shankar, Athreya
We propose a protocol for the preparation of generalized Greenberger-Horne-Zeilinger (GHZ) states of $N$ atoms each with $d=3$ or $4$ internal levels. We generalize the celebrated one-axis twisting (OAT) Hamiltonian for $N$ qubits to qudits by includ
Externí odkaz:
http://arxiv.org/abs/2407.19735
Autor:
Gandhi, Sanket, Atul, Mahajan, Samanyu, Sharma, Vishal, Gupta, Rushil, Mondal, Arnab Kumar, Singla, Parag
Recent work has shown that object-centric representations can greatly help improve the accuracy of learning dynamics while also bringing interpretability. In this work, we take this idea one step further, ask the following question: "can learning dis
Externí odkaz:
http://arxiv.org/abs/2407.03216
We introduce a hybrid optomechanical system containing an annularly trapped Bose-Einstein condensate (BEC) inside an optical cavity driven by Lauguerre-Gaussian (LG) modes. Spiral phase elements serve as the end mirrors of the cavity such that the re
Externí odkaz:
http://arxiv.org/abs/2407.01990
Autor:
Lozano, Alejandro, Nirschl, Jeffrey, Burgess, James, Gupte, Sanket Rajan, Zhang, Yuhui, Unell, Alyssa, Yeung-Levy, Serena
Recent advances in microscopy have enabled the rapid generation of terabytes of image data in cell biology and biomedical research. Vision-language models (VLMs) offer a promising solution for large-scale biological image analysis, enhancing research
Externí odkaz:
http://arxiv.org/abs/2407.01791
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Pilligua, Maria, Biescas, Nil, Vazquez-Corral, Javier, Lladós, Josep, Valveny, Ernest, Biswas, Sanket
The rapid evolution of intelligent document processing systems demands robust solutions that adapt to diverse domains without extensive retraining. Traditional methods often falter with variable document types, leading to poor performance. To overcom
Externí odkaz:
http://arxiv.org/abs/2406.08610
Autor:
Biswas, Sanket, Jain, Rajiv, Morariu, Vlad I., Gu, Jiuxiang, Mathur, Puneet, Wigington, Curtis, Sun, Tong, Lladós, Josep
While the generation of document layouts has been extensively explored, comprehensive document generation encompassing both layout and content presents a more complex challenge. This paper delves into this advanced domain, proposing a novel approach
Externí odkaz:
http://arxiv.org/abs/2406.08354