Zobrazeno 1 - 10
of 10 498
pro vyhledávání: '"P, Keen"'
Autor:
Perera, Derek, Miller Jr, John H, Williams, Liliya L. R., Liesenborgs, Jori, Keen, Allison, Li, Sung Kei, Limousin, Marceau
The increasingly large numbers of multiple images in cluster-scale gravitational lenses have allowed for tighter constraints on the mass distributions of these systems. Most lens models have progressed alongside this increase in image number. The gen
Externí odkaz:
http://arxiv.org/abs/2411.05083
Autor:
Li, Zhangheng, You, Keen, Zhang, Haotian, Feng, Di, Agrawal, Harsh, Li, Xiujun, Moorthy, Mohana Prasad Sathya, Nichols, Jeff, Yang, Yinfei, Gan, Zhe
Building a generalist model for user interface (UI) understanding is challenging due to various foundational issues, such as platform diversity, resolution variation, and data limitation. In this paper, we introduce Ferret-UI 2, a multimodal large la
Externí odkaz:
http://arxiv.org/abs/2410.18967
Autor:
Miron, Marius, Keen, Sara, Liu, Jen-Yu, Hoffman, Benjamin, Hagiwara, Masato, Pietquin, Olivier, Effenberger, Felix, Cusimano, Maddie
Animal vocalization denoising is a task similar to human speech enhancement, a well-studied field of research. In contrast to the latter, it is applied to a higher diversity of sound production mechanisms and recording environments, and this higher d
Externí odkaz:
http://arxiv.org/abs/2410.03427
Autor:
Chen, Hong-You, Lai, Zhengfeng, Zhang, Haotian, Wang, Xinze, Eichner, Marcin, You, Keen, Cao, Meng, Zhang, Bowen, Yang, Yinfei, Gan, Zhe
Contrastive Language-Image Pre-training (CLIP) has been a celebrated method for training vision encoders to generate image/text representations facilitating various applications. Recently, CLIP has been widely adopted as the vision backbone of multim
Externí odkaz:
http://arxiv.org/abs/2410.02746
Autor:
Zhang, Haotian, Gao, Mingfei, Gan, Zhe, Dufter, Philipp, Wenzel, Nina, Huang, Forrest, Shah, Dhruti, Du, Xianzhi, Zhang, Bowen, Li, Yanghao, Dodge, Sam, You, Keen, Yang, Zhen, Timofeev, Aleksei, Xu, Mingze, Chen, Hong-You, Fauconnier, Jean-Philippe, Lai, Zhengfeng, You, Haoxuan, Wang, Zirui, Dehghan, Afshin, Grasch, Peter, Yang, Yinfei
We present MM1.5, a new family of multimodal large language models (MLLMs) designed to enhance capabilities in text-rich image understanding, visual referring and grounding, and multi-image reasoning. Building upon the MM1 architecture, MM1.5 adopts
Externí odkaz:
http://arxiv.org/abs/2409.20566
the action of the function on its Julia set is still ergodic if some, but not all of the asymptotic values land on infinity, and the remaining ones land on a compact repeller. In this paper, we complete the characterization of ergodicity for Nevanlin
Externí odkaz:
http://arxiv.org/abs/2409.12127
The Abstraction and Reasoning Corpus (ARC) is a visual program synthesis benchmark designed to test challenging out-of-distribution generalization in humans and machines. Since 2019, limited progress has been observed on the challenge using existing
Externí odkaz:
http://arxiv.org/abs/2409.01374
Autor:
Harbourne, Elodie A., Cattermull, John, Boström, Hanna L. B., Witte, Rebecca, Roth, Nikolaj, Pasta, Mauro, Keen, David A., Goodwin, Andrew L.
Jahn-Teller distortions of transition-metal coordination environments link orbital occupancies to structure. In the solid state, such distortions can be strongly correlated through the propagation of strain and/or through orbital interactions. Cooper
Externí odkaz:
http://arxiv.org/abs/2408.13169
Autor:
Lepori, Michael A., Tartaglini, Alexa R., Vong, Wai Keen, Serre, Thomas, Lake, Brenden M., Pavlick, Ellie
Though vision transformers (ViTs) have achieved state-of-the-art performance in a variety of settings, they exhibit surprising failures when performing tasks involving visual relations. This begs the question: how do ViTs attempt to perform tasks tha
Externí odkaz:
http://arxiv.org/abs/2406.15955
Autor:
Guéroult, Quentin, Bulled, Jonathan, Patteson, Henry, Coates, Chloe, Smith, Ronald, Playford, Helen, Keen, David, Goodwin, Andrew
The transition-metal dicyanides M(CN)$_2$ (M = Zn, Cd) are amongst the most important negative thermal expansion (NTE) materials known, favoured for the magnitude, isotropy, and thermal persistence of the NTE behaviour they show. The conventional pic
Externí odkaz:
http://arxiv.org/abs/2406.11381