Zobrazeno 1 - 10
of 634
pro vyhledávání: '"Park Jae Sung"'
Autor:
Awadalla, Anas, Xue, Le, Shu, Manli, Yan, An, Wang, Jun, Purushwalkam, Senthil, Shen, Sheng, Lee, Hannah, Lo, Oscar, Park, Jae Sung, Guha, Etash, Savarese, Silvio, Schmidt, Ludwig, Choi, Yejin, Xiong, Caiming, Xu, Ran
We introduce BLIP3-KALE, a dataset of 218 million image-text pairs that bridges the gap between descriptive synthetic captions and factual web-scale alt-text. KALE augments synthetic dense image captions with web-scale alt-text to generate factually
Externí odkaz:
http://arxiv.org/abs/2411.07461
Autor:
Salehi, Mohammadreza, Park, Jae Sung, Yadav, Tanush, Kusupati, Aditya, Krishna, Ranjay, Choi, Yejin, Hajishirzi, Hannaneh, Farhadi, Ali
Publikováno v:
NeurIPS 2024 Track Datasets and Benchmarks
Our world is full of varied actions and moves across specialized domains that we, as humans, strive to identify and understand. Within any single domain, actions can often appear quite similar, making it challenging for deep models to distinguish the
Externí odkaz:
http://arxiv.org/abs/2410.05774
Autor:
Deitke, Matt, Clark, Christopher, Lee, Sangho, Tripathi, Rohun, Yang, Yue, Park, Jae Sung, Salehi, Mohammadreza, Muennighoff, Niklas, Lo, Kyle, Soldaini, Luca, Lu, Jiasen, Anderson, Taira, Bransom, Erin, Ehsani, Kiana, Ngo, Huong, Chen, YenSung, Patel, Ajay, Yatskar, Mark, Callison-Burch, Chris, Head, Andrew, Hendrix, Rose, Bastani, Favyen, VanderBilt, Eli, Lambert, Nathan, Chou, Yvonne, Chheda, Arnavi, Sparks, Jenna, Skjonsberg, Sam, Schmitz, Michael, Sarnat, Aaron, Bischoff, Byron, Walsh, Pete, Newell, Chris, Wolters, Piper, Gupta, Tanmay, Zeng, Kuo-Hao, Borchardt, Jon, Groeneveld, Dirk, Nam, Crystal, Lebrecht, Sophie, Wittlif, Caitlin, Schoenick, Carissa, Michel, Oscar, Krishna, Ranjay, Weihs, Luca, Smith, Noah A., Hajishirzi, Hannaneh, Girshick, Ross, Farhadi, Ali, Kembhavi, Aniruddha
Today's most advanced vision-language models (VLMs) remain proprietary. The strongest open-weight models rely heavily on synthetic data from proprietary VLMs to achieve good performance, effectively distilling these closed VLMs into open ones. As a r
Externí odkaz:
http://arxiv.org/abs/2409.17146
Autor:
Chandu, Khyathi Raghavi, Li, Linjie, Awadalla, Anas, Lu, Ximing, Park, Jae Sung, Hessel, Jack, Wang, Lijuan, Choi, Yejin
The ability to acknowledge the inevitable uncertainty in their knowledge and reasoning is a prerequisite for AI systems to be truly truthful and reliable. In this paper, we present a taxonomy of uncertainty specific to vision-language AI systems, dis
Externí odkaz:
http://arxiv.org/abs/2407.01942
Autor:
Shen, Ethan, Fan, Alan, Pratt, Sarah M., Park, Jae Sung, Wallingford, Matthew, Kakade, Sham M., Holtzman, Ari, Krishna, Ranjay, Farhadi, Ali, Kusupati, Aditya
Many applications today provide users with multiple auto-complete drafts as they type, including GitHub's code completion, Gmail's smart compose, and Apple's messaging auto-suggestions. Under the hood, language models support this by running an autor
Externí odkaz:
http://arxiv.org/abs/2405.18400
Autor:
Durante, Zane, Huang, Qiuyuan, Wake, Naoki, Gong, Ran, Park, Jae Sung, Sarkar, Bidipta, Taori, Rohan, Noda, Yusuke, Terzopoulos, Demetri, Choi, Yejin, Ikeuchi, Katsushi, Vo, Hoi, Fei-Fei, Li, Gao, Jianfeng
Multi-modal AI systems will likely become a ubiquitous presence in our everyday lives. A promising approach to making these systems more interactive is to embody them as agents within physical and virtual environments. At present, systems leverage ex
Externí odkaz:
http://arxiv.org/abs/2401.03568
Autor:
Park, Jae Sung, Hessel, Jack, Chandu, Khyathi Raghavi, Liang, Paul Pu, Lu, Ximing, West, Peter, Yu, Youngjae, Huang, Qiuyuan, Gao, Jianfeng, Farhadi, Ali, Choi, Yejin
Instruction following vision-language (VL) models offer a flexible interface that supports a broad range of multimodal tasks in a zero-shot fashion. However, interfaces that operate on full images do not directly enable the user to "point to" and acc
Externí odkaz:
http://arxiv.org/abs/2312.04837
Autor:
Mirfendereski, Siamak, Park, Jae Sung
The rheological behaviour of dense suspensions of ideally conductive particles in the presence of both electric field and shear flow is studied using large-scale numerical simulations. Under the action of an electric field, these particles are known
Externí odkaz:
http://arxiv.org/abs/2311.09121
The transition to turbulence in a plane Poiseuille flow of dilute polymer solutions is studied by direct numerical simulations of a FENE-P fluid. A range of Reynolds number ($Re$) in $2000 \le Re \le 5000$ is studied but with the same level of elasti
Externí odkaz:
http://arxiv.org/abs/2311.03188
Autor:
Huang, Qiuyuan, Park, Jae Sung, Gupta, Abhinav, Bennett, Paul, Gong, Ran, Som, Subhojit, Peng, Baolin, Mohammed, Owais Khan, Pal, Chris, Choi, Yejin, Gao, Jianfeng
Despite the growing adoption of mixed reality and interactive AI agents, it remains challenging for these systems to generate high quality 2D/3D scenes in unseen environments. The common practice requires deploying an AI agent to collect large amount
Externí odkaz:
http://arxiv.org/abs/2305.00970