Zobrazeno 1 - 10
of 14 793
pro vyhledávání: '"Yang, Yue"'
Autor:
Deitke, Matt, Clark, Christopher, Lee, Sangho, Tripathi, Rohun, Yang, Yue, Park, Jae Sung, Salehi, Mohammadreza, Muennighoff, Niklas, Lo, Kyle, Soldaini, Luca, Lu, Jiasen, Anderson, Taira, Bransom, Erin, Ehsani, Kiana, Ngo, Huong, Chen, YenSung, Patel, Ajay, Yatskar, Mark, Callison-Burch, Chris, Head, Andrew, Hendrix, Rose, Bastani, Favyen, VanderBilt, Eli, Lambert, Nathan, Chou, Yvonne, Chheda, Arnavi, Sparks, Jenna, Skjonsberg, Sam, Schmitz, Michael, Sarnat, Aaron, Bischoff, Byron, Walsh, Pete, Newell, Chris, Wolters, Piper, Gupta, Tanmay, Zeng, Kuo-Hao, Borchardt, Jon, Groeneveld, Dirk, Dumas, Jen, Nam, Crystal, Lebrecht, Sophie, Wittlif, Caitlin, Schoenick, Carissa, Michel, Oscar, Krishna, Ranjay, Weihs, Luca, Smith, Noah A., Hajishirzi, Hannaneh, Girshick, Ross, Farhadi, Ali, Kembhavi, Aniruddha
Today's most advanced multimodal models remain proprietary. The strongest open-weight models rely heavily on synthetic data from proprietary VLMs to achieve good performance, effectively distilling these closed models into open ones. As a result, the
Externí odkaz:
http://arxiv.org/abs/2409.17146
Autor:
Chen, Yihao, Yang, Yue
We develop a deep reinforcement learning method for training a jellyfish-like swimmer to effectively track a moving target in a two-dimensional flow. This swimmer is a flexible object equipped with a muscle model based on torsional springs. We employ
Externí odkaz:
http://arxiv.org/abs/2409.08815
Autor:
Wang, Zhaowei, Hao, Ying, Wei, Hao, Xiao, Qing, Chen, Lulu, Li, Yulong, Yang, Yue, Li, Tianyi
Recent advancements in text-to-image diffusion models have significantly transformed visual content generation, yet their application in specialized fields such as interior design remains underexplored. In this paper, we present RoomDiffusion, a pion
Externí odkaz:
http://arxiv.org/abs/2409.03198
Autor:
Li, Keqin, Wang, Jin, Wu, Xubo, Peng, Xirui, Chang, Runmian, Deng, Xiaoyu, Kang, Yiwen, Yang, Yue, Ni, Fanghao, Hong, Bo
With the rapid growth of global e-commerce, the demand for automation in the logistics industry is increasing. This study focuses on automated picking systems in warehouses, utilizing deep learning and reinforcement learning technologies to enhance p
Externí odkaz:
http://arxiv.org/abs/2408.16633
Autor:
Lu, Qiuyu, Yi, Semina, Gan, Mentian, Huang, Jihong, Zhang, Xiao, Yang, Yue, Shen, Chenyi, Yao, Lining
While it seems counterintuitive to think of degradation within an operating device as beneficial, one may argue that when rationally designed, the controlled breakdown of materials can be harnessed for specific functions. To apply this principle to t
Externí odkaz:
http://arxiv.org/abs/2408.01660
Autor:
Xu, Boyan, Wen, Liang, Li, Zihao, Yang, Yuxing, Wu, Guanlan, Tang, Xiongpeng, Li, Yu, Wu, Zihao, Su, Qingxian, Shi, Xueqing, Yang, Yue, Tong, Rui, Ng, How Yong
Recent advancements in Large Language Models (LLMs) have sparked interest in their potential applications across various fields. This paper embarked on a pivotal inquiry: Can existing LLMs effectively serve as "water expert models" for water engineer
Externí odkaz:
http://arxiv.org/abs/2407.21045
Vision-language foundation models have been incredibly successful in a wide range of downstream computer vision tasks using adaptation methods. However, due to the high cost of obtaining pre-training datasets, pairs with weak image-text correlation i
Externí odkaz:
http://arxiv.org/abs/2407.08787
Autor:
Yang, Zhantao, Feng, Ruili, Yan, Keyu, Wang, Huangji, Wang, Zhicai, Zhu, Shangwen, Zhang, Han, Xiao, Jie, Wu, Pingyu, Zhu, Kai, Chen, Jixuan, Xie, Chen-Wei, Mao, Chaojie, Yang, Yue, Zhang, Hongyang, Liu, Yu, Cheng, Fan
This paper presents Bag-of-Concept Graph (BACON) to gift models with limited linguistic abilities to taste the privilege of Vision Language Models (VLMs) and boost downstream tasks such as detection, visual question answering (VQA), and image generat
Externí odkaz:
http://arxiv.org/abs/2407.03314
We propose a geometry-to-flow diffusion model that utilizes the input of obstacle shape to predict a flow field past the obstacle. The model is based on a learnable Markov transition kernel to recover the data distribution from the Gaussian distribut
Externí odkaz:
http://arxiv.org/abs/2407.00735
Autor:
Meng, Fanqing, Shao, Wenqi, Luo, Lixin, Wang, Yahong, Chen, Yiran, Lu, Quanfeng, Yang, Yue, Yang, Tianshuo, Zhang, Kaipeng, Qiao, Yu, Luo, Ping
Text-to-image (T2I) models have made substantial progress in generating images from textual prompts. However, they frequently fail to produce images consistent with physical commonsense, a vital capability for applications in world simulation and eve
Externí odkaz:
http://arxiv.org/abs/2406.11802