Zobrazeno 1 - 10
of 12 387
pro vyhledávání: '"Chiao P"'
Autor:
Hsieh, He-Yen, Li, Ziyun, Zhang, Sai Qian, Ting, Wei-Te Mark, Chang, Kao-Den, De Salvo, Barbara, Liu, Chiao, Kung, H. T.
We present GazeGen, a user interaction system that generates visual content (images and videos) for locations indicated by the user's eye gaze. GazeGen allows intuitive manipulation of visual content by targeting regions of interest with gaze. Using
Externí odkaz:
http://arxiv.org/abs/2411.04335
Multi-object 3D Grounding involves locating 3D boxes based on a given query phrase from a point cloud. It is a challenging and significant task with numerous applications in visual understanding, human-computer interaction, and robotics. To tackle th
Externí odkaz:
http://arxiv.org/abs/2410.22306
Autor:
Polyak, Adam, Zohar, Amit, Brown, Andrew, Tjandra, Andros, Sinha, Animesh, Lee, Ann, Vyas, Apoorv, Shi, Bowen, Ma, Chih-Yao, Chuang, Ching-Yao, Yan, David, Choudhary, Dhruv, Wang, Dingkang, Sethi, Geet, Pang, Guan, Ma, Haoyu, Misra, Ishan, Hou, Ji, Wang, Jialiang, Jagadeesh, Kiran, Li, Kunpeng, Zhang, Luxin, Singh, Mannat, Williamson, Mary, Le, Matt, Yu, Matthew, Singh, Mitesh Kumar, Zhang, Peizhao, Vajda, Peter, Duval, Quentin, Girdhar, Rohit, Sumbaly, Roshan, Rambhatla, Sai Saketh, Tsai, Sam, Azadi, Samaneh, Datta, Samyak, Chen, Sanyuan, Bell, Sean, Ramaswamy, Sharadh, Sheynin, Shelly, Bhattacharya, Siddharth, Motwani, Simran, Xu, Tao, Li, Tianhe, Hou, Tingbo, Hsu, Wei-Ning, Yin, Xi, Dai, Xiaoliang, Taigman, Yaniv, Luo, Yaqiao, Liu, Yen-Cheng, Wu, Yi-Chiao, Zhao, Yue, Kirstain, Yuval, He, Zecheng, He, Zijian, Pumarola, Albert, Thabet, Ali, Sanakoyeu, Artsiom, Mallya, Arun, Guo, Baishan, Araya, Boris, Kerr, Breena, Wood, Carleigh, Liu, Ce, Peng, Cen, Vengertsev, Dimitry, Schonfeld, Edgar, Blanchard, Elliot, Juefei-Xu, Felix, Nord, Fraylie, Liang, Jeff, Hoffman, John, Kohler, Jonas, Fire, Kaolin, Sivakumar, Karthik, Chen, Lawrence, Yu, Licheng, Gao, Luya, Georgopoulos, Markos, Moritz, Rashel, Sampson, Sara K., Li, Shikai, Parmeggiani, Simone, Fine, Steve, Fowler, Tara, Petrovic, Vladan, Du, Yuming
We present Movie Gen, a cast of foundation models that generates high-quality, 1080p HD videos with different aspect ratios and synchronized audio. We also show additional capabilities such as precise instruction-based video editing and generation of
Externí odkaz:
http://arxiv.org/abs/2410.13720
Multivariate time-series data in fields like healthcare and industry are informative but challenging due to high dimensionality and lack of labels. Recent self-supervised learning methods excel in learning rich representations without labels but stru
Externí odkaz:
http://arxiv.org/abs/2410.12606
Autor:
Zhao, Yiwei, Li, Ziyun, Khwa, Win-San, Sun, Xiaoyu, Zhang, Sai Qian, Sarwar, Syed Shakib, Stangherlin, Kleber Hugo, Lu, Yi-Lun, Gomez, Jorge Tomas, Seo, Jae-Sun, Gibbons, Phillip B., De Salvo, Barbara, Liu, Chiao
Low-Latency and Low-Power Edge AI is essential for Virtual Reality and Augmented Reality applications. Recent advances show that hybrid models, combining convolution layers (CNN) and transformers (ViT), often achieve superior accuracy/performance tra
Externí odkaz:
http://arxiv.org/abs/2410.08326
Autor:
Chen, Yu-Hua, Cheng, Yuan-Chiao, Yeh, Yen-Tung, Wu, Jui-Te, Ho, Yu-Hsiang, Jang, Jyh-Shing Roger, Yang, Yi-Hsuan
Electric guitar tone modeling typically focuses on the non-linear transformation from clean to amplifier-rendered audio. Traditional methods rely on one-to-one mappings, incorporating device parameters into neural models to replicate specific amplifi
Externí odkaz:
http://arxiv.org/abs/2410.04702
Subsampling layers play a crucial role in deep nets by discarding a portion of an activation map to reduce its spatial dimensions. This encourages the deep net to learn higher-level representations. Contrary to this motivation, we hypothesize that th
Externí odkaz:
http://arxiv.org/abs/2410.01083
Advancements in open-source pre-trained backbones make it relatively easy to fine-tune a model for new tasks. However, this lowered entry barrier poses potential risks, e.g., bad actors developing models for harmful applications. A question arises: I
Externí odkaz:
http://arxiv.org/abs/2409.19210
Autor:
Fornari, Fabrizio, Compagnucci, Ivan, De Donato, Massimo Callisto, Bertrand, Yannis, Beyel, Harry Herbert, Carrión, Emilio, Franceschetti, Marco, Groher, Wolfgang, Grüger, Joscha, Kilic, Emre, Koschmider, Agnes, Leotta, Francesco, Li, Chiao-Yun, Lugaresi, Giovanni, Malburg, Lukas, Mangler, Juergen, Mecella, Massimo, Pastor, Oscar, Riss, Uwe, Seiger, Ronny, Serral, Estefania, Torres, Victoria, Valderas, Pedro
Modern organizations necessitate continuous business processes improvement to maintain efficiency, adaptability, and competitiveness. In the last few years, the Internet of Things, via the deployment of sensors and actuators, has heavily been adopted
Externí odkaz:
http://arxiv.org/abs/2410.08219
Autor:
Gao, Jun, Krishna, Govind, Yeung, Edith, Yu, Lingxi, Gangopadhyay, Sayan, Chan, Kai-Sum, Huang, Chiao-Tzu, Descamps, Thomas, Reimer, Michael E., Poole, Philip J., Dalacu, Dan, Zwiller, Val, Elshaari, Ali W.
Coherent control of single photon sources is a key requirement for the advancement of photonic quantum technologies. Among them, nanowire-based quantum dot sources are popular due to their potential for on-chip hybrid integration. Here we demonstrate
Externí odkaz:
http://arxiv.org/abs/2409.14964