Zobrazeno 1 - 10
of 8 754
pro vyhledávání: '"Bilge, P."'
Autor:
Xiong, Yunyang, Zhou, Chong, Xiang, Xiaoyu, Wu, Lemeng, Zhu, Chenchen, Liu, Zechun, Suri, Saksham, Varadarajan, Balakrishnan, Akula, Ramya, Iandola, Forrest, Krishnamoorthi, Raghuraman, Soran, Bilge, Chandra, Vikas
Segment Anything Model 2 (SAM 2) has emerged as a powerful tool for video object segmentation and tracking anything. Key components of SAM 2 that drive the impressive video object segmentation performance include a large multistage image encoder for
Externí odkaz:
http://arxiv.org/abs/2411.18933
Autor:
Fedorov, Igor, Plawiak, Kate, Wu, Lemeng, Elgamal, Tarek, Suda, Naveen, Smith, Eric, Zhan, Hongyuan, Chi, Jianfeng, Hulovatyy, Yuriy, Patel, Kimish, Liu, Zechun, Zhao, Changsheng, Shi, Yangyang, Blankevoort, Tijmen, Pasupuleti, Mahesh, Soran, Bilge, Coudert, Zacharie Delpierre, Alao, Rachad, Krishnamoorthi, Raghuraman, Chandra, Vikas
This paper presents Llama Guard 3-1B-INT4, a compact and efficient Llama Guard model, which has been open-sourced to the community during Meta Connect 2024. We demonstrate that Llama Guard 3-1B-INT4 can be deployed on resource-constrained devices, ac
Externí odkaz:
http://arxiv.org/abs/2411.17713
The Newman-Unti-Tamburino (NUT) solution is characterized as the unique Petrov Type $D$ vacuum metric such that the two double principal null directions form an integrable distribution. The uniqueness of the NUT is established by evaluating the integ
Externí odkaz:
http://arxiv.org/abs/2411.11400
Leveraging generative AI (for example, Large Language Models) for language understanding within robotics opens up possibilities for LLM-driven robot end-user development (EUD). Despite the numerous design opportunities it provides, little is understo
Externí odkaz:
http://arxiv.org/abs/2411.04273
Autor:
Shen, Xiaoqian, Xiong, Yunyang, Zhao, Changsheng, Wu, Lemeng, Chen, Jun, Zhu, Chenchen, Liu, Zechun, Xiao, Fanyi, Varadarajan, Balakrishnan, Bordes, Florian, Liu, Zhuang, Xu, Hu, Kim, Hyunwoo J., Soran, Bilge, Krishnamoorthi, Raghuraman, Elhoseiny, Mohamed, Chandra, Vikas
Multimodal Large Language Models (MLLMs) have shown promising progress in understanding and analyzing video content. However, processing long videos remains a significant challenge constrained by LLM's context size. To address this limitation, we pro
Externí odkaz:
http://arxiv.org/abs/2410.17434
Autor:
Lee, Yejin, Sun, Anna, Hosmer, Basil, Acun, Bilge, Balioglu, Can, Wang, Changhan, Hernandez, Charles David, Puhrsch, Christian, Haziza, Daniel, Guessous, Driss, Massa, Francisco, Kahn, Jacob, Wan, Jeffrey, Reizenstein, Jeremy, Zhai, Jiaqi, Isaacson, Joe, Schlosser, Joel, Pino, Juan, Sadagopan, Kaushik Ram, Shamis, Leonid, Ma, Linjian, Hwang, Min-Jae, Chen, Mingda, Elhoushi, Mostafa, Rodriguez, Pedro, Pasunuru, Ram, Yih, Scott, Popuri, Sravya, Liu, Xing, Wu, Carole-Jean
Generative artificial intelligence (AI) technology is revolutionizing the computing industry. Not only its applications have broadened to various sectors but also poses new system design and optimization opportunities. The technology is capable of un
Externí odkaz:
http://arxiv.org/abs/2410.00215
Autor:
Tuorila, Heidi, Viheriälä, Jukka, Arafat, Yeasir, Atar, Fatih Bilge, Gunning, Fatima, Corbett, Brian, Guina, Mircea
3D integration of GaSb-based gain chips on a silicon photonics platform using micro-transfer printing is demonstrated for the first time. The release process of GaSb coupons, and their transfer for the demonstration of hybrid GaSb/Silicon-photonics o
Externí odkaz:
http://arxiv.org/abs/2409.13413
In recent years, transformer-based architectures become the de facto standard for sequence modeling in deep learning frameworks. Inspired by the successful examples, we propose a causal visual-inertial fusion transformer (VIFT) for pose estimation in
Externí odkaz:
http://arxiv.org/abs/2409.08769
Autor:
Hu, Yaxin, Lim, Hajin, Kakonge, Lisa, Mitchell, Jade T., Johnson, Hailey L., Turkstra, Lyn, Duff, Melissa C., Toma, Catalina L., Mutlu, Bilge
Traumatic brain injury (TBI) can cause a range of cognitive and communication challenges that negatively affect social participation in both face-to-face interactions and computer-mediated communication. In particular, individuals with TBI report bar
Externí odkaz:
http://arxiv.org/abs/2408.09683
Autor:
İzci, Bilge, Özkoç, Murad
The aim of this paper is to introduce the notion of $a$-locally closed set by utilizing $a$-open sets defined by Ekici and to study some properties of this new notion. Also, some characterizations and many fundamental results regarding this new conce
Externí odkaz:
http://arxiv.org/abs/2408.03169