Zobrazeno 1 - 10
of 18 188
pro vyhledávání: '"Sheen, A."'
At Expedia, learning-to-rank (LTR) models plays a key role on our website in sorting and presenting information more relevant to users, such as search filters, property rooms, amenities, and images. A major challenge in deploying these models is ensu
Externí odkaz:
http://arxiv.org/abs/2410.01959
In-context learning (ICL) is a cornerstone of large language model (LLM) functionality, yet its theoretical foundations remain elusive due to the complexity of transformer architectures. In particular, most existing work only theoretically explains h
Externí odkaz:
http://arxiv.org/abs/2409.10559
We study gradient flow on the exponential loss for a classification problem with a one-layer softmax attention model, where the key and query weight matrices are trained separately. Under a separability assumption on the data, we show that when gradi
Externí odkaz:
http://arxiv.org/abs/2403.08699
Autor:
Kim, Duho, Sheen, Yun-Kyeong, Jaffé, Yara L., Kelkar, Kshitija, Ranjan, Adarsh, Piraino-Cerda, Franco, Crossett, Jacob P., Lourenço, Ana Carolina Costa, Martin, Garreth, Nantais, Julie B., Demarco, Ricardo, Treister, Ezequiel, Yi, Sukyoung K.
Publikováno v:
ApJ 966 124 (2024)
We study the incidence and spatial distribution of galaxies that are currently undergoing gravitational merging (M) or that have signs of a post merger (PM) in six galaxy clusters (A754, A2399, A2670, A3558, A3562, and A3716) within the redshift rang
Externí odkaz:
http://arxiv.org/abs/2403.06437
We study the dynamics of gradient flow for training a multi-head softmax attention model for in-context learning of multi-task linear regression. We establish the global convergence of gradient flow under suitable choices of initialization. In additi
Externí odkaz:
http://arxiv.org/abs/2402.19442
Autor:
Turan, Nurettin, Fesl, Benedikt, Joham, Michael, Ma, Zhengxiang, Soong, Anthony C. K., Sheen, Baoling, Xiao, Weimin, Utschick, Wolfgang
Discrete Fourier transform (DFT) codebook-based solutions are well-established for limited feedback schemes in frequency division duplex (FDD) systems. In recent years, data-aided solutions have been shown to achieve higher performance, enabled by th
Externí odkaz:
http://arxiv.org/abs/2401.01721
Voice plays an important role in our lives by facilitating communication, conveying emotions, and indicating health. Therefore, tracking vocal interactions can provide valuable insight into many aspects of our lives. This paper presents our ongoing e
Externí odkaz:
http://arxiv.org/abs/2312.10265
Autor:
Pak, Mina, Baek, Junhyun, Lee, Joon Hyeop, Chung, Aeree, Owers, Matt, Jeong, Hyunjin, Sung, Eon-Chang, Sheen, Yun-Kyeong
We present the discovery of a new H I structure in the NGC 7194 group from the observations using the Karl G. Jansky Very Large Array. NGC 7194 group is a nearby (z ~ 0.027) small galaxy group with five quiescent members. The observations reveal a 20
Externí odkaz:
http://arxiv.org/abs/2312.09567
Autor:
Byun, Woowon, Kim, Minjin, Sheen, Yun-Kyeong, Lee, Dongseob, Ho, Luis C., Ko, Jongwan, Seon, Kwang-Il, Shim, Hyunjin, Kim, Dohyeong, Kim, Yongjung, Lee, Joon Hyeop, Jeong, Hyunjin, Woo, Jong-Hak, Jeong, Woong-Seob, Park, Byeong-Gon, Kim, Sang Chul, Lee, Yongseok, Cha, Sang-Mok, Song, Hyunmi, Son, Donghoon, Yang, Yujin
We search for quasi-stellar objects (QSOs) in a wide area of the south ecliptic pole (SEP) field, which has been and will continue to be intensively explored through various space missions. For this purpose, we obtain deep broadband optical images of
Externí odkaz:
http://arxiv.org/abs/2307.15307
Autor:
Gaver, Bill, Boucher, Andy, Brown, Dean, Matsuda, Naho, Ovalle, Liliana, Sheen, Andy, Vanis, Mike
Publikováno v:
Designing More-than-Human Smart Cities: Beyond Sustainability, Towards Cohabitation.
Externí odkaz:
https://doi.org/10.1093/9780191980060.003.006