Zobrazeno 1 - 10
of 12 023
pro vyhledávání: '"Szekely, Or"'
Autor:
Cho, Hyundong, Jedema, Nicolaas, Ribeiro, Leonardo F. R., Sharma, Karishma, Szekely, Pedro, Moschitti, Alessandro, Janssen, Ruben, May, Jonathan
Current instruction-tuned language models are exclusively trained with textual preference data and thus are often not aligned with the unique requirements of other modalities, such as speech. To better align language models with the speech domain, we
Externí odkaz:
http://arxiv.org/abs/2409.14672
Autor:
Bora, Zsófia, Könyves-Tóth, Réka, Vinkó, József, Bánhidi, Dominik, Bíró, Imre Barna, Bostroem, K. Azalee, Bódi, Attila, Burke, Jamison, Csányi, István, Cseh, Borbála, Farah, Joseph, Filippenko, Alexei V., Hegedűs, Tibor, Hiramatsu, Daichi, Horti-Dávid, Ágoston, Howell, D. Andrew, Jha, Saurabh W., Kalup, Csilla, Krezinger, Máté, Kriskovics, Levente, McCully, Curtis, Newsome, Megan, Ordasi, András, Gonzalez, Estefania Padilla, Pál, András, Pellegrino, Craig, Seli, Bálint, Sódor, Ádám, Szabó, Zsófia Marianna, Szabó, Norton O., Szakáts, Róbert, Szalai, Tamás, Székely, Péter, Terreran, Giacomo, Varga, Vázsony, Vida, Krisztián, Wang, Xiaofeng, Wheeler, J. Craig
The progenitor system(s) as well as the explosion mechanism(s) of thermonuclear (Type Ia) supernovae are long-standing issues in astrophysics. Here we present ejecta masses and other physical parameters for 28 recent Type Ia supernovae inferred from
Externí odkaz:
http://arxiv.org/abs/2408.11928
Autor:
Mehta, Shivam, Lameris, Harm, Punmiya, Rajiv, Beskow, Jonas, Székely, Éva, Henter, Gustav Eje
Converting input symbols to output audio in TTS requires modelling the durations of speech sounds. Leading non-autoregressive (NAR) TTS models treat duration modelling as a regression problem. The same utterance is then spoken with identical timings
Externí odkaz:
http://arxiv.org/abs/2406.05401
Autor:
Szalai, T., Könyves-Tóth, R., Nagy, A. P., Hiramatsu, D., Arcavi, I., Bostroem, A., Howell, D. A., Farah, J., McCully, C., Newsome, M., Gonzalez, E. Padilla, Pellegrino, C., Terreran, G., Berger, E., Blanchard, P., Gomez, S., Székely, P., Bánhidi, D., Bíró, I. B., Csányi, I., Pál, A., Rho, J., Vinkó, J.
Publikováno v:
A&A 690, A17 (2024)
There is a growing number of peculiar events that cannot be assigned to any of the main supernova (SN) classes. SN 1987A and a handful of similar objects, thought to be explosive outcomes of blue supergiant stars, belong to them: while their spectra
Externí odkaz:
http://arxiv.org/abs/2406.02498
Autor:
Wang, Siyang, Székely, Éva
Recent advances in generative language modeling applied to discrete speech tokens presented a new avenue for text-to-speech (TTS) synthesis. These speech language models (SLMs), similarly to their textual counterparts, are scalable, probabilistic, an
Externí odkaz:
http://arxiv.org/abs/2405.09768
Autor:
Móri, Tamás F., Székely, Gábor J.
In this paper three unrelated problems will be discussed. What connects them is the rich methodology of classical probability theory. In the first two problems we have a complete answer to the problems raised; in the third case, what we call the Hung
Externí odkaz:
http://arxiv.org/abs/2404.10654
Autor:
Biron, Tirza, Barboy, Moshe, Ben-Artzy, Eran, Golubchik, Alona, Marmor, Yanir, Szekely, Smadar, Winter, Yaron, Harel, David
Non-verbal signals in speech are encoded by prosody and carry information that ranges from conversation action to attitude and emotion. Despite its importance, the principles that govern prosodic structure are not yet adequately understood. This pape
Externí odkaz:
http://arxiv.org/abs/2403.03522
Autor:
Greene, Michelle R., Balas, Benjamin J., Lescroart, Mark D., MacNeilage, Paul R., Hart, Jennifer A., Binaee, Kamran, Hausamann, Peter A., Mezile, Ronald, Shankar, Bharath, Sinnott, Christian B., Capurro, Kaylie, Halow, Savannah, Howe, Hunter, Josyula, Mariam, Li, Annie, Mieses, Abraham, Mohamed, Amina, Nudnou, Ilya, Parkhill, Ezra, Riley, Peter, Schmidt, Brett, Shinkle, Matthew W., Si, Wentao, Szekely, Brian, Torres, Joaquin M., Weissmann, Eliana
We introduce the Visual Experience Dataset (VEDB), a compilation of over 240 hours of egocentric video combined with gaze- and head-tracking data that offers an unprecedented view of the visual world as experienced by human observers. The dataset con
Externí odkaz:
http://arxiv.org/abs/2404.18934
The manual modeling of complex systems is a daunting task; and although a plethora of methods exist that mitigate this issue, the problem remains very difficult. Recent advances in generative AI have allowed the creation of general-purpose chatbots,
Externí odkaz:
http://arxiv.org/abs/2401.02245
Methodologies for development of complex systems and models include external reviews by domain and technology experts. Among others, such reviews can uncover undocumented built-in assumptions that may be critical for correct and safe operation or con
Externí odkaz:
http://arxiv.org/abs/2312.16507