Výsledky vyhledávání - "A, Groeneveld"

Report

LOFAR high-band antenna observations of the Perseus cluster

Autor: van Weeren, R. J., Timmerman, R., Vaidya, V., Gendron-Marsolais, M. -L., Botteon, A., Roberts, I. D., Hlavacek-Larrondo, J., Bonafede, A., Brüggen, M., Brunetti, G., Cassano, R., Cuciti, V., Edge, A. C., Gastaldello, F., Groeneveld, C., Shimwell, T. W.

The Perseus cluster is the brightest X-ray cluster in the sky and is known as a cool-core galaxy cluster. Being a very nearby cluster, it has been extensively studied. This has provided a comprehensive view of the physical processes that operate in t

Externí odkaz: http://arxiv.org/abs/2410.02863

Zobrazit plný text záznamu

Report

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Autor: Deitke, Matt, Clark, Christopher, Lee, Sangho, Tripathi, Rohun, Yang, Yue, Park, Jae Sung, Salehi, Mohammadreza, Muennighoff, Niklas, Lo, Kyle, Soldaini, Luca, Lu, Jiasen, Anderson, Taira, Bransom, Erin, Ehsani, Kiana, Ngo, Huong, Chen, YenSung, Patel, Ajay, Yatskar, Mark, Callison-Burch, Chris, Head, Andrew, Hendrix, Rose, Bastani, Favyen, VanderBilt, Eli, Lambert, Nathan, Chou, Yvonne, Chheda, Arnavi, Sparks, Jenna, Skjonsberg, Sam, Schmitz, Michael, Sarnat, Aaron, Bischoff, Byron, Walsh, Pete, Newell, Chris, Wolters, Piper, Gupta, Tanmay, Zeng, Kuo-Hao, Borchardt, Jon, Groeneveld, Dirk, Dumas, Jen, Nam, Crystal, Lebrecht, Sophie, Wittlif, Caitlin, Schoenick, Carissa, Michel, Oscar, Krishna, Ranjay, Weihs, Luca, Smith, Noah A., Hajishirzi, Hannaneh, Girshick, Ross, Farhadi, Ali, Kembhavi, Aniruddha

Today's most advanced multimodal models remain proprietary. The strongest open-weight models rely heavily on synthetic data from proprietary VLMs to achieve good performance, effectively distilling these closed models into open ones. As a result, the

Externí odkaz: http://arxiv.org/abs/2409.17146

Zobrazit plný text záznamu

Report

OLMoE: Open Mixture-of-Experts Language Models

We introduce OLMoE, a fully open, state-of-the-art language model leveraging sparse Mixture-of-Experts (MoE). OLMoE-1B-7B has 7 billion (B) parameters but uses only 1B per input token. We pretrain it on 5 trillion tokens and further adapt it to creat

Externí odkaz: http://arxiv.org/abs/2409.02060

Zobrazit plný text záznamu

Report

DataComp-LM: In search of the next generation of training sets for language models

Autor: Li, Jeffrey, Fang, Alex, Smyrnis, Georgios, Ivgi, Maor, Jordan, Matt, Gadre, Samir, Bansal, Hritik, Guha, Etash, Keh, Sedrick, Arora, Kushal, Garg, Saurabh, Xin, Rui, Muennighoff, Niklas, Heckel, Reinhard, Mercat, Jean, Chen, Mayee, Gururangan, Suchin, Wortsman, Mitchell, Albalak, Alon, Bitton, Yonatan, Nezhurina, Marianna, Abbas, Amro, Hsieh, Cheng-Yu, Ghosh, Dhruba, Gardner, Josh, Kilian, Maciej, Zhang, Hanlin, Shao, Rulin, Pratt, Sarah, Sanyal, Sunny, Ilharco, Gabriel, Daras, Giannis, Marathe, Kalyani, Gokaslan, Aaron, Zhang, Jieyu, Chandu, Khyathi, Nguyen, Thao, Vasiljevic, Igor, Kakade, Sham, Song, Shuran, Sanghavi, Sujay, Faghri, Fartash, Oh, Sewoong, Zettlemoyer, Luke, Lo, Kyle, El-Nouby, Alaaeldin, Pouransari, Hadi, Toshev, Alexander, Wang, Stephanie, Groeneveld, Dirk, Soldaini, Luca, Koh, Pang Wei, Jitsev, Jenia, Kollar, Thomas, Dimakis, Alexandros G., Carmon, Yair, Dave, Achal, Schmidt, Ludwig, Shankar, Vaishaal

We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset experiments with the goal of improving language models. As part of DCLM, we provide a standardized corpus of 240T tokens extracted from Common Crawl, effective pretrai

Externí odkaz: http://arxiv.org/abs/2406.11794

Zobrazit plný text záznamu

Report

Characterization of the decametre sky at subarcminute resolution

Autor: Groeneveld, C., van Weeren, R. J., Osinga, E., Williams, W. L., Callingham, J. R., de Gasperin, F., Botteon, A., Shimwell, T., de Jong, J. M. G. H. J., Jansen, L. F., Miley, G. K., Brunetti, G., Brüggen, M., Röttgering, H. J. A.

Publikováno v: Nature Astronomy 8 (2024) 786-795

The largely unexplored decameter radio band (10-30 MHz) provides a unique window for studying a range of astronomical topics, such as auroral emission from exoplanets, inefficient cosmic ray acceleration mechanisms, fossil radio plasma, and free-free

Externí odkaz: http://arxiv.org/abs/2405.05311

Zobrazit plný text záznamu

Report

OLMo: Accelerating the Science of Language Models

Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have become closed off, gated behind proprietary interfaces, with important det

Externí odkaz: http://arxiv.org/abs/2402.00838

Zobrazit plný text záznamu

Report

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Information about pretraining corpora used to train the current best-performing language models is seldom discussed: commercial models rarely detail their data, and even open models are often released without accompanying training data or recipes to

Externí odkaz: http://arxiv.org/abs/2402.00159

Zobrazit plný text záznamu

Report

Students' and Professionals' Perceived Creativity In Software Engineering: A Comparative Study

Autor: Groeneveld, Wouter, Luyten, Laurens, Vennekens, Joost, Aerts, Kris

Publikováno v: European Journal of Engineering Education, 19 December 2023, page 1-18

Creativity is a critical skill that professional software engineers leverage to tackle difficult problems. In higher education, multiple efforts have been made to spark creative skills of engineering students. However, creativity is a vague concept t

Externí odkaz: http://arxiv.org/abs/2312.12014

Zobrazit plný text záznamu

Report

Paloma: A Benchmark for Evaluating Language Model Fit

Autor: Magnusson, Ian, Bhagia, Akshita, Hofmann, Valentin, Soldaini, Luca, Jha, Ananya Harsh, Tafjord, Oyvind, Schwenk, Dustin, Walsh, Evan Pete, Elazar, Yanai, Lo, Kyle, Groeneveld, Dirk, Beltagy, Iz, Hajishirzi, Hannaneh, Smith, Noah A., Richardson, Kyle, Dodge, Jesse

Language models (LMs) commonly report perplexity on monolithic data held out from training. Implicitly or explicitly, this data is composed of domains$\unicode{x2013}$varying distributions of language. Rather than assuming perplexity on one distribut

Externí odkaz: http://arxiv.org/abs/2312.10523

Zobrazit plný text záznamu

Report

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

Autor: Groeneveld, Dirk, Awadalla, Anas, Beltagy, Iz, Bhagia, Akshita, Magnusson, Ian, Peng, Hao, Tafjord, Oyvind, Walsh, Pete, Richardson, Kyle, Dodge, Jesse

The success of large language models has shifted the evaluation paradigms in natural language processing (NLP). The community's interest has drifted towards comparing NLP models across many tasks, domains, and datasets, often at an extreme scale. Thi

Externí odkaz: http://arxiv.org/abs/2312.10253

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání