Výsledky vyhledávání - "Oberski, Daniel L."

Report

PATCH! Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Proficiency in 8th Grade Mathematics

Autor: Fang, Qixiang, Oberski, Daniel L., Nguyen, Dong

Many existing benchmarks of large (multimodal) language models (LLMs) focus on measuring LLMs' academic proficiency, often with also an interest in comparing model performance with human test takers. While these benchmarks have proven key to the deve

Externí odkaz: http://arxiv.org/abs/2404.01799

Zobrazit plný text záznamu

Report

General-Purpose User Modeling with Behavioral Logs: A Snapchat Case Study

Autor: Fang, Qixiang, Zhou, Zhihan, Barbieri, Francesco, Liu, Yozen, Neves, Leonardo, Nguyen, Dong, Oberski, Daniel L., Bos, Maarten W., Dotsch, Ron

Learning general-purpose user representations based on user behavioral logs is an increasingly popular user modeling approach. It benefits from easily available, privacy-friendly yet expressive data, and does not require extensive re-tuning of the up

Externí odkaz: http://arxiv.org/abs/2312.12111

Zobrazit plný text záznamu

Report

On Text-based Personality Computing: Challenges and Future Directions

Autor: Fang, Qixiang, Giachanou, Anastasia, Bagheri, Ayoub, Boeschoten, Laura, van Kesteren, Erik-Jan, Kamalabad, Mahdi Shafiee, Oberski, Daniel L

Text-based personality computing (TPC) has gained many research interests in NLP. In this paper, we describe 15 challenges that we consider deserving the attention of the research community. These challenges are organized by the following topics: per

Externí odkaz: http://arxiv.org/abs/2212.06711

Zobrazit plný text záznamu

Report

Evaluating the Construct Validity of Text Embeddings with Application to Survey Questions

Autor: Fang, Qixiang, Nguyen, Dong, Oberski, Daniel L

Publikováno v: EPJ Data Sci. 11, 39 (2022)

Text embedding models from Natural Language Processing can map text data (e.g. words, sentences, documents) to supposedly meaningful numerical representations (a.k.a. text embeddings). While such models are increasingly applied in social science rese

Externí odkaz: http://arxiv.org/abs/2202.09166

Zobrazit plný text záznamu

Report

Digital trace data collection through data donation

Autor: Boeschoten, Laura, Ausloos, Jef, Moeller, Judith, Araujo, Theo, Oberski, Daniel L.

A potentially powerful method of social-scientific data collection and investigation has been created by an unexpected institution: the law. Article 15 of the EU's 2018 General Data Protection Regulation (GDPR) mandates that individuals have electron

Externí odkaz: http://arxiv.org/abs/2011.09851

Zobrazit plný text záznamu

Akademický článek

A systematic literature review of time series methods applied to epidemic prediction

Autor: Batoure Bamana, Apollinaire, Shafiee Kamalabad, Mahdi, Oberski, Daniel L.

Publikováno v: In Informatics in Medicine Unlocked 2024 50

Zobrazit plný text záznamu

Report

Multimodal Learning for Cardiovascular Risk Prediction using EHR Data

Autor: Bagheri, Ayoub, Groenhof, T. Katrien J., Veldhuis, Wouter B., de Jong, Pim A., Asselbergs, Folkert W., Oberski, Daniel L.

Electronic health records (EHRs) contain structured and unstructured data of significant clinical and research value. Various machine learning approaches have been developed to employ information in EHRs for risk prediction. The majority of these att

Externí odkaz: http://arxiv.org/abs/2008.11979

Zobrazit plný text záznamu

Report

The effect of measurement error on clustering algorithms

Autor: Pankowska, Paulina, Oberski, Daniel L.

Clustering consists of a popular set of techniques used to separate data into interesting groups for further analysis. Many data sources on which clustering is performed are well-known to contain random and systematic measurement errors. Such errors

Externí odkaz: http://arxiv.org/abs/2005.11743

Zobrazit plný text záznamu

Report

Fair inference on error-prone outcomes

Autor: Boeschoten, Laura, van Kesteren, Erik-Jan, Bagheri, Ayoub, Oberski, Daniel L.

Fair inference in supervised learning is an important and active area of research, yielding a range of useful methods to assess and account for fairness criteria when predicting ground truth targets. As shown in recent work, however, when target labe

Externí odkaz: http://arxiv.org/abs/2003.07621

Zobrazit plný text záznamu

Report

Privacy-Preserving Generalized Linear Models using Distributed Block Coordinate Descent

Autor: van Kesteren, Erik-Jan, Sun, Chang, Oberski, Daniel L., Dumontier, Michel, Ippel, Lianne

Combining data from varied sources has considerable potential for knowledge discovery: collaborating data parties can mine data in an expanded feature space, allowing them to explore a larger range of scientific questions. However, data sharing among

Externí odkaz: http://arxiv.org/abs/1911.03183

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání