Zobrazeno 1 - 10
of 30 712
pro vyhledávání: '"evaluation practices"'
This paper critically examines current methodologies for evaluating models in Conditional and Average Treatment Effect (CATE/ATE) estimation, identifying several key pitfalls in existing practices. The current approach of over-reliance on specific me
Externí odkaz:
http://arxiv.org/abs/2409.05161
Autor:
Schmidtová, Patrícia, Mahamood, Saad, Balloccu, Simone, Dušek, Ondřej, Gatt, Albert, Gkatzia, Dimitra, Howcroft, David M., Plátek, Ondřej, Sivaprasad, Adarsa
Automatic metrics are extensively used to evaluate natural language processing systems. However, there has been increasing focus on how they are used and reported by practitioners within the field. In this paper, we have conducted a survey on the use
Externí odkaz:
http://arxiv.org/abs/2408.09169
Autor:
Jacinta, Mutie Mwikali1 mutiemj@outlook.com, Wambugu, Lydia2 lydiah.nyaguthii@uonbi.ac.ke, Nyonje, Raphael3 raphael.nyonje@uonbi.ac.ke, Kikwatha, Reuben4 kikwathar@uonbi.ac.ke
Publikováno v:
International Journal of Professional Business Review (JPBReview). 2024, Vol. 9 Issue 9, p1-18. 18p.
While multilingual language models (MLMs) have been trained on 100+ languages, they are typically only evaluated across a handful of them due to a lack of available test data in most languages. This is particularly problematic when assessing MLM's po
Externí odkaz:
http://arxiv.org/abs/2406.14267