Zobrazeno 1 - 10
of 298
pro vyhledávání: '"Ginter Filip"'
Autor:
Luukkonen, Risto, Komulainen, Ville, Luoma, Jouni, Eskelinen, Anni, Kanerva, Jenna, Kupari, Hanna-Mari, Ginter, Filip, Laippala, Veronika, Muennighoff, Niklas, Piktus, Aleksandra, Wang, Thomas, Tazi, Nouamane, Scao, Teven Le, Wolf, Thomas, Suominen, Osma, Sairanen, Samuli, Merioksa, Mikko, Heinonen, Jyrki, Vahtola, Aija, Antao, Samuel, Pyysalo, Sampo
Large language models (LLMs) excel in many tasks in NLP and beyond, but most open models have very limited coverage of smaller languages and LLM work tends to focus on languages where nearly unlimited data is available for pretraining. In this work,
Externí odkaz:
http://arxiv.org/abs/2311.05640
Relation Extraction (RE) remains a challenging task, especially when considering realistic out-of-domain evaluations. One of the main reasons for this is the limited training size of current RE datasets: obtaining high-quality (manually annotated) da
Externí odkaz:
http://arxiv.org/abs/2305.11016
Most research in Relation Extraction (RE) involves the English language, mainly due to the lack of multi-lingual resources. We propose Multi-CrossRE, the broadest multi-lingual dataset for RE, including 26 languages in addition to English, and coveri
Externí odkaz:
http://arxiv.org/abs/2305.10985
Autor:
Haris, Muhammad Junaid, Upreti, Aanchal, Kurtaran, Melih, Ginter, Filip, Lafond, Sebastien, Azimi, Sepinoud
The problem of gender bias is highly prevalent and well known. In this paper, we have analysed the portrayal of gender roles in English movies, a medium that effectively influences society in shaping people's beliefs and opinions. First, we gathered
Externí odkaz:
http://arxiv.org/abs/2211.12504
Autor:
Gehrmann, Sebastian, Bhattacharjee, Abhik, Mahendiran, Abinaya, Wang, Alex, Papangelis, Alexandros, Madaan, Aman, McMillan-Major, Angelina, Shvets, Anna, Upadhyay, Ashish, Yao, Bingsheng, Wilie, Bryan, Bhagavatula, Chandra, You, Chaobin, Thomson, Craig, Garbacea, Cristina, Wang, Dakuo, Deutsch, Daniel, Xiong, Deyi, Jin, Di, Gkatzia, Dimitra, Radev, Dragomir, Clark, Elizabeth, Durmus, Esin, Ladhak, Faisal, Ginter, Filip, Winata, Genta Indra, Strobelt, Hendrik, Hayashi, Hiroaki, Novikova, Jekaterina, Kanerva, Jenna, Chim, Jenny, Zhou, Jiawei, Clive, Jordan, Maynez, Joshua, Sedoc, João, Juraska, Juraj, Dhole, Kaustubh, Chandu, Khyathi Raghavi, Perez-Beltrachini, Laura, Ribeiro, Leonardo F. R., Tunstall, Lewis, Zhang, Li, Pushkarna, Mahima, Creutz, Mathias, White, Michael, Kale, Mihir Sanjay, Eddine, Moussa Kamal, Daheim, Nico, Subramani, Nishant, Dusek, Ondrej, Liang, Paul Pu, Ammanamanchi, Pawan Sasanka, Zhu, Qi, Puduppully, Ratish, Kriz, Reno, Shahriyar, Rifat, Cardenas, Ronald, Mahamood, Saad, Osei, Salomey, Cahyawijaya, Samuel, Štajner, Sanja, Montella, Sebastien, Shailza, Jolly, Shailza, Mille, Simon, Hasan, Tahmid, Shen, Tianhao, Adewumi, Tosin, Raunak, Vikas, Raheja, Vipul, Nikolaev, Vitaly, Tsai, Vivian, Jernite, Yacine, Xu, Ying, Sang, Yisi, Liu, Yixin, Hou, Yufang
Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better
Externí odkaz:
http://arxiv.org/abs/2206.11249
Autor:
Kanerva, Jenna, Ginter, Filip
The prevailing practice in the academia is to evaluate the model performance on in-domain evaluation data typically set aside from the training corpus. However, in many real world applications the data on which the model is applied may very substanti
Externí odkaz:
http://arxiv.org/abs/2204.10621
In this paper, we approach the problem of semantic search by framing the search task as paraphrase span detection, i.e. given a segment of text as a query phrase, the task is to identify its paraphrase in a given document, the same modelling setup as
Externí odkaz:
http://arxiv.org/abs/2112.04886
Publikováno v:
BMC Bioinformatics, Vol 11, Iss Suppl 5, p O2 (2010)
Externí odkaz:
https://doaj.org/article/e43ffe2d54b34571a9f6dd922b895f67
Publikováno v:
BMC Bioinformatics, Vol 9, Iss Suppl 3, p S6 (2008)
Abstract Background Growing interest in the application of natural language processing methods to biomedical text has led to an increasing number of corpora and methods targeting protein-protein interaction (PPI) extraction. However, there is no gene
Externí odkaz:
https://doaj.org/article/442c6f684c544eed8b08bf9dbe105fd8
Autor:
Rönnqvist, Samuel, Myntti, Amanda, Kyröläinen, Aki-Juhani, Pyysalo, Sampo, Laippala, Veronika, Ginter, Filip
In recent years, several methods have been proposed for explaining individual predictions of deep learning models, yet there has been little study of how to aggregate these predictions to explain how such models view classes as a whole in text classi
Externí odkaz:
http://arxiv.org/abs/2108.13653