Zobrazeno 1 - 10
of 19
pro vyhledávání: '"Doostmohammadi, Ehsan"'
Work on instruction-tuned Large Language Models (LLMs) has used automatic methods based on text overlap and LLM judgments as cost-effective alternatives to human evaluation. In this paper, we perform a meta-evaluation of such methods and assess their
Externí odkaz:
http://arxiv.org/abs/2402.10770
Augmenting language models with a retrieval mechanism has been shown to significantly improve their performance while keeping the number of parameters low. Retrieval-augmented models commonly rely on a semantic retrieval mechanism based on the simila
Externí odkaz:
http://arxiv.org/abs/2305.16243
Recent work on the Retrieval-Enhanced Transformer (RETRO) model has shown that off-loading memory from trainable weights to a retrieval database can significantly improve language modeling and match the performance of non-retrieval models that are an
Externí odkaz:
http://arxiv.org/abs/2302.12128
Autor:
Taghizadeh, Nasrin, Doostmohammadi, Ehsan, Seifossadat, Elham, Rabiee, Hamid R., Tahaei, Maedeh S.
We have released Sina-BERT, a language model pre-trained on BERT (Devlin et al., 2018) to address the lack of a high-quality Persian language model in the medical domain. SINA-BERT utilizes pre-training on a large-scale corpus of medical contents inc
Externí odkaz:
http://arxiv.org/abs/2104.07613
Words are properly segmented in the Persian writing system; in practice, however, these writing rules are often neglected, resulting in single words being written disjointedly and multiple words written without any white spaces between them. This pap
Externí odkaz:
http://arxiv.org/abs/2010.00287
Keyphrases provide an extremely dense summary of a text. Such information can be used in many Natural Language Processing tasks, such as information retrieval and text summarization. Since previous studies on Persian keyword or keyphrase extraction h
Externí odkaz:
http://arxiv.org/abs/2009.12269
Keyphrases are a very short summary of an input text and provide the main subjects discussed in the text. Keyphrase extraction is a useful upstream task and can be used in various natural language processing problems, for example, text summarization
Externí odkaz:
http://arxiv.org/abs/2009.12271
Identification of the languages written using cuneiform symbols is a difficult task due to the lack of resources and the problem of tokenization. The Cuneiform Language Identification task in VarDial 2019 addresses the problem of identifying seven la
Externí odkaz:
http://arxiv.org/abs/2009.10794
This paper presents the models submitted by Ghmerti team for subtasks A and B of the OffensEval shared task at SemEval 2019. OffensEval addresses the problem of identifying and categorizing offensive language in social media in three subtasks; whethe
Externí odkaz:
http://arxiv.org/abs/2009.10792
Ezafe is a grammatical particle in some Iranian languages that links two words together. Regardless of the important information it conveys, it is almost always not indicated in Persian script, resulting in mistakes in reading complex sentences and e
Externí odkaz:
http://arxiv.org/abs/2009.09474