Zobrazeno 1 - 10
of 60
pro vyhledávání: '"Çano, Erion"'
Autor:
Çano, Erion, Lamaj, Dario
The scarcity of available text corpora for low-resource languages like Albanian is a serious hurdle for research in natural language processing tasks. This paper introduces AlbNews, a collection of 600 topically labeled news headlines and 2600 unlabe
Externí odkaz:
http://arxiv.org/abs/2402.04028
Autor:
Çano, Erion
Scarcity of resources such as annotated text corpora for under-resourced languages like Albanian is a serious impediment in computational linguistics and natural language processing research. This paper presents AlbNER, a corpus of 900 sentences with
Externí odkaz:
http://arxiv.org/abs/2309.08741
Autor:
Çano, Erion, Vogli, Xhesilda
Corporate Social Responsibility (CSR) has become an important topic that is gaining academic interest. This research paper presents CSREU, a new dataset with attributes of 115 European companies, which includes several performance indicators and the
Externí odkaz:
http://arxiv.org/abs/2306.09798
Autor:
Çano, Erion
Lack of available resources such as text corpora for low-resource languages seriously hinders research on natural language processing and computational linguistics. This paper presents AlbMoRe, a corpus of 800 sentiment annotated movie reviews in Alb
Externí odkaz:
http://arxiv.org/abs/2306.08526
Autor:
Kougia, Vasiliki, Fetzel, Simon, Kirchmair, Thomas, Çano, Erion, Baharlou, Sina Moayed, Sharifzadeh, Sahand, Roth, Benjamin
Memes are a popular form of communicating trends and ideas in social media and on the internet in general, combining the modalities of images and text. They can express humor and sarcasm but can also have offensive content. Analyzing and classifying
Externí odkaz:
http://arxiv.org/abs/2305.18391
Autor:
Vogli, Xhesilda, Çano, Erion
As stakeholders' pressure on corporates for disclosing their corporate social responsibility operations grows, it is crucial to understand how efficient corporate disclosure systems are in bridging the gap between corporate social responsibility repo
Externí odkaz:
http://arxiv.org/abs/2301.03404
Autor:
Ziaee, Amir, Çano, Erion
This study introduces a new normalization layer termed Batch Layer Normalization (BLN) to reduce the problem of internal covariate shift in deep neural network layers. As a combined version of batch and layer normalization, BLN adaptively puts approp
Externí odkaz:
http://arxiv.org/abs/2209.08898
Compared to standard Named Entity Recognition (NER), identifying persons, locations, and organizations in historical texts constitutes a big challenge. To obtain machine-readable corpora, the historical text is usually scanned and Optical Character R
Externí odkaz:
http://arxiv.org/abs/2205.15575
Autor:
Çano, Erion, Roth, Benjamin
Collections of research article data harvested from the web have become common recently since they are important resources for experimenting on tasks such as named entity recognition, text summarization, or keyword generation. In fact, certain types
Externí odkaz:
http://arxiv.org/abs/2205.11249
Autor:
Roth, Benjamin, Çano, Erion
We propose a scheme for self-training of grammaticality models for constituency analysis based on linguistic tests. A pre-trained language model is fine-tuned by contrastive estimation of grammatical sentences from a corpus, and ungrammatical sentenc
Externí odkaz:
http://arxiv.org/abs/2109.15159