Zobrazeno 1 - 10
of 22
pro vyhledávání: '"Jiaul H. Paik"'
Autor:
Sudeshna Das, Jiaul H. Paik
Publikováno v:
Journal of the Association for Information Science and Technology. 74:461-475
Publikováno v:
ACM Transactions on Information Systems. 40:1-24
Existing probabilistic retrieval models do not restrict the domain of the random variables that they deal with. In this article, we show that the upper bound of the normalized term frequency ( tf ) from the relevant documents is much smaller than the
Publikováno v:
Journal of Informetrics. 17:101392
Autor:
Sudeshna Das, Jiaul H. Paik
Publikováno v:
Information Processing & Management. 58:102423
The gender information of named entities is an important prerequisite for many text analysis tasks such as gender bias detection and targeted advertising. Despite its valuable use cases, gender tagging of named entities has traditionally been databas
Autor:
Jiaul H. Paik
Publikováno v:
ACM Transactions on Intelligent Systems and Technology. 7:1-21
This article proposes a term weighting scheme for measuring query-document similarity that attempts to explicitly model the dependency between separate occurrences of a term in a document. The assumption is that, if a term appears once in a document,
Publikováno v:
ACM Transactions on Information Systems. 33:1-34
The multinomial language model has been one of the most effective models of retrieval for over a decade. However, the multinomial distribution does not model one important linguistic phenomenon relating to term-dependency, that is the tendency of a t
Publikováno v:
ACM Transactions on Asian Language Information Processing. 13:1-22
Automatic query expansion (AQE) is a useful technique for enhancing the effectiveness of information retrieval systems. In this article, we propose a novel AQE algorithm which first adopts a systematic incremental approach to choose feedback document
Publikováno v:
ACM Transactions on Information Systems. 31:1-29
Stemming is a widely used technique in information retrieval systems to address the vocabulary mismatch problem arising out of morphological phenomena. The major shortcoming of the commonly used stemmers is that they accept the morphological variants
Autor:
Jimmy Lin, Jiaul H. Paik
Publikováno v:
ICTIR
"Evaluation as a service" (EaaS) refers to a family of related evaluation methodologies that enables community-wide evaluations and the construction of test collections on documents that cannot be easily distributed. In the API-based approach, the ba
Publikováno v:
ACM Transactions on Information Systems. 29:1-24
A novel graph-based language-independent stemming algorithm suitable for information retrieval is proposed in this article. The main features of the algorithm are retrieval effectiveness, generality, and computational efficiency. We test our approach