Zobrazeno 1 - 10
of 288
pro vyhledávání: '"Vasilyev, Oleg A."'
A dense passage retrieval system can serve as the initial stages of information retrieval, selecting the most relevant text passages for downstream tasks. In this work we conducted experiments with the goal of finding how much the quality of a multil
Externí odkaz:
http://arxiv.org/abs/2407.00923
Autor:
Vasilyev, Oleg, Bohannon, John
We found that a simple property of clusters in a clustered dataset of news correlate strongly with importance and urgency of news (IUN) as assessed by LLM. We verified our finding across different news datasets, dataset sizes, clustering algorithms a
Externí odkaz:
http://arxiv.org/abs/2402.10302
Autor:
Yurchenko, Sergei N., Brady, Ryan P., Tennyson, Jonathan, Smirnov, Alexander N., Vasilyev, Oleg A., Solomonik, Victor G.
Empirical line lists for the open shell molecule $^{89}$Y$^{16}$O (yttrium oxide) and its isotopologues are presented. The line lists cover the 6 lowest electronic states: $X {}^{2}\Sigma^{+}$, $A {}^{2}\Pi$, $A' {}^{2}\Delta$, $B {}^{2}\Sigma^{+}$,
Externí odkaz:
http://arxiv.org/abs/2308.04173
Semantics of a sentence is defined with much less ambiguity than semantics of a single word, and we assume that it should be better preserved by translation to another language. If multilingual sentence embeddings intend to represent sentence semanti
Externí odkaz:
http://arxiv.org/abs/2305.14256
Using Monte Carlo simulations we study the two-dimensional Ising model on triangular, square, and hexagonal lattices with various topologies. We focus on the behavior of the magnetic susceptibility and of the specific heat near the critical point of
Externí odkaz:
http://arxiv.org/abs/2212.14358
Autor:
Vasilyev, Oleg, Bohannon, John
We propose a new kind of embedding for natural language text that deeply represents semantic meaning. Standard text embeddings use the outputs from hidden layers of a pretrained language model. In our method, we let a language model learn from the te
Externí odkaz:
http://arxiv.org/abs/2208.08386
We propose a simple and practical method for named entity linking (NEL), based on entity representation by multiple embeddings. To explore this method, and to review its dependency on parameters, we measure its performance on Namesakes, a highly chal
Externí odkaz:
http://arxiv.org/abs/2205.10498
Autor:
Vasilyev, Oleg, Bohannon, John
Factual consistency is one of important summary evaluation dimensions, especially as summary generation becomes more fluent and coherent. The ESTIME measure, recently proposed specifically for factual consistency, achieves high correlations with huma
Externí odkaz:
http://arxiv.org/abs/2112.11638
We present Namesakes, a dataset of ambiguously named entities obtained from English-language Wikipedia and news articles. It consists of 58862 mentions of 4148 unique entities and their namesakes: 1000 mentions from news, 28843 from Wikipedia article
Externí odkaz:
http://arxiv.org/abs/2111.11372
The creation of a quality summarization dataset is an expensive, time-consuming effort, requiring the production and evaluation of summaries by both trained humans and machines. If such effort is made in one language, it would be beneficial to be abl
Externí odkaz:
http://arxiv.org/abs/2109.08129