Výsledky vyhledávání - "Artemova Ekaterina"

Report

U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in LLMs

Autor: Chernyshev, Konstantin, Polshkov, Vitaliy, Artemova, Ekaterina, Myasnikov, Alex, Stepanov, Vlad, Miasnikov, Alexei, Tilga, Sergei

The current evaluation of mathematical skills in LLMs is limited, as existing benchmarks are either relatively small, primarily focus on elementary and high-school problems, or lack diversity in topics. Additionally, the inclusion of visual elements

Externí odkaz: http://arxiv.org/abs/2412.03205

Zobrazit plný text záznamu

Report

Hands-On Tutorial: Labeling with LLM and Human-in-the-Loop

Autor: Artemova, Ekaterina, Tsvigun, Akim, Schlechtweg, Dominik, Fedorova, Natalia, Tilga, Sergei, Obmoroshev, Boris

Training and deploying machine learning models relies on a large amount of human-annotated data. As human labeling becomes increasingly expensive and time-consuming, recent research has developed multiple strategies to speed up annotation and reduce

Externí odkaz: http://arxiv.org/abs/2411.04637

Zobrazit plný text záznamu

Report

Beemo: Benchmark of Expert-edited Machine-generated Outputs

Autor: Artemova, Ekaterina, Lucas, Jason, Venkatraman, Saranya, Lee, Jooyoung, Tilga, Sergei, Uchendu, Adaku, Mikhailov, Vladislav

The rapid proliferation of large language models (LLMs) has increased the volume of machine-generated texts (MGTs) and blurred text authorship in various domains. However, most existing MGT benchmarks include single-author texts (human-written and ma

Externí odkaz: http://arxiv.org/abs/2411.04032

Zobrazit plný text záznamu

Report

LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection

The ease of access to large language models (LLMs) has enabled a widespread of machine-generated texts, and now it is often hard to tell whether a piece of text was human-written or machine-generated. This raises concerns about potential misuse, part

Externí odkaz: http://arxiv.org/abs/2408.04284

Zobrazit plný text záznamu

Report

Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers

Autor: Andreev, Nikita, Shirnin, Alexander, Mikhailov, Vladislav, Artemova, Ekaterina

This paper presents Papilusion, an AI-generated scientific text detector developed within the DAGPap24 shared task on detecting automatically generated scientific papers. We propose an ensemble-based approach and conduct ablation studies to analyze t

Externí odkaz: http://arxiv.org/abs/2407.17629

Zobrazit plný text záznamu

Report

RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs

Autor: Taktasheva, Ekaterina, Bazhukov, Maxim, Koncha, Kirill, Fenogenova, Alena, Artemova, Ekaterina, Mikhailov, Vladislav

Minimal pairs are a well-established approach to evaluating the grammatical knowledge of language models. However, existing resources for minimal pairs address a limited number of languages and lack diversity of language-specific grammatical phenomen

Externí odkaz: http://arxiv.org/abs/2406.19232

Zobrazit plný text záznamu

Report

AIpom at SemEval-2024 Task 8: Detecting AI-produced Outputs in M4

Autor: Shirnin, Alexander, Andreev, Nikita, Mikhailov, Vladislav, Artemova, Ekaterina

This paper describes AIpom, a system designed to detect a boundary between human-written and machine-generated text (SemEval-2024 Task 8, Subtask C: Human-Machine Mixed Text Detection). We propose a two-stage pipeline combining predictions from an in

Externí odkaz: http://arxiv.org/abs/2403.19354

Zobrazit plný text záznamu

Report

RuBia: A Russian Language Bias Detection Dataset

Autor: Grigoreva, Veronika, Ivanova, Anastasiia, Alimova, Ilseyar, Artemova, Ekaterina

Warning: this work contains upsetting or disturbing content. Large language models (LLMs) tend to learn the social and cultural biases present in the raw pre-training data. To test if an LLM's behavior is fair, functional datasets are employed, and d

Externí odkaz: http://arxiv.org/abs/2403.17553

Zobrazit plný text záznamu

Report

Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data

Autor: Peng, Siyao, Sun, Zihang, Shan, Huangyan, Kolm, Marie, Blaschke, Verena, Artemova, Ekaterina, Plank, Barbara

Named Entity Recognition (NER) is a fundamental task to extract key information from texts, but annotated resources are scarce for dialects. This paper introduces the first dialectal NER dataset for German, BarNER, with 161K tokens annotated on Bavar

Externí odkaz: http://arxiv.org/abs/2403.12749

Zobrazit plný text záznamu

Report

Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties

Autor: Artemova, Ekaterina, Blaschke, Verena, Plank, Barbara

Mainstream cross-lingual task-oriented dialogue (ToD) systems leverage the transfer learning paradigm by training a joint model for intent recognition and slot-filling in English and applying it, zero-shot, to other languages. We address a gap in pri

Externí odkaz: http://arxiv.org/abs/2402.02078

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání