Zobrazeno 1 - 10
of 1 555
pro vyhledávání: '"68t50"'
Autor:
Ng, Hunter
This study investigates the emerging phenomenon of "ghost hiring" or "ghost jobs", where employers advertise job openings without intending to fill them. Using a novel dataset from Glassdoor and employing a LLM-BERT technique, I find that up to 21% o
Externí odkaz:
http://arxiv.org/abs/2410.21771
Keyphrase selection is a challenging task in natural language processing that has a wide range of applications. Adapting existing supervised and unsupervised solutions for the Russian language faces several limitations due to the rich morphology of R
Externí odkaz:
http://arxiv.org/abs/2410.18040
Autor:
Bensch, Oliver, Bensch, Leonie, Nilsson, Tommy, Saling, Florian, Sadri, Wafa M., Hartmann, Carsten, Hecking, Tobias, Kutz, J. Nathan
As humanity prepares for new missions to the Moon and Mars, astronauts will need to operate with greater autonomy, given the communication delays that make real-time support from Earth difficult. For instance, messages between Mars and Earth can take
Externí odkaz:
http://arxiv.org/abs/2410.16397
Crowdsourced annotations of data play a substantial role in the development of Artificial Intelligence (AI). It is broadly recognised that annotations of text data can contain annotator bias, where systematic disagreement in annotations can be traced
Externí odkaz:
http://arxiv.org/abs/2410.15726
Autor:
Kmainasi, Mohamed Bayan, Shahroor, Ali Ezzat, Hasanain, Maram, Laskar, Sahinur Rahman, Hassan, Naeemul, Alam, Firoj
Large Language Models (LLMs) have demonstrated remarkable success as general-purpose task solvers across various fields, including NLP, healthcare, finance, and law. However, their capabilities remain limited when addressing domain-specific problems,
Externí odkaz:
http://arxiv.org/abs/2410.15308
Objective: Recognizing diseases from discharge letters is crucial for cohort selection and epidemiological analyses, as this is the only type of data consistently produced across hospitals. This is a classic document classification problem, typically
Externí odkaz:
http://arxiv.org/abs/2410.15051
Autor:
Wagner, Jan-Samuel, DeCaprio, Dave, Raja, Abishek Chiffon Muthu, Holman, Jonathan M., Brady, Lauren K., Cheung, Sky C., Barzekar, Hosein, Yang, Eric, Martinez II, Mark Anthony, Soong, David, Sridhar, Sriram, Si, Han, Higgs, Brandon W., Hamadeh, Hisham, Ogden, Scott
We introduce Controller-Embedded Language Model Interactions (CELI), a framework that integrates control logic directly within language model (LM) prompts, facilitating complex, multi-stage task execution. CELI addresses limitations of existing promp
Externí odkaz:
http://arxiv.org/abs/2410.14627
Predicting case criticality helps legal professionals in the court system manage large volumes of case law. This paper introduces the Criticality Prediction dataset, a new resource for evaluating the potential influence of Swiss Federal Supreme Court
Externí odkaz:
http://arxiv.org/abs/2410.13460
Autor:
Rolshoven, Luca, Rasiah, Vishvaksenan, Bose, Srinanda Brügger, Stürmer, Matthias, Niklaus, Joel
Legal research is a time-consuming task that most lawyers face on a daily basis. A large part of legal research entails looking up relevant caselaw and bringing it in relation to the case at hand. Lawyers heavily rely on summaries (also called headno
Externí odkaz:
http://arxiv.org/abs/2410.13456
We present a novel approach to automatically generate non-trivial task-specific synthetic datasets for hallucination detection. Our approach features a two-step generation-selection pipeline, using hallucination pattern guidance and a language style
Externí odkaz:
http://arxiv.org/abs/2410.12278