Výsledky vyhledávání

Report

Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models

Autor: Wang, Fei, Wan, Xingchen, Sun, Ruoxi, Chen, Jiefeng, Arık, Sercan Ö.

Retrieval-Augmented Generation (RAG), while effective in integrating external knowledge to address the limitations of large language models (LLMs), can be undermined by imperfect retrieval, which may introduce irrelevant, misleading, or even maliciou

Externí odkaz: http://arxiv.org/abs/2410.07176

Zobrazit plný text záznamu

Report

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Autor: Jin, Bowen, Yoon, Jinsung, Han, Jiawei, Arik, Sercan O.

Retrieval-augmented generation (RAG) empowers large language models (LLMs) to utilize external knowledge sources. The increasing capacity of LLMs to process longer input sequences opens up avenues for providing more retrieved information, to potentia

Externí odkaz: http://arxiv.org/abs/2410.05983

Zobrazit plný text záznamu

Report

CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL

Autor: Pourreza, Mohammadreza, Li, Hailong, Sun, Ruoxi, Chung, Yeounoh, Talaei, Shayan, Kakkar, Gaurav Tarlok, Gan, Yu, Saberi, Amin, Ozcan, Fatma, Arik, Sercan O.

In tackling the challenges of large language model (LLM) performance for Text-to-SQL tasks, we introduce CHASE-SQL, a new framework that employs innovative strategies, using test-time compute in multi-agent modeling to improve candidate generation an

Externí odkaz: http://arxiv.org/abs/2410.01943

Zobrazit plný text záznamu

Report

SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging

Autor: Pourreza, Mohammadreza, Sun, Ruoxi, Li, Hailong, Miculicich, Lesly, Pfister, Tomas, Arik, Sercan O.

Recent advances in Text-to-SQL have largely focused on the SQLite dialect, neglecting the diverse landscape of SQL dialects like BigQuery and PostgreSQL. This limitation is due to the diversity in SQL syntaxes and functions, along with the high cost

Externí odkaz: http://arxiv.org/abs/2408.12733

Zobrazit plný text záznamu

Report

CROME: Cross-Modal Adapters for Efficient Multimodal LLM

Autor: Ebrahimi, Sayna, Arik, Sercan O., Nama, Tejas, Pfister, Tomas

Multimodal Large Language Models (MLLMs) demonstrate remarkable image-language capabilities, but their widespread use faces challenges in cost-effective training and adaptation. Existing approaches often necessitate expensive language model retrainin

Externí odkaz: http://arxiv.org/abs/2408.06610

Zobrazit plný text záznamu

Report

Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions

Autor: Yoon, Jinsung, Sinha, Raj, Arik, Sercan O, Pfister, Tomas

Embeddings from Large Language Models (LLMs) have emerged as critical components in various applications, particularly for information retrieval. While high-dimensional embeddings generally demonstrate superior performance as they contain more salien

Externí odkaz: http://arxiv.org/abs/2407.20243

Zobrazit plný text záznamu

Report

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Autor: Su, Hongjin, Yen, Howard, Xia, Mengzhou, Shi, Weijia, Muennighoff, Niklas, Wang, Han-yu, Liu, Haisu, Shi, Quan, Siegel, Zachary S., Tang, Michael, Sun, Ruoxi, Yoon, Jinsung, Arik, Sercan O., Chen, Danqi, Yu, Tao

Existing retrieval benchmarks primarily consist of information-seeking queries (e.g., aggregated questions from search engines) where keyword or semantic-based retrieval is usually sufficient. However, many complex real-world queries require in-depth

Externí odkaz: http://arxiv.org/abs/2407.12883

Zobrazit plný text záznamu

Report

Late Breaking Results: Fortifying Neural Networks: Safeguarding Against Adversarial Attacks with Stochastic Computing

Autor: Banitaba, Faeze S., Aygun, Sercan, Najafi, M. Hassan

In neural network (NN) security, safeguarding model integrity and resilience against adversarial attacks has become paramount. This study investigates the application of stochastic computing (SC) as a novel mechanism to fortify NN models. The primary

Externí odkaz: http://arxiv.org/abs/2407.04861

Zobrazit plný text záznamu

Report

Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt Optimization

Autor: Wan, Xingchen, Sun, Ruoxi, Nakhost, Hootan, Arik, Sercan O.

Large language models have demonstrated remarkable capabilities, but their performance is heavily reliant on effective prompt engineering. Automatic prompt optimization (APO) methods are designed to automate this and can be broadly categorized into t

Externí odkaz: http://arxiv.org/abs/2406.15708

Zobrazit plný text záznamu

Report

Learned Feature Importance Scores for Automated Feature Engineering

Autor: Dong, Yihe, Arik, Sercan, Yoder, Nathanael, Pfister, Tomas

Feature engineering has demonstrated substantial utility for many machine learning workflows, such as in the small data regime or when distribution shifts are severe. Thus automating this capability can relieve much manual effort and improve model pe

Externí odkaz: http://arxiv.org/abs/2406.04153

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání