Výsledky vyhledávání

Report

CoDi: Conversational Distillation for Grounded Question Answering

Autor: Huber, Patrick, Einolghozati, Arash, Conway, Rylan, Narang, Kanika, Smith, Matt, Nayyar, Waqar, Sagar, Adithya, Aly, Ahmed, Shrivastava, Akshat

Distilling conversational skills into Small Language Models (SLMs) with approximately 1 billion parameters presents significant challenges. Firstly, SLMs have limited capacity in their model parameters to learn extensive knowledge compared to larger

Externí odkaz: http://arxiv.org/abs/2408.11219

Zobrazit plný text záznamu

Report

Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

Autor: Agrawal, Rishabh, Dahlin, Nathan, Jain, Rahul, Nayyar, Ashutosh

Imitation learning (IL) is notably effective for robotic tasks where directly programming behaviors or defining optimal control costs is challenging. In this work, we address a scenario where the imitator relies solely on observed behavior and cannot

Externí odkaz: http://arxiv.org/abs/2408.09125

Zobrazit plný text záznamu

Report

FACTS About Building Retrieval Augmented Generation-based Chatbots

Enterprise chatbots, powered by generative AI, are emerging as key applications to enhance employee productivity. Retrieval Augmented Generation (RAG), Large Language Models (LLMs), and orchestration frameworks like Langchain and Llamaindex are cruci

Externí odkaz: http://arxiv.org/abs/2407.07858

Zobrazit plný text záznamu

Report

Fruit Classification System with Deep Learning and Neural Architecture Search

Autor: Dewi, Christine, Thiruvady, Dhananjay, Zaidi, Nayyar

The fruit identification process involves analyzing and categorizing different types of fruits based on their visual characteristics. This activity can be achieved using a range of methodologies, encompassing manual examination, conventional computer

Externí odkaz: http://arxiv.org/abs/2406.01869

Zobrazit plný text záznamu

Report

Scaling Data Plane Verification with Intent-based Slicing

Autor: Chou, Kuan-Yen, Prabhu, Santhosh, Subramanian, Giri, Zhou, Wenxuan, Nayyar, Aanand, Godfrey, Brighten, Caesar, Matthew

Data plane verification has grown into a powerful tool to ensure network correctness. However, existing monolithic data plane models have high memory requirements with large networks, and the existing method of scaling out is too limited in expressiv

Externí odkaz: http://arxiv.org/abs/2405.20982

Zobrazit plný text záznamu

Report

Pure Exploration for Constrained Best Mixed Arm Identification with a Fixed Budget

Autor: Tang, Dengwang, Jain, Rahul, Nayyar, Ashutosh, Nuzzo, Pierluigi

In this paper, we introduce the constrained best mixed arm identification (CBMAI) problem with a fixed budget. This is a pure exploration problem in a stochastic finite armed bandit model. Each arm is associated with a reward and multiple types of co

Externí odkaz: http://arxiv.org/abs/2405.15090

Zobrazit plný text záznamu

Report

Using Explainable AI and Hierarchical Planning for Outreach with Robots

Autor: Dobhal, Daksh, Nagpal, Jayesh, Karia, Rushang, Verma, Pulkit, Nayyar, Rashmeet Kaur, Shah, Naman, Srivastava, Siddharth

Understanding how robots plan and execute tasks is crucial in today's world, where they are becoming more prevalent in our daily lives. However, teaching non-experts the complexities of robot planning can be challenging. This work presents an open-so

Externí odkaz: http://arxiv.org/abs/2404.00808

Zobrazit plný text záznamu

Report

Model approximation in MDPs with unbounded per-step cost

Autor: Bozkurt, Berk, Mahajan, Aditya, Nayyar, Ashutosh, Ouyang, Yi

We consider the problem of designing a control policy for an infinite-horizon discounted cost Markov decision process $\mathcal{M}$ when we only have access to an approximate model $\hat{\mathcal{M}}$. How well does an optimal policy $\hat{\pi}^{\sta

Externí odkaz: http://arxiv.org/abs/2402.08813

Zobrazit plný text záznamu

Report

HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent

Autor: Xu, Weijie, Huang, Zicheng, Hu, Wenxiang, Fang, Xi, Cherukuri, Rajesh Kumar, Nayyar, Naumaan, Malandri, Lorenzo, Sengamedu, Srinivasan H.

Publikováno v: EACL 2024

Recent advancements in Large Language Models (LLMs) have been reshaping Natural Language Processing (NLP) task in several domains. Their use in the field of Human Resources (HR) has still room for expansions and could be beneficial for several time c

Externí odkaz: http://arxiv.org/abs/2402.01018

Zobrazit plný text záznamu

Report

Posterior Sampling-based Online Learning for Episodic POMDPs

Autor: Tang, Dengwang, Ye, Dongze, Jain, Rahul, Nayyar, Ashutosh, Nuzzo, Pierluigi

Learning in POMDPs is known to be significantly harder than MDPs. In this paper, we consider the online learning problem for episodic POMDPs with unknown transition and observation models. We propose a Posterior Sampling-based reinforcement learning

Externí odkaz: http://arxiv.org/abs/2310.10107

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání