Výsledky vyhledávání - "Li, Xiaocheng"

Report

Towards Better Understanding of In-Context Learning Ability from In-Context Uncertainty Quantification

Autor: Liu, Shang, Cai, Zhongze, Chen, Guanting, Li, Xiaocheng

Predicting simple function classes has been widely used as a testbed for developing theory and understanding of the trained Transformer's in-context learning (ICL) ability. In this paper, we revisit the training of Transformers on linear regression t

Externí odkaz: http://arxiv.org/abs/2405.15115

Zobrazit plný text záznamu

Report

Understanding the Training and Generalization of Pretrained Transformer for Sequential Decision Making

Autor: Wang, Hanzhao, Pan, Yu, Sun, Fupeng, Liu, Shang, Talluri, Kalyan, Chen, Guanting, Li, Xiaocheng

In this paper, we consider the supervised pretrained transformer for a class of sequential decision-making problems. The class of considered problems is a subset of the general formulation of reinforcement learning in that there is no transition prob

Externí odkaz: http://arxiv.org/abs/2405.14219

Zobrazit plný text záznamu

Report

Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach

Autor: Liu, Linyu, Pan, Yu, Li, Xiaocheng, Chen, Guanting

In this paper, we study the problem of uncertainty estimation and calibration for LLMs. We first formulate the uncertainty estimation problem for LLMs and then propose a supervised approach that takes advantage of the labeled datasets and estimates t

Externí odkaz: http://arxiv.org/abs/2404.15993

Zobrazit plný text záznamu

Report

Towards Better Statistical Understanding of Watermarking LLMs

Autor: Cai, Zhongze, Liu, Shang, Wang, Hanzhao, Zhong, Huaiyang, Li, Xiaocheng

In this paper, we study the problem of watermarking large language models (LLMs). We consider the trade-off between model distortion and detection ability and formulate it as a constrained optimization problem based on the green-red algorithm of Kirc

Externí odkaz: http://arxiv.org/abs/2403.13027

Zobrazit plný text záznamu

Report

Transformer Choice Net: A Transformer Neural Network for Choice Prediction

Autor: Wang, Hanzhao, Li, Xiaocheng, Talluri, Kalyan

Discrete-choice models, such as Multinomial Logit, Probit, or Mixed-Logit, are widely used in Marketing, Economics, and Operations Research: given a set of alternatives, the customer is modeled as choosing one of the alternatives to maximize a (laten

Externí odkaz: http://arxiv.org/abs/2310.08716

Zobrazit plný text záznamu

Report

Facilitating Battery Swapping Services for Freight Trucks with Spatial-Temporal Demand Prediction

Autor: Liu, Linyu, Dai, Zhen, Song, Shiji, Li, Xiaocheng, Chen, Guanting

Electrifying heavy-duty trucks offers a substantial opportunity to curtail carbon emissions, advancing toward a carbon-neutral future. However, the inherent challenges of limited battery energy and the sheer weight of heavy-duty trucks lead to reduce

Externí odkaz: http://arxiv.org/abs/2310.04440

Zobrazit plný text záznamu

Report

Learning to Make Adherence-Aware Advice

Autor: Chen, Guanting, Li, Xiaocheng, Sun, Chunlin, Wang, Hanzhao

As artificial intelligence (AI) systems play an increasingly prominent role in human decision-making, challenges surface in the realm of human-AI interactions. One challenge arises from the suboptimal AI policies due to the inadequate consideration o

Externí odkaz: http://arxiv.org/abs/2310.00817

Zobrazit plný text záznamu

Report

A Neural Network Based Choice Model for Assortment Optimization

Autor: Wang, Hanzhao, Cai, Zhongze, Li, Xiaocheng, Talluri, Kalyan

Discrete-choice models are used in economics, marketing and revenue management to predict customer purchase probabilities, say as a function of prices and other features of the offered assortment. While they have been shown to be expressive, capturin

Externí odkaz: http://arxiv.org/abs/2308.05617

Zobrazit plný text záznamu

Report

When No-Rejection Learning is Consistent for Regression with Rejection

Autor: Li, Xiaocheng, Liu, Shang, Sun, Chunlin, Wang, Hanzhao

Learning with rejection has been a prototypical model for studying the human-AI interaction on prediction tasks. Upon the arrival of a sample instance, the model first uses a rejector to decide whether to accept and use the AI predictor to make a pre

Externí odkaz: http://arxiv.org/abs/2307.02932

Zobrazit plný text záznamu

Report

Understanding Uncertainty Sampling

Autor: Liu, Shang, Li, Xiaocheng

Uncertainty sampling is a prevalent active learning algorithm that queries sequentially the annotations of data samples which the current prediction model is uncertain about. However, the usage of uncertainty sampling has been largely heuristic: (i)

Externí odkaz: http://arxiv.org/abs/2307.02719

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání