Výsledky vyhledávání

Akademický článek

Effects of nobiletin on intestinal stem cell proliferation in vitro and in vivo

Autor: WU Jingfeng, GUO Kenan, ZHANG Ning

Publikováno v: 陆军军医大学学报, Vol 45, Iss 21, Pp 2195-2205 (2023)

Objective To investigate the regulatory effect of nobiletin (NOB) at typical effective doses on intestinal stem cells in vivo and in vitro. Methods After a 3D culture model of mouse colorectal tumor cell line MC38 was constructed, the death and survi

Externí odkaz: https://doaj.org/article/27559199d3dc444087d8a262f0ab5d78

Zobrazit plný text záznamu

Akademický článek

Overexpression of IL22 by adeno-associated virus effectively ameliorates dextran sulfate sodium-induced colitis in mice

Autor: WU Tianyu, SONG Yalan, WU Jingfeng, ZHENG Zhiyuan

Publikováno v: 陆军军医大学学报, Vol 45, Iss 19, Pp 1995-2006 (2023)

Objective To investigate the effects of interleukin22 (IL22) in acute colitis induced by dextran sulfate sodium salt (DSS) in mice and its underlying mechanism. Methods Three IL22 knockout mice (IL22-/-) and 3 control mice (IL22+/+) of 8-week-old mal

Externí odkaz: https://doaj.org/article/56f043d10b83446e9682e31077f6967b

Zobrazit plný text záznamu

Report

UELLM: A Unified and Efficient Approach for LLM Inference Serving

Autor: He, Yiyuan, Xu, Minxian, Wu, Jingfeng, Zheng, Wanyi, Ye, Kejiang, Xu, Chengzhong

In the context of Machine Learning as a Service (MLaaS) clouds, the extensive use of Large Language Models (LLMs) often requires efficient management of significant query loads. When providing real-time inference services, several challenges arise. F

Externí odkaz: http://arxiv.org/abs/2409.14961

Zobrazit plný text záznamu

Report

CloudNativeSim: a toolkit for modeling and simulation of cloud-native applications

Autor: Wu, Jingfeng, Xu, Minxian, He, Yiyuan, Ye, Kejiang, Xu, Chengzhong

Cloud-native applications are increasingly becoming popular in modern software design. Employing a microservice-based architecture into these applications is a prevalent strategy that enhances system availability and flexibility. However, cloud-nativ

Externí odkaz: http://arxiv.org/abs/2409.05093

Zobrazit plný text záznamu

Report

Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization

Autor: Cai, Yuhang, Wu, Jingfeng, Mei, Song, Lindsey, Michael, Bartlett, Peter L.

The typical training of neural networks using large stepsize gradient descent (GD) under the logistic loss often involves two distinct phases, where the empirical risk oscillates in the first phase but decreases monotonically in the second phase. We

Externí odkaz: http://arxiv.org/abs/2406.08654

Zobrazit plný text záznamu

Report

Scaling Laws in Linear Regression: Compute, Parameters, and Data

Autor: Lin, Licong, Wu, Jingfeng, Kakade, Sham M., Bartlett, Peter L., Lee, Jason D.

Empirically, large-scale deep learning models often satisfy a neural scaling law: the test error of the trained model improves polynomially as the model size and data size grow. However, conventional wisdom suggests the test error consists of approxi

Externí odkaz: http://arxiv.org/abs/2406.08466

Zobrazit plný text záznamu

Report

Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency

Autor: Wu, Jingfeng, Bartlett, Peter L., Telgarsky, Matus, Yu, Bin

We consider gradient descent (GD) with a constant stepsize applied to logistic regression with linearly separable data, where the constant stepsize $\eta$ is so large that the loss initially oscillates. We show that GD exits this initial oscillatory

Externí odkaz: http://arxiv.org/abs/2402.15926

Zobrazit plný text záznamu

Report

In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization

Autor: Zhang, Ruiqi, Wu, Jingfeng, Bartlett, Peter L.

We study the \emph{in-context learning} (ICL) ability of a \emph{Linear Transformer Block} (LTB) that combines a linear attention component and a linear multi-layer perceptron (MLP) component. For ICL of linear regression with a Gaussian prior and a

Externí odkaz: http://arxiv.org/abs/2402.14951

Zobrazit plný text záznamu

Report

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Autor: Li, Xuheng, Deng, Yihe, Wu, Jingfeng, Zhou, Dongruo, Gu, Quanquan

Accelerated stochastic gradient descent (ASGD) is a workhorse in deep learning and often achieves better generalization performance than SGD. However, existing optimization theory can only explain the faster convergence of ASGD, but cannot explain it

Externí odkaz: http://arxiv.org/abs/2311.14222

Zobrazit plný text záznamu

Report

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

Autor: Wu, Jingfeng, Zou, Difan, Chen, Zixiang, Braverman, Vladimir, Gu, Quanquan, Bartlett, Peter L.

Transformers pretrained on diverse tasks exhibit remarkable in-context learning (ICL) capabilities, enabling them to solve unseen tasks solely based on input contexts without adjusting model parameters. In this paper, we study ICL in one of its simpl

Externí odkaz: http://arxiv.org/abs/2310.08391

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání