Výsledky vyhledávání

Report

Can Models Learn Skill Composition from Examples?

Autor: Zhao, Haoyu, Kaur, Simran, Yu, Dingli, Goyal, Anirudh, Arora, Sanjeev

As large language models (LLMs) become increasingly advanced, their ability to exhibit compositional generalization -- the capacity to combine learned skills in novel ways not encountered during training -- has garnered significant attention. This ty

Externí odkaz: http://arxiv.org/abs/2409.19808

Zobrazit plný text záznamu

Report

Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning

Autor: Kaur, Simran, Park, Simon, Goyal, Anirudh, Arora, Sanjeev

We introduce Instruct-SkillMix, an automated approach for creating diverse, high quality SFT data. The Instruct-SkillMix pipeline involves two stages, each leveraging an existing powerful LLM: (1) Skill extraction: uses the LLM to extract core "skill

Externí odkaz: http://arxiv.org/abs/2408.14774

Zobrazit plný text záznamu

Akademický článek

A Rare Case of Familiar Hypertension Presenting with Hypertensive Encephalopathy in an Elderly Patient: A Diagnostic Dilemma: A Presentation of Liddle’s Syndrome due to Novel Mutation in SCNN1G Gene

Autor: Sethi Suman, Mehta Sudhir, Sethi Nitin, Makkar Vikas, Kaur Simran, Sohal M Preet

Publikováno v: Saudi Journal of Kidney Diseases and Transplantation, Vol 32, Iss 4, Pp 1163-1165 (2021)

Liddle’s syndrome is a rare cause of secondary hypertension (HTN). Basic characteristics of this disease are HTN, reduced concentration of aldosterone and renin activity, as well as increased excretion of potassium, leading to hypokalemia and metab

Externí odkaz: https://doaj.org/article/d4f9123f97e940319694f748caa77b9d

Zobrazit plný text záznamu

Kniha

Girls That Invest : Your Guide to Financial Independence Through Shares and Stocks. [elektronicky zdroj]

Autor: Kaur, Simran

Externí odkaz: Kolekce e-knih KNAV (Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on requests.)

Report

Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models

Autor: Yu, Dingli, Kaur, Simran, Gupta, Arushi, Brown-Cohen, Jonah, Goyal, Anirudh, Arora, Sanjeev

With LLMs shifting their role from statistical modeling of language to serving as general-purpose AI agents, how should LLM evaluations change? Arguably, a key ability of an AI agent is to flexibly combine, as needed, the basic skills it has learned.

Externí odkaz: http://arxiv.org/abs/2310.17567

Zobrazit plný text záznamu

Kniha

Do Fair Treatments Matter in Managing Diverse Groups? A Study on Organisational Justice and Individual Work Behaviour

Autor: Lather, Anu Singh, author, Kaur, Simran, author

Publikováno v: VUCA and Other Analytics in Business Resilience, Part B

Externí odkaz: http://www.emeraldinsight.com/doi/10.1108/978-1-83753-198-120241001

Zobrazit plný text záznamu

Report

Disentangling the Mechanisms Behind Implicit Regularization in SGD

Autor: Novack, Zachary, Kaur, Simran, Marwah, Tanya, Garg, Saurabh, Lipton, Zachary C.

A number of competing hypotheses have been proposed to explain why small-batch Stochastic Gradient Descent (SGD)leads to improved generalization over the full-batch regime, with recent work crediting the implicit regularization of various quantities

Externí odkaz: http://arxiv.org/abs/2211.15853

Zobrazit plný text záznamu

Report

A survey on scheduling and mapping techniques in 3D Network-on-chip

Autor: Kaur, Simran Preet, Ghose, Manojit, Pathak, Ananya, Patole, Rutuja

Network-on-Chips (NoCs) have been widely employed in the design of multiprocessor system-on-chips (MPSoCs) as a scalable communication solution. NoCs enable communications between on-chip Intellectual Property (IP) cores and allow those cores to achi

Externí odkaz: http://arxiv.org/abs/2211.02378

Zobrazit plný text záznamu

Report

On the Maximum Hessian Eigenvalue and Generalization

Autor: Kaur, Simran, Cohen, Jeremy, Lipton, Zachary C.

The mechanisms by which certain training interventions, such as increasing learning rates and applying batch normalization, improve the generalization of deep networks remains a mystery. Prior works have speculated that "flatter" solutions generalize

Externí odkaz: http://arxiv.org/abs/2206.10654

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání