Výsledky vyhledávání - "Mercat, Jean"

Report

Autor: Mercat, Jean, Vasiljevic, Igor, Keh, Sedrick, Arora, Kushal, Dave, Achal, Gaidon, Adrien, Kollar, Thomas

Linear transformers have emerged as a subquadratic-time alternative to softmax attention and have garnered significant interest due to their fixed-size recurrent state that lowers inference cost. However, their original formulation suffers from poor

Externí odkaz: http://arxiv.org/abs/2405.06640

Zobrazit plný text záznamu

Report

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

Autor: Khazatsky, Alexander, Pertsch, Karl, Nair, Suraj, Balakrishna, Ashwin, Dasari, Sudeep, Karamcheti, Siddharth, Nasiriany, Soroush, Srirama, Mohan Kumar, Chen, Lawrence Yunliang, Ellis, Kirsty, Fagan, Peter David, Hejna, Joey, Itkina, Masha, Lepert, Marion, Ma, Yecheng Jason, Miller, Patrick Tree, Wu, Jimmy, Belkhale, Suneel, Dass, Shivin, Ha, Huy, Jain, Arhan, Lee, Abraham, Lee, Youngwoon, Memmel, Marius, Park, Sungjae, Radosavovic, Ilija, Wang, Kaiyuan, Zhan, Albert, Black, Kevin, Chi, Cheng, Hatch, Kyle Beltran, Lin, Shan, Lu, Jingpei, Mercat, Jean, Rehman, Abdul, Sanketi, Pannag R, Sharma, Archit, Simpson, Cody, Vuong, Quan, Walke, Homer Rich, Wulfe, Blake, Xiao, Ted, Yang, Jonathan Heewon, Yavary, Arefeh, Zhao, Tony Z., Agia, Christopher, Baijal, Rohan, Castro, Mateo Guaman, Chen, Daphne, Chen, Qiuyu, Chung, Trinity, Drake, Jaimyn, Foster, Ethan Paul, Gao, Jensen, Herrera, David Antonio, Heo, Minho, Hsu, Kyle, Hu, Jiaheng, Jackson, Donovon, Le, Charlotte, Li, Yunshuang, Lin, Kevin, Lin, Roy, Ma, Zehan, Maddukuri, Abhiram, Mirchandani, Suvir, Morton, Daniel, Nguyen, Tony, O'Neill, Abigail, Scalise, Rosario, Seale, Derick, Son, Victor, Tian, Stephen, Tran, Emi, Wang, Andrew E., Wu, Yilin, Xie, Annie, Yang, Jingyun, Yin, Patrick, Zhang, Yunchu, Bastani, Osbert, Berseth, Glen, Bohg, Jeannette, Goldberg, Ken, Gupta, Abhinav, Gupta, Abhishek, Jayaraman, Dinesh, Lim, Joseph J, Malik, Jitendra, Martín-Martín, Roberto, Ramamoorthy, Subramanian, Sadigh, Dorsa, Song, Shuran, Wu, Jiajun, Yip, Michael C., Zhu, Yuke, Kollar, Thomas, Levine, Sergey, Finn, Chelsea

The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipul

Externí odkaz: http://arxiv.org/abs/2403.12945

Zobrazit plný text záznamu

Report

Language models scale reliably with over-training and on downstream tasks

Scaling laws are useful guides for derisking expensive training runs, as they predict performance of large models using cheaper, small-scale experiments. However, there remain gaps between current scaling studies and how language models are ultimatel

Externí odkaz: http://arxiv.org/abs/2403.08540

Zobrazit plný text záznamu

Report

Residual Q-Learning: Offline and Online Policy Customization without Value

Autor: Li, Chenran, Tang, Chen, Nishimura, Haruki, Mercat, Jean, Tomizuka, Masayoshi, Zhan, Wei

Imitation Learning (IL) is a widely used framework for learning imitative behavior from demonstrations. It is especially appealing for solving complex real-world tasks where handcrafting reward function is difficult, or when the goal is to mimic huma

Externí odkaz: http://arxiv.org/abs/2306.09526

Zobrazit plný text záznamu

Report

RAP: Risk-Aware Prediction for Robust Planning

Autor: Nishimura, Haruki, Mercat, Jean, Wulfe, Blake, McAllister, Rowan, Gaidon, Adrien

Robust planning in interactive scenarios requires predicting the uncertain future to make risk-aware decisions. Unfortunately, due to long-tail safety-critical events, the risk is often under-estimated by finite-sampling approximations of probabilist

Externí odkaz: http://arxiv.org/abs/2210.01368

Zobrazit plný text záznamu

Report

Control-Aware Prediction Objectives for Autonomous Driving

Autor: McAllister, Rowan, Wulfe, Blake, Mercat, Jean, Ellis, Logan, Levine, Sergey, Gaidon, Adrien

Autonomous vehicle software is typically structured as a modular pipeline of individual components (e.g., perception, prediction, and planning) to help separate concerns into interpretable sub-tasks. Even when end-to-end training is possible, each mo

Externí odkaz: http://arxiv.org/abs/2204.13319

Zobrazit plný text záznamu

Report

Dynamics-Aware Comparison of Learned Reward Functions

Autor: Wulfe, Blake, Balakrishna, Ashwin, Ellis, Logan, Mercat, Jean, McAllister, Rowan, Gaidon, Adrien

The ability to learn reward functions plays an important role in enabling the deployment of intelligent agents in the real world. However, comparing reward functions, for example as a means of evaluating reward learning methods, presents a challenge.

Externí odkaz: http://arxiv.org/abs/2201.10081

Zobrazit plný text záznamu

Report

Higher Order Linear Transformer

Autor: Mercat, Jean

Following up on the linear transformer part of the article from Katharopoulos et al., that takes this idea from Shen et al., the trick that produces a linear complexity for the attention mechanism is re-used and extended to a second-order approximati

Externí odkaz: http://arxiv.org/abs/2010.14816

Zobrazit plný text záznamu

Report

Social Attention for Autonomous Decision-Making in Dense Traffic

Autor: Leurent, Edouard, Mercat, Jean

We study the design of learning architectures for behavioural planning in a dense traffic setting. Such architectures should deal with a varying number of nearby vehicles, be invariant to the ordering chosen to describe them, while staying accurate a

Externí odkaz: http://arxiv.org/abs/1911.12250

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání