Výsledky vyhledávání - "Mondal, Pradipto"

Report

DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets

Autor: Rangwani, Harsh, Mondal, Pradipto, Mishra, Mayank, Asokan, Ashish Ramayee, Babu, R. Venkatesh

Vision Transformer (ViT) has emerged as a prominent architecture for various computer vision tasks. In ViT, we divide the input image into patch tokens and process them through a stack of self attention blocks. However, unlike Convolutional Neural Ne

Externí odkaz: http://arxiv.org/abs/2404.02900

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání