Výsledky vyhledávání - "Antoine Miech"

Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos

Autor: Tomas Soucek, Jean-Baptiste Alayrac, Antoine Miech, Ivan Laptev, Josef Sivic

Publikováno v: CVPR 2022-IEEE/CVF Conference on Computer Vision and Pattern Recognition
CVPR 2022-IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States

Human actions often induce changes of object states such as "cutting an apple", "cleaning shoes" or "pouring coffee". In this paper, we seek to temporally localize object states (e.g. "empty" and "full" cup) together with the corresponding state-modi

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2c183ca2149b4b99492e69cb248e1698
http://arxiv.org/abs/2203.11637

Zobrazit plný text záznamu

End-to-End Learning of Visual Representations from Uncurated Instructional Videos

Autor: Jean-Baptiste Alayrac, Andrew Zisserman, Lucas Smaira, Ivan Laptev, Josef Sivic, Antoine Miech

Publikováno v: CVPR 2020-IEEE Conference on Computer Vision and Pattern Recognition
CVPR 2020-IEEE Conference on Computer Vision and Pattern Recognition, Jun 2020, Seattle / Virtual, United States
CVPR

Annotating videos is cumbersome, expensive and not scalable. Yet, many strong video models still rely on manually annotated data. With the recent introduction of the HowTo100M dataset, narrated videos now offer the possibility of learning video repre

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::29ec1d48b3d728bdba9d2fd617cb6ced
https://inria.hal.science/hal-01569540v2/file/miech17ICCV.pdf

Zobrazit plný text záznamu

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers

Autor: Antoine Miech, Josef Sivic, Jean-Baptiste Alayrac, Ivan Laptev, Andrew Zisserman

Publikováno v: CVPR
CVPR 2021-Conference on Computer Vision and Pattern Recognition
CVPR 2021-Conference on Computer Vision and Pattern Recognition, Jun 2021, Nashville, United States

Our objective is language-based search of large-scale image and video datasets. For this task, the approach that consists of independently mapping text and vision to a joint embedding space, a.k.a. dual encoders, is attractive as retrieval scales and

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::67b7a5501272501fa0c1d39703bdfade
https://doi.org/10.1109/cvpr46437.2021.00970

Zobrazit plný text záznamu

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

Autor: Jean-Baptiste Alayrac, Josef Sivic, Antoine Miech, Ivan Laptev, Dimitri Zhukov, Makarand Tapaswi

Publikováno v: ICCV 2019-International Conference on Computer Vision
ICCV 2019-International Conference on Computer Vision, Oct 2019, Séoul, South Korea
ICCV

Learning text-video embeddings usually requires a dataset of video clips with manually provided captions. However, such datasets are expensive and time consuming to create and therefore difficult to obtain on a large scale. In this work, we propose i

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::720f44a13124db3df7f8d3e8279c7bc2
https://hal.science/hal-02433497

Zobrazit plný text záznamu

Leveraging the Present to Anticipate the Future in Videos

Autor: Ivan Laptev, Antoine Miech, Josef Sivic, Lorenzo Torresani, Du Tran, Heng Wang

Publikováno v: CVPR Workshops

Anticipating actions before they are executed is crucial for a wide range of practical applications including autonomous driving and the moderation of live video streaming. While most prior work in this area requires partial observation of executed a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c466112269a0221fd8cc5dd3401a8360
https://doi.org/10.1109/cvprw.2019.00351

Zobrazit plný text záznamu

Learning from Video and Text via Large-Scale Discriminative Clustering

Autor: Antoine Miech, Jean-Baptiste Alayrac, Josef Sivic, Ivan Laptev, Piotr Bojanowski

Publikováno v: ICCV

Discriminative clustering has been successfully applied to a number of weakly supervised learning tasks. Such applications include person and action recognition, text-to-video alignment, object co-segmentation and co-localization in videos and images

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::5c5345b5c532c5021c3d3770c4719d92
https://doi.org/10.1109/iccv.2017.562

Zobrazit plný text záznamu

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání