Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Antoine Miech"'
Publikováno v:
CVPR 2022-IEEE/CVF Conference on Computer Vision and Pattern Recognition
CVPR 2022-IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States
CVPR 2022-IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States
Human actions often induce changes of object states such as "cutting an apple", "cleaning shoes" or "pouring coffee". In this paper, we seek to temporally localize object states (e.g. "empty" and "full" cup) together with the corresponding state-modi
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2c183ca2149b4b99492e69cb248e1698
http://arxiv.org/abs/2203.11637
http://arxiv.org/abs/2203.11637
Autor:
Jean-Baptiste Alayrac, Andrew Zisserman, Lucas Smaira, Ivan Laptev, Josef Sivic, Antoine Miech
Publikováno v:
CVPR 2020-IEEE Conference on Computer Vision and Pattern Recognition
CVPR 2020-IEEE Conference on Computer Vision and Pattern Recognition, Jun 2020, Seattle / Virtual, United States
CVPR
CVPR 2020-IEEE Conference on Computer Vision and Pattern Recognition, Jun 2020, Seattle / Virtual, United States
CVPR
Annotating videos is cumbersome, expensive and not scalable. Yet, many strong video models still rely on manually annotated data. With the recent introduction of the HowTo100M dataset, narrated videos now offer the possibility of learning video repre
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::29ec1d48b3d728bdba9d2fd617cb6ced
https://inria.hal.science/hal-01569540v2/file/miech17ICCV.pdf
https://inria.hal.science/hal-01569540v2/file/miech17ICCV.pdf
Publikováno v:
CVPR
CVPR 2021-Conference on Computer Vision and Pattern Recognition
CVPR 2021-Conference on Computer Vision and Pattern Recognition, Jun 2021, Nashville, United States
CVPR 2021-Conference on Computer Vision and Pattern Recognition
CVPR 2021-Conference on Computer Vision and Pattern Recognition, Jun 2021, Nashville, United States
Our objective is language-based search of large-scale image and video datasets. For this task, the approach that consists of independently mapping text and vision to a joint embedding space, a.k.a. dual encoders, is attractive as retrieval scales and
Autor:
Jean-Baptiste Alayrac, Josef Sivic, Antoine Miech, Ivan Laptev, Dimitri Zhukov, Makarand Tapaswi
Publikováno v:
ICCV 2019-International Conference on Computer Vision
ICCV 2019-International Conference on Computer Vision, Oct 2019, Séoul, South Korea
ICCV
ICCV 2019-International Conference on Computer Vision, Oct 2019, Séoul, South Korea
ICCV
Learning text-video embeddings usually requires a dataset of video clips with manually provided captions. However, such datasets are expensive and time consuming to create and therefore difficult to obtain on a large scale. In this work, we propose i
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::720f44a13124db3df7f8d3e8279c7bc2
https://hal.science/hal-02433497
https://hal.science/hal-02433497
Publikováno v:
CVPR Workshops
Anticipating actions before they are executed is crucial for a wide range of practical applications including autonomous driving and the moderation of live video streaming. While most prior work in this area requires partial observation of executed a
Publikováno v:
ICCV
Discriminative clustering has been successfully applied to a number of weakly supervised learning tasks. Such applications include person and action recognition, text-to-video alignment, object co-segmentation and co-localization in videos and images
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.