Výsledky vyhledávání - "Matt Feiszli"

FASTER Recurrent Networks for Efficient Video Classification

Autor: Du Tran, Laura Sevilla-Lara, Yi Yang, Linchao Zhu, Heng Wang, Matt Feiszli

Publikováno v: AAAI
University of Technology Sydney

Typical video classification methods often divide a video into short clips, do inference on each clip independently, then aggregate the clip-level predictions to generate the video-level results. However, processing visually similar clips independent

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::140409fe0b4935c9f7b0d71f928c75e6
https://doi.org/10.1609/aaai.v34i07.7012

Zobrazit plný text záznamu

GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval

Autor: Yuxuan Wang, Difei Gao, Licheng Yu, Weixian Lei, Matt Feiszli, Mike Zheng Shou

Publikováno v: Lecture Notes in Computer Science ISBN: 9783031198328

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::62812e1488e820b6a50b4b4c6aa0f240
https://doi.org/10.1007/978-3-031-19833-5_41

Zobrazit plný text záznamu

PyTorchVideo: A Deep Learning Library for Video Understanding

Autor: Jitendra Malik, Haoqi Fan, Ross Girshick, Aaron Adcock, Matt Feiszli, Haichuan Yang, Bo Xiong, Nikhila Ravi, Yanghao Li, Christoph Feichtenhofer, Tullie Murrell, Heng Wang, Kalyan Vasudev Alwala, Meng Li, Wan-Yen Lo, Yilei Li

Publikováno v: ACM Multimedia

We introduce PyTorchVideo, an open-source deep-learning library that provides a rich set of modular, efficient, and reproducible components for a variety of video understanding tasks, including classification, detection, self-supervised learning, and

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::41950dc1e1f1cc0f6f48cfaea938e168
http://arxiv.org/abs/2111.09887

Zobrazit plný text záznamu

Only Time Can Tell: Discovering Temporal Data for Temporal Modeling

Autor: Laura Sevilla-Lara, Vedanuj Goswami, Lorenzo Torresani, Zhicheng Yan, Shengxin Zha, Matt Feiszli

Publikováno v: WACV

Understanding temporal information and how the visual world changes over time, is a fundamental ability of intelligent systems. In video understanding, temporal information is at the core of many current challenges, including compression, efficient i

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::402f927d729c2c67cd994d1abaaeb0c8
https://doi.org/10.1109/wacv48630.2021.00058

Zobrazit plný text záznamu

Don’t Judge an Object by Its Context: Learning to Overcome Contextual Bias

Autor: Krishna Kumar Singh, Matt Feiszli, Kristen Grauman, Yong Jae Lee, Dhruv Mahajan, Deepti Ghadiyaram

Publikováno v: CVPR

Existing models often leverage co-occurrences between objects and their context to improve recognition accuracy. However, strongly relying on context risks a model's generalizability, especially when typical co-occurrence patterns are absent. This wo

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::a27a5aaa809339c105eb27f7be33558e
https://doi.org/10.1109/cvpr42600.2020.01108

Zobrazit plný text záznamu

SF-Net: Single-Frame Supervision for Temporal Action Localization

Autor: Fan Ma, Matt Feiszli, Shengxin Zha, Yi Yang, Linchao Zhu, Zheng Shou, Gourab Kundu

Publikováno v: Computer Vision – ECCV 2020 ISBN: 9783030585471
ECCV (4)
University of Technology Sydney

In this paper, we study an intermediate form of supervision, i.e., single-frame supervision, for temporal action localization (TAL). To obtain the single-frame supervision, the annotators are asked to identify only a single frame within the temporal

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5de4c1848d2cd0d3a54d98df87c3ae89
https://doi.org/10.1007/978-3-030-58548-8_25

Zobrazit plný text záznamu

Numerical Computation of Weil--Peterson Geodesics in the Universal Teichmüller Space

Autor: Akil Narayan, Matt Feiszli

Publikováno v: SIAM Journal on Imaging Sciences. 10:1322-1345

We propose an optimization algorithm for computing geodesics on the universal Teichmuller space $T(1)$ in the Weil--Petersson (WP) metric. Another realization for T(1) is the space of planar shapes, modulo translation and scale, and thus our algorith

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::020a6e7dff10ad1278141a2de597728e
https://doi.org/10.1137/15m1043947

Zobrazit plný text záznamu

Video Modeling with Correlation Networks

Autor: Heng Wang, Matt Feiszli, Lorenzo Torresani, Du Tran

Publikováno v: CVPR

Motion is a salient cue to recognize actions in video. Modern action recognition models leverage motion information either explicitly by using optical flow as input or implicitly by means of 3D convolutional filters that simultaneously capture appear

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::97d0eebfc39d5cbd252fd0f4a9beb2d7
http://arxiv.org/abs/1906.03349

Zobrazit plný text záznamu

What Makes Training Multi-Modal Classification Networks Hard?

Autor: Matt Feiszli, Weiyao Wang, Du Tran

Publikováno v: CVPR

Consider end-to-end training of a multi-modal vs. a single-modal network on a task with multiple input modalities: the multi-modal network receives more information, so it should match or outperform its single-modal counterpart. In our experiments, h

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7d4b62e2db6b5a646b036045f2649cc9

Zobrazit plný text záznamu

Video Classification with Channel-Separated Convolutional Networks

Autor: Du Tran, Matt Feiszli, Lorenzo Torresani, Heng Wang

Publikováno v: ICCV

Group convolution has been shown to offer great computational savings in various 2D convolutional architectures for image classification. It is natural to ask: 1) if group convolution can help to alleviate the high computational cost of video classif

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4189e75666b6f5def0822a6b54b315db

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání