Výsledky vyhledávání

Report

Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders

Autor: Yang, Haosen, Huang, Deng, Wen, Bin, Wu, Jiannan, Yao, Hongxun, Jiang, Yi, Zhu, Xiatian, Yuan, Zehuan

Masked autoencoders (MAEs) have emerged recently as art self-supervised spatiotemporal representation learners. Inheriting from the image counterparts, however, existing video MAEs still focus largely on static appearance learning whilst are limited

Externí odkaz: http://arxiv.org/abs/2210.04154

Zobrazit plný text záznamu

Akademický článek

2-Methyl-4'-(methylthio)-2-morpholinopropiophenone: A commercial photoinitiator being used as a new psychoactive substance

Autor: Yen, Yao-Te, Zhou, Song-Lin, Huang, Deng-Ying, Tseng, Shih-Hao, Wang, Chung-Feng, Chyueh, San-Chong

Publikováno v: In Forensic Science International July 2024 360

Zobrazit plný text záznamu

Report

ASCNet: Self-supervised Video Representation Learning with Appearance-Speed Consistency

Autor: Huang, Deng, Wu, Wenhao, Hu, Weiwen, Liu, Xu, He, Dongliang, Wu, Zhihua, Wu, Xiangmiao, Tan, Mingkui, Ding, Errui

We study self-supervised video representation learning, which is a challenging task due to 1) lack of labels for explicit supervision; 2) unstructured and noisy visual information. Existing methods mainly use contrastive loss with video clips as the

Externí odkaz: http://arxiv.org/abs/2106.02342

Zobrazit plný text záznamu

Dissertation/ Thesis

Determination of High Density Lipoprotein Concentration and Cholesterol Content by ApoE Aptamer Based Electrochemical Sensor

Autor: HUANG, DENG-YING, 黃鐙瑩

107
With the lifestyle change, the lack of regular exercises, unhealthy eating habitat and the extreme weather are the risk factors that cause the induction of cardiovascular diseases (CVD). Not only elders and disease-specific group especially

Externí odkaz: http://ndltd.ncl.edu.tw/handle/6h5zet

Zobrazit plný text záznamu

Report

RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning

Autor: Chen, Peihao, Huang, Deng, He, Dongliang, Long, Xiang, Zeng, Runhao, Wen, Shilei, Tan, Mingkui, Gan, Chuang

We study unsupervised video representation learning that seeks to learn both motion and appearance features from unlabeled video only, which can be reused for downstream tasks such as action recognition. This task, however, is extremely challenging d

Externí odkaz: http://arxiv.org/abs/2011.07949

Zobrazit plný text záznamu

Report

Location-aware Graph Convolutional Networks for Video Question Answering

Autor: Huang, Deng, Chen, Peihao, Zeng, Runhao, Du, Qing, Tan, Mingkui, Gan, Chuang

We addressed the challenging task of video question answering, which requires machines to answer questions about videos in a natural language form. Previous state-of-the-art methods attempt to apply spatio-temporal attention mechanism on video frame

Externí odkaz: http://arxiv.org/abs/2008.09105

Zobrazit plný text záznamu

Report

Foley Music: Learning to Generate Music from Videos

Autor: Gan, Chuang, Huang, Deng, Chen, Peihao, Tenenbaum, Joshua B., Torralba, Antonio

In this paper, we introduce Foley Music, a system that can synthesize plausible music for a silent video clip about people playing musical instruments. We first identify two key intermediate representations for a successful video to music generator:

Externí odkaz: http://arxiv.org/abs/2007.10984

Zobrazit plný text záznamu

Report

Generating Visually Aligned Sound from Videos

Autor: Chen, Peihao, Zhang, Yang, Tan, Mingkui, Xiao, Hongdong, Huang, Deng, Gan, Chuang

We focus on the task of generating sound from natural videos, and the sound should be both temporally and content-wise aligned with visual signals. This task is extremely challenging because some sounds generated \emph{outside} a camera can not be in

Externí odkaz: http://arxiv.org/abs/2008.00820

Zobrazit plný text záznamu

Report

Music Gesture for Visual Sound Separation

Autor: Gan, Chuang, Huang, Deng, Zhao, Hang, Tenenbaum, Joshua B., Torralba, Antonio

Recent deep learning approaches have achieved impressive performance on visual sound separation tasks. However, these approaches are mostly built on appearance and optical flow like motion feature representations, which exhibit limited abilities to f

Externí odkaz: http://arxiv.org/abs/2004.09476

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání