Výsledky vyhledávání - "Zhang, David"

Report

SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification

Autor: Li, Zuoyong, Lin, Qinghua, Fan, Haoyi, Zhao, Tiesong, Zhang, David

Semi-supervised learning suffers from the imbalance of labeled and unlabeled training data in the video surveillance scenario. In this paper, we propose a new semi-supervised learning method called SIAVC for industrial accident video classification.

Externí odkaz: http://arxiv.org/abs/2405.14506

Zobrazit plný text záznamu

Report

Ada-HGNN: Adaptive Sampling for Scalable Hypergraph Neural Networks

Autor: Wang, Shuai, Zhang, David W., Huang, Jia-Hong, Rudinac, Stevan, Kackovic, Monika, Wijnberg, Nachoem, Worring, Marcel

Hypergraphs serve as an effective model for depicting complex connections in various real-world scenarios, from social to biological networks. The development of Hypergraph Neural Networks (HGNNs) has emerged as a valuable method to manage the intric

Externí odkaz: http://arxiv.org/abs/2405.13372

Zobrazit plný text záznamu

Report

Graph Neural Networks for Learning Equivariant Representations of Neural Networks

Autor: Kofinas, Miltiadis, Knyazev, Boris, Zhang, Yan, Chen, Yunlu, Burghouts, Gertjan J., Gavves, Efstratios, Snoek, Cees G. M., Zhang, David W.

Neural networks that process the parameters of other neural networks find applications in domains as diverse as classifying implicit neural representations, generating neural network weights, and predicting generalization errors. However, existing ap

Externí odkaz: http://arxiv.org/abs/2403.12143

Zobrazit plný text záznamu

Report

DragAnything: Motion Control for Anything using Entity Representation

Autor: Wu, Weijia, Li, Zhuang, Gu, Yuchao, Zhao, Rui, He, Yefei, Zhang, David Junhao, Shou, Mike Zheng, Li, Yan, Gao, Tingting, Zhang, Di

We introduce DragAnything, which utilizes a entity representation to achieve motion control for any object in controllable video generation. Comparison to existing motion control methods, DragAnything offers several advantages. Firstly, trajectory-ba

Externí odkaz: http://arxiv.org/abs/2403.07420

Zobrazit plný text záznamu

Report

Perceptive self-supervised learning network for noisy image watermark removal

Autor: Tian, Chunwei, Zheng, Menghua, Li, Bo, Zhang, Yanning, Zhang, Shichao, Zhang, David

Popular methods usually use a degradation model in a supervised way to learn a watermark removal model. However, it is true that reference images are difficult to obtain in the real world, as well as collected images by cameras suffer from noise. To

Externí odkaz: http://arxiv.org/abs/2403.02211

Zobrazit plný text záznamu

Report

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Autor: Butt, Natasha, Manczak, Blazej, Wiggers, Auke, Rainone, Corrado, Zhang, David W., Defferrard, Michaël, Cohen, Taco

Large language models are increasingly solving tasks that are commonly believed to require human-level reasoning ability. However, these models still perform very poorly on benchmarks of general intelligence such as the Abstraction and Reasoning Corp

Externí odkaz: http://arxiv.org/abs/2402.04858

Zobrazit plný text záznamu

Report

Improved Generalization of Weight Space Networks via Augmentations

Autor: Shamsian, Aviv, Navon, Aviv, Zhang, David W., Zhang, Yan, Fetaya, Ethan, Chechik, Gal, Maron, Haggai

Learning in deep weight spaces (DWS), where neural networks process the weights of other neural networks, is an emerging research direction, with applications to 2D and 3D neural fields (INRs, NeRFs), as well as making inferences about other types of

Externí odkaz: http://arxiv.org/abs/2402.04081

Zobrazit plný text záznamu

Report

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Autor: Zhang, David Junhao, Li, Dongxu, Le, Hung, Shou, Mike Zheng, Xiong, Caiming, Sahoo, Doyen

Most existing video diffusion models (VDMs) are limited to mere text conditions. Thereby, they are usually lacking in control over visual appearance and geometry structure of the generated videos. This work presents Moonshot, a new video generation m

Externí odkaz: http://arxiv.org/abs/2401.01827

Zobrazit plný text záznamu

Report

Diffusing More Objects for Semi-Supervised Domain Adaptation with Less Labeling

Autor: Heuvel, Leander van den, Burghouts, Gertjan, Zhang, David W., Englebienne, Gwenn, van Rooij, Sabina B.

For object detection, it is possible to view the prediction of bounding boxes as a reverse diffusion process. Using a diffusion model, the random bounding boxes are iteratively refined in a denoising step, conditioned on the image. We propose a stoch

Externí odkaz: http://arxiv.org/abs/2312.12000

Zobrazit plný text záznamu

Report

Latent Space Editing in Transformer-Based Flow Matching

Autor: Hu, Vincent Tao, Zhang, David W, Mettes, Pascal, Tang, Meng, Zhao, Deli, Snoek, Cees G. M.

This paper strives for image editing via generative models. Flow Matching is an emerging generative modeling technique that offers the advantage of simple and efficient training. Simultaneously, a new transformer-based U-ViT has recently been propose

Externí odkaz: http://arxiv.org/abs/2312.10825

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání