Zobrazeno 1 - 10
of 211
pro vyhledávání: '"S. P. Gary"'
Autor:
Chen, Jierun, Wei, Fangyun, Zhao, Jinjing, Song, Sizhe, Wu, Bohuai, Peng, Zhuoxuan, Chan, S. -H. Gary, Zhang, Hongyang
Referring expression comprehension (REC) involves localizing a target instance based on a textual description. Recent advancements in REC have been driven by large multimodal models (LMMs) like CogVLM, which achieved 92.44% accuracy on RefCOCO. Howev
Externí odkaz:
http://arxiv.org/abs/2406.16866
Classifying a pedestrian in one of the three conveyor states of "elevator," "escalator" and "neither" is fundamental to many applications such as indoor localization and people flow analysis. We estimate, for the first time, the pedestrian conveyor s
Externí odkaz:
http://arxiv.org/abs/2405.03218
Autor:
Peng, Zhuoxuan, Chan, S. -H. Gary
Due to its promising results, density map regression has been widely employed for image-based crowd counting. The approach, however, often suffers from severe performance degradation when tested on data from unseen scenarios, the so-called "domain sh
Externí odkaz:
http://arxiv.org/abs/2403.09124
Knowledge distillation (KD) has been recognized as an effective tool to compress and accelerate models. However, current KD approaches generally suffer from an accuracy drop and/or an excruciatingly long distillation process. In this paper, we tackle
Externí odkaz:
http://arxiv.org/abs/2312.13223
Unsupervised domain adaptation (UDA) seeks to bridge the domain gap between the target and source using unlabeled target data. Source-free UDA removes the requirement for labeled source data at the target to preserve data privacy and storage. However
Externí odkaz:
http://arxiv.org/abs/2312.00540
Time series data, including univariate and multivariate ones, are characterized by unique composition and complex multi-scale temporal variations. They often require special consideration of decomposition and multi-scale modeling to analyze. Existing
Externí odkaz:
http://arxiv.org/abs/2310.11959
Deep neural networks achieve superior performance for learning from independent and identically distributed (i.i.d.) data. However, their performance deteriorates significantly when handling out-of-distribution (OoD) data, where the training and test
Externí odkaz:
http://arxiv.org/abs/2307.12219
Autor:
Zhuo, Weipeng, Chiu, Ka Ho, Chen, Jierun, Zhao, Ziqi, Chan, S. -H. Gary, Ha, Sangtae, Lee, Chul-Ho
Floor labels of crowdsourced RF signals are crucial for many smart-city applications, such as multi-floor indoor localization, geofencing, and robot surveillance. To build a prediction model to identify the floor number of a new RF signal upon its me
Externí odkaz:
http://arxiv.org/abs/2307.05914
Order-driven market simulation mimics the trader behaviors to generate order streams to support interactive studies of financial strategies. In market simulator, the multi-agent approach is commonly adopted due to its explainability. Existing multi-a
Externí odkaz:
http://arxiv.org/abs/2307.12987
Autor:
Chen, Jierun, Kao, Shiu-hong, He, Hao, Zhuo, Weipeng, Wen, Song, Lee, Chul-Ho, Chan, S. -H. Gary
To design fast neural networks, many works have been focusing on reducing the number of floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does not necessarily lead to a similar level of reduction in latency. This ma
Externí odkaz:
http://arxiv.org/abs/2303.03667