Zobrazeno 1 - 10
of 188
pro vyhledávání: '"Okatani, Takayuki"'
Diffusion models have demonstrated impressive image generation capabilities. Personalized approaches, such as textual inversion and Dreambooth, enhance model individualization using specific images. These methods enable generating images of specific
Externí odkaz:
http://arxiv.org/abs/2407.05312
Publikováno v:
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV2024)
Computer vision has become increasingly prevalent in solving real-world problems across diverse domains, including smart agriculture, fishery, and livestock management. These applications may not require processing many image frames per second, leadi
Externí odkaz:
http://arxiv.org/abs/2311.03747
Autor:
Charoenpitaks, Korawat, Nguyen, Van-Quang, Suganuma, Masanori, Takahashi, Masahiro, Niihara, Ryoma, Okatani, Takayuki
Publikováno v:
IEEE Trans. Intell. Veh. (2024) 1-11
This paper addresses the problem of predicting hazards that drivers may encounter while driving a car. We formulate it as a task of anticipating impending accidents using a single input image captured by car dashcams. Unlike existing approaches to dr
Externí odkaz:
http://arxiv.org/abs/2310.04671
Recent studies on visual anomaly detection (AD) of industrial objects/textures have achieved quite good performance. They consider an unsupervised setting, specifically the one-class setting, in which we assume the availability of a set of normal (\t
Externí odkaz:
http://arxiv.org/abs/2307.03243
Previous works on unsupervised industrial anomaly detection mainly focus on local structural anomalies such as cracks and color contamination. While achieving significantly high detection performance on this kind of anomaly, they are faced with logic
Externí odkaz:
http://arxiv.org/abs/2307.03101
Smartphones equipped with a multi-camera system comprising multiple cameras with different field-of-view (FoVs) are becoming more prevalent. These camera configurations are compatible with reference-based SR and video SR, which can be executed simult
Externí odkaz:
http://arxiv.org/abs/2307.02897
Despite the recent advancement in the study of removing motion blur in an image, it is still hard to deal with strong blurs. While there are limits in removing blurs from a single image, it has more potential to use multiple images, e.g., using an ad
Externí odkaz:
http://arxiv.org/abs/2307.02875
In this paper, a bridge member damage cause estimation framework is proposed by calculating the image position using Structure from Motion (SfM) and acquiring its information via Visual Question Answering (VQA). For this, a VQA model was developed th
Externí odkaz:
http://arxiv.org/abs/2302.09208
Advanced visual localization techniques encompass image retrieval challenges and 6 Degree-of-Freedom (DoF) camera pose estimation, such as hierarchical localization. Thus, they must extract global and local features from input images. Previous method
Externí odkaz:
http://arxiv.org/abs/2212.13105
Open-set object detection (OSOD) has recently gained attention. It is to detect unknown objects while correctly detecting known objects. In this paper, we first point out that the recent studies' formalization of OSOD, which generalizes open-set reco
Externí odkaz:
http://arxiv.org/abs/2207.09775