Zobrazeno 1 - 10
of 147
pro vyhledávání: '"Yutaka, Satoh"'
Publikováno v:
IEEE Access, Vol 12, Pp 142291-142305 (2024)
Pre-training for 3D object recognition typically requires a large-scale 3D dataset to learn effective 3D geometric representations. However, constructing such datasets is costly due to the extensive 3D data collection and human annotation required. T
Externí odkaz:
https://doaj.org/article/f49bdbe143ab42aabd174ce17d8fae06
Publikováno v:
IEEE Access, Vol 11, Pp 136166-136178 (2023)
Does formula-driven supervised learning (FDSL) work effectively with fine-tuning on small datasets? Additionally, how many natural images do a network pre-trained with FDSL require to acquire sufficient image features? FDSL is a pre-training method t
Externí odkaz:
https://doaj.org/article/fd5b25a4ba32476391ddd53beaf9caa6
Publikováno v:
Journal of the Japan Society for Precision Engineering. 89:99-104
Publikováno v:
IEEE Transactions on Intelligent Transportation Systems. 23:11917-11929
Autor:
Kodai Nakashima, Hirokatsu Kataoka, Asato Matsumoto, Kenji Iwata, Nakamasa Inoue, Yutaka Satoh
Publikováno v:
Proceedings of the AAAI Conference on Artificial Intelligence. 36:1990-1998
Is it possible to complete Vision Transformer (ViT) pre-training without natural images and human-annotated labels? This question has become increasingly relevant in recent months because while current ViT pre-training tends to rely heavily on a larg
Publikováno v:
Sensors, Vol 20, Iss 17, p 4761 (2020)
This study proposes a framework for describing a scene change using natural language text based on indoor scene observations conducted before and after a scene change. The recognition of scene changes plays an essential role in a variety of real-worl
Externí odkaz:
https://doaj.org/article/3169401acabd47dca1f4aeba71180bb4
Publikováno v:
Sensors, Vol 20, Iss 8, p 2281 (2020)
This paper proposes a framework that allows the observation of a scene iteratively to answer a given question about the scene. Conventional visual question answering (VQA) methods are designed to answer given questions based on single-view images. Ho
Externí odkaz:
https://doaj.org/article/d85a4ca10a724f53b6a8aa1df363e7fd
Publikováno v:
Journal of the Japan Society for Precision Engineering. 88:66-71
Autor:
Yue Qiu, Shintaro Yamamoto, Ryosuke Yamada, Ryota Suzuki, Hirokatsu Kataoka, Kenji Iwata, Yutaka Satoh
Publikováno v:
2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
Autor:
Yue Qiu, Yoshiki Nagasaki, Kensho Hara, Hirokatsu Kataoka, Ryota Suzuki, Kenji Iwata, Yutaka Satoh
Publikováno v:
2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).