Zobrazeno 1 - 5
of 5
pro vyhledávání: '"Voutharoja, Bhanu Prakash"'
Image-to-recipe retrieval is a challenging vision-to-language task of significant practical value. The main challenge of the task lies in the ultra-high redundancy in the long recipe and the large variation reflected in both food item combination and
Externí odkaz:
http://arxiv.org/abs/2305.11327
Automatic radiology report generation is challenging as medical images or reports are usually similar to each other due to the common content of anatomy. This makes a model hard to capture the uniqueness of individual images and is prone to producing
Externí odkaz:
http://arxiv.org/abs/2305.07176
Recent works on form understanding mostly employ multimodal transformers or large-scale pre-trained language models. These models need ample data for pre-training. In contrast, humans can usually identify key-value pairings from a form only by lookin
Externí odkaz:
http://arxiv.org/abs/2305.04460
In this paper, we propose a novel pipeline that leverages language foundation models for temporal sequential pattern mining, such as for human mobility forecasting tasks. For example, in the task of predicting Place-of-Interest (POI) customer flows,
Externí odkaz:
http://arxiv.org/abs/2209.05479
Autor:
Guan V; School of Medical, Indigenous and Health Sciences, Faculty of Science, Medicine and Health, University of Wollongong, Wollongong, New South Wales, Australia., Zhou C; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Wan H; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Zhou R; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Zhang D; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Zhang S; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Yang W; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Voutharoja BP; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Wang L; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Win KT; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia., Wang P; School of Computing and Information Technology, Faculty of Engineering and Information Sciences, University of Wollongong, Wollongong, New South Wales, Australia.
Publikováno v:
JMIR formative research [JMIR Form Res] 2023 Aug 07; Vol. 7, pp. e46839. Date of Electronic Publication: 2023 Aug 07.