Zobrazeno 1 - 10
of 29 991
pro vyhledávání: '"Open domain"'
The real estate market relies heavily on structured data, such as property details, market trends, and price fluctuations. However, the lack of specialized Tabular Question Answering datasets in this domain limits the development of automated questio
Externí odkaz:
http://arxiv.org/abs/2412.10104
Training-free diffusion models have achieved remarkable progress in generating multi-subject consistent images within open-domain scenarios. The key idea of these methods is to incorporate reference subject information within the attention layer. How
Externí odkaz:
http://arxiv.org/abs/2411.19261
Publikováno v:
ACM Transactions on Information Systems 40(1): 9:1-9:44 (2022)
Incorporating external knowledge into dialogue generation has been proven to benefit the performance of an open-domain Dialogue System (DS), such as generating informative or stylized responses, controlling conversation topics. In this article, we st
Externí odkaz:
http://arxiv.org/abs/2411.09166
Autor:
Liu, Siyi, Ning, Qiang, Halder, Kishaloy, Xiao, Wei, Qi, Zheng, Htut, Phu Mon, Zhang, Yi, John, Neha Anna, Min, Bonan, Benajiba, Yassine, Roth, Dan
Open domain question answering systems frequently rely on information retrieved from large collections of text (such as the Web) to answer questions. However, such collections of text often contain conflicting information, and indiscriminately depend
Externí odkaz:
http://arxiv.org/abs/2410.12311
Climate change's destruction of marine biodiversity is threatening communities and economies around the world which rely on healthy oceans for their livelihoods. The challenge of applying computer vision to niche, real-world domains such as ocean con
Externí odkaz:
http://arxiv.org/abs/2412.02262
Autor:
Wang, Ruicheng, Xu, Sicheng, Dai, Cassie, Xiang, Jianfeng, Deng, Yu, Tong, Xin, Yang, Jiaolong
We present MoGe, a powerful model for recovering 3D geometry from monocular open-domain images. Given a single image, our model directly predicts a 3D point map of the captured scene with an affine-invariant representation, which is agnostic to true
Externí odkaz:
http://arxiv.org/abs/2410.19115
Open-domain long-form text generation requires generating coherent, comprehensive responses that address complex queries with both breadth and depth. This task is challenging due to the need to accurately capture diverse facets of input queries. Exis
Externí odkaz:
http://arxiv.org/abs/2410.15511
We introduce MDSGen, a novel framework for vision-guided open-domain sound generation optimized for model parameter size, memory consumption, and inference speed. This framework incorporates two key innovations: (1) a redundant video feature removal
Externí odkaz:
http://arxiv.org/abs/2410.02130
Shared memories between two individuals strengthen their bond and are crucial for facilitating their ongoing conversations. This study aims to make long-term dialogue more engaging by leveraging these shared memories. To this end, we introduce a new
Externí odkaz:
http://arxiv.org/abs/2410.20682
Autor:
Rony, Md Rashad Al Hasan, Shaha, Sudipto Kumar, Hasan, Rakib Al, Dey, Sumon Kanti, Rafi, Amzad Hossain, Sirajee, Ashraf Hasan, Lehmann, Jens
Bengali is the seventh most spoken language on earth, yet considered a low-resource language in the field of natural language processing (NLP). Question answering over unstructured text is a challenging NLP task as it requires understanding both ques
Externí odkaz:
http://arxiv.org/abs/2410.10229