Výsledky vyhledávání

Report

RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector

Autor: Wang, Zhensheng, Yang, Wenmian, Zhou, Kun, Zhang, Yiquan, Jia, Weijia

The real estate market relies heavily on structured data, such as property details, market trends, and price fluctuations. However, the lack of specialized Tabular Question Answering datasets in this domain limits the development of automated questio

Externí odkaz: http://arxiv.org/abs/2412.10104

Zobrazit plný text záznamu

Report

Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention

Autor: He, Huiguo, Wang, Qiuyue, Zhou, Yuan, Cai, Yuxuan, Chao, Hongyang, Yin, Jian, Yang, Huan

Training-free diffusion models have achieved remarkable progress in generating multi-subject consistent images within open-domain scenarios. The key idea of these methods is to incorporate reference subject information within the attention layer. How

Externí odkaz: http://arxiv.org/abs/2411.19261

Zobrazit plný text záznamu

Report

Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey

Autor: Ma, Longxuan, Li, Mingda, Zhang, Weinan, Li, Jiapeng, Liu, Ting

Publikováno v: ACM Transactions on Information Systems 40(1): 9:1-9:44 (2022)

Incorporating external knowledge into dialogue generation has been proven to benefit the performance of an open-domain Dialogue System (DS), such as generating informative or stylized responses, controlling conversation topics. In this article, we st

Externí odkaz: http://arxiv.org/abs/2411.09166

Zobrazit plný text záznamu

Report

Open Domain Question Answering with Conflicting Contexts

Autor: Liu, Siyi, Ning, Qiang, Halder, Kishaloy, Xiao, Wei, Qi, Zheng, Htut, Phu Mon, Zhang, Yi, John, Neha Anna, Min, Bonan, Benajiba, Yassine, Roth, Dan

Open domain question answering systems frequently rely on information retrieved from large collections of text (such as the Web) to answer questions. However, such collections of text often contain conflicting information, and indiscriminately depend

Externí odkaz: http://arxiv.org/abs/2410.12311

Zobrazit plný text záznamu

Report

Composing Open-domain Vision with RAG for Ocean Monitoring and Conservation

Autor: Dyanatkar, Sepand, Li, Angran, Dungate, Alexander

Climate change's destruction of marine biodiversity is threatening communities and economies around the world which rely on healthy oceans for their livelihoods. The challenge of applying computer vision to niche, real-world domains such as ocean con

Externí odkaz: http://arxiv.org/abs/2412.02262

Zobrazit plný text záznamu

Report

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Autor: Wang, Ruicheng, Xu, Sicheng, Dai, Cassie, Xiang, Jianfeng, Deng, Yu, Tong, Xin, Yang, Jiaolong

We present MoGe, a powerful model for recovering 3D geometry from monocular open-domain images. Given a single image, our model directly predicts a 3D point map of the captured scene with an affine-invariant representation, which is agnostic to true

Externí odkaz: http://arxiv.org/abs/2410.19115

Zobrazit plný text záznamu

Report

ConTReGen: Context-driven Tree-structured Retrieval for Open-domain Long-form Text Generation

Autor: Roy, Kashob Kumar, Akash, Pritom Saha, Chang, Kevin Chen-Chuan, Popa, Lucian

Open-domain long-form text generation requires generating coherent, comprehensive responses that address complex queries with both breadth and depth. This task is challenging due to the need to accurately capture diverse facets of input queries. Exis

Externí odkaz: http://arxiv.org/abs/2410.15511

Zobrazit plný text záznamu

Report

MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation

Autor: Pham, Trung X., Ton, Tri, Yoo, Chang D.

We introduce MDSGen, a novel framework for vision-guided open-domain sound generation optimized for model parameter size, memory consumption, and inference speed. This framework incorporates two key innovations: (1) a redundant video feature removal

Externí odkaz: http://arxiv.org/abs/2410.02130

Zobrazit plný text záznamu

Report

SHARE: Shared Memory-Aware Open-Domain Long-Term Dialogue Dataset Constructed from Movie Script

Autor: Kim, Eunwon, Park, Chanho, Chang, Buru

Shared memories between two individuals strengthen their bond and are crucial for facilitating their ongoing conversations. This study aims to make long-term dialogue more engaging by leveraging these shared memories. To this end, we introduce a new

Externí odkaz: http://arxiv.org/abs/2410.20682

Zobrazit plný text záznamu

Report

BanglaQuAD: A Bengali Open-domain Question Answering Dataset

Autor: Rony, Md Rashad Al Hasan, Shaha, Sudipto Kumar, Hasan, Rakib Al, Dey, Sumon Kanti, Rafi, Amzad Hossain, Sirajee, Ashraf Hasan, Lehmann, Jens

Bengali is the seventh most spoken language on earth, yet considered a low-resource language in the field of natural language processing (NLP). Question answering over unstructured text is a challenging NLP task as it requires understanding both ques

Externí odkaz: http://arxiv.org/abs/2410.10229

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání