FathomGPT: A Natural Language Interface for Interactively Exploring Ocean Science Data

Autor: Khanal, Nabin, Yu, Chun Meng, Chiu, Jui-Cheng, Chaudhary, Anav, Zhang, Ziyue, Katija, Kakani, Forbes, Angus G.
Rok vydání: 2024
Předmět:
Zdroj: UIST 2024: Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology
Druh dokumentu: Working Paper
DOI: 10.1145/3654777.3676462
Popis: We introduce FathomGPT, an open source system for the interactive investigation of ocean science data via a natural language interface. FathomGPT was developed in close collaboration with marine scientists to enable researchers to explore and analyze the FathomNet image database. FathomGPT provides a custom information retrieval pipeline that leverages OpenAI's large language models to enable: the creation of complex queries to retrieve images, taxonomic information, and scientific measurements; mapping common names and morphological features to scientific names; generating interactive charts on demand; and searching by image or specified patterns within an image. In designing FathomGPT, particular emphasis was placed on enhancing the user's experience by facilitating free-form exploration and optimizing response times. We present an architectural overview and implementation details of FathomGPT, along with a series of ablation studies that demonstrate the effectiveness of our approach to name resolution, fine tuning, and prompt modification. We also present usage scenarios of interactive data exploration sessions and document feedback from ocean scientists and machine learning experts.
Comment: The first two authors contributed equally to this work. Accepted to the 37th Annual ACM Symposium on User Interface Software and Technology (UIST 2024)
Databáze: arXiv