ClimateNet: an expert-labeled open dataset and deep learning architecture for enabling high-precision analyses of extreme weather

Autor:	Andrew Lou, Sathyavat Chandran, Mayur Mudigonda, Lukas Kapp-Schwoerer, Ankur Mahesh, Annette Greiner, Prabhat, Katherine Dagon, Thorsten Kurth, Ege Karaismailoglu, W. Chapman, William D. Collins, Andre Graubner, Karthik Kashinath, Ben Toms, Jiayi Chen, Sol Kim, Colby Lewis, Christine A. Shields, Leo von Kleist, Michael Wehner, Travis A. O'Brien, Kevin Yang
Rok vydání:	2021
Předmět:	010504 meteorology & atmospheric sciences business.industry Computer science lcsh:QE1-996.5 Supervised learning Context (language use) Weather and climate Atmospheric model Machine learning computer.software_genre 01 natural sciences lcsh:Geology 010104 statistics & probability Extreme weather Analytics Pattern recognition (psychology) Climate model Artificial intelligence 0101 mathematics business computer 0105 earth and related environmental sciences
Zdroj:	Geoscientific Model Development, Vol 14, Pp 107-124 (2021)
ISSN:	1991-9603
Popis:	Identifying, detecting, and localizing extreme weather events is a crucial first step in understanding how they may vary under different climate change scenarios. Pattern recognition tasks such as classification, object detection, and segmentation (i.e., pixel-level classification) have remained challenging problems in the weather and climate sciences. While there exist many empirical heuristics for detecting extreme events, the disparities between the output of these different methods even for a single event are large and often difficult to reconcile. Given the success of deep learning (DL) in tackling similar problems in computer vision, we advocate a DL-based approach. DL, however, works best in the context of supervised learning – when labeled datasets are readily available. Reliable labeled training data for extreme weather and climate events is scarce. We create “ClimateNet” – an open, community-sourced human-expert-labeled curated dataset that captures tropical cyclones (TCs) and atmospheric rivers (ARs) in high-resolution climate model output from a simulation of a recent historical period. We use the curated ClimateNet dataset to train a state-of-the-art DL model for pixel-level identification – i.e., segmentation – of TCs and ARs. We then apply the trained DL model to historical and climate change scenarios simulated by the Community Atmospheric Model (CAM5.1) and show that the DL model accurately segments the data into TCs, ARs, or “the background” at a pixel level. Further, we show how the segmentation results can be used to conduct spatially and temporally precise analytics by quantifying distributions of extreme precipitation conditioned on event types (TC or AR) at regional scales. The key contribution of this work is that it paves the way for DL-based automated, high-fidelity, and highly precise analytics of climate data using a curated expert-labeled dataset – ClimateNet. ClimateNet and the DL-based segmentation method provide several unique capabilities: (i) they can be used to calculate a variety of TC and AR statistics at a fine-grained level; (ii) they can be applied to different climate scenarios and different datasets without tuning as they do not rely on threshold conditions; and (iii) the proposed DL method is suitable for rapidly analyzing large amounts of climate model output. While our study has been conducted for two important extreme weather patterns (TCs and ARs) in simulation datasets, we believe that this methodology can be applied to a much broader class of patterns and applied to observational and reanalysis data products via transfer learning.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::bfd3be8619132ccee57e29942fad10e3 https://doi.org/10.5194/gmd-14-107-2021 Zobrazit plný text záznamu Plný text ve formátu PDF Plný text ve formátu HTML
Nepřihlášeným uživatelům se plný text nezobrazuje	K zobrazení výsledku je třeba se přihlásit.