A large-scale dataset for training deep learning segmentation and tracking of extreme weather.

Journal: Scientific data
Published Date:

Abstract

As Earth's climate continues to undergo changes, it is imperative to gain understanding of how high-impact, extreme weather events will change. Researchers are increasingly relying on data-driven, learning-based approaches for the detection and tracking of extreme weather events. While several attempts to generate datasets of hand-labeled weather or climate have been made, a significant challenge has been to gather a sufficient number of expert-annotated samples. To address this challenge, we introduce the largest dataset of expert-guided, hand-labeled segmentation masks of extreme weather events. It contains global annotations for atmospheric rivers, tropical cyclones, and atmospheric blocking events from the European Centre for Medium-Range Weather Forecasting's reanalysis version 5. Every timestep for each event is annotated by two separate annotators to bring the total number of labeled timesteps to 49,184. Professional annotators were trained and guided to identify these features by domain-experts, and event-specific experts were consulted for each of the annotation guides. The resulting annotations are demonstrated to have characteristics similar to other methods and those generated directly by domain experts.

Authors

  • Sol Kim
    Department of Pediatrics, Seoul St. Mary's Hospital, The Catholic University of Korea, Seoul, Republic of Korea; Department of Pediatrics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.
  • Andre Graubner
    ETH Zurich, Institute of Geodesy and Photogrammetry, Zurich, Switzerland. andregr@phys.ethz.ch.
  • Lukas Kapp-Schwoerer
    ETH Zurich, Institute of Geodesy and Photogrammetry, Zurich, Switzerland. Lukas.Kapp.Schwoerer@gmail.com.
  • Karthik Kashinath
    NVIDIA Corporation, HPC + AI, Santa Clara, 95051, USA.
  • Konrad Schindler

Keywords

No keywords available for this article.