15
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Meteorological data rescue: Citizen science lessons learned from Southern Weather Discovery

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Summary

          Daily weather reconstructions (called “reanalyses”) can help improve our understanding of meteorology and long-term climate changes. Adding undigitized historical weather observations to the datasets that underpin reanalyses is desirable; however, time requirements to capture those data from a range of archives is usually limited. Southern Weather Discovery is a citizen science data rescue project that recovered tabulated handwritten meteorological observations from ship log books and land-based stations spanning New Zealand, the Southern Ocean, and Antarctica. We describe the Zooniverse-hosted Southern Weather Discovery campaign, highlight promotion tactics, and replicate keying levels needed to obtain 100% complete transcribed datasets with minimal type 1 and type 2 transcription errors. Rescued weather observations can augment optical character recognition (OCR) text recognition libraries. Closer links between citizen science data rescue and OCR-based scientific data capture will accelerate weather reconstruction improvements, which can be harnessed to mitigate impacts on communities and infrastructure from weather extremes.

          Highlights

          • Data rescue on Zooniverse allowed rapid transcription of historic weather observations

          • Eight replicate data entries can be used to obtain consensus with minimal errors

          • Transcribed weather observations can dramatically expand OCR character libraries

          The bigger picture

          Citizen science has the potential to capture historical handwritten scientific tabulated data that are not held in digital databases. However, undertaking a citizen science campaign for that purpose is not well described, which we address here. Our citizen science data rescue approach constrained data keying targets, developed participant instructions using clear examples, established replication levels to maximize completeness and confidence of data transcription, and demonstrated common data rescue pitfalls. We highlight how an effective communications strategy helps to maintain project momentum. Collaborating with industry to enhance optical character recognition (OCR) capability has the benefit of accelerating data rescue progress that can rapidly augment scientific data repositories. The resulting improvements to comprehensive historical weather datasets with global coverage can support models and predictive capabilities that help mitigate impacts on society from extreme weather.

          Abstract

          Southern Weather Discovery is a citizen science project on Zooniverse that captured handwritten historical weather observations. This descriptor article outlines how we ran that citizen science project, which can be adapted to a wide range of disciplines. We highlight replicated data keying requirements to minimize transcription errors, some common pitfalls to avoid, and the importance of a good communications strategy. Our partnership with industry on optical character recognition shows potential to harness computer vision to accelerate historical scientific data capture.

          Related collections

          Most cited references52

          • Record: found
          • Abstract: not found
          • Article: not found

          Gradient-based learning applied to document recognition

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The NCEP/NCAR 40-Year Reanalysis Project

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              The Twentieth Century Reanalysis Project

                Bookmark

                Author and article information

                Contributors
                Journal
                Patterns (N Y)
                Patterns (N Y)
                Patterns
                Elsevier
                2666-3899
                27 May 2022
                10 June 2022
                27 May 2022
                : 3
                : 6
                : 100495
                Affiliations
                [1 ]National Institute of Water and Atmospheric Research LTD, 41 Market Place, Auckland 1011 New Zealand
                [2 ]Met Office Hadley Centre, FitzRoy Road, Exeter EX1 3PB, UK
                [3 ]RECLAIM, Climatic Research Unit, University of East Anglia, Norwich NR4 7TJ, UK
                [4 ]Now based at Department of Paleobiology, National Museum of Natural History, Smithsonian Institution, Washington DC 20560, USA
                [5 ]Microsoft New Zealand LTD, Microsoft House, Level 5/22 Viaduct Harbour Avenue, Auckland CBD, Auckland 1010, New Zealand
                [6 ]Cooperative Institute for Research in Environmental Sciences, University of Colorado at Boulder, NOAA Physical Sciences Laboratory, Boulder, USA
                [7 ]National Centre for Atmospheric Science, Department of Meteorology, University of Reading, Reading RG6 6BB, UK
                Author notes
                []Corresponding author a.lorrey@ 123456niwa.co.nz
                [8]

                Lead contact

                Article
                S2666-3899(22)00080-0 100495
                10.1016/j.patter.2022.100495
                9214331
                dbbea95d-1c37-4223-80f4-6223ad5ef6ea
                © 2022 The Authors

                This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

                History
                : 9 January 2020
                : 8 March 2022
                : 28 March 2022
                Categories
                Descriptor

                data rescue,meteorology,climate,reanalysis,citizen science,zooniverse,optical character recognition

                Comments

                Comment on this article