You are on page 1of 1

Saving Canadian surface observations of weather, climate, and hydrological variables, 1984-1998 A data rescue project

Katie Schmitt Graduate School of Library and Information Science -- University of Illinois at Urbana-Champaign Data collection
Dr. Rouse, Professor Emeritus at McMaster University, was considered to be one of the top scientists in his field by the Canadian Association of Geographers (CAG). CAG stated that Dr. Rouse pioneered field research on the evaporation of high latitude sites, including woodlands, tundra, wetland and lake surfaces (1998) and he received the 1998 CAG Award for Scholarly Distinction in Geography. The data were collected over a 14-year period, from 1984 to 1998, in multiple studies by Dr. Rouse and his teams. Sites were chosen in Churchill, Manitoba, Marantz Lake, Manitoba, and Inuvik, Northwest Territories, specifically to include a variety of terrains ranging from wetlands to boulders.

Introduction
ClimoBase is a dataset that consists of incremental measurements resulting from multiple surface-climate studies led by Dr. Wayne Rouse from 1984 to 1998. The data will be preserved for future research in the NOAA collection at the National Snow and Ice Data Center (NSIDC) in Boulder, Colorado.

Outcome
After the rescue process, ClimoBase has the potential to become a highly-valued reference dataset. Observational data, specifically regarding climate science, is of high value to a variety of individuals including scientists, politicians, students, and journalists. Migrate Tools: Python and Linux ClimoBase 2014 Migrated files to CSV format 7,008 CSV files / 67 MB Replaced space delimiters with Data files organized by study commas for easy processing Detailed README.txt files Standardized NULL values Separated values that were Descriptive metadata merged together by positive Stored securely or negative signs Accessed via NSIDC Replaced values merged together by multiple decimal points with NULL Describe Replaced previous flagged values of -6999 with NULL

Appraise

Reuse
Map Data 2014 Google

Plan

A plain text description of ClimoBase was submitted to NSIDCs technical writers in lieu of a specific metadata schema. Including provenance records, contact information, an accurate citation and detailed information about scope of the dataset.

Original state of the dataset


The original dataset was created as a central repository for Dr. Rouses research. The majority of the data are in daily ClimoBase archive files that consist of measure1999 ments made in regular increments over 24 hour periods. 7068 files / 62 MB Fortran extraction program Scope Encoded file naming convention Numeric NULL value (-6999) Surface-climate measurements No metadata including solar time, wind speed, Minimal explanation wind direction, dry-bulb, wet-bulb, Space delimited and vapor pressure in 24 unique sites. Preliminary Quality Analysis Flagged any erroneous or missing measurements -6999 Provenance A CD-ROM containing the data as well as ancillary information was given to former NSIDC Director Dr. Roger Barry.

Access

Data Rescue Workflow


Adapted from the DCC Curation Lifecycle Model

Preserve

The data files were divided and grouped by specific study and then described with README.txt files. Store and Access The migrated and preserved data will be stored on a server owned by the National Snow and Ice Data Center (NSIDC) The public will be allowed to access ClimoBase data through nsidc.org

Store

Migrate

Before:
16.93 -15.54 69.43 -17.91 66.94 -20.92 36.69 -16.07 45.22 77.78 109.08 186.19 -5.80 45.90 80.04 43.18 0.187 0.0161 0.0767 0.295 0.0178 0.0094 0.324 0.0159 0.0081 0.379 0.0150 0.0037 0.414 0.0176 0.0010 0.481 0.0194-0.0048 0.507 0.0160-0.0069 0.508 0.0202-0.0038 0.505 0.0140-0.0132 0.503 0.0172-0.0119 0.394 0.0151-0.0139 0.351 0.0144-0.0179 0.358 0.0149-0.0212 0.357 0.0186-0.0283 0.295 0.0134-0.0484 0.407 0.0172-0.0209

After:
16.93,-15.54,0.187,0.0161,0.0767 69.43,-17.91,0.295,0.0178,0.0094 66.94,-20.92,0.324,0.0159,0.0081 36.69,-16.07,0.379,0.0150,0.0037 45.22,-5.80,0.414,0.0176,0.0010 77.78,45.90,0.481,0.0194,0.0048 109.08,80.04,0.507,0.0160,0.0069 186.19,43.18,0.508,0.0202,-0.0038 165.91,161.35,0.505,0.0140,-0.0132 158.47,141.05,0.503,0.0172,-0.0119 104.02,81.32,0.394,0.0151,-0.0139 81.48,76.60,0.351,0.0144,-0.0179 118.83,99.21,0.358,0.0149,-0.0212 146.65,139.12,0.357,0.0186,-0.0283 150.01,156.52,0.295,0.0134,-0.0484 148.14,143.44,0.407,0.0172,-0.0209

165.91 161.35 158.47 141.05

Describe

104.02 81.48 118.83

81.32 76.60 99.21

146.65 139.12 150.01 156.52 148.14 143.44

Sample data from a ClimoBase daily archive file

Resources
1998 CAG Award for Scholarly Distinction in Geography. (1998). Retrieved September 15, 2013, from http://www.cag-acg.ca/en/wayne_robert_rouse.html DCC Curation Lifecycle Model. (n.d.). Retrieved September 15, 2013, from http://www.dcc.ac.uk/resources/curation-lifecycle-model National Science Board. (2005). The elements of the digital data collections universe. Long-lived Digital Data Collections. Available: http://www.nsf.gov/pubs/2005/nsb0540/ NOAA at NSIDC. (n.d.). Retrieved September 15, 2013, from http://nsidc.org/noaa/

Acknowledgments
The author would like to especially thank Florence Fetterer, Program Manager, and Ruth Duerr, Principal Investigator, at the National Snow and Ice Data Center (NSIDC) in Boulder, Colorado for their help and support. Thank you also to Dr. Wayne Rouse, Professor Emeritus at McMaster University, for the opportunity and experience.

Further information
ClimoBase data is expected to be available to the general public for research use in the summer of 2014. The dataset has the potential to show evidence of climate changes in Northern Canada throughout the time of Dr. Rouses studies and as compared to today. kmschmi2@illinois.edu @kmschmitt http://katieschmitt.me

You might also like