--------------------------------------------------------------------- Atmospheric Hydrogen Mixing Ratios from the NOAA GMD Carbon Cycle Cooperative Global Air Sampling Network, 1988-2005 Version: 2006-11-27.1229 -------------------------------------------------------------------- CONTENTS 1. Data source and contacts 2. Use of data 3. Reciprocity 4. Warnings 5. Update notes 6. Introduction 7. DATA - General Comments 7.1 DATA - Sampling Locations 7.2 DATA - File Name Description 7.3 DATA - Event 7.4 DATA - QC Flags 7.5 DATA - Monthly Averages 8. Data retrieval 9. References -------------------------------------------------------------------- 1. DATA SOURCE AND CONTACTS National Oceanic and Atmospheric Administration (NOAA) Earth System Research Laboratory (ESRL) Global Monitoring Division (GMD) Carbon Cycle Greenhouse Gases (CCGG) Correspondence concerning these data should be directed to: Paul C. Novelli NOAA ESRL Global Monitoring Division 325 Broadway, GMD-1 Boulder, CO 80305 U.S.A. Telephone: 303 497-6974 Electronic Mail: Paul.C.Novelli@noaa.gov -------------------------------------------------------------------- 2. USE OF DATA These data are made freely available to the public and the scientific community in the belief that their wide dissemination will lead to greater understanding and new scientific insights. The availability of these data does not constitute publication of the data. NOAA relies on the ethics and integrity of the user to assure that GMD receives fair credit for their work. If the data are obtained for potential use in a publication or presentation, GMD should be informed at the outset of the nature of this work. If the GMD data are essential to the work, or if an important result or conclusion depends on the GMD data, co-authorship may be appropriate. This should be discussed at an early stage in the work. Manuscripts using the GMD data should be sent to GMD for review before they are submitted for publication so we can insure that the quality and limitations of the data are accurately represented. -------------------------------------------------------------------- 3. RECIPROCITY Use of these data implies an agreement to reciprocate. Laboratories making similar measurements agree to make their own data available to the general public and to the scientific community in an equally complete and easily accessible form. Modelers are encouraged to make available to the community, upon request, their own tools used in the interpretation of the GMD data, namely well documented model code, transport fields, and additional information necessary for other scientists to repeat the work and to run modified versions. Model availability includes collaborative support for new users of the models. -------------------------------------------------------------------- 4. WARNINGS Every effort is made to produce the most accurate and precise measurements possible. However, we reserve the right to make corrections to the data based on recalibration of standard gases or for other reasons deemed scientifically justified. We are not responsible for results and conclusions based on use of these data without regard to this warning. -------------------------------------------------------------------- 5. UPDATE NOTES -------------------------------------------------------------------- 6. INTRODUCTION Individual site files provide H2 mixing ratios in parts per billion (ppb) (ppb = 1 part in 10^-9 by mole fraction = nmole/mole) based on measurements from the NOAA CMDL Carbon Cycle Cooperative Global Air Sampling Network. In this directory, data are provided through June 30, 1998 for stations from which the first sample was collected before January 1995. Table 1, accessed below, lists the flask network sampling location, the 3 letter codes used to identify these sites, their latitude, longitude and altitude. Data from sites not provided in this directory may be available from P. Novelli (CMDL). All air samples were analyzed for H2 at the NOAA CMDL laboratory in Boulder using gas chromatography with mercuric oxide reduction detection, with all measurements referenced to an internal H2 calibration scale (Novelli et al., 1999). During 1988-1991, one flask of a sample pair was analyzed for H2, and when there was suitable pressure remaining in the flask, two or more aliquots were measured. We have used the difference in H2 mixing ratios between the aliquots as an indication of the precision of the measurements. We used a single point calibration as the response characteristics of the instrument had been shown to be linear over a range of 480 to 600 ppb CO (Novelli et al., 1999). In October 1991 we began analyzing a single aliquot from both members of a flask pair (rather than 2 or more aliqouts from a single flask). The principle reasons for the change were to simplify flask handling procedures (The CMDL carbon monoxide and methane projects also measure both flasks of a sample pair) and to use pair agreement (the difference in mixing ratio between the two flasks collected simultaneously) as an additional diagnostic to use in evaluating the quality of the data. We believe that pair agreement provides quality constraints on the results from collection through analysis. Novelli et al. (1999) provide a detailed description of the history of H2 measurements at CMDL. ------------------------------------------------------------------- 7. DATA - GENERAL COMMENTS Measurements are reported in units of 10^-9 mol H2 per mol of dry air (nmol/mol) or parts per billion (ppb)) relative to the NOAA CMDL H2 scale (Novelli et al., 1999). Long-term measurement uncertainty is estimated to be 15 ppb (~3%). The absolute accuracy of our H2 scale is unknown. In 2004 a correction was applied to all measurements made between Dec 1998 and Nov 2002 to account for drift in a working standard. The drift rate was determined from multiple calibrations using other H2 standards that are believed stable. The Pacific Ocean Cruise (POC, travelling between the US west coast and New Zealand or Australia), Atlantic Ocean Cruise (AOC, US east coast to South Africa), and Western Pacific Cruise (WPC, New Zealand and Japan) data have been merged and grouped into 5 degree latitude bins. For the South China Sea cruises (SCS) the data are grouped in 3 degree latitude bins. Sampling frequencies are approximately weekly for the fixed sites and average one sample every 3 weeks per latitude zone for POC and about one sample every week per latitude for SCS. The air samples are collected by two general methods: flushing and then pressurizing glass flasks with a pump, or opening a stopcock on an evacuated glass flask. During each sampling event, a pair of flasks is filled. 7.1 DATA - SAMPLING LOCATIONS Table of NOAA GMD cooperative air sampling sites. The table includes the three letter code used to identify each site; the site name; latitude, longitude, and altitude (meters above sea level) of the sampling location; the number of samples; the first and last sample dates; and sampling status. Note: Data for all species may not be available for all sites listed in the table. To view near real-time data, manipulate and compare data, and create custom graphs, please visit http://www.cmdl.noaa.gov/ccgg/iadv/. ------------------------------------------------------------------- 7.2 DATA - FILE NAME DESCRIPTION Encoded into each file name is the sampling location, platform, and strategy; measurement laboratory; file content; and gas identifier. All file names use the following naming scheme: 1 2 3 4 5 6 7 [site][data grouping]_[lab#][strategy][platform]_[qualifier].[gas] 1. [Sampling site] (ex) brw_, poc_, car_, amt_ 2. [Grouping of data within the file] Data may be grouped by date, latitude, longitude, altitude, etc. If data are grouped by date then identifier has yyyy[mm][dd] format. (ex) brw2005_, amt200403_, lef20050315_ If the sampling platform is an aircraft then the identifier is a 3-character numeric field with units of 10^2 meters (hm) above sea level. (ex) car040_, haa005_ If the sampling platform is a ship then the identifier is a 3-character alphanumeric field with units of degrees (00-90). Bins in the northern and southern hemispheres are denoted as n## and s## respectively. The equatorial bin is denoted as 000. (ex) pocs25_, poc000_, scsn03_ 3. [Measurement laboratory] A two character numeric field identifies the measurement laboratory (01-99). NOAA GMD is lab number 01 (see http://www.cmdl.noaa.gov/ccgg/globalview/labs/). 4. [Sampling strategy] A single alphanumeric character (0-9,a-z,A-Z) indicates the sampling strategy. _??C Semi-continuous _??D Discrete 5. [Sampling platform] A single alphanumeric character (0-9,a-z,A-Z) indicates the sampling platform. _???0 Land _???1 Ship _???2 Aircraft _???3 Tower 6. [Qualifier] An alphanumeric string describes the type of data included in the file. _????_event Data from every collection event _????_mm Computed monthly mean values _????_hr Computed hourly averages (semi-continuous data only) _????_day Computed daily averages (semi-continuous data only) _????_all All Data 7. [Gas] Identifies the trace gas species. _????_???.co2 Carbon dioxide _????_???.ch4 Methane _????_???.co2c13 d13C (co2) ------------------------------------------------------------------- 7.3 DATA - EVENT The data files "ccg/h2/flask/event/" use the following naming scheme (see Section 7.2): xxx[data grouping]_01D[platform]_event.h2 (ex) pocn30_01D1_event.h2 contains all event data from POC samples collected from a moving ship and grouped at 30N +/- 2.5 degrees. (ex) brw_01D0_event.h2 contains all event data from from BRW samples collected from a fixed position. The data files contain multiple lines of header information followed by one line for each atmospheric measurement. Fields are defined as follows: Field 1: [SITE CODE] The three-character sampling location code (see above). Field 2: [YEAR] The sample collection date and time in UTC. Field 3: [MONTH] Field 4: [DAY] Field 5: [HOUR] Field 6: [MINUTE] Field 7: [FLASK ID] The sample container ID. Field 8: [METHOD] A single-character code that identifies the sample collection method. The codes are: P - Sample collected using a portable, battery powered pumping unit. Two flasks are connected in series, flushed with air, and then pressurized to 1.2 - 1.5 times ambient pressure. D - Similar to P but the air passes through a condenser cooled to about 5 deg C to partially dry the sample. G - Similar to D but with a gold-plated condenser. T - Evacuated flask filled by opening an O-ring sealed stopcock. S - Flasks filled at NOAA CMDL observatories by sampling air from the in situ CO2 measurement air intake system. N - Before 1981, flasks filled using a hand-held aspirator bulb. After 1981, flasks filled using a pump different from those used in method P, D, or G. F - Five liter evacuated flasks filled by opening a ground glass, greased stopcock. Field 9: [TRACE GAS NAME] Gas identifier (e.g., co2, co2c13). Field 10: [MEASURED VALUE] Dry air mole fraction or isotopic composition. Missing values are denoted by -999.99[9]. Field 11: [QC FLAG] A three-character field indicating the results of our data rejection and selection process, described in section 7.4. Field 12: [INSTRUMENT] A 2-character code that identifies the instrument used for the measurement. Field 13: [YEAR] The measurement date and time in LT. Field 14: [MONTH] Field 15: [DAY] Field 16: [HOUR] Field 17: [MINUTE] Field 18: [LATITUDE] The latitude where the sample was collected, (negative (-) numbers indicate samples collected in the southern hemipshere). Field 19: [LONGITUDE] The longitude where the sample was collected, (negative (-) numbers indicate samples collected in the western hemisphere). Field 20: [ALTITUDE] The altitude where the sample was collected (masl). Field 21: [WIND SPEED] Wind Speed (m/s) as reported by the field operator during sample collection. Please note: CCCG has no means by which to assess the reliability of these recorded observations. Field 22: [WIND DIRECTION] Wind Direction (degrees) as reported by the field operator during sample collection. Please note: CCGG has no means by which to assess the reliability of these recorded observations. Field 23: [EVENT NUMBER] A long integer that uniquely identifies the sampling event. Fields in each line are delimited by whitespace. (ex) KEY 2004 12 17 15 00 2185-99 D co2 400.170 -.. L3 2005 01 03 13 52 25.66 -80.20 3.00 4.6 90 193357 ------------------------------------------------------------------- 7.4 QC FLAGS NOAA GMD uses a 3-column quality control flag where each column is defined as follows: column 1 REJECTION flag. An alphanumeric other than a period (.) in the FIRST column indicates a sample with obvious problems during collection or analysis. This measurement should not be interpreted. column 2 SELECTION flag. An alphanumeric other than a period (.) in the SECOND column indicates a sample that is likely valid but does not meet selection criteria determined by the goals of a particular investigation. column 3 COMMENT flag. An alphanumeric other than a period (.) in the THIRD column provides additional information about the collection or analysis of the sample. WARNING: A "P" in the 3rd column of the QC flag indicates the measurement result is preliminary and has not yet been carefully examined by the PI. The "P" flag is removed once the quality of the measurement has been determined. If both the first and second column contain a period (e.g., "..." and "..I"), the sample is RETAINED. Since samples are collected in pairs, the pair difference is calculated, and samples with a pair difference greater than 15 ppb are flagged. Retained and rejected flasks are flagged as follows. (NOTE: If either the first or second of these characters is not a period, the sample has been rejected.) Flag Description Retained ... good pair (D <= 15 ppb) Rejected .X. Not representative of background conditions (> 3 sigma from a fitted curve), may include unidentified sampling/analysis problems. .N. High/low mixing ratio thought to not represent background condiitons. P.. Poor flask pair agreement (D > 15 ppb) Z.. Poor aliquot agreement (D > 7.5 ppb) *.. off scale result, broken flask or unacceptable analysis W.. Wrong flask type for H2 S.. Sample was collected through the CO2 intake for the in situ analyzer ..I Flask is one of the pair used in an intercomparison with the cooperating agency The retained values comprise the data set that we feel best represents the H2 distribution in the remote, well-mixed global troposphere. It is possible, and even likely, that some values flagged as non-background conditions are valid measurements, but represent poorly mixed air parcels influenced by local anthropogenic sources or strong local biospheric sources or sinks. Users of these data should be aware that data selection is a difficult but necessary aspect of the analysis and interpretation of atmospheric trace gas data sets, and the specific data selection scheme used may be determined by the goals of a of a particular investigation. ------------------------------------------------------------------- 7.5 DATA - MONTHLY AVERAGES The data files "ccg/h2/flask/month/" use the following naming scheme (see Section 7.2): xxx[data grouping]_01D[platform]_mm.h2 (ex) pocn30_01D1_mm.h2 contains computed monthly mean values from POC samples collected from a moving ship and grouped at 30N +/- 2.5 degrees. (ex) brw_01D0_mm.h2 contains computed monthly mean values from BRW samples collected from a fixed position. Monthly means are produced for each site by first averaging all valid measurement results in the event file with a unique sample date and time. Values are then extracted at weekly intervals from a smooth curve (Thoning et al., 1989) fitted to the averaged data and these weekly values are averaged for each month to give the monthly means recorded in the files. Flagged data are excluded from the curve fitting process. Some sites are excluded from the monthly mean directory because sparse data or a short record does not allow a reasonable curve fit. Also, if there are 3 or more consecutive months without data, monthly means are not calculated for these months. The data files contain multiple lines of header information followed by one line for each available month. Fields are defined as follows: Field 1: [SITE CODE] The three-character sampling location code (see above). Field 2: [YEAR] The sample collection year and month. Field 3: [MONTH] Field 4: [MEAN VALUE] Computed monthly mean value In these files a monthly mean value of -999.99 denotes months where there are insufficient data to calculate a monthly mean. ------------------------------------------------------------------- 8. DATA RETRIEVAL To transfer all files in a directory, it is more efficient to download the tar or zipped files. To transfer a tar file, use the following steps from the ftp prompt: 1. ftp> binary ! set transfer mode to binary 2. ftp> get filename.tar.Z ! transfer the file 3. ftp> bye ! leave ftp 4. $ uncompress filename.tar.Z ! uncompress your local copy 5. $ tar xvf filename.tar ! unpack the file To transfer a zipped file, use the following steps from the ftp prompt: 1. ftp> binary ! set transfer mode to binary 2. ftp> get filename.zip ! transfer the file 3. ftp> bye ! leave ftp 4. $ unzip filename.zip ! uncompress your local copy ------------------------------------------------------------------- 9. REFERENCES Lang, P.M., L.P. Steele, R.C. Martin, and K.A. Masarie, Atmospheric methane data for the period 1983-1985 from the NOAA/GMCC global cooperative flask sampling network, NOAA Technical Memorandum ERL CMDL-1, 1990a. Lang, P.M., L.P. Steele, and R.C. Martin, Atmospheric methane data for the period 1986-1988 from the NOAA/CMDL global cooperative flask sampling network, NOAA Technical Memorandum ERL CMDL-2, 1990b. Novelli, P.C., K.A. Masarie, and P.M. Lang, Molecular hydrogen in the troposhere: Global distributions and budget, J. Geophys Res., 104, 30,427-30,444, 1999. Thoning, K.W., P.P. Tans, and W.D. Komhyr, 1989, Atmospheric carbon dioxide at Mauna Loa Observatory 2. Analysis of the NOAA GMCC data, 1974-1985, J. Geophys. Res., 94, 8549-8565. -------------------------------------------------------------------