There is a vast amount of available station data. Versions have usually been adjusted and care has to be taken to be sure exactly which version has been used. Categories of information that I’ve tried to link here include both data and metadata. This is for orientation only. This does not include information on SST estimates or gridcell estimates.
Anthony Watts/ Surface Stations
http://www.surfacestations.org
John V http://www.opentemp.org/_results/_20070919_RuralCRN12_Revised.zip
Standards
http://www.campbellsci.com/documents/apnotes/siting.pdf
http://www.wmo.int/pages/prog/www/IMOP/meetings/Surface/ET-STMT1_Geneva2004/Doc4.3(2).pdf
http://www1.ncdc.noaa.gov/pub/data/uscrn/documentation/program/X030FullDocumentD0.pdf (CRN)
Station Data in the Major Gridcell Composites
There are three major global indices of temperatures that incorporate station data: CRU, GISS and NOAA. Each of these groups primarily relies on the GHCN (Global Historical Climatology Network) for their input data. GHCN has two versions: 1 and 2. Each version contains max, min, mean and adjusted mean. A large proportion of the GHCN network is composed of the USHCN (US Historical Climatology Network). The USHCN network has two versions: 1 and 2 (not yet released), which do not coincide with GHCN versions. USHCN version 1 has raw, time-of-observation adjusted, adjusted and urban adjusted variations. Daily information is available for a subset of GHCN. Identification numbers are not consistent between USHCN and GHCN (and elsewhere). I know of no official concordance of USHCN and GHCN identifications and have archived my own.
GHCN Monthly see also GHCN Daily and USHCN
Readme http://www.ncdc.noaa.gov/oa/climate/ghcn-monthly/index.php . See also http://www.ncdc.noaa.gov/oa/climate/research/ghcn/ghcngrid.html
Directory ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/v2
An inventory of 7280 stations together with particulars such as lat, long, altitude, population etc,: ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/v2/v2.temperature.inv
Look at the directory for a list of available versions. There is a medium-sized zipped data file that is updated all the time (my present download – June 20, 2007) ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/v2/v2.mean.Z
GHCN carries out their own adjustments: ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/v2/v2.mean_adj.Z – also updated all the time. About 25% smaller in size than the raw data.
GHCN has 12 identification numbers: the first 3 are country code, the next 5 the nearby WMO station and the next 3 a station identified. All 11 digits are needed to identify a station. This 11 digit code links back to the Station Inventory File. The 12th digit identifies the “duplicate number” since GHCN raw archives versions that are scribally distinct.
There is a mirror at KNMI: http://climexp.knmi.nl/selectstation.cgi?someone@somewhere (I haven’t verified that versions correspond yet.)
Academic references:
http://www.ncdc.noaa.gov/oa/climate/ghcn-monthly/index.php http://www.ncdc.noaa.gov/oa/climate/ratpac/index.php?name=ratpac-a Holder et al: CCSP chapter 3
ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/v2/zipd
GHCN Version 1 (1992)
Directory http://cdiac.ornl.gov/ftp/ndp041/
Station List http://cdiac.ornl.gov/ftp/ndp041/temp.statinv
Data http://cdiac.ornl.gov/ftp/ndp041/temp.data.Z
GSOD
Stations ftp://ftp.ncdc.noaa.gov/pub/data/gsod/ish-history.txt
CRU
Station list only – my collation http://data.climateaudit.org/data/station/cru/cru.info.dat
GISS
Readme http://data.giss.nasa.gov/gistemp/
Station Lists: There are two station lists at GISS. The list http://data.giss.nasa.gov/gistemp/station_data/v2.temperature.inv.txt has 7364 stations and is the more comprehensive. The 7280 GHCN stations are all present; the others are nearly all from Antarctica plus 2 Southern Ocean stations. The other list http://data.giss.nasa.gov/gistemp/station_data/station_list.txt are the stations “as used”; I have not located an explanation of the reason for the subsetting.
Data: There are 3 GISS datasets: 0: raw; 1- combined; 2- adjusted.
dset=0 contains multiple versions and for the most part, this information seems to match GHCN versions, including the duplicate number. As noted above, different versions of one station may only have scribal variations.
dset=1 is the “combined” time series at one station. Very occasionally, there are two dset=1 versions for one station. The form of “combining” records is very idiosyncratic, involving pervasive adjustments; there is a lengthy discussion at climateaudit on the topic. For USHCN stations, the “raw” GISS version is generally similar to the USHCN adjusted (FILNET) version.
dset=2 – “adjusted” using the GISS adjustment, which supposedly adjusts the trend of urban stations to the trend of rural stations within 1000 km.
There is no comprehensive version of the data. Each individual time series can be scraped from the GISS website
http://data.giss.nasa.gov/gistemp/station_data/ . It is VERY time-consuming to scrape the entire data set: I’ve done so on a high-speed cable connection and the downloading of 16 (net) MB of dset=0 data took nearly 24 hours.
I’ve posted up my collations of scraped data as follows:
dset0: http://data.climateaudit.org/data/station/giss/giss.dsete0.tab
References: http://data.giss.nasa.gov/gistemp/references.html http://pubs.giss.nasa.gov/
USHCN
USHCN: CDIAC Version
- this is not as updated as NOAA
Prefix http://cdiac.ornl.gov/ftp/ushcn_monthly/
Readme http://cdiac.ornl.gov/ftp/ushcn_monthly/ushcn_monthly_doc.html
Station inventory http://cdiac.ornl.gov/ftp/ushcn_monthly/station_inventory
Note: http://cdiac.ornl.gov/ftp/ushcn_monthly/hcn_doe_mean_data.Z (20 MB archived May 10, 2005; up to Dec 2002). Includes three versions for individual stations: Areal Edited (Raw), TOBS (Time of Observation adjusted), and FILNET (adjusted). Max and min are also available.Downloaded MAy 25, 2007
Calculated mean data hcn_calc_mean_data.Z (12 MB archived May 12, 2005 at CDIAC; different version at NOAAto Dec 2002) Areal Edited, TOBS and FILNET Tmean data calculated from max and min versions ( hcn_doe_max_data.Z and hcn_doe_min_data.Z). So far I’ve not compared this to the hcn_doe_mean version.
Urban heat-adjusted data http://cdiac.ornl.gov/ftp/ushcn_monthly/urban_mean_fahr.Z (archived – 5.2 MB Feb 2007) I haven’t examined this data set yet or the UHI adjustment yet.
Daily http://cdiac.ornl.gov/ftp/ndp070/ http://cdiac.ornl.gov/ftp/ndp070/invent.txt stations http://cdiac.ornl.gov/ftp/ndp070/ istory http://cdiac.ornl.gov/ftp/ndp070/history.txt
Station history http://cdiac.ornl.gov/ftp/ushcn_monthly/station_history . Further details at http://mi3.ncdc.noaa.gov/mi3qry/login.cfm
Landuse metadata http://cdiac.ornl.gov/ftp/ushcn_monthly/station_landuse
Population metadata http://cdiac.ornl.gov/ftp/ushcn_monthly/metrof_orig
Old version (Revision 3) http://cdiac.ornl.gov/r3d/ushcn/ushcn.htm
USHCN: NOAA Version
Readme http://www.ncdc.noaa.gov/oa/climate/research/ushcn/
Directory http://www1.ncdc.noaa.gov/pub/data/ushcn/ updated Oct 11, 2007.
Prefix ftp://ftp.ncdc.noaa.gov/pub/data/ushcn http://www1.ncdc.noaa.gov/pub/data/ushcn/
Monthly mean data: ftp://ftp.ncdc.noaa.gov/pub/data/ushcn/hcn_doe_mean_data.Z dated March 1, 2007 is different than the version with the same name at CDIAC and goes up to late 2006. Includes three versions for individual stations: Areal Edited (Raw), TOBS (Time of Observation adjusted), and FILNET (adjusted). Max and min are also available. Urban http://www1.ncdc.noaa.gov/pub/data/ushcn/urban_mean_fahr.Z
Downloaded June 13, 2007.
Versions at NOAA have same nomenclature as CDIAC.
History (archived 1995) http://www1.ncdc.noaa.gov/pub/data/ushcn/station.history.Z
USHCN:Old Version
Readme http://www.ncdc.noaa.gov/oa/climate/research/ushcn/ushcn.html
USHCN:Daily Version
Stations: http://www1.ncdc.noaa.gov/pub/data/ushcn/daily/invent.txt
GHCN Daily
Directory ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily
Three very large zipped files:
all: ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/ghcnd_all.tar.gz 1.0 GB
gsn: ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/ghcnd_gsn.tar.gz 59 MB
hcn: ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/ghcnd_hcn.tar.gz 153MB
These three files appear to bundle the contents of 3 large subdirectories:
all: ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/all
gsn: ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/gsn
hcn: ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/hcn
The file structure is: 10500065516.dly etc using GHCND identification
hcn: 42500455946.dly 890 KB 8/2/2007 1:38:00 PM
Station Lists:
http://www1.ncdc.noaa.gov/pub/data/ghcn/daily/ghcnd-stations.txt
Station history versions not necessarily consistent. For example information on Aberdeen WA is at cdiac, but not MI3.
GHCN Daily:OA Version
Readme http://www.ncdc.noaa.gov/oa/climate/ghcn-daily/index.php
Daily directory ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily
all: Directory with “.dly” files for all of GHCND
gsn: Directory with “.dly” files for the GCOS Surface Network
hcn: Directory with “.dly” files for U.S. HCN
Daily Station Inventory: http://www1.ncdc.noaa.gov/pub/data/ghcn/daily/ghcnd-stations.txt
Raw Data http://www1.ncdc.noaa.gov/pub/data/ghcn/daily/hcn/
GCOS stations ghcnd-gsn.tar.gz
USHCN ghcnd-hcn.tar.gz
All ghcnd-all.tar.gz
Academic references: http://www.ncdc.noaa.gov/oa/climate/ghcn-monthly/index.php
NOAA Climvis
Station list: http://www1.ncdc.noaa.gov/pub/data/climvis/ghcn/v2.comp.srt
GSN (part of GCOS)
GSN = GCOS Surface Network
About 1100 stations, mainly airports. This is very up-to-date. GCOS also has upper-air measurements. GCOS is described in Peterson et al 1997 as a network of “good” stations.
Directory http://www1.ncdc.noaa.gov/pub/data/gcos/
Readme http://www.ncdc.noaa.gov/oa/hofn/global-insitu.html See also http://www.wmo.ch/pages/prog/gcos/
Station List http://www.ncdc.noaa.gov/hofngsn/HOFNGsnStn See also http://cdo.ncdc.noaa.gov/cdo/gsnmonstn.txt ;
Also see info at http://www1.ncdc.noaa.gov/pub/data/ghcn/daily/ghcnd-stations.txt with collated IDs
List of 1086 station numbers http://www1.ncdc.noaa.gov/pub/data/gcos/GCOS_GSN_STN_LIST
Station ranges http://www1.ncdc.noaa.gov/pub/data/gcos/COUNTRY_XREF_LIST11302006.prn
Summary of availabilities in 7 regions: http://www1.ncdc.noaa.gov/pub/data/gcos/WW_REG7_POR_summary etc
Webpage http://gosic.org/ios/GCOS-main-page.htm
Historical http://cdo.ncdc.noaa.gov/pls/plclimprod/cdomain.DS3500
GSN: Ron Ray Version
Directory: ftp://ftp.ncdc.noaa.gov/pub/data/gsn
Daily Data: ftp.ncdc.noaa.gov/pub/data/gsn/gsndy.zip (Very large; over 80 MB zipped;’ over 500 MB unzipped)
I’ve extracted info from the gsndy zip file and it is generally not up to date. This appears to be the historical collection.
To red in R from zip file, use: data_handle < – unz(url, "\\project\\gsn\\data\\gsndy.dat", "rb");
Raw Data Directory: ftp://ftp.ncdc.noaa.gov/pub/data/gsn/rawdata (By country)
Reformated Directory: ftp://ftp.ncdc.noaa.gov/pub/data/gsn/reformated In files like /reformated/07130.dly.Z
GSN: GHCND Version
I haven’t determined yet what differences exist between Ron Ray and GHCN Versions.
Station Data Directory: http://www1.ncdc.noaa.gov/pub/data/ghcn/daily/gsn/
File nomenclature: 22200029612.dly ccciiiwwwww : country code; id; 5 -digit wmo number
For US stations, they use the wmo number and not the GSN number. A (partial) concordance of GSN and WMO numbers is
at http://www1.ncdc.noaa.gov/pub/data/ghcn/daily/ghcnd-stations.txt
Readme ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/readme.txt
Other:
http://www.ncdc.noaa.gov/oa/usgcos/renovationprojects.htm
http://www1.ncdc.noaa.gov/pub/data/gcos/GSN_sum_long_term.txt
Alaska
Karl readme: http://cdiac.ornl.gov/ftp/db1004/db1004.txt
US CRN
This is a recently (sites are vintage 2002-2003) set-up “high-quality” network.
Station list http://www.ncdc.noaa.gov/crn/newstations?sort_by=loc_state
Monthly data. No organized information but a little information can be scraped from http://cdo.ncdc.noaa.gov/pls/plclimprod/cdomain.abbrev2id?datasetabbv=GSNMON
RAWS
2200 stations
Station List http://www.fs.fed.us/raws/ http://www.raws.dri.edu/wraws/
MISC
Station List http://www.rap.ucar.edu/weather/surface/stations.txt
HOLLAND (KNMI)
About 8 Dutch stations here with both metadata and data.
Metadata http://www.knmi.nl/klimatologie/metadata
Monthly Data. Can be scraped from http://climexp.knmi.nl/getdutchstations.cgi?someone@somewhere+tg yielding webpages: http://climexp.knmi.nl/data/tgtg310.dat
Daily data: http://climexp.knmi.nl/selectdailyseries.cgi?someone@somewhere
http://www.knmi.nl/klimatologie/daggegevens/uitleg.html
EUROPE
European Climate Assessment Dataset http://eca.knmi.nl/dailydata/datadictionaryall.php
http://ingrid.ldeo.columbia.edu/SOURCES/.NOAA/.NCDC/.GCPS/.MONTHLY/.STATION/Name/?help+dataselection
Russia
Data ftp://ftp.meteo.ru/okldata
Switzerland
Webpage for homogenized series http://www.meteoswiss.ch/web/en/climate/climate_since_1864/homogeneous_data.html
Germany
http://www.dwd.de/en/FundE/Klima/KLIS/daten/online/nat/index_standardformat.htm
19 Stations http://www.dwd.de/de/FundE/Klima/KLIS/daten/online/wwr/stationstabelle.htm
http://www.ntsg.umt.edu/cgi-bin/show_good_ncdc_stations.pl
http://www.ocs.oregonstate.edu/page_links/climate_data_zones/station_climate/climate_stations.html#station_images
http://www.ocs.orst.edu/pub_ftp/climate_data/mme/Station_names
STATE
California
http://wwwcimis.water.ca.gov/cimis/data.jsp http://wwwcimis.water.ca.gov/cimis/frontStationListData.do
NCDC
Station Locator http://www.ncdc.noaa.gov/oa/climate/stationlocator.html
Climate Data Modernization Program http://www.ncdc.noaa.gov/oa/climate/cdmp/cdmp.html
Image Access Program http://www.ncdc.noaa.gov/oa/climate/cdmp/wssrd.html
Webpages NNDC http://cdo.ncdc.noaa.gov/CDO/cdo
Station Lists http://www.wrcc.dri.edu/inventory/ http://cdo.ncdc.noaa.gov/CDO/inventory.txt http://cdo.ncdc.noaa.gov/cdo/3500stn.txt
ASOS
http://www.nws.noaa.gov/asos/
http://www.aoml.noaa.gov/hrd/asos/
Airport stations (With photos) http://mi3.ncdc.noaa.gov/mi3report/MISC/ASOS-STATIONS.TXT
http://www.aoml.noaa.gov/hrd/asos/completed.txt (Dec 2003)
Coop
Coop Network http://www.nws.noaa.gov/om/coop/standard.htm
Active Stations List 1998 (from NCDC Catalog) http://www.wrcc.dri.edu/inventory/inventact.html Also see http://www.wrcc.dri.edu/inventory/sodct.html (ct = Connecticut)
Station History (archived 1994) http://www1.ncdc.noaa.gov/pub/data/stnhistory/old/stnhis_coop_us (Archived 1996) http://www1.ncdc.noaa.gov/pub/data/stnhistory/stationh.txt
Photos http://www1.ncdc.noaa.gov/pub/data/stations/photos/
CLIMAT Bulletins http://cdo.ncdc.noaa.gov/pls/plclimprod/cdomain.DS3500
CLIMAT 3500 stations http://cdo.ncdc.noaa.gov/cdo/gsnmondoc.txt http://www.ncdc.noaa.gov/oa/land.html
http://cdo.ncdc.noaa.gov/pls/plclimprod/cdomain.abbrev2id?datasetabbv=GSNMON&countryabbv=&georegionabbv=&forceoutside=
http://www1.ncdc.noaa.gov/pub/orders/CDO463711058580.html
Histories http://mrcc.sws.uiuc.edu/FORTS/histories/
New Zealand
NZ climate database description
Description of free subscription
http://cliflo.niwa.co.nz/
http://cliflo.niwa.co.nz/pls/niwp/wgenf.genform1


11 Comments
Is there anywhere I can find the true raw temperature data, what proxies (and/or actual stations) are used for what years, and a taxonomy of the various datasets? What I’m trying to do is find out how many different historical sets of temperature data there are and what organizations archive and rely on what data, and what is the basis for that data. I’m aware that there are thousands of temperature stations all over the world; are there multiple organizations that collect that data, or is it collected and dessiminated by only one organization? Assuming that more than one organization has this data, where is it kept, and is there a detailed audit trail of what the true raw readings were, and how and by what algorithms they may have been adjusted for movement of the station, changing environmental factors, etc?
I apologize if this is OT or there is some simple place on this site where this is located, but I have looked at length to no avail.
TIA,
Allen
Maybe what I’m looking for, at least for the raw data, is the GHCN data?
Sorry for all the questions, but what is the relationship between the GHCN data and the CRU, GISS and NOAA “indices”?
Thanks again!
Allen
P. S. What prompted this is a discussion with someone who claimed there were “thousands” of data sets that all showed warming over the last hundred years, and I took issue with that. Then, later, I saw the same claim in the comments section of a newspaper article on climategate. I was under the impression there were only one or two datasets all the other researchers used, and, if they were corrupted, in would have very far reaching ramifications.
Steve: found a nice Google Earth tool for viewing GISS station data. Not sure if it’s raw of adjusted (probably adjusted), but still handy.
http://edgcm.columbia.edu/~mankoff/StationData/
Steve anyone
All the raw data I’m finding is in text form. How the hell do I convert to a form suirable for use in R.
The obvious, very tedious way I can work out but is there another. Please
Steve: I’ve posted utilities for converting station data to R. See climateaudit.info/scripts. Which station data are you looking at? PS – I’ve converted the Met Office dump into a more organized R format and could place this online.
A google world map with clickable stations leading to raw temperature measurements would be tremendously powerful in helping people look with their own eyes, rather than go for pre chewed data…
Steve, anyone,
I have downloaded the Jones et al Model data for Oxford UK from “http://image.guardian.co.uk/sys-files/Guardian/documents/2009/12/08/uk.csv” (an extract from “http://www.metoffice.gov.uk/climatechange/science/monitoring/reference/All.zip”) and compared it to the Actual temperatures as recorded at “http://www.metoffice.gov.uk/climate/uk/stationdata/oxforddata.txt” for the relevant dates, 1900-1980.
These two series are believed to be the same station.
Model.
Number= 038900
Name= OXFORD
Country= UK
Lat= 51.7
Long= 1.2
Height= 63
and
Actual.
Oxford
Location: 4509E 2072N 63 Meters
I have caclulated the tMonthlyMean values from the Actual data and compared it to the Model tMonthlyMean figures on a month by month basis. These mostly show a +-0.05 difference (which is presumably due to some rounding errors to get to 0.1 degree published values either by me or by others).
Can anyone tell me why, then, the last fews years of the Model data (1978-1980) differs so widely from the Actual recorded temperatures? The Model is out by up to 2.2 degress C and an average of 0.36 degress C of warming compared to the Actual temperatures for just these last few years.
Actual – Model Oxford 1978-1980
Year Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
1978 0.15 0.35 0.35 0.35 0.35 0.15 0.25 0.40 0.35 0.25 0.05 0.45
1979 0.15 0.25 0.35 0.35 0.30 0.15 0.25 0.40 0.90 0.25 0.20 0.05
1980 0.20 0.25 0.70 0.40 0.35 0.15 0.35 0.40 0.40 0.30 0.15 2.20
Anyone with any insights into this divergence of the Jones et al Model from Actual?
I read somewhere recently (unfortunately cannot remember where) that the reason that the temperature figures are changed or updated, is because only thermometers that have been in existence for 20 years are taken into account initially, but then as they reach 20 years, they are added into the measurements as are their measurements which go back 20 years.
Firstly, is this correct?
Secondly, if this is so, how many of these thermometers in waiting are there, and where are they situated.
In view of the long term planning to hoodwink the public (in my view of course), I would not put it past some people to have placed thermometers in locations which will favour the increase.
This is also related to the demise of many thermometers in rural locations as already documented.
I look forward to some comment on this.
“In view of the long term planning to hoodwink the public (in my view of course), I would not put it past some people to have placed thermometers in locations which will favour the increase.”
It may be your view, but that doesn’t mean you don’t need evidence. My experience of climate scientists is that they couldn’t give a toss whether the public believes them or not – and nor should they. It’s good if your work turns out useful, but you don’t know that until you’ve done it. As to placing the thermometers – how would you choose sites that “favour the increase”? I’ve been a meteorologist for 30 years and I would have thought it was impossible. Also, thermometers are located to aid weather forecasting, usually up to 5 days ahead at most and mostly for aviation, military, or agricultural purposes. The use of the data by climatologists is incidental; they have no influence over the siting of most instruments, even in the UK Met Office there is no overlap.
The USHCN files for 1-2 and 3-5 station site ratings include 5 of the Pennsylvanian stations. On the survey results maps it looks as if most, if not all of them, were rated.
Is there a table of the surface station ratings (1-5) for all the surface stations rated? While I found the documentation for the surveys at http://surfacestations.org I was unable to locate such a table.
While you, personally may find it inconceivable, the climategate emails, WATTS observations and the diarizations on this blog suggest to ME that such a practice is at lesat negligent, and may be both deliberate and widespread.
I posted these over at WUWT & chiefio; some readers may find it useful.
——————————————————————-
Previous discussions (the ‘lost’ stations in Honolulu and Dutch Harbour) have already called attention to the limited intelligence of available data searches. Trivial errors lead to blind alleys. Type in ‘MCMILLIN’, instead of ‘MC MILLIN’, and MMS will simply report that it couldn’t find a match. The list of version 2 stations available at:
http://cdiac.ornl.gov/ftp/ushcn_v2_monthly/ushcn-stations.txt
contains over 250 station names that do not match the MMS data. For whatever use it may have, I have used the COOP numbers to look up and ‘correct’ these names. The resulting file is posted at:
http://members.dslextreme.com/users/juanslayton/v2_stations.txt
Of course, I don’t really know which names are ‘correct,’ but MMS has more information, so I go with their names.
——————————————————————–
…and here’s the list of closed v2 stations. Many have been closed since the 90’s. One does wonder what motivates creating a new network incorporating stations that have been closed ten or fifteen years?
http://members.dslextreme.com/users/juanslayton/closedstations