Comments on: Some Gridcell and Station Utilities

By: Bob Koss

Bob Koss — Wed, 14 Mar 2007 21:56:57 +0000

I took the 2x2 gridded surface temperature files for the years 1880-2004 and put them in a bar chart by latitude. I did it two different ways. In one I simply totaled up all the data over the years by latitude and divided by the latitude data count to arrive at a mean value. That way no individual data point gets any more weight than any other. The other way I took the mean for each year by latitude, totaled them and divided by the number of years of data for that latitude. That way each year gets the same weight. I've posted links to the charts below. Point of interest. 62% of the data points are in the last 1/2 of the record. To not bias your perspective, I suggest you decide which way you think would be a more appropriate rendering of the data before looking at it. Using total mean. link Using yearly mean. link

By: Steve McIntyre

Steve McIntyre — Mon, 12 Mar 2007 18:38:14 +0000

This reminds of another comment by Keynes on Tingergen's multiple correlations:

I infer that he considers independence of no importance. But my mind goes back to the days when Mr Yule sprang a mine under the contraptions of optimistic statisticians by his discovery of spurious correlation. In plain terms, it is evident that, if what is really the same factor is appearing in several places under various disguises, a free choice of regression coefficients can lead to strange results. It becomes like those puzzles for children where you write down your age, multiply, add this and that, subtract something else and eventually end up with the number of the Best in Revelation.

Keynes and Yule were coauthors in around 1910. I've posted up Keynes comment

By: James Erlandson

James Erlandson — Mon, 12 Mar 2007 18:18:49 +0000

Josiah Charles Stamp -- first director of the Bank of England and chairman of the London, Midland and Scottish Railway. The government are very keen on amassing statistics. They collect them, add them, raise them to the nth power, take the cube root and prepare wonderful diagrams. But you must never forget that every one of these figures comes in the first instance from the village watchman, who just puts down what he damn pleases. (quoting an anonymous English judge.) Wikipedia

By: Sinan Unur

Sinan Unur — Mon, 12 Mar 2007 13:08:22 +0000

OK, so I got interested in decoding the binary data sets at ftp://data.giss.nasa.gov/pub/gistemp/download/ as well. Wrote some Perl to slice and dice the data set into various series. I now have fully 1.6Gb less free hard drive space and I cannot figure out where my Sunday went 🙂

I’ll tidy up the various scripts and post on my web site when I get a chance. The result of my attempt at visualizing TSurf1200 and SSTHadR2 combined is available on Google Video.

Enjoy.

Sinan

By: Bob Koss

Bob Koss — Wed, 07 Mar 2007 20:40:08 +0000

Downloaded the data from files found at ftp://data.giss.nasa.gov/pub/gistemp/bin/
I assume the data is the same Giss data as that which Steve linked to, just saved
in binary format.

I converted the binary yearly Fortran files into individual monthly text files.
Each file contains 3 columns. Latitude, longitude, and anomaly.

All data was 2×2 degree cell size. 16200 total cells.

From the 1880-2004 Ts files(surface air temperature). 1500 months of data.
1674 cells have one month or less with data. The next lowest count is 187 months.
3217 cells have data for all months. I get an anomaly from 1880-2004 of 0.046C.

From the 1950-2004 LOTI files(land ocean temperature index). 660 months of data.
78 cells have 11 months or less with data. The next lowest count is 148 months.
10885 have data for all months. I get an anomaly of 0.135C.

I calculated the anomaly by keeping a running total of monthly anomalies for each cell
that had 120 months of data. Calculated the mean for each cell. Totaled those values
and divided by the number of cells with 120 months.
If that’s not the correct way to do it, someone please speak up and clue me in.
I don’t have great statistical skills.

Can’t say I’m surprised by the color of LOTI image. The data starts during the cold part
of the 20th century. Still nothing extraordinary.

I created a couple maps of the data.
Colored coded: yellow > 0.5C. red > 0.0C. Green 0 to -0.5C. Blue LOTI 1950-2004
TS 1880-2004

Full-size 3600 pixels wide.
LOTI 1950-2004
TS 1880-2004

By: Wolfgang Flamme

Wolfgang Flamme — Wed, 07 Mar 2007 17:04:24 +0000

Thank you very much, Steve!

I certainly don’t have your R-skills… so I’m still fiddling with the re-collated data part (GHCN Station + HadCRUT2 *.tab).
The rest however is a very straightforward thing to manage … Good work!

By: Jean S

Jean S — Tue, 06 Mar 2007 23:47:07 +0000

Thanks, Steve!

If someone is interested, there is a Scandinavian (up to 2002) data collection available here:
http://www.smhi.se/hfa_coord/nordklim/

A collection of excellent Denmark/Greenland data sets is available from
http://www.dmi.dk/dmi/index/viden/dmi-publikationer/tekniskerapporter.htm