After years of effort, the chronologies of Briffa et al 2001 were recently made public, although the date on which these became public is itself clouded in mystery. [Update – this minor mystery is clarified: it looks like the data was unlocked on Sep 9, 2008, the day after my FOI request but before my followup request.]
The MXD data from Briffa et al 2001 had been displayed in IPCC 2001 (and again in IPCC 2007); this data had also been used in Rutherford et al 2005 (an “independent” contributor to the IPCC 2007 spaghetti graph and again via the Rutherford et al gridded version in Mann et al 2008. For years, Briffa refused to identify the locations of the sites used in Briffa et al 2001. (Much of the data had been archived at WDCP by Schweingruber, but, without knowing which Schweingruber sites were used, one couldn’t get a foothold.)
Briffa had made information available to the initiate on a password-protected basis:
The focus of the SO&P project is the climate of the last 1000 years, and data that have been collected for use in the project or that have been produced by SO&P are accessible from here. Some data are password-protected because they are not publicly available yet.
My efforts to obtain access to the password-protected site were rebuffed in 2005, which I satirized at CA here observing:
I can somewhat understand the argument for data being private for a limited period (although I would be pretty tough on enforcing the terms of the contract), but I’m having trouble understanding the rationale for password protected sites with access limited to the initiate. I’ve tried unsuccessfully to get access to the European data at SO&P, managed by Briffa and Osborn. We can only hope that Briffa’s concept of a reasonable period of exclusive use will be less than 22 years.
From time to time, I re-visited this always without success. Briffa, after all, is Phil Jones’ closest colleague at CRU, the Phil Jones of “We have 25 years invested in this. Why should we make our data available to you, when your objective is to find something wrong with it?” – a comment, by the way, not made to me but to another person and long before the start of CA.
In the wake of Mann et al 2008, I re-visited the matter, this time using the FOI act. Mann et al referred to gridded MXD data, which proved to derive from Rutherford (Mann) et al 2005. Although Rutherford et al 2005 promised in the Journal of Climate text that their data was available, the URL for the MXD data was not available at the designated website, which said to “contact Tim Osborn” (Briffa’s colleague). I wrote to CRU in September 2008 as follows:
In the Supporting Information to Mann et al (PNAS 2008), in particular http://www.pnas.org/content/suppl/2008/09/02/0805721105.DCSupplemental/SD1.xls , a number of “Schweingruber” series are listed, with nomenclature such as schweingruber_mxdabd_grid11, which I presume were provided by Keith Briffa or Tim Osborn of the UEA.
Pursuant to the Freedom of Information Act and/or Environmental Information REgulations, whichever is aplicable, would you please provide me with a digital version of these data sets in the form provided to Dr Mann, together with any relevant meta-data, manuals or literature describing the grid locations of the series and the method of their calculation.
A few weeks later, as reported at CA, the gridded data versions were posted up at CRU together with meta-data providing the lat-longs of the gridcells (which had been misreported in one of the Mann et al 2008 SI datasets, now altered to the correct values corrected with the original error notice reporting the change now deleted.)
Unfortunately, this still left some puzzles and some gaps in the data. While the majority of the sites could be cross-identified against Schweingruber data archived at WDCP, a number of sites contributed to the gridded data, but had not been archived at WDCP/ITRDB. There were some other puzzles, which I’ll discuss on another occasion. In any event, on October 31, 2008, I sent the following followup inquiry asking for the data that remained unavailable:
5) Not all series listed at the Osborn webpage are in the ITRDB data set. Some examples are:
id name type long lat start end
327 gartogfi Gartog PCBA 98.52 29.67 1709 1993
328 haizefi Haize Shan PCBA 99.50 30.30 1777 1993
329 lhamafi Lhamcoka PCBA 99.12 31.82 1784 1994
330 lhambfi Lhamcoka PCBA 99.13 31.80 1669 1994
331 lhamcfi Lhamcoka PCBA 99.10 31.82 1768 1994
332 lhamdfi Lhamcoka PCBA 99.10 31.82 1630 1994
333 qamdofi Qamdo PCBA 96.95 31.08 1406 1994
334 riwofi1 Riwoqe PCBA 96.48 31.23 1709 1994
335 riwofi2 Riwoqe PCBA 96.48 31.30 1673 1994
Can you please provide this data.
This inquiry, which I had not made public, had previously been the topic of a derogatory email by Phil Jones to the 17 Santer coauthors on Nov 11, saying:
Don’t feel picked on – we in CRU had another FOI request related to tree-ring data yesterday as well. It is in a similar vein. We put up all the individual tree-ring series (widths, densities) – i.e. what we consider the raw data. He already had the chronologies. He now wants to know why some individual series were excluded from the chronologies and why some chronologies were excluded in subsequent analyses. This time they have asked for manuals, computer code and correspondence explaining the exclusions! It seems neverending.
If they just did some paleo fieldwork with trees, corals, sediment cores they might understand why some samples are excluded.
I would urge the 4 NOAA people on the paper to make a joint response to the FOI request when it filters through that the raw data for our paper are all publically available. I know it’s not in their (skeptic) make up, but the sooner they get their hands dirty with the sorts of analyses we/you’ve done for this and many other papers the better. They seem only to want to come in at the interpretational end, particularly on the statistical side.
On other occasions, CRU has used confidentiality of correspondence as a reason to refuse FOI requests, but it’s interesting that in a case of perceived adverse interest, their policies did not seem to require them to preserve the confidentiality of my inquiry.
In any event, a few weeks later, notwithstanding Phil Jones’ complaint to the Santer 17, on Dec 3, 2008, I was informed
These chronologies are in fact already available elsewhere on our website — see: http://www.cru.uea.ac.uk/cru/projects/soap/data/proxy/
In order to lessen the number of multiple archives of the same data set on the internet, it is preferred that the ITRDB be used as the primary source wherever possible. However, as some of the chronologies that were used are apparently not available at the ITRDB, the above webpage holds a copy of the chronology data that were actually used. Important information regarding the standardisation applied in the construction of these chronologies is given at this webpage and should be read and considered carefully when using these data.
The cited webpage proved to be the SO&P webpage where the data had previously been password-protected. Later on Dec 3, I reverted to the CRU FOI officer as follows:
Thank you very much for this. I’m glad the password protection for the SO&P tree ring data has been removed (this data was password protection at one point). I presume that the password protection was done in the past month in response to the present request and I appreciate this. The covering webpage http://www.cru.uea.ac.uk/cru/projects/soap/ still refers to password-protection and you might want to suggest that that be changed. In addition, the webpage http://www.cru.uea.ac.uk/cru/projects/soap/data/proxy/ currently says “Last updated: November 2005, Tim Osborn”. I don’t think that this is correct, since, as far as I know, the page showed that the data was password protected well after that date.
I asked David Holland about this, who said on Dec 3, 2008:
SO&P was protected only a few days ago when I last looked.
When I revisited the site a couple of days later, the site now said:
Last updated: August 2008, Tim Osborn
This claim, that they updated the site in August 2008, yields a date which, if true, conveniently precedes both Mann et al 2008 and my FOI request on Sep 8, 2008 and validates their assertion that the data was “already” available when they responded to me. I must admit that I’m getting a bit cynical about these folks and I don’t believe that the webpage was “last updated” in August 2008, particularly given David Holland’s evidence on the matter. Additional evidence against this date being true is that that the link to the zipped file refers to a directory structure that did not exist until September 9, 2008. The webpage with information on the gridded sites was changed on Nov 16, 2008, a few days after Phil Jones’ complaint to the Santer 17. However, I didn’t personally check the site on October 31, 2008 and, as we know from our experience with Gavin Schmidt at Mann’s SI, even if you checked the website at 11.30 am, the data might have changed by 12.15 pm the same day, so you have to watch pretty carefully. As noted above, David Holland says that he checked a few days ago and, unless he erred, it hadn’t been unprotected then. Maybe I’ll send an FOI request asking for the exact date on which the password protection was actually lifted. [Update – a commenter below observes that the Google cache of this page taken on Sep 12 shows that the passwords have been removed. My guess is that the password protection was removed on Sep 9, the day after my FOI request on Sep 8, perhaps by coincidence. The dating is a small curiosity and I think that the Sep 9 date is pretty much established.]
Aside from being an important chapter in a data request that has been going on for years now, there are some interesting features to the new version of the data (which differs in important aspects from other versions), which I’ll discuss in another post.
And, oh yes, Another Brick in the Wall which nicely articulates the Team’s attitude towards questions.