<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Wilson on Yamal Substitution</title>
	<atom:link href="http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/feed/" rel="self" type="application/rss+xml" />
	<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/</link>
	<description>by Steve McIntyre</description>
	<lastBuildDate>Fri, 24 May 2013 07:48:03 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Re-Visiting the &#34;Yamal Substitution&#34; &#171; Climate Audit</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-259538</link>
		<dc:creator><![CDATA[Re-Visiting the &#34;Yamal Substitution&#34; &#171; Climate Audit]]></dc:creator>
		<pubDate>Thu, 31 Mar 2011 16:35:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-259538</guid>
		<description><![CDATA[[...] in any sense that Polar Urals did not have as well. (Rob Wilson&#8217;s own comparison (see here used a different (his own) version of Urals RCS, where the rolling variance of the MWP is more [...]]]></description>
		<content:encoded><![CDATA[<p>[...] in any sense that Polar Urals did not have as well. (Rob Wilson&#8217;s own comparison (see here used a different (his own) version of Urals RCS, where the rolling variance of the MWP is more [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John S</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44186</link>
		<dc:creator><![CDATA[John S]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 20:16:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44186</guid>
		<description><![CDATA[Steve,

I&#039;m not familiar with mixed effects. Ny naive approach to this would probably involve using a Kalman filter or equivalent to extract the two &#039;unobserved&#039; components at a given site. I&#039;ll have a bit more of a think...]]></description>
		<content:encoded><![CDATA[<p>Steve,</p>
<p>I&#8217;m not familiar with mixed effects. Ny naive approach to this would probably involve using a Kalman filter or equivalent to extract the two &#8216;unobserved&#8217; components at a given site. I&#8217;ll have a bit more of a think&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Sadlov</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44185</link>
		<dc:creator><![CDATA[Steve Sadlov]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 16:25:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44185</guid>
		<description><![CDATA[My real statistical education began at this site. ;)]]></description>
		<content:encoded><![CDATA[<p>My real statistical education began at this site. <img src='http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve McIntyre</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44184</link>
		<dc:creator><![CDATA[Steve McIntyre]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 13:38:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44184</guid>
		<description><![CDATA[#36 -John S,
let me answer this for Rob rather than using up his limited time on this particular question. Ring widths decline as trees get older. Methods of age adjustment are a big topic in dendro literature, which uses the trade term &quot;standardization&quot;. I&#039;ll mention 4 methods that you see from time to time. At a site, you will have N cores which start and end at different times, have different mean growth rates and ultimate ages and you&#039;re trying to get a growth index representing the site.

1) An older method common in the 1980s was to fit a cubic spline to each core; obtain the deltas from the fit and average the deltas. Dendrochronologists in the 1990s observed that this removed any long-term information (although series formed in this way were used in the programmatic article Hughes and Diaz 1994, still cited, to criticize earlier work which did not rely on tree rings.)

2) The individual cores, viewed s time series, are heavily autocorrelated. Some dendrochronologists did ARMA fitting to individual cores (prewhitening), obtained the deltas from that and averaged them. Removing the autocorrelation in the cores tends to leave a chronology with little long-term variation (e.g. Stahle chronologies used in MBH98).

3) Some chronologists (e.g. Jacoby, Graybill) fitted each core with a generalized negative exponential (neg exp plus a constant) or, if this didn&#039;t fit, a horizontal or negative-trending straight line. This is option 2 in COFECHA. It&#039;s usually what is meant by &quot;conservative standardization&quot;. There are some curious numerical issues in how this is implemented in COFECHA, but they don&#039;t have a big impact. Cook and Peters 1997 criticized this method as potentially producing biased end effects - their examples ironically were a bristlecone (Campito Mt) and Gaspe.

Virtually every chronology archived at WDCP as a *.crn series is one of the above 3 types.

4) The term &quot;RCS&quot; standardization describes a procedure which has become popular in the 1990s - calculating a generalized negative exponential fit for the entire site, and calculating the deltas for each core from the one fit. This results in more centennial scale variability. In some of the sites of interest to Briffa (e.g. Polar    Urals), the cores are very short on average (median - 150 years) and so standardizing individual cores eliminates any changes in the population mean.

I think that there are some interesting statistical issues involved in dendro standardization. A couple of years ago - is it that long that I&#039;ve been doing this - I experimented with doing ring width standardization using mixed effects methods and got some really excellent results. The advantage of this is that I was able to put some of these calculations in a more general statistical framework and show how the little ad hoc dendro recipes fit into this framework and how to use the diagnostics available from the mixed effects modeling for dendro purposes.

In mixed effects terms (say the nlme package of Pinheiro and Bates), &quot;conservative&quot; standardization is simply a type of nlsList fit; while RCS standardization is an nls-fit. I can do either chronology in a couple of lines of code and replicate standard results applying a more general statistical method. The dialectic between nlsList and nls modeling (or lmList and lm modeling) observed in mixed effects texts is exactly and I mean EXACTLY identical to the tension between RCS and conservative standardization in dendro.

In mixed effects terms, you have interesting issues: there is an individual effect as mean growth rates vary by tree; there is an aging effect; there is a random effect from annual climate (reserving temperature/precipitation issues); you have autocorrelation in the residuals; you&#039;ve got every issue contemplated in mixed effects and then some.

In addition, the dendro data sets are large and rich. While the mixed effects software in 2004 could deal with combinations of these issues, it didn&#039;t quite deal with all of them. However, you could get a lot done with what was there.  It would be an interesting project to follow up on this. It&#039;s a big project; it would work like a champ; but I&#039;ve got overtaken with too many other things. It would be a great project for a grad statistics student.

However, since this issue has come up in the context of a &quot;practical&quot; issue, maybe I&#039;ll re-visit it. So much to do, so little time.]]></description>
		<content:encoded><![CDATA[<p>#36 -John S,<br />
let me answer this for Rob rather than using up his limited time on this particular question. Ring widths decline as trees get older. Methods of age adjustment are a big topic in dendro literature, which uses the trade term &#8220;standardization&#8221;. I&#8217;ll mention 4 methods that you see from time to time. At a site, you will have N cores which start and end at different times, have different mean growth rates and ultimate ages and you&#8217;re trying to get a growth index representing the site.</p>
<p>1) An older method common in the 1980s was to fit a cubic spline to each core; obtain the deltas from the fit and average the deltas. Dendrochronologists in the 1990s observed that this removed any long-term information (although series formed in this way were used in the programmatic article Hughes and Diaz 1994, still cited, to criticize earlier work which did not rely on tree rings.)</p>
<p>2) The individual cores, viewed s time series, are heavily autocorrelated. Some dendrochronologists did ARMA fitting to individual cores (prewhitening), obtained the deltas from that and averaged them. Removing the autocorrelation in the cores tends to leave a chronology with little long-term variation (e.g. Stahle chronologies used in MBH98).</p>
<p>3) Some chronologists (e.g. Jacoby, Graybill) fitted each core with a generalized negative exponential (neg exp plus a constant) or, if this didn&#8217;t fit, a horizontal or negative-trending straight line. This is option 2 in COFECHA. It&#8217;s usually what is meant by &#8220;conservative standardization&#8221;. There are some curious numerical issues in how this is implemented in COFECHA, but they don&#8217;t have a big impact. Cook and Peters 1997 criticized this method as potentially producing biased end effects &#8211; their examples ironically were a bristlecone (Campito Mt) and Gaspe.</p>
<p>Virtually every chronology archived at WDCP as a *.crn series is one of the above 3 types.</p>
<p>4) The term &#8220;RCS&#8221; standardization describes a procedure which has become popular in the 1990s &#8211; calculating a generalized negative exponential fit for the entire site, and calculating the deltas for each core from the one fit. This results in more centennial scale variability. In some of the sites of interest to Briffa (e.g. Polar    Urals), the cores are very short on average (median &#8211; 150 years) and so standardizing individual cores eliminates any changes in the population mean.</p>
<p>I think that there are some interesting statistical issues involved in dendro standardization. A couple of years ago &#8211; is it that long that I&#8217;ve been doing this &#8211; I experimented with doing ring width standardization using mixed effects methods and got some really excellent results. The advantage of this is that I was able to put some of these calculations in a more general statistical framework and show how the little ad hoc dendro recipes fit into this framework and how to use the diagnostics available from the mixed effects modeling for dendro purposes.</p>
<p>In mixed effects terms (say the nlme package of Pinheiro and Bates), &#8220;conservative&#8221; standardization is simply a type of nlsList fit; while RCS standardization is an nls-fit. I can do either chronology in a couple of lines of code and replicate standard results applying a more general statistical method. The dialectic between nlsList and nls modeling (or lmList and lm modeling) observed in mixed effects texts is exactly and I mean EXACTLY identical to the tension between RCS and conservative standardization in dendro.</p>
<p>In mixed effects terms, you have interesting issues: there is an individual effect as mean growth rates vary by tree; there is an aging effect; there is a random effect from annual climate (reserving temperature/precipitation issues); you have autocorrelation in the residuals; you&#8217;ve got every issue contemplated in mixed effects and then some.</p>
<p>In addition, the dendro data sets are large and rich. While the mixed effects software in 2004 could deal with combinations of these issues, it didn&#8217;t quite deal with all of them. However, you could get a lot done with what was there.  It would be an interesting project to follow up on this. It&#8217;s a big project; it would work like a champ; but I&#8217;ve got overtaken with too many other things. It would be a great project for a grad statistics student.</p>
<p>However, since this issue has come up in the context of a &#8220;practical&#8221; issue, maybe I&#8217;ll re-visit it. So much to do, so little time.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John S</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44183</link>
		<dc:creator><![CDATA[John S]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 10:30:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44183</guid>
		<description><![CDATA[Rob,

I&#039;d really like to understand.

Perhaps the first step is for me to understand the processing you do to the raw series better. I understand that it is to adjust for variable growth by age of the tree - anything else? Can you direct me to any references that would be accessible via the Internet? I&#039;m also missing something on this detrending process - aren&#039;t you ultimately looking for a series with a &#039;trend&#039; i.e. the signal of interest? Is the concern with heteroskedasticity really a concern about trends rather than variability per se?]]></description>
		<content:encoded><![CDATA[<p>Rob,</p>
<p>I&#8217;d really like to understand.</p>
<p>Perhaps the first step is for me to understand the processing you do to the raw series better. I understand that it is to adjust for variable growth by age of the tree &#8211; anything else? Can you direct me to any references that would be accessible via the Internet? I&#8217;m also missing something on this detrending process &#8211; aren&#8217;t you ultimately looking for a series with a &#8216;trend&#8217; i.e. the signal of interest? Is the concern with heteroskedasticity really a concern about trends rather than variability per se?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob Wilson</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44182</link>
		<dc:creator><![CDATA[Rob Wilson]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 09:01:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44182</guid>
		<description><![CDATA[Most of you have missed the point
the high variance in my RCS chronology was an artefact of ineffectual detrending of the data.
I am sure I said that over processing of data was a bad thing.
Look at our paper and you will see that the variance of some of the time-series is not that time stable. This could be corrected for, BUT as you say, there is no rationale for it.
signing off.]]></description>
		<content:encoded><![CDATA[<p>Most of you have missed the point<br />
the high variance in my RCS chronology was an artefact of ineffectual detrending of the data.<br />
I am sure I said that over processing of data was a bad thing.<br />
Look at our paper and you will see that the variance of some of the time-series is not that time stable. This could be corrected for, BUT as you say, there is no rationale for it.<br />
signing off.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John S</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44181</link>
		<dc:creator><![CDATA[John S]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 02:55:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44181</guid>
		<description><![CDATA[Re #32
Worse, it sounds more like:
RW=min(B1*Temp,B2*Precip,B3*CO2, ...)

Which is an entirely different level of &#039;non-linear&#039; (you can&#039;t even make first order Taylor expansion/log-linearisation arguments with this one). It requires temperature to be the limiting factor throughout the entire (or near enough) reconstruction history for them to be valid.]]></description>
		<content:encoded><![CDATA[<p>Re #32<br />
Worse, it sounds more like:<br />
RW=min(B1*Temp,B2*Precip,B3*CO2, &#8230;)</p>
<p>Which is an entirely different level of &#8216;non-linear&#8217; (you can&#8217;t even make first order Taylor expansion/log-linearisation arguments with this one). It requires temperature to be the limiting factor throughout the entire (or near enough) reconstruction history for them to be valid.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Sadlov</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44180</link>
		<dc:creator><![CDATA[Steve Sadlov]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 02:37:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44180</guid>
		<description><![CDATA[RE: 32. I would add that there is ample, overt evidence that with the possible exeption of trees in well watered mid latitude places with lower annual variation in climate (pretty much, that means solely a Marine West Coast climate) and *perhaps* certain cases in well watered placed at similar latitudes with greater temperature range (e.g Humid Continental) there is no correlation whatever between tree ring width and the actual variation of temperature experienced at real sites. The US SW - probably either correlated with moisture or with some range of multvariate optima (my hunch is the former). Subarctic boreal - all over the map with postive and negative responders to &quot;whatever&quot; drives their growth. Humid subtropical - moisture. Mediterranean - moisture. Tropical climes - N/A.]]></description>
		<content:encoded><![CDATA[<p>RE: 32. I would add that there is ample, overt evidence that with the possible exeption of trees in well watered mid latitude places with lower annual variation in climate (pretty much, that means solely a Marine West Coast climate) and *perhaps* certain cases in well watered placed at similar latitudes with greater temperature range (e.g Humid Continental) there is no correlation whatever between tree ring width and the actual variation of temperature experienced at real sites. The US SW &#8211; probably either correlated with moisture or with some range of multvariate optima (my hunch is the former). Subarctic boreal &#8211; all over the map with postive and negative responders to &#8220;whatever&#8221; drives their growth. Humid subtropical &#8211; moisture. Mediterranean &#8211; moisture. Tropical climes &#8211; N/A.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul Linsay</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44179</link>
		<dc:creator><![CDATA[Paul Linsay]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 02:25:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44179</guid>
		<description><![CDATA[#25:  It&#039;s definitely not a linear relationship and much worse than what you guess.  If you think about it for a few minutes the relation is more like

RW = A1*X*Y*(....)

because if the soil is infertile the tree dies, if it&#039;s too dry the tree dies,...  It&#039;s not possible to get that kind of result from a simple linear relationship.

This whole episode illustrates the trouble people get into when doing a statistical analysis.  Statistics isn&#039;t magic.  Statistics can&#039;t extract information that isn&#039;t there.  Unless there is a prior understanding of the mathematical relation among all the variables affecting a proxy and independent control or measurement of the uninteresting variables, there is no way to honestly extract the variable of interest, temperature in this case.  With all the knowledgeable people contributing to this blog, I have yet to see any indication that the relation between tree rings/density and tree growth factors(temperature, water, sunlight,fertility,...)  is understood in any quantitative way.  Wilson&#039;s use of low variance is unjustified unless it can be shown, a priori, that it somehow keeps the uninteresting variables constant.  Absent that, it&#039;s just a form of cherry picking applied after the fact to get time series that have the desired correlation with temperature.  In this context, both the Polar Urals and Yamal time series are irrelevant as thermometers since we don&#039;t know all the conditions that controlled the tree growth.]]></description>
		<content:encoded><![CDATA[<p>#25:  It&#8217;s definitely not a linear relationship and much worse than what you guess.  If you think about it for a few minutes the relation is more like</p>
<p>RW = A1*X*Y*(&#8230;.)</p>
<p>because if the soil is infertile the tree dies, if it&#8217;s too dry the tree dies,&#8230;  It&#8217;s not possible to get that kind of result from a simple linear relationship.</p>
<p>This whole episode illustrates the trouble people get into when doing a statistical analysis.  Statistics isn&#8217;t magic.  Statistics can&#8217;t extract information that isn&#8217;t there.  Unless there is a prior understanding of the mathematical relation among all the variables affecting a proxy and independent control or measurement of the uninteresting variables, there is no way to honestly extract the variable of interest, temperature in this case.  With all the knowledgeable people contributing to this blog, I have yet to see any indication that the relation between tree rings/density and tree growth factors(temperature, water, sunlight,fertility,&#8230;)  is understood in any quantitative way.  Wilson&#8217;s use of low variance is unjustified unless it can be shown, a priori, that it somehow keeps the uninteresting variables constant.  Absent that, it&#8217;s just a form of cherry picking applied after the fact to get time series that have the desired correlation with temperature.  In this context, both the Polar Urals and Yamal time series are irrelevant as thermometers since we don&#8217;t know all the conditions that controlled the tree growth.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Martin Ringo</title>
		<link>http://climateaudit.org/2006/02/22/wilson-on-yamal-substitution/#comment-44178</link>
		<dc:creator><![CDATA[Martin Ringo]]></dc:creator>
		<pubDate>Thu, 23 Feb 2006 01:42:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=541#comment-44178</guid>
		<description><![CDATA[Re #27  Yes.  I was offering the explanation in the sense of the &quot;Classical Linear Model,&quot; i.e. without serially correlated errors.  For the case of proxies and temperature where we have serial correlated variables, running the Y or X regression without some form of correction (be it a full ARMA model or just an old fashioned Cochrane-Orcutt iterative, rho hat estimator), will produce varying degrees of spurious correlation /significance.  However, ye old Ordinary Least Squares estimators are still unbiased with first order serial correlation.  (Technical note for those who care:  unbiased does not mean the expected value of the absolute difference of the estimator and its mean is zero.  Thus, serially correlated variables case, say both variables are AR1, will increase the absolute value vis-àÆà➭vis the null hypothesis of zero effect.)  What OLS estimators have is a much greater variance than the good, corrective estimators.  So ala the Ferson case (data mining with amongst serially correlated variables), selecting only the positive correlations (the data mining) when there are serial correlations on both independent and dependent variables (the spurious correlation issue) will presumably reinforce the omitted variables effect of the same type selection by correlation.   The magnitude might be kind of interesting to figure out when you get back from the conference on deconstructing reconstructing.]]></description>
		<content:encoded><![CDATA[<p>Re #27  Yes.  I was offering the explanation in the sense of the &#8220;Classical Linear Model,&#8221; i.e. without serially correlated errors.  For the case of proxies and temperature where we have serial correlated variables, running the Y or X regression without some form of correction (be it a full ARMA model or just an old fashioned Cochrane-Orcutt iterative, rho hat estimator), will produce varying degrees of spurious correlation /significance.  However, ye old Ordinary Least Squares estimators are still unbiased with first order serial correlation.  (Technical note for those who care:  unbiased does not mean the expected value of the absolute difference of the estimator and its mean is zero.  Thus, serially correlated variables case, say both variables are AR1, will increase the absolute value vis-àÆà➭vis the null hypothesis of zero effect.)  What OLS estimators have is a much greater variance than the good, corrective estimators.  So ala the Ferson case (data mining with amongst serially correlated variables), selecting only the positive correlations (the data mining) when there are serial correlations on both independent and dependent variables (the spurious correlation issue) will presumably reinforce the omitted variables effect of the same type selection by correlation.   The magnitude might be kind of interesting to figure out when you get back from the conference on deconstructing reconstructing.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
