<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: TTLS in a Steig Context</title>
	<atom:link href="http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/feed/" rel="self" type="application/rss+xml" />
	<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/</link>
	<description>by Steve McIntyre</description>
	<lastBuildDate>Thu, 20 Jun 2013 02:13:38 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Notes on RegEM &#171; Climate Audit</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-306791</link>
		<dc:creator><![CDATA[Notes on RegEM &#171; Climate Audit]]></dc:creator>
		<pubDate>Sun, 16 Oct 2011 22:27:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-306791</guid>
		<description><![CDATA[[...] eigenvector matrix V is used in the TTLS calculation. See previous CA discussion here. Let’s suppose that there are p proxies and one instrumental series in the network. V will [...]]]></description>
		<content:encoded><![CDATA[<p>[...] eigenvector matrix V is used in the TTLS calculation. See previous CA discussion here. Let’s suppose that there are p proxies and one instrumental series in the network. V will [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan O</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184510</link>
		<dc:creator><![CDATA[Ryan O]]></dc:creator>
		<pubDate>Sat, 04 Jul 2009 06:11:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184510</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-346833&quot; rel=&quot;nofollow&quot;&gt;RomanM (#80)&lt;/a&gt;, You are indeed correct . . . and after I figured it out, I feel a bit stupid.
.
I had thought that such a correction improved the verification statistics.  However, I mis-named the variable that I applied the correction to, so all of my test reconstructions were run with just the stations scaled by the eigenvector weights and no correction.  When I figured that out and actually applied the correction, the verification statistics were degraded a bit.
.
And then, after thinking about it, I started wondering why I thought that would work in the first place.
.
Sometimes . . . not so smart is I.]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-346833" rel="nofollow">RomanM (#80)</a>, You are indeed correct . . . and after I figured it out, I feel a bit stupid.<br />
.<br />
I had thought that such a correction improved the verification statistics.  However, I mis-named the variable that I applied the correction to, so all of my test reconstructions were run with just the stations scaled by the eigenvector weights and no correction.  When I figured that out and actually applied the correction, the verification statistics were degraded a bit.<br />
.<br />
And then, after thinking about it, I started wondering why I thought that would work in the first place.<br />
.<br />
Sometimes . . . not so smart is I.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan O</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184509</link>
		<dc:creator><![CDATA[Ryan O]]></dc:creator>
		<pubDate>Thu, 25 Jun 2009 16:07:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184509</guid>
		<description><![CDATA[Ignore all of the above crap.  Working with Latex apparently drained my brain.
.
I redid this explanation in Word and posed it as a question to Dr. Beckers.  The .pdf - with a MUCH improved explanation of what I did - is here:
.
http://www.mediafire.com/file/eczactfnzyw/Dr_Beckers.pdf]]></description>
		<content:encoded><![CDATA[<p>Ignore all of the above crap.  Working with Latex apparently drained my brain.<br />
.<br />
I redid this explanation in Word and posed it as a question to Dr. Beckers.  The .pdf &#8211; with a MUCH improved explanation of what I did &#8211; is here:<br />
.<br />
<a href="http://www.mediafire.com/file/eczactfnzyw/Dr_Beckers.pdf" rel="nofollow">http://www.mediafire.com/file/eczactfnzyw/Dr_Beckers.pdf</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan O</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184508</link>
		<dc:creator><![CDATA[Ryan O]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 17:24:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184508</guid>
		<description><![CDATA[Okay . . . I&#039;ll have to learn Latex better.  Here&#039;s the equations using Word.  Some of the symbols are a tad different due to Word&#039;s limitations:
.
]]></description>
		<content:encoded><![CDATA[<p>Okay . . . I&#8217;ll have to learn Latex better.  Here&#8217;s the equations using Word.  Some of the symbols are a tad different due to Word&#8217;s limitations:<br />
.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan O</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184507</link>
		<dc:creator><![CDATA[Ryan O]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 15:47:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184507</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-346863&quot; rel=&quot;nofollow&quot;&gt;Ryan O (#84)&lt;/a&gt;, I give up.  Eqn. 8 describes a matrix with the scaled station data times $latex \alpha $ placed beside PC 1.]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-346863" rel="nofollow">Ryan O (#84)</a>, I give up.  Eqn. 8 describes a matrix with the scaled station data times <img src='http://s0.wp.com/latex.php?latex=%5Calpha+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha ' title='&#92;alpha ' class='latex' /> placed beside PC 1.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan O</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184506</link>
		<dc:creator><![CDATA[Ryan O]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 15:46:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184506</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-346861&quot; rel=&quot;nofollow&quot;&gt;Ryan O (#83)&lt;/a&gt;,
.
Stupid human.
.
Eqn. 8:  $latex \begin{array}\alpha*Y_1 &amp; x_1\end{array} $]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-346861" rel="nofollow">Ryan O (#83)</a>,<br />
.<br />
Stupid human.<br />
.<br />
Eqn. 8:  <img src='http://s0.wp.com/latex.php?latex=%5Cbegin%7Barray%7D%5Calpha%2AY_1+%26+x_1%5Cend%7Barray%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;begin{array}&#92;alpha*Y_1 &amp; x_1&#92;end{array} ' title='&#92;begin{array}&#92;alpha*Y_1 &amp; x_1&#92;end{array} ' class='latex' /></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan O</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184505</link>
		<dc:creator><![CDATA[Ryan O]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 15:43:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184505</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-346859&quot; rel=&quot;nofollow&quot;&gt;Ryan O (#82)&lt;/a&gt;,
.
Eqn. in #2 should read:  $latex x_N $
.
#4 should read:  Place PC 1 ( $latex x_1 $ ) next to the completely imputed station data.
.
Eqn. in #8 should read: ( $latex \alpha*Y_1,PCscaled x_n $ )
.
Those &quot;PCscaled&quot; and &quot;eigenscaled&quot; notes should be part of the subscripts.  They&#039;re just descriptors.]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-346859" rel="nofollow">Ryan O (#82)</a>,<br />
.<br />
Eqn. in #2 should read:  <img src='http://s0.wp.com/latex.php?latex=x_N+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_N ' title='x_N ' class='latex' /><br />
.<br />
#4 should read:  Place PC 1 ( <img src='http://s0.wp.com/latex.php?latex=x_1+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_1 ' title='x_1 ' class='latex' /> ) next to the completely imputed station data.<br />
.<br />
Eqn. in #8 should read: ( <img src='http://s0.wp.com/latex.php?latex=%5Calpha%2AY_1%2CPCscaled+x_n+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha*Y_1,PCscaled x_n ' title='&#92;alpha*Y_1,PCscaled x_n ' class='latex' /> )<br />
.<br />
Those &#8220;PCscaled&#8221; and &#8220;eigenscaled&#8221; notes should be part of the subscripts.  They&#8217;re just descriptors.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan O</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184504</link>
		<dc:creator><![CDATA[Ryan O]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 15:37:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184504</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-346833&quot; rel=&quot;nofollow&quot;&gt;RomanM (#80)&lt;/a&gt;, I don&#039;t know whether this will change your opinion, but I think I need to get off my ass and write down the math for the latest algorithm.  The algorithm no longer does a simple regression of station data and the PCs; it explicitly uses the eigenvector to weight the station data and performs the regression on the weighted matrix.  The steps to get to this point are:
.
1.  Perform RegEM on the station data without the PCs.
.
2.  Find the PCs ($latex x_(1,...,N) $) for the AVHRR data:
.
$latex SVD(AVHRR\prime)=USV\prime $
$latex x_n=S_n V_n\prime $
.
3.  Extract the spatial weights for each PC at the station locations.  Let&#039;s call this matrix $latex \tilde{U} $
.
4.  Place PC 1 $latex x_1 $ next to the completely imputed station data.
.
The next few steps take place within my modified RegEM algorithm:
.
5.  Scale all series to unit variance.
.
6.  Scale the station data by the appropriate vector from $latex \tilde{U} $ and rescale such that the range of $latex \tilde{U_1} $ equals 1.  This is for consistency - it maintains the same scaling of station data relative to PC regardless of which PC you are imputing.
.
$latex Y_1(eigenscaled) = Y_1 * \tilde{U_1} $
$latex Y_1(PCscaled) = \frac{Y_1(eigenscaled)}{\sum{&#124;\tilde{U_1}&#124;}} $
.
7.  Multiply $latex Y_1(PCscaled) $ by a weighting factor $latex \alpha $ (default to 1).  The reason for this is to provide a means for performing the reconstruction as an extrapolation of the PCs (small $latex \alpha $) by de-emphasizing the station data, or as an interpolation (large $latex \alpha $) by emphasizing the station data.  How this works is clear from the next step.
.
8.  In RegEM, we now have the following matrix:
.
$latex ( \alpha Y_1(PCscaled) x_1 ) $
.
We perform the SVD &lt;i&gt;directly&lt;/i&gt; on this matrix.  Because each series is scaled by the appropriate constant from $latex \tilde{U_1} $, series where the PC explains the most variation will be selected as the low-order PCs, while series where the PC explains little variation will be relegated to the higher-order PCs.  This ensures that the stations representing high-weight regions are represented in the SVD of the matrix.  It also maximizes the contribution to the total error of the high-weight stations during the OLS regression, which causes the high-weight stations to drive the imputation.
.
Now the purpose of $latex \alpha $ is apparent:  small $latex \alpha $ weights errors in the PC high, so minimizing imputation error in the PC minimizes the overall error.  This results in an extrapolation of the PCs.  High $latex \alpha $ weights errors in the stations high, so minimizing imputation error in the stations minimizes the overall error.  The latter forces the PC to the station solution.  This results in anchoring the original eigenvector $latex U $ at the station values during the reconstruction, and using the coefficients to predict temperature at locations where the stations are not present - in other words, $latex U $ becomes analogous to the kriging estimator for kriging.
.
9.  Once the algorithm converges, we extract the imputed PC, $latex \hat{x_1} $.
.
10.  At this point, we can recover the temperature contribution from the imputed PC 1:
.
$latex \tilde(T) = U_1 \hat{x_1} $
.
The reason I felt it was necessary to remove the contribution from PC 1 before calculating PC 2 is because of the $latex Y_1(eigenscaled) = Y_1 * \tilde{U_1} $ step.  If you did not do this step, then I agree that removing the contribution is unnecessary.  However, weighting the stations changes the regression results.  As each PC has its own set of station weights in $latex \tilde{U} $ - and these station weights were derived by explaining &lt;i&gt;remaining&lt;/i&gt; variation after the prior PCs had been calculated - then for each subsequent regression to be valid, you need to remove the contribution of the prior PCs.  Failure to do so results in convergence to a different (albeit similar) answer.
.
When I do this by removing the contribution of prior PCs, the net effect is to give a reconstruction solution that takes higher regpars to achieve if the contribution is not removed.  I&#039;m not totally sure I&#039;m on the right track, though.
.
Re: &lt;a href=&quot;#comment-346836&quot; rel=&quot;nofollow&quot;&gt;Steve McIntyre (#81)&lt;/a&gt;, This latest algorithm does explicitly use the geographic information.  Also, it&#039;s faster than the previous ones - so I&#039;ve done test reconstructions using 100+ PCs.  We could easily do reconstructions using ALL of the PCs, which makes determining the truncation point simple.  After ~13 it doesn&#039;t matter.  You don&#039;t have to worry about including too many - because you &lt;i&gt;can&#039;t&lt;/i&gt; include too many.
.
Also, while I am interested in the average, I am also interested in the geographic distribution of the trends since it was the odd geographical distribution that made the Steig paper so noteworthy.  The last 7 methods (or thereabouts) Jeff Id and I have done all show the same geographic distribution.  The only outlier is Steig&#039;s.
.
*presses submit and prays the Latex worked...*]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-346833" rel="nofollow">RomanM (#80)</a>, I don&#8217;t know whether this will change your opinion, but I think I need to get off my ass and write down the math for the latest algorithm.  The algorithm no longer does a simple regression of station data and the PCs; it explicitly uses the eigenvector to weight the station data and performs the regression on the weighted matrix.  The steps to get to this point are:<br />
.<br />
1.  Perform RegEM on the station data without the PCs.<br />
.<br />
2.  Find the PCs (<img src='http://s0.wp.com/latex.php?latex=x_%281%2C...%2CN%29+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_(1,...,N) ' title='x_(1,...,N) ' class='latex' />) for the AVHRR data:<br />
.<br />
<img src='http://s0.wp.com/latex.php?latex=SVD%28AVHRR%5Cprime%29%3DUSV%5Cprime+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='SVD(AVHRR&#92;prime)=USV&#92;prime ' title='SVD(AVHRR&#92;prime)=USV&#92;prime ' class='latex' /><br />
<img src='http://s0.wp.com/latex.php?latex=x_n%3DS_n+V_n%5Cprime+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_n=S_n V_n&#92;prime ' title='x_n=S_n V_n&#92;prime ' class='latex' /><br />
.<br />
3.  Extract the spatial weights for each PC at the station locations.  Let&#8217;s call this matrix <img src='http://s0.wp.com/latex.php?latex=%5Ctilde%7BU%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde{U} ' title='&#92;tilde{U} ' class='latex' /><br />
.<br />
4.  Place PC 1 <img src='http://s0.wp.com/latex.php?latex=x_1+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='x_1 ' title='x_1 ' class='latex' /> next to the completely imputed station data.<br />
.<br />
The next few steps take place within my modified RegEM algorithm:<br />
.<br />
5.  Scale all series to unit variance.<br />
.<br />
6.  Scale the station data by the appropriate vector from <img src='http://s0.wp.com/latex.php?latex=%5Ctilde%7BU%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde{U} ' title='&#92;tilde{U} ' class='latex' /> and rescale such that the range of <img src='http://s0.wp.com/latex.php?latex=%5Ctilde%7BU_1%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde{U_1} ' title='&#92;tilde{U_1} ' class='latex' /> equals 1.  This is for consistency &#8211; it maintains the same scaling of station data relative to PC regardless of which PC you are imputing.<br />
.<br />
<img src='http://s0.wp.com/latex.php?latex=Y_1%28eigenscaled%29+%3D+Y_1+%2A+%5Ctilde%7BU_1%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Y_1(eigenscaled) = Y_1 * &#92;tilde{U_1} ' title='Y_1(eigenscaled) = Y_1 * &#92;tilde{U_1} ' class='latex' /><br />
<img src='http://s0.wp.com/latex.php?latex=Y_1%28PCscaled%29+%3D+%5Cfrac%7BY_1%28eigenscaled%29%7D%7B%5Csum%7B%7C%5Ctilde%7BU_1%7D%7C%7D%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Y_1(PCscaled) = &#92;frac{Y_1(eigenscaled)}{&#92;sum{|&#92;tilde{U_1}|}} ' title='Y_1(PCscaled) = &#92;frac{Y_1(eigenscaled)}{&#92;sum{|&#92;tilde{U_1}|}} ' class='latex' /><br />
.<br />
7.  Multiply <img src='http://s0.wp.com/latex.php?latex=Y_1%28PCscaled%29+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Y_1(PCscaled) ' title='Y_1(PCscaled) ' class='latex' /> by a weighting factor <img src='http://s0.wp.com/latex.php?latex=%5Calpha+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha ' title='&#92;alpha ' class='latex' /> (default to 1).  The reason for this is to provide a means for performing the reconstruction as an extrapolation of the PCs (small <img src='http://s0.wp.com/latex.php?latex=%5Calpha+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha ' title='&#92;alpha ' class='latex' />) by de-emphasizing the station data, or as an interpolation (large <img src='http://s0.wp.com/latex.php?latex=%5Calpha+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha ' title='&#92;alpha ' class='latex' />) by emphasizing the station data.  How this works is clear from the next step.<br />
.<br />
8.  In RegEM, we now have the following matrix:<br />
.<br />
<img src='http://s0.wp.com/latex.php?latex=%28+%5Calpha+Y_1%28PCscaled%29+x_1+%29+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='( &#92;alpha Y_1(PCscaled) x_1 ) ' title='( &#92;alpha Y_1(PCscaled) x_1 ) ' class='latex' /><br />
.<br />
We perform the SVD <i>directly</i> on this matrix.  Because each series is scaled by the appropriate constant from <img src='http://s0.wp.com/latex.php?latex=%5Ctilde%7BU_1%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde{U_1} ' title='&#92;tilde{U_1} ' class='latex' />, series where the PC explains the most variation will be selected as the low-order PCs, while series where the PC explains little variation will be relegated to the higher-order PCs.  This ensures that the stations representing high-weight regions are represented in the SVD of the matrix.  It also maximizes the contribution to the total error of the high-weight stations during the OLS regression, which causes the high-weight stations to drive the imputation.<br />
.<br />
Now the purpose of <img src='http://s0.wp.com/latex.php?latex=%5Calpha+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha ' title='&#92;alpha ' class='latex' /> is apparent:  small <img src='http://s0.wp.com/latex.php?latex=%5Calpha+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha ' title='&#92;alpha ' class='latex' /> weights errors in the PC high, so minimizing imputation error in the PC minimizes the overall error.  This results in an extrapolation of the PCs.  High <img src='http://s0.wp.com/latex.php?latex=%5Calpha+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;alpha ' title='&#92;alpha ' class='latex' /> weights errors in the stations high, so minimizing imputation error in the stations minimizes the overall error.  The latter forces the PC to the station solution.  This results in anchoring the original eigenvector <img src='http://s0.wp.com/latex.php?latex=U+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U ' title='U ' class='latex' /> at the station values during the reconstruction, and using the coefficients to predict temperature at locations where the stations are not present &#8211; in other words, <img src='http://s0.wp.com/latex.php?latex=U+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='U ' title='U ' class='latex' /> becomes analogous to the kriging estimator for kriging.<br />
.<br />
9.  Once the algorithm converges, we extract the imputed PC, <img src='http://s0.wp.com/latex.php?latex=%5Chat%7Bx_1%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;hat{x_1} ' title='&#92;hat{x_1} ' class='latex' />.<br />
.<br />
10.  At this point, we can recover the temperature contribution from the imputed PC 1:<br />
.<br />
<img src='http://s0.wp.com/latex.php?latex=%5Ctilde%28T%29+%3D+U_1+%5Chat%7Bx_1%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde(T) = U_1 &#92;hat{x_1} ' title='&#92;tilde(T) = U_1 &#92;hat{x_1} ' class='latex' /><br />
.<br />
The reason I felt it was necessary to remove the contribution from PC 1 before calculating PC 2 is because of the <img src='http://s0.wp.com/latex.php?latex=Y_1%28eigenscaled%29+%3D+Y_1+%2A+%5Ctilde%7BU_1%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='Y_1(eigenscaled) = Y_1 * &#92;tilde{U_1} ' title='Y_1(eigenscaled) = Y_1 * &#92;tilde{U_1} ' class='latex' /> step.  If you did not do this step, then I agree that removing the contribution is unnecessary.  However, weighting the stations changes the regression results.  As each PC has its own set of station weights in <img src='http://s0.wp.com/latex.php?latex=%5Ctilde%7BU%7D+&amp;bg=ffffff&amp;fg=000&amp;s=0' alt='&#92;tilde{U} ' title='&#92;tilde{U} ' class='latex' /> &#8211; and these station weights were derived by explaining <i>remaining</i> variation after the prior PCs had been calculated &#8211; then for each subsequent regression to be valid, you need to remove the contribution of the prior PCs.  Failure to do so results in convergence to a different (albeit similar) answer.<br />
.<br />
When I do this by removing the contribution of prior PCs, the net effect is to give a reconstruction solution that takes higher regpars to achieve if the contribution is not removed.  I&#8217;m not totally sure I&#8217;m on the right track, though.<br />
.<br />
Re: <a href="#comment-346836" rel="nofollow">Steve McIntyre (#81)</a>, This latest algorithm does explicitly use the geographic information.  Also, it&#8217;s faster than the previous ones &#8211; so I&#8217;ve done test reconstructions using 100+ PCs.  We could easily do reconstructions using ALL of the PCs, which makes determining the truncation point simple.  After ~13 it doesn&#8217;t matter.  You don&#8217;t have to worry about including too many &#8211; because you <i>can&#8217;t</i> include too many.<br />
.<br />
Also, while I am interested in the average, I am also interested in the geographic distribution of the trends since it was the odd geographical distribution that made the Steig paper so noteworthy.  The last 7 methods (or thereabouts) Jeff Id and I have done all show the same geographic distribution.  The only outlier is Steig&#8217;s.<br />
.<br />
*presses submit and prays the Latex worked&#8230;*</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve McIntyre</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184503</link>
		<dc:creator><![CDATA[Steve McIntyre]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 12:37:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184503</guid>
		<description><![CDATA[One point about infilling using E/M methods (or PCs or this sort of thing) - they don&#039;t explicitly use geographic information.

On the other hand, area-averaging methods such as ore reserves explicitly use the geometry.

Here are a couple of thoughts.

I think that we can say that there is very strong evidence for spatial autocorrelation with more or less negative exponential decorrelation. (Far more evidence for this than Steig&#039;s 3 &lt;strong&gt;physical&lt;/strong&gt; modes).  This should enable some analytic simplifications that simply cut through the PC problem altogether.  I don&#039;t think that there&#039;s any evidence of real modes. And I don&#039;t see any purpose in truncating things merely for computational reasons without trying to think through the math. Maybe there&#039;s an analytical sum of series somewhere.

Second, our interest at any given time is not really in the infilled values at particular stations but in the Antarctic continental average at any given time given spotty measurements. On a spatial autocorrelation basis, this is an inverse of the Fourier heat equation (which also makes sense from first principles) - which is going to be approximated by the disk case where you have a value at the center and some values on the boundary. My guess is that the analytic solution to this sort of problem is a form of area-averaging.  These things usually end up with a geometric connection.]]></description>
		<content:encoded><![CDATA[<p>One point about infilling using E/M methods (or PCs or this sort of thing) &#8211; they don&#8217;t explicitly use geographic information.</p>
<p>On the other hand, area-averaging methods such as ore reserves explicitly use the geometry.</p>
<p>Here are a couple of thoughts.</p>
<p>I think that we can say that there is very strong evidence for spatial autocorrelation with more or less negative exponential decorrelation. (Far more evidence for this than Steig&#8217;s 3 <strong>physical</strong> modes).  This should enable some analytic simplifications that simply cut through the PC problem altogether.  I don&#8217;t think that there&#8217;s any evidence of real modes. And I don&#8217;t see any purpose in truncating things merely for computational reasons without trying to think through the math. Maybe there&#8217;s an analytical sum of series somewhere.</p>
<p>Second, our interest at any given time is not really in the infilled values at particular stations but in the Antarctic continental average at any given time given spotty measurements. On a spatial autocorrelation basis, this is an inverse of the Fourier heat equation (which also makes sense from first principles) &#8211; which is going to be approximated by the disk case where you have a value at the center and some values on the boundary. My guess is that the analytic solution to this sort of problem is a form of area-averaging.  These things usually end up with a geometric connection.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: RomanM</title>
		<link>http://climateaudit.org/2009/06/02/ttls-in-a-steig-context/#comment-184502</link>
		<dc:creator><![CDATA[RomanM]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 11:12:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=6071#comment-184502</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-346794&quot; rel=&quot;nofollow&quot;&gt;Ryan O (#79)&lt;/a&gt;,
Ryan, it&#039;s not clear to me that, under the circumstances, any &quot;correction&quot; is necessary.

I assume that what you are talking about here is as follows:

-- The station data is infilled for the entire period first (either by itself or by using the satellite PCS to infill the post 1982 period and then extended to the earlier times).

-- This infilled station data is used to extend the PCs.

If this is the case, then the station data is being used merely to predict the PCs and predicting the first PC does not &quot;use up&quot; information which is no longer available to predict the others.  In a simple regression situation, if I am predicting two variables using a fixed set of predictors, the least squares solution for the simultaneous prediction is numerically the same as that of the separate individual regressions.  Of course, this latter fact is not true for total least squares regression, but the underlying concepts appear to be the same.

Where your notion of adjusting the predictors comes from is the decomposition of the variability of the station data in which quite correctly, if a PC summarizes some of the variability  of the station temperature structure, it should be removed from the data before the next PC is calculated.  That is not quite the situation you are looking at.

You could also try the following sequential approach as a sort of in-between approach:  Take the station data and impute the first PC,  Then attach that to the station data matrix and use the new structure to impute the second PC (keeping the firsr PC imputuation fixed), etc.  This way, previous PCs have an effect on the later imputations (but not vice versa).

If I missed a step on what you are doing, please correct me and I will comment further.

Good stuff on getting a better handle on a not-so-transparent technique.]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-346794" rel="nofollow">Ryan O (#79)</a>,<br />
Ryan, it&#8217;s not clear to me that, under the circumstances, any &#8220;correction&#8221; is necessary.</p>
<p>I assume that what you are talking about here is as follows:</p>
<p>&#8211; The station data is infilled for the entire period first (either by itself or by using the satellite PCS to infill the post 1982 period and then extended to the earlier times).</p>
<p>&#8211; This infilled station data is used to extend the PCs.</p>
<p>If this is the case, then the station data is being used merely to predict the PCs and predicting the first PC does not &#8220;use up&#8221; information which is no longer available to predict the others.  In a simple regression situation, if I am predicting two variables using a fixed set of predictors, the least squares solution for the simultaneous prediction is numerically the same as that of the separate individual regressions.  Of course, this latter fact is not true for total least squares regression, but the underlying concepts appear to be the same.</p>
<p>Where your notion of adjusting the predictors comes from is the decomposition of the variability of the station data in which quite correctly, if a PC summarizes some of the variability  of the station temperature structure, it should be removed from the data before the next PC is calculated.  That is not quite the situation you are looking at.</p>
<p>You could also try the following sequential approach as a sort of in-between approach:  Take the station data and impute the first PC,  Then attach that to the station data matrix and use the new structure to impute the second PC (keeping the firsr PC imputuation fixed), etc.  This way, previous PCs have an effect on the later imputations (but not vice versa).</p>
<p>If I missed a step on what you are doing, please correct me and I will comment further.</p>
<p>Good stuff on getting a better handle on a not-so-transparent technique.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
