<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Martin Ringo on Principal Components</title>
	<atom:link href="http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/feed/" rel="self" type="application/rss+xml" />
	<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/</link>
	<description>by Steve McIntyre</description>
	<lastBuildDate>Fri, 24 May 2013 22:47:12 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: David Stockwell</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43950</link>
		<dc:creator><![CDATA[David Stockwell]]></dc:creator>
		<pubDate>Tue, 21 Feb 2006 15:33:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43950</guid>
		<description><![CDATA[Re #30.  There are three possible outcomes of plotting error against number of PCs. (1) errors decrease with increasing PCs (Mann&#039;s maintained hypothesis) (2) errors are unchanged with increasing PC&#039;s (Martin&#039;s hypotheisi) and (3) errors increase with increasing PC&#039;s (David&#039;s hypothesis).  This seems like a nice experiment and I will be interested in seeing the result!

Re #30 re #28.  I am the same way.  I have generally found z tests to underestimate errors, and like to construct simulations to understand what is going on.  As to means, I have a theory that the temperature of the MWP from the reconstructions is related to the normalizing temperature for the series.  If I get a chance I will collate and plot the zero anomoly temperature against the MWP temperature for a number of series and see if there is a positive slope! Once again, not proof, but suggestive of these proxies are simply expressing hidden &#039;maintained hyportheses&#039;.]]></description>
		<content:encoded><![CDATA[<p>Re #30.  There are three possible outcomes of plotting error against number of PCs. (1) errors decrease with increasing PCs (Mann&#8217;s maintained hypothesis) (2) errors are unchanged with increasing PC&#8217;s (Martin&#8217;s hypotheisi) and (3) errors increase with increasing PC&#8217;s (David&#8217;s hypothesis).  This seems like a nice experiment and I will be interested in seeing the result!</p>
<p>Re #30 re #28.  I am the same way.  I have generally found z tests to underestimate errors, and like to construct simulations to understand what is going on.  As to means, I have a theory that the temperature of the MWP from the reconstructions is related to the normalizing temperature for the series.  If I get a chance I will collate and plot the zero anomoly temperature against the MWP temperature for a number of series and see if there is a positive slope! Once again, not proof, but suggestive of these proxies are simply expressing hidden &#8216;maintained hyportheses&#8217;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Martin Ringo</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43949</link>
		<dc:creator><![CDATA[Martin Ringo]]></dc:creator>
		<pubDate>Sun, 19 Feb 2006 06:57:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43949</guid>
		<description><![CDATA[Re #27, David,
I do not believe that the will a great deal of sensitivity in &quot;most cases&quot; for changes in the number of principal components used.  The reason is that in most cases (my that I mean most of the parameter space for a given model of the Data Generating Process, &quot;DGP,&quot;) the relevant statistics are close to 1 or 0.  I have been playing around (euphemism for I&#039;m still debugging parts of the model) and there are specifications that will bring the &quot;full&quot; and &quot;short&quot; reconstructions closer together.  I will run the sensitivity to the number of PCs that in the near future.

Re #28, because I am a simple minded old guy (or maybe because I live most of my working life in a business world), I like to see the numbers and the math.  So I try to build my simulations in spreadsheets first with a much simplified version. The &quot;Can one reconstruct from principal components with short sample weighting&quot; issue -- that&#039;s a mouthful -- becomes very clear in the spreadsheet:  no REs, no rho-squareds, not even Theil&#039;s Us, just some simple graphs and the interocular test.  The results hit you between the eyes:  with weightings (the post-multiplying matrix) from the full sample (which can&#039;t be done in reality), one can reconstruct; with weightings from the short sample, one can&#039;t.

Note, this does not rule out reconstruction from one or a few proxies to one temperature series.  But it cast considerable doubt on the reconstruction from many proxies to a many temperature or to a temperature aggregate series.  Making this claim truly applicable resolves on how well one can model the range of reasonable specifications of temperature.  If temperature were strictly periodic with, say, white noise and with constant spatial correlation, then short sampling would do pretty well.  It is easy to go to a fully randomly periodic variation, but how does one move on the continuum between the two?  Or how much should the spatial correlations be allowed to change over time?  All these things make a big difference in the results.  The MBH98 type of reconstruction is a complicated way of saying that the relevant temperature patterns of the past are sufficiently strong in the current instrument record that even if we had the full record, we would change how much to weight PC1, PC2, etc. to reconstruct temperature series A and so forth for series B, C ...  That condition is a &quot;maintained hypothesis&quot; (untested and assumed a priori to be true, and I think unstated) of the MBH98 model.  They would say my type simulations only apply if the simulations could produce short sample weightings that reconstructed temperature approximately correctly.  They would probably phrase it in the form that the work is irrelevant unless it simulated strong forcings that affected all series approximately the same way. {Ever think about why someone would have to run a climate model to test if the MBH98 transformation mined for hockey stick?  Running a tree growth model (e.g. TreeGrOSS, PTAEDA, STIM, TREGRO -- go take a look at it to see what the denrochronology guys are missing by not teaming up with the commercial forestry guys) or maybe something that David figured out in his kind of modeling might make sense as the underlying DGP, but not a climate model. It merely serves to restrict the DGP of the proxies, not estimate it from empirical data.}

Finally note, while we can never test the maintained hypothesis on temperature, we can do a &quot;proxy&quot; test, in both senses of the word.  Can the proxy variable series be reconstructed from the short sample?  If the proxy variable sets can&#039;t be reconstructed by principal component analysis, then there is more than reasonable doubt as to efficacy of doing so on the temperatures.  Any interest?]]></description>
		<content:encoded><![CDATA[<p>Re #27, David,<br />
I do not believe that the will a great deal of sensitivity in &#8220;most cases&#8221; for changes in the number of principal components used.  The reason is that in most cases (my that I mean most of the parameter space for a given model of the Data Generating Process, &#8220;DGP,&#8221;) the relevant statistics are close to 1 or 0.  I have been playing around (euphemism for I&#8217;m still debugging parts of the model) and there are specifications that will bring the &#8220;full&#8221; and &#8220;short&#8221; reconstructions closer together.  I will run the sensitivity to the number of PCs that in the near future.</p>
<p>Re #28, because I am a simple minded old guy (or maybe because I live most of my working life in a business world), I like to see the numbers and the math.  So I try to build my simulations in spreadsheets first with a much simplified version. The &#8220;Can one reconstruct from principal components with short sample weighting&#8221; issue &#8212; that&#8217;s a mouthful &#8212; becomes very clear in the spreadsheet:  no REs, no rho-squareds, not even Theil&#8217;s Us, just some simple graphs and the interocular test.  The results hit you between the eyes:  with weightings (the post-multiplying matrix) from the full sample (which can&#8217;t be done in reality), one can reconstruct; with weightings from the short sample, one can&#8217;t.</p>
<p>Note, this does not rule out reconstruction from one or a few proxies to one temperature series.  But it cast considerable doubt on the reconstruction from many proxies to a many temperature or to a temperature aggregate series.  Making this claim truly applicable resolves on how well one can model the range of reasonable specifications of temperature.  If temperature were strictly periodic with, say, white noise and with constant spatial correlation, then short sampling would do pretty well.  It is easy to go to a fully randomly periodic variation, but how does one move on the continuum between the two?  Or how much should the spatial correlations be allowed to change over time?  All these things make a big difference in the results.  The MBH98 type of reconstruction is a complicated way of saying that the relevant temperature patterns of the past are sufficiently strong in the current instrument record that even if we had the full record, we would change how much to weight PC1, PC2, etc. to reconstruct temperature series A and so forth for series B, C &#8230;  That condition is a &#8220;maintained hypothesis&#8221; (untested and assumed a priori to be true, and I think unstated) of the MBH98 model.  They would say my type simulations only apply if the simulations could produce short sample weightings that reconstructed temperature approximately correctly.  They would probably phrase it in the form that the work is irrelevant unless it simulated strong forcings that affected all series approximately the same way. {Ever think about why someone would have to run a climate model to test if the MBH98 transformation mined for hockey stick?  Running a tree growth model (e.g. TreeGrOSS, PTAEDA, STIM, TREGRO &#8212; go take a look at it to see what the denrochronology guys are missing by not teaming up with the commercial forestry guys) or maybe something that David figured out in his kind of modeling might make sense as the underlying DGP, but not a climate model. It merely serves to restrict the DGP of the proxies, not estimate it from empirical data.}</p>
<p>Finally note, while we can never test the maintained hypothesis on temperature, we can do a &#8220;proxy&#8221; test, in both senses of the word.  Can the proxy variable series be reconstructed from the short sample?  If the proxy variable sets can&#8217;t be reconstructed by principal component analysis, then there is more than reasonable doubt as to efficacy of doing so on the temperatures.  Any interest?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43948</link>
		<dc:creator><![CDATA[Mark]]></dc:creator>
		<pubDate>Thu, 16 Feb 2006 16:25:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43948</guid>
		<description><![CDATA[Steve S.  Agreed.  At least, I think that &quot;related&quot; fields provide a different perspective.  There&#039;s a common math underlying any form of signal processing (that&#039;s all this is), but a different viewpoint for each area of interest.

I&#039;m still thinking about ways to extract what we want, btw.  What I do for radar (ECM as noted) correlates noisy elements against a known signal or primary element, which is something not necessarily present in climate science (if we had a 1000 year true temperature record we could, but we don&#039;t).  perhaps we could &quot;test&quot; the known flawed proxies against the noisy ones to show the correlation between them?

Mark]]></description>
		<content:encoded><![CDATA[<p>Steve S.  Agreed.  At least, I think that &#8220;related&#8221; fields provide a different perspective.  There&#8217;s a common math underlying any form of signal processing (that&#8217;s all this is), but a different viewpoint for each area of interest.</p>
<p>I&#8217;m still thinking about ways to extract what we want, btw.  What I do for radar (ECM as noted) correlates noisy elements against a known signal or primary element, which is something not necessarily present in climate science (if we had a 1000 year true temperature record we could, but we don&#8217;t).  perhaps we could &#8220;test&#8221; the known flawed proxies against the noisy ones to show the correlation between them?</p>
<p>Mark</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Stockwell</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43947</link>
		<dc:creator><![CDATA[David Stockwell]]></dc:creator>
		<pubDate>Thu, 16 Feb 2006 15:11:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43947</guid>
		<description><![CDATA[The more I look at these graphs I think Martin is right.  This method couldn&#039;t even predict the mean.  You can see this by imagining a process where the proxies are random series with a given mean, from which a small number are chosen based on correlation with temperatures at the end.  Then the reconstruction of those selected series is simply the mean - which is an arbitrary parameter!  Looking at the reconstructions, including Moberg, Esper and others, you could see (reading backwards in time) the height of the MWP as a reversion of a random series to a predetermined mean, after a purturbation due to the biased selection for a downward impulse matching temperatures.]]></description>
		<content:encoded><![CDATA[<p>The more I look at these graphs I think Martin is right.  This method couldn&#8217;t even predict the mean.  You can see this by imagining a process where the proxies are random series with a given mean, from which a small number are chosen based on correlation with temperatures at the end.  Then the reconstruction of those selected series is simply the mean &#8211; which is an arbitrary parameter!  Looking at the reconstructions, including Moberg, Esper and others, you could see (reading backwards in time) the height of the MWP as a reversion of a random series to a predetermined mean, after a purturbation due to the biased selection for a downward impulse matching temperatures.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Stockwell</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43946</link>
		<dc:creator><![CDATA[David Stockwell]]></dc:creator>
		<pubDate>Thu, 16 Feb 2006 14:22:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43946</guid>
		<description><![CDATA[Re: #19 It would be interesting to see the error predicting the known long series with the short clibration series, plotted against number of Neofs. That would tell us the number that maximizes accuracy.  I would do it myself if the script was in R.]]></description>
		<content:encoded><![CDATA[<p>Re: #19 It would be interesting to see the error predicting the known long series with the short clibration series, plotted against number of Neofs. That would tell us the number that maximizes accuracy.  I would do it myself if the script was in R.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Harry G</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43945</link>
		<dc:creator><![CDATA[Harry G]]></dc:creator>
		<pubDate>Thu, 16 Feb 2006 12:42:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43945</guid>
		<description><![CDATA[Oops sorry - my apologies - I meant in the original article by Martin Ringo - getting old and dyslexic.

&lt;strong&gt;&lt;em&gt;John responds:&lt;/em&gt; Quite right! Equation updated. At least someone is awake on this blog....&lt;/strong&gt;]]></description>
		<content:encoded><![CDATA[<p>Oops sorry &#8211; my apologies &#8211; I meant in the original article by Martin Ringo &#8211; getting old and dyslexic.</p>
<p><strong><em>John responds:</em> Quite right! Equation updated. At least someone is awake on this blog&#8230;.</strong></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Harry G</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43944</link>
		<dc:creator><![CDATA[Harry G]]></dc:creator>
		<pubDate>Thu, 16 Feb 2006 12:38:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43944</guid>
		<description><![CDATA[John

re your interjection in the original article by Mark - I think you missed out a square on the second Y hat.]]></description>
		<content:encoded><![CDATA[<p>John</p>
<p>re your interjection in the original article by Mark &#8211; I think you missed out a square on the second Y hat.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Sadlov</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43943</link>
		<dc:creator><![CDATA[Steve Sadlov]]></dc:creator>
		<pubDate>Thu, 16 Feb 2006 02:03:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43943</guid>
		<description><![CDATA[Mark, no comment yet from Steve M, but here are my two pence. The more we can get people such as yourself, with experience in things like ECM, or folks with experience in noise management in high speed digital circuits, and the like, the more we can bring to bear the best the world has to offer in order to not only challenge the current &quot;climatological&quot; orthodoxy, but also, to come up with entirely new methodologies for analyzing the - pardon the pun - real climate!]]></description>
		<content:encoded><![CDATA[<p>Mark, no comment yet from Steve M, but here are my two pence. The more we can get people such as yourself, with experience in things like ECM, or folks with experience in noise management in high speed digital circuits, and the like, the more we can bring to bear the best the world has to offer in order to not only challenge the current &#8220;climatological&#8221; orthodoxy, but also, to come up with entirely new methodologies for analyzing the &#8211; pardon the pun &#8211; real climate!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43942</link>
		<dc:creator><![CDATA[Mark]]></dc:creator>
		<pubDate>Wed, 15 Feb 2006 23:21:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43942</guid>
		<description><![CDATA[The generalized sidelobe canceller is nothing more than the top-level description of a beamformer.  The underlying principle is simply to find the correlated elements across a signal set.  Assuming your primary signal (main-lobe from a directional antenna) is pointing at your desired signal, and each canceller element is omni-directional, the GSC actually steers nulls (spatially) towards sidelobe interferers.

That there are lobes in the directional portion of the desired signal is irrelevant to this discussion.  Where I noted similarity is the concept of finding the correlation across an array of proxies.  In this sort of system, the correlated bits are actually the desired signals (as opposed to the noise we want to reject in a radar application, for example), sort of a reverse of the canceller problem.  I.e. we are in a sense pulling the correlated signals out of the noise.

I typically used a form of a mofified Gram-Schmidt orthogonalization to implement the GSC and generate the weights - one per element or proxy in this case (do a search at the USPTO for &quot;Gerlach, Karl&quot;
 and you&#039;ll see the actual algorithm I&#039;m running).  The cascaded structure I use has computational benefits that aren&#039;t important when the process is not real-time.  However, I&#039;m wondering if a similar approach can be used, without a main-lobe analogy, to find a correlation across a proxy set and &quot;pull out&quot; the signals from the background noise.

Hopefully we haven&#039;t derailed to far off topic, Steve. :)

Mark]]></description>
		<content:encoded><![CDATA[<p>The generalized sidelobe canceller is nothing more than the top-level description of a beamformer.  The underlying principle is simply to find the correlated elements across a signal set.  Assuming your primary signal (main-lobe from a directional antenna) is pointing at your desired signal, and each canceller element is omni-directional, the GSC actually steers nulls (spatially) towards sidelobe interferers.</p>
<p>That there are lobes in the directional portion of the desired signal is irrelevant to this discussion.  Where I noted similarity is the concept of finding the correlation across an array of proxies.  In this sort of system, the correlated bits are actually the desired signals (as opposed to the noise we want to reject in a radar application, for example), sort of a reverse of the canceller problem.  I.e. we are in a sense pulling the correlated signals out of the noise.</p>
<p>I typically used a form of a mofified Gram-Schmidt orthogonalization to implement the GSC and generate the weights &#8211; one per element or proxy in this case (do a search at the USPTO for &#8220;Gerlach, Karl&#8221;<br />
 and you&#8217;ll see the actual algorithm I&#8217;m running).  The cascaded structure I use has computational benefits that aren&#8217;t important when the process is not real-time.  However, I&#8217;m wondering if a similar approach can be used, without a main-lobe analogy, to find a correlation across a proxy set and &#8220;pull out&#8221; the signals from the background noise.</p>
<p>Hopefully we haven&#8217;t derailed to far off topic, Steve. <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Mark</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ian</title>
		<link>http://climateaudit.org/2006/02/15/martin-ringo-on-principal-components/#comment-43941</link>
		<dc:creator><![CDATA[Ian]]></dc:creator>
		<pubDate>Wed, 15 Feb 2006 23:06:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=532#comment-43941</guid>
		<description><![CDATA[Sorry that should say:

Even if the flawed proxies do not show up as noise at least they will be given reasonable weightings

The dangers of using voice recognition system.]]></description>
		<content:encoded><![CDATA[<p>Sorry that should say:</p>
<p>Even if the flawed proxies do not show up as noise at least they will be given reasonable weightings</p>
<p>The dangers of using voice recognition system.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
