<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Smoothing Dongge</title>
	<atom:link href="http://climateaudit.org/2008/12/06/smoothing-dongge/feed/" rel="self" type="application/rss+xml" />
	<link>http://climateaudit.org/2008/12/06/smoothing-dongge/</link>
	<description>by Steve McIntyre</description>
	<lastBuildDate>Wed, 19 Jun 2013 05:32:09 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: David Stockwell</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169487</link>
		<dc:creator><![CDATA[David Stockwell]]></dc:creator>
		<pubDate>Tue, 14 Jul 2009 04:59:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169487</guid>
		<description><![CDATA[Does anyone know of a publication comparing the statistics of binned vs smoothed data? This impacts on the recent discussions regarding methods of smoothing, as you would think that binning would always be more reliable than smoothing on autocorrelated dta.  But I can&#039;t find any study or advice on it.  thanks]]></description>
		<content:encoded><![CDATA[<p>Does anyone know of a publication comparing the statistics of binned vs smoothed data? This impacts on the recent discussions regarding methods of smoothing, as you would think that binning would always be more reliable than smoothing on autocorrelated dta.  But I can&#8217;t find any study or advice on it.  thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tony</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169486</link>
		<dc:creator><![CDATA[Tony]]></dc:creator>
		<pubDate>Mon, 22 Dec 2008 17:16:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169486</guid>
		<description><![CDATA[Off topic:

Working in the digital signal processing area, I think what is missing in handling climate data is a better understanding (and knowledge) of sampled data. After all what we have here is discrete data series sampled from a continuous signal. Besides obvious errors with the sampling process (UHI et al) I wonder if people working in climate science fundamentally understand what it means to &quot;take a temperature sample&quot;. Basic Nyquist and Shannon stuff.

Besides, I think taking a &quot;true&quot; daily-average &quot;temperature sample&quot; at a weather-station is quite pointless (for climate science IMHO) if you don&#039;t have records about the accompanying daily sunshine duration and energy transport via wind, convection, and so on. Even than I have difficulties to fathom how one could create a representation of &quot;real climate&quot; from this data - but maybe I just lack imagination.]]></description>
		<content:encoded><![CDATA[<p>Off topic:</p>
<p>Working in the digital signal processing area, I think what is missing in handling climate data is a better understanding (and knowledge) of sampled data. After all what we have here is discrete data series sampled from a continuous signal. Besides obvious errors with the sampling process (UHI et al) I wonder if people working in climate science fundamentally understand what it means to &#8220;take a temperature sample&#8221;. Basic Nyquist and Shannon stuff.</p>
<p>Besides, I think taking a &#8220;true&#8221; daily-average &#8220;temperature sample&#8221; at a weather-station is quite pointless (for climate science IMHO) if you don&#8217;t have records about the accompanying daily sunshine duration and energy transport via wind, convection, and so on. Even than I have difficulties to fathom how one could create a representation of &#8220;real climate&#8221; from this data &#8211; but maybe I just lack imagination.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: genejockey</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169485</link>
		<dc:creator><![CDATA[genejockey]]></dc:creator>
		<pubDate>Tue, 09 Dec 2008 01:39:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169485</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-314819&quot; rel=&quot;nofollow&quot;&gt;Steve McIntyre (#14)&lt;/a&gt;,

&lt;blockquote&gt;Given that PNAS knew that these calculations would be scrutinized, you&#039;d think that they&#039;d have made a better effort to have stuff that was a little less embarrassing.&lt;/blockquote&gt;

As you are probably aware, &lt;em&gt;PNAS&lt;/em&gt; has a mixed reputation because of the different tracks of peer review that can taken to have a paper published in the journal.  The &quot;Communicated by Lonnie G. Thompson&quot; on Mann (2008) indicates that the paper was submitted through Track I.  In this path to publication, Dr. Thompson served as editor for the article and obtained at least 2 reviews of the paper from individuals at other institutions, and not from those of any of the authors.  The peer review process is not conducted at the level of the PNAS editorial board, but at the level of the communicator.  In principle, those peers and their reviews should be anonymous, but in practice, they are often not.  Once the concerns of the reviewers are answered, the editor advises the editorial board of whether to publish it.  While the editorial board reserves the right to reject any manuscript, this rarely happens on Track I publications.  I have been witness to conversations between scientists that sounded like the following: &quot;We&#039;ll get Dr. X down the hall to communicate it, and he&#039;ll lean on 2 of his peers to bless it, and it&#039;ll get published.&quot;

My mentor, a NAS member themselves, refused to publish my work in PNAS because of the stigma attached when you communicate your own work.  I don&#039;t enjoy the fact that I am immediately more skeptical of manuscripts submitted through Track I, but I am.]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-314819" rel="nofollow">Steve McIntyre (#14)</a>,</p>
<blockquote><p>Given that PNAS knew that these calculations would be scrutinized, you&#8217;d think that they&#8217;d have made a better effort to have stuff that was a little less embarrassing.</p></blockquote>
<p>As you are probably aware, <em>PNAS</em> has a mixed reputation because of the different tracks of peer review that can taken to have a paper published in the journal.  The &#8220;Communicated by Lonnie G. Thompson&#8221; on Mann (2008) indicates that the paper was submitted through Track I.  In this path to publication, Dr. Thompson served as editor for the article and obtained at least 2 reviews of the paper from individuals at other institutions, and not from those of any of the authors.  The peer review process is not conducted at the level of the PNAS editorial board, but at the level of the communicator.  In principle, those peers and their reviews should be anonymous, but in practice, they are often not.  Once the concerns of the reviewers are answered, the editor advises the editorial board of whether to publish it.  While the editorial board reserves the right to reject any manuscript, this rarely happens on Track I publications.  I have been witness to conversations between scientists that sounded like the following: &#8220;We&#8217;ll get Dr. X down the hall to communicate it, and he&#8217;ll lean on 2 of his peers to bless it, and it&#8217;ll get published.&#8221;</p>
<p>My mentor, a NAS member themselves, refused to publish my work in PNAS because of the stigma attached when you communicate your own work.  I don&#8217;t enjoy the fact that I am immediately more skeptical of manuscripts submitted through Track I, but I am.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve McIntyre</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169484</link>
		<dc:creator><![CDATA[Steve McIntyre]]></dc:creator>
		<pubDate>Sun, 07 Dec 2008 16:37:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169484</guid>
		<description><![CDATA[#12. I think that the most sensible approach (as Hu concurs) is to do the analysis with binned data - perhaps varying the binning approach along the lines suggested in a comment by Hu. What is the correct number of df for a Mannian calculation, considering all the steps? Who knows. Remember you start off with irregular data spaced about 10 years apart on average. You then do a linear interpolation. You then apply a Butterworth filter with f=.05 to both the instrumental and proxy data and you get a set of residuals with autocorrelation off the charts.  The setup doesn&#039;t fit the assumptions to do a linear regression (which is equivalent here to a correlation).

You have no basis for asserting that the effective N is about 14 merely because &quot;Mann is using a 20-yr lowpass filter&quot;.  I don&#039;t know how many df are really in this setup; but there&#039;s more to this set up than the Butterworth.  You haven;&#039;t shown that Slutsky-Yule isn&#039;t present here, for example.

You can&#039;t ignore the binned results, which are arguably more faithful to the data as it exists. All in all, the Mannian calculation is seriously screwed up way of doing things. Given that PNAS knew that these calculations would be scrutinized, you&#039;d think that they&#039;d have made a better effort to have stuff that was a little less embarrassing.]]></description>
		<content:encoded><![CDATA[<p>#12. I think that the most sensible approach (as Hu concurs) is to do the analysis with binned data &#8211; perhaps varying the binning approach along the lines suggested in a comment by Hu. What is the correct number of df for a Mannian calculation, considering all the steps? Who knows. Remember you start off with irregular data spaced about 10 years apart on average. You then do a linear interpolation. You then apply a Butterworth filter with f=.05 to both the instrumental and proxy data and you get a set of residuals with autocorrelation off the charts.  The setup doesn&#8217;t fit the assumptions to do a linear regression (which is equivalent here to a correlation).</p>
<p>You have no basis for asserting that the effective N is about 14 merely because &#8220;Mann is using a 20-yr lowpass filter&#8221;.  I don&#8217;t know how many df are really in this setup; but there&#8217;s more to this set up than the Butterworth.  You haven;&#8217;t shown that Slutsky-Yule isn&#8217;t present here, for example.</p>
<p>You can&#8217;t ignore the binned results, which are arguably more faithful to the data as it exists. All in all, the Mannian calculation is seriously screwed up way of doing things. Given that PNAS knew that these calculations would be scrutinized, you&#8217;d think that they&#8217;d have made a better effort to have stuff that was a little less embarrassing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hu McCulloch</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169483</link>
		<dc:creator><![CDATA[Hu McCulloch]]></dc:creator>
		<pubDate>Sun, 07 Dec 2008 16:07:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169483</guid>
		<description><![CDATA[Jesper #12 writes,
&lt;blockquote&gt;
I don&#039;t think this is quite right. Mann is using a 20-yr lowpass filter which makes the effective sampling rate about once every 10 yrs, for N ~ 14. Applying the Quenouille formula on an annual autocorrelation basis in this case produces a meaningless number. It would be more suitable to calculate the autocorrelation at lag ~11. A correlation greater than 0.5 for N=14, given serial independence, does reach 95% significance.
&lt;/blockquote&gt;
But if you use the autocorrelation at lag 11, the &quot;N&quot; in the formula should be 14 or so, not 146, so that as long as there is still a fair amount of autocorrelation, the adjusted sample size will still be well under 14.

But the AR(1) model probably isn&#039;t very good for this complicated doubly smoothed data that may have been autocorrelated to start with.  I would therefore just take the Quenouille adjustment as an indication of cause for extreme concern, rather than as definitive.  I think Steve did right to just regress the raw Dongge data on binned temperature, though as I noted in #3 above, there might be other reasonable ways of binning.]]></description>
		<content:encoded><![CDATA[<p>Jesper #12 writes,</p>
<blockquote><p>
I don&#8217;t think this is quite right. Mann is using a 20-yr lowpass filter which makes the effective sampling rate about once every 10 yrs, for N ~ 14. Applying the Quenouille formula on an annual autocorrelation basis in this case produces a meaningless number. It would be more suitable to calculate the autocorrelation at lag ~11. A correlation greater than 0.5 for N=14, given serial independence, does reach 95% significance.
</p></blockquote>
<p>But if you use the autocorrelation at lag 11, the &#8220;N&#8221; in the formula should be 14 or so, not 146, so that as long as there is still a fair amount of autocorrelation, the adjusted sample size will still be well under 14.</p>
<p>But the AR(1) model probably isn&#8217;t very good for this complicated doubly smoothed data that may have been autocorrelated to start with.  I would therefore just take the Quenouille adjustment as an indication of cause for extreme concern, rather than as definitive.  I think Steve did right to just regress the raw Dongge data on binned temperature, though as I noted in #3 above, there might be other reasonable ways of binning.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jesper</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169482</link>
		<dc:creator><![CDATA[Jesper]]></dc:creator>
		<pubDate>Sun, 07 Dec 2008 15:34:24 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169482</guid>
		<description><![CDATA[&lt;blockquote&gt;Y&#039;see, the number of years is 146 (1850-1995). The autocorrelation of the residuals in a linear regression is 0.9945544 and the resulting degrees of freedom using the Quenouille formula used in Santer et al 2008 (N(1-r)/(1+r) is only 0.399, something that must have worried Gavin Schmidt.&lt;/blockquote&gt;

I don&#039;t think this is quite right.  Mann is using a 20-yr lowpass filter which makes the effective sampling rate about once every 10 yrs, for N ~ 14.  Applying the Quenouille formula on an annual autocorrelation basis in this case produces a meaningless number.  It would be more suitable to calculate the autocorrelation at lag ~11.  A correlation greater than 0.5 for N=14, given serial independence, does reach 95% significance.]]></description>
		<content:encoded><![CDATA[<blockquote><p>Y&#8217;see, the number of years is 146 (1850-1995). The autocorrelation of the residuals in a linear regression is 0.9945544 and the resulting degrees of freedom using the Quenouille formula used in Santer et al 2008 (N(1-r)/(1+r) is only 0.399, something that must have worried Gavin Schmidt.</p></blockquote>
<p>I don&#8217;t think this is quite right.  Mann is using a 20-yr lowpass filter which makes the effective sampling rate about once every 10 yrs, for N ~ 14.  Applying the Quenouille formula on an annual autocorrelation basis in this case produces a meaningless number.  It would be more suitable to calculate the autocorrelation at lag ~11.  A correlation greater than 0.5 for N=14, given serial independence, does reach 95% significance.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Id</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169481</link>
		<dc:creator><![CDATA[Jeff Id]]></dc:creator>
		<pubDate>Sun, 07 Dec 2008 15:13:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169481</guid>
		<description><![CDATA[I understand the argument for nonexistence of teleconnection. If you have a timeseries and you average a section, placing the result at the one side causes a phase change in the filtered signal.  Variable frequency data (red dots) results in a variable phase change which if there is a signal would make correlation worse.]]></description>
		<content:encoded><![CDATA[<p>I understand the argument for nonexistence of teleconnection. If you have a timeseries and you average a section, placing the result at the one side causes a phase change in the filtered signal.  Variable frequency data (red dots) results in a variable phase change which if there is a signal would make correlation worse.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Clark</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169480</link>
		<dc:creator><![CDATA[Clark]]></dc:creator>
		<pubDate>Sun, 07 Dec 2008 05:40:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169480</guid>
		<description><![CDATA[&lt;blockquote&gt;Steve:  Why are you asking me what the rationale is? Ask Mann or Gavin Schmidt.&lt;/blockquote&gt;

A great idea, only Gavin would censor everything at RC, Hansen is busy on the Discovery channel claiming Armageddon from 2 C temp rise, and Mann is busy with stealth deletions and alterations on the Penn State web site.  In all seriousness, why isn&#039;t the paleoclimate crowd demanding these answers, if not in the review process, then in later commentary?  (I know, rhetorical and OT).]]></description>
		<content:encoded><![CDATA[<blockquote><p>Steve:  Why are you asking me what the rationale is? Ask Mann or Gavin Schmidt.</p></blockquote>
<p>A great idea, only Gavin would censor everything at RC, Hansen is busy on the Discovery channel claiming Armageddon from 2 C temp rise, and Mann is busy with stealth deletions and alterations on the Penn State web site.  In all seriousness, why isn&#8217;t the paleoclimate crowd demanding these answers, if not in the review process, then in later commentary?  (I know, rhetorical and OT).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jorge kafkazar</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169479</link>
		<dc:creator><![CDATA[jorge kafkazar]]></dc:creator>
		<pubDate>Sun, 07 Dec 2008 04:12:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169479</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-314767&quot; rel=&quot;nofollow&quot;&gt;Jim Norvell (#7)&lt;/a&gt;,
Hmm. Seems to me as if much of accepted warming theory has a very low signal to nits ratio.

Regarding the Dongge O18 data, it&#039;s been a long time since my last statistics course, but the notion of linear interpolation between irregular-year data points raises a flag. This seems to add additional smoothing at the front end. Depending on the ratio of filled-in to raw-data years, this could be a &lt;strong&gt;very&lt;/strong&gt; heavy-handed smoothing. But maybe that&#039;s what you already said in another way. If so, sorry.]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-314767" rel="nofollow">Jim Norvell (#7)</a>,<br />
Hmm. Seems to me as if much of accepted warming theory has a very low signal to nits ratio.</p>
<p>Regarding the Dongge O18 data, it&#8217;s been a long time since my last statistics course, but the notion of linear interpolation between irregular-year data points raises a flag. This seems to add additional smoothing at the front end. Depending on the ratio of filled-in to raw-data years, this could be a <strong>very</strong> heavy-handed smoothing. But maybe that&#8217;s what you already said in another way. If so, sorry.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: MarkB</title>
		<link>http://climateaudit.org/2008/12/06/smoothing-dongge/#comment-169478</link>
		<dc:creator><![CDATA[MarkB]]></dc:creator>
		<pubDate>Sun, 07 Dec 2008 02:32:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=4539#comment-169478</guid>
		<description><![CDATA[Re: &lt;a href=&quot;#comment-314767&quot; rel=&quot;nofollow&quot;&gt;Jim Norvell (#7)&lt;/a&gt;,

Two space shuttles burned up because engineers failed to &quot;pick nits.&quot;]]></description>
		<content:encoded><![CDATA[<p>Re: <a href="#comment-314767" rel="nofollow">Jim Norvell (#7)</a>,</p>
<p>Two space shuttles burned up because engineers failed to &#8220;pick nits.&#8221;</p>
]]></content:encoded>
	</item>
</channel>
</rss>
