Comments on: Ross McKitrick on Mann et al 2007

By: Robert Wood

Robert Wood — Fri, 23 Nov 2007 00:26:09 +0000

Interesting comments about filling in missing gaps of time series. As a hardware EE, I have found the best implementation, from both a cost and effect perspective, is just to use random noise or repeat the previous data packet(s). These had the least effect upon the spectral characteristics

By: Peter D. Tillman

Peter D. Tillman — Thu, 22 Nov 2007 20:20:41 +0000

Ross, I hope you and Steve write up your criticisms of Mann et al (2007) and submit them to JGR. I think your findings, and Steve’s remarks on their idiotic (and repeated) scrambling of their data set, should be put into the formal literature.

[shakes head in disbelief]
Peter D. Tillman
Consulting Geologist, Arizona and New Mexico (USA)

By: Jean S

Jean S — Thu, 22 Nov 2007 19:16:04 +0000

if we werent doing what we are doing, we be doing something else, which might be optimal if what we were doing happened to fit the optimality conditions.

… which is nice since this purely hypothetical advantage is not offered by other current CFR methods (and also we might bull a reviewer or two with these fancy sounding sentences) 🙂

By: DAV

DAV — Thu, 22 Nov 2007 18:55:29 +0000

tree growth doesnt drive the climate

There could be feedback but, yes, it’s unlikely that tree ring data would represent a cause in Pearl’s sense.

thus was born regEM. Chances are (just a guess on my part) people developing computer algorithms to fill in random holes in data matrices werent thinking about tree rings and climate when they developed the recursive data algorithm

One of the pitfalls with the EM algorithm. Its results are best used when the data omissions are not meaningful ,i.e., caused by random events such as coding error vs. say data collection ceasing upon subject’s death. A second pitfall is failing to realize that it only produces expected results by filling in the most likely value. This is useful say when training a Bayes Net but I agree it’s of questionable value for learning something new. One exception though, might be learning the values of a hidden variable.

Speaking of omission, I admit I haven’t read the paper yet. I’ll have to correct that soon. It’s a bit hard to imagine a cause of meaningful omissions in tree ring data.

regularization process introduces a bias in the estimated missing values

Yeah. That’s bizarre. If anything, the missing value is replaced with an estimate biased by the other data.

By: Ross McKitrick

Ross McKitrick — Thu, 22 Nov 2007 18:42:11 +0000

I think what they’re calling “regularization”is usually referred to as ridge regression (though maybe there’s a difference I didn’t pick up on). Ridge regression introduces a bias in the slope estimator as a tradeoff for a reduction in the trace of the variance matrix, which makes the standard errors smaller. But you have to make a case why the tradeoff is valid since the ridge parameter can be arbitrary. As far as I know it’s usually associated with collinearity problems, and if you use it you’re expected to show that the size of the introduced bias is small.

The claim that RegEM’s properties are “demonstrably optimal in the limit of no regularization” amounts to saying that ridge regression has the advantage that if you don’t do ridge regression it reduces to OLS, and OLS is optimal, in those cases where OLS is the optimal estimator. In other words, if we weren’t doing what we are doing, we be doing something else, which might be optimal if what we were doing happened to fit the optimality conditions.

By: Jean S

Jean S — Thu, 22 Nov 2007 18:12:10 +0000

would raise alarm bells in econometrics

My alarm bells went wild after reading this (my bold):

As explained by Schneider [2001], under normality assumptions, the conventional EM algorithm without regularization converges to the maximum likelihood estimates of the mean values, covariance matrices and missing values, which thus enjoy the optimality properties common to maximum likelihood estimates [Little and Rubin, 1987]. In the limit of no regularization, as Schneider [2001] further explains, the RegEM algorithm reduces to the conventional EM algorithm and thus enjoys the same optimality properties. While the regularization process introduces a bias in the estimated missing values as the price for a reduced variance (the bias/variance trade-off common to all regularized regression approaches), it is advisable in the potentially ill-posed problems common to CFR. Unlike other current CFR methods, RegEM offers the theoretical advantage that its properties are demonstrably optimal in the limit of no regularization.