<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Reading GISS Station Data</title>
	<atom:link href="http://climateaudit.org/2007/03/03/reading-giss-station-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/</link>
	<description>by Steve McIntyre</description>
	<lastBuildDate>Fri, 24 May 2013 15:59:03 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: John Baltutis</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80746</link>
		<dc:creator><![CDATA[John Baltutis]]></dc:creator>
		<pubDate>Thu, 17 May 2007 07:10:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80746</guid>
		<description><![CDATA[Re: #36
Steve M:

You asked for e-mail addresses in response to this from the webmaster:
&lt;blockquote&gt;GISTEMP research group to explain your needs. E-mail addresses for the GISTEMP research group are located at the bottom of the page at &lt;a href=&quot;http://data.giss.nasa.gov/gistemp/&quot; rel=&quot;nofollow&quot;&gt;GISTEMP Research Group&lt;/a&gt;&lt;/blockquote&gt;
Not exactly true. There are links to their bio pages which do have their e-mail addressess. However, good luck with getting a response from them. I wish thee well.

&lt;b&gt;&lt;i&gt;Contacts
Please address scientific inquiries about the GISTEMP analysis to Dr. James Hansen. (jhansen@giss.nasa.gov)

Please address technical questions about these GISTEMP webpages to Dr. Reto Ruedy (rruedy@giss.nasa.gov)

Also participating in the GISTEMP analysis are Dr. Makiko Sato (makikosato@giss.nasa.gov) and Dr. Ken Lo (klo@giss.nasa.gov)&lt;/i&gt;&lt;/b&gt;]]></description>
		<content:encoded><![CDATA[<p>Re: #36<br />
Steve M:</p>
<p>You asked for e-mail addresses in response to this from the webmaster:</p>
<blockquote><p>GISTEMP research group to explain your needs. E-mail addresses for the GISTEMP research group are located at the bottom of the page at <a href="http://data.giss.nasa.gov/gistemp/" rel="nofollow">GISTEMP Research Group</a></p></blockquote>
<p>Not exactly true. There are links to their bio pages which do have their e-mail addressess. However, good luck with getting a response from them. I wish thee well.</p>
<p><b><i>Contacts<br />
Please address scientific inquiries about the GISTEMP analysis to Dr. James Hansen. (jhansen@giss.nasa.gov)</p>
<p>Please address technical questions about these GISTEMP webpages to Dr. Reto Ruedy (rruedy@giss.nasa.gov)</p>
<p>Also participating in the GISTEMP analysis are Dr. Makiko Sato (makikosato@giss.nasa.gov) and Dr. Ken Lo (klo@giss.nasa.gov)</i></b></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: fFreddy</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80745</link>
		<dc:creator><![CDATA[fFreddy]]></dc:creator>
		<pubDate>Thu, 17 May 2007 07:05:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80745</guid>
		<description><![CDATA[Twit. Not much help if he&#039;s blocked ...

Dr. James E. Hansen : jhansen@giss.nasa.gov
Dr. Reto A. Ruedy   : rruedy@giss.nasa.gov
Dr. Makiko Sato     : makikosato@giss.nasa.gov
Dr. Kwok-Wai Ken Lo : klo@giss.nasa.gov]]></description>
		<content:encoded><![CDATA[<p>Twit. Not much help if he&#8217;s blocked &#8230;</p>
<p>Dr. James E. Hansen : <a href="mailto:jhansen@giss.nasa.gov">jhansen@giss.nasa.gov</a><br />
Dr. Reto A. Ruedy   : <a href="mailto:rruedy@giss.nasa.gov">rruedy@giss.nasa.gov</a><br />
Dr. Makiko Sato     : <a href="mailto:makikosato@giss.nasa.gov">makikosato@giss.nasa.gov</a><br />
Dr. Kwok-Wai Ken Lo : <a href="mailto:klo@giss.nasa.gov">klo@giss.nasa.gov</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: fFreddy</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80744</link>
		<dc:creator><![CDATA[fFreddy]]></dc:creator>
		<pubDate>Thu, 17 May 2007 06:55:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80744</guid>
		<description><![CDATA[Re #39, Steve
Bottom of the page &lt;a href=&quot;http://data.giss.nasa.gov/gistemp/&quot; rel=&quot;nofollow&quot;&gt;http://data.giss.nasa.gov/gistemp/&lt;/a&gt; :

&lt;b&gt;Contacts&lt;/b&gt;
Please address scientific inquiries about the GISTEMP analysis to
&lt;a href=&quot;http://www.giss.nasa.gov/staff/jhansen.html&quot; rel=&quot;nofollow&quot;&gt;Dr. James Hansen&lt;/a&gt;.
Please address technical questions about these GISTEMP webpages to
&lt;a href=&quot;http://www.giss.nasa.gov/staff/rruedy.html&quot; rel=&quot;nofollow&quot;&gt;Dr. Reto Ruedy&lt;/a&gt;.
Also participating in the GISTEMP analysis are &lt;a href=&quot;http://www.giss.nasa.gov/staff/makiko_sato.html&quot; rel=&quot;nofollow&quot;&gt;Dr. Makiko Sato&lt;/a&gt; and &lt;a href=&quot;http://www.giss.nasa.gov/staff/klo.html&quot; rel=&quot;nofollow&quot;&gt;Dr. Ken Lo&lt;/a&gt;.]]></description>
		<content:encoded><![CDATA[<p>Re #39, Steve<br />
Bottom of the page <a href="http://data.giss.nasa.gov/gistemp/" rel="nofollow">http://data.giss.nasa.gov/gistemp/</a> :</p>
<p><b>Contacts</b><br />
Please address scientific inquiries about the GISTEMP analysis to<br />
<a href="http://www.giss.nasa.gov/staff/jhansen.html" rel="nofollow">Dr. James Hansen</a>.<br />
Please address technical questions about these GISTEMP webpages to<br />
<a href="http://www.giss.nasa.gov/staff/rruedy.html" rel="nofollow">Dr. Reto Ruedy</a>.<br />
Also participating in the GISTEMP analysis are <a href="http://www.giss.nasa.gov/staff/makiko_sato.html" rel="nofollow">Dr. Makiko Sato</a> and <a href="http://www.giss.nasa.gov/staff/klo.html" rel="nofollow">Dr. Ken Lo</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rv</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80743</link>
		<dc:creator><![CDATA[rv]]></dc:creator>
		<pubDate>Thu, 17 May 2007 04:30:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80743</guid>
		<description><![CDATA[#39 I&#039;m getting a 403 Forbidden error when going to any page under http://data.giss.nasa.gov/ site from all networks that I have access to, so it looks like they have a server error right now, and it&#039;s not just us being blocked.

How would you like me to get you the data set?  Direct e-mail&#039;s out since it&#039;s too big, but perhaps we can switch to e-mail to coordinate an upload?

Let me know,

-rv]]></description>
		<content:encoded><![CDATA[<p>#39 I&#8217;m getting a 403 Forbidden error when going to any page under <a href="http://data.giss.nasa.gov/" rel="nofollow">http://data.giss.nasa.gov/</a> site from all networks that I have access to, so it looks like they have a server error right now, and it&#8217;s not just us being blocked.</p>
<p>How would you like me to get you the data set?  Direct e-mail&#8217;s out since it&#8217;s too big, but perhaps we can switch to e-mail to coordinate an upload?</p>
<p>Let me know,</p>
<p>-rv</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve McIntyre</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80742</link>
		<dc:creator><![CDATA[Steve McIntyre]]></dc:creator>
		<pubDate>Thu, 17 May 2007 04:25:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80742</guid>
		<description><![CDATA[#38 sounds good. CAn somebody post up the email addresses for the GISTEMP research group so that I can contact them directly as well.]]></description>
		<content:encoded><![CDATA[<p>#38 sounds good. CAn somebody post up the email addresses for the GISTEMP research group so that I can contact them directly as well.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rv</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80741</link>
		<dc:creator><![CDATA[rv]]></dc:creator>
		<pubDate>Thu, 17 May 2007 04:07:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80741</guid>
		<description><![CDATA[#36 Steve,

On March 5, soon after you posted the original entry, I successfully downloaded all three datasets for all stations using a variation of the process you described.  I throttled my requests to a couple every few seconds so that I would not hammer the site (and hopefully wouldn&#039;t get disconnected) and ended up taking 36 hours to grab everything.  I figured it was only a matter of time before the multi-step process stopped working, or they started blocking people using your technique, given the obvious lengths that they went to to make downloading difficult.

I would be happy to send you the combined and compressed station data if you like.  As a tar.bz2 file it&#039;s a 26 MB archive containing ~27,000 separate files.  I originally intended to combine and process it into one file with a WMO number column, but the day job interfered.  If that would be helpful, I could try to get to it maybe this weekend.

-rv]]></description>
		<content:encoded><![CDATA[<p>#36 Steve,</p>
<p>On March 5, soon after you posted the original entry, I successfully downloaded all three datasets for all stations using a variation of the process you described.  I throttled my requests to a couple every few seconds so that I would not hammer the site (and hopefully wouldn&#8217;t get disconnected) and ended up taking 36 hours to grab everything.  I figured it was only a matter of time before the multi-step process stopped working, or they started blocking people using your technique, given the obvious lengths that they went to to make downloading difficult.</p>
<p>I would be happy to send you the combined and compressed station data if you like.  As a tar.bz2 file it&#8217;s a 26 MB archive containing ~27,000 separate files.  I originally intended to combine and process it into one file with a WMO number column, but the day job interfered.  If that would be helpful, I could try to get to it maybe this weekend.</p>
<p>-rv</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve McIntyre</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80740</link>
		<dc:creator><![CDATA[Steve McIntyre]]></dc:creator>
		<pubDate>Thu, 17 May 2007 04:04:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80740</guid>
		<description><![CDATA[Here&#039;s the program that I was running. It failed a couple of times when there was no information and had to be re-started by increasing the start point. I could use the try(...) function to work around this, but it seemed just as easy to occasionally restart it. It&#039;s a slow retrieval process. I&#039;m on a high-speed network and this has taken about 8 hours. The grandkids were over and it was just running in the background.

&lt;blockquote&gt;    source(“http://data.climateaudit.org/scripts/gridcell/read.giss.station.txt”)
    idgiss=scan(&quot;http://data.climateaudit.org/data/giss/idgiss.dat&quot;);N=length(idgiss)
    giss=rep(list(NA),N) #
            ##I got stopped at 1615
     i=1614

     K=i+1
     for (i in K:N) {
          station.giss&lt; -download_giss_data(idgiss[i])
          giss[[i]]=station.giss
          names(giss)[i]=id[i]
          }
&lt;/blockquote&gt;&lt;/blockquote&gt;]]></description>
		<content:encoded><![CDATA[<p>Here&#8217;s the program that I was running. It failed a couple of times when there was no information and had to be re-started by increasing the start point. I could use the try(&#8230;) function to work around this, but it seemed just as easy to occasionally restart it. It&#8217;s a slow retrieval process. I&#8217;m on a high-speed network and this has taken about 8 hours. The grandkids were over and it was just running in the background.</p>
<blockquote><p>    source(“http://data.climateaudit.org/scripts/gridcell/read.giss.station.txt”)<br />
    idgiss=scan(&#8220;http://data.climateaudit.org/data/giss/idgiss.dat&#8221;);N=length(idgiss)<br />
    giss=rep(list(NA),N) #<br />
            ##I got stopped at 1615<br />
     i=1614</p>
<p>     K=i+1<br />
     for (i in K:N) {<br />
          station.giss&lt; -download_giss_data(idgiss[i])<br />
          giss[[i]]=station.giss<br />
          names(giss)[i]=id[i]<br />
          }
</p></blockquote>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve McIntyre</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80739</link>
		<dc:creator><![CDATA[Steve McIntyre]]></dc:creator>
		<pubDate>Thu, 17 May 2007 03:48:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80739</guid>
		<description><![CDATA[I sent an inquiry to the webmaster. I was indeed blocked. The webmaster sent me a reply in amazingly quick time:

&lt;blockquote&gt;
Steve,

Although you did not provide any further details about
your problem, I will assume that you are the person on
the ***.com network who has been running a
robot for the past several hours trying to scrape
GISTEMP station data and who has made over 16000 (!)
requests to the data.giss.nasa.gov website.

Please note that the robots.txt file on that website
includes a list of directories which any legitimate
web robot is _forbidden_ from trying to index. That
list of off-limits directories includes the /work/
and /cgi-bin/ directories.

Because the robot running on the ***.com network
has rather obviously and blatantly violated those rules,
I placed a block on our server restricting its access
to the server.

If you are indeed the person who has been running that particular web robot, and if you do need access to some large amount of the GISTEMP station data for a scientific purpose, then you should contact the GISTEMP research group to explain your needs. E-mail addresses for the GISTEMP research group are located at the bottom of the page at http://data.giss.nasa.gov/gistemp/

rbs&lt;/blockquote&gt;


I was not running a robot but an R-project. I&#039;m blocked from the webpage in question so maybe someone can send me the email address.

Nicholas, I&#039;ll email you with what I was running.]]></description>
		<content:encoded><![CDATA[<p>I sent an inquiry to the webmaster. I was indeed blocked. The webmaster sent me a reply in amazingly quick time:</p>
<blockquote><p>
Steve,</p>
<p>Although you did not provide any further details about<br />
your problem, I will assume that you are the person on<br />
the ***.com network who has been running a<br />
robot for the past several hours trying to scrape<br />
GISTEMP station data and who has made over 16000 (!)<br />
requests to the data.giss.nasa.gov website.</p>
<p>Please note that the robots.txt file on that website<br />
includes a list of directories which any legitimate<br />
web robot is _forbidden_ from trying to index. That<br />
list of off-limits directories includes the /work/<br />
and /cgi-bin/ directories.</p>
<p>Because the robot running on the ***.com network<br />
has rather obviously and blatantly violated those rules,<br />
I placed a block on our server restricting its access<br />
to the server.</p>
<p>If you are indeed the person who has been running that particular web robot, and if you do need access to some large amount of the GISTEMP station data for a scientific purpose, then you should contact the GISTEMP research group to explain your needs. E-mail addresses for the GISTEMP research group are located at the bottom of the page at <a href="http://data.giss.nasa.gov/gistemp/" rel="nofollow">http://data.giss.nasa.gov/gistemp/</a></p>
<p>rbs</p></blockquote>
<p>I was not running a robot but an R-project. I&#8217;m blocked from the webpage in question so maybe someone can send me the email address.</p>
<p>Nicholas, I&#8217;ll email you with what I was running.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nicholas</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80738</link>
		<dc:creator><![CDATA[Nicholas]]></dc:creator>
		<pubDate>Thu, 17 May 2007 03:36:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80738</guid>
		<description><![CDATA[Do you need someone else to run a script for you? Perhaps if you break the job up into a few chunks, you could distribute those chunks to us, we could run them, and then you could collate the results into one big set.

We might end up getting blocked too, eventually, but hopefully after we&#039;re each finished 1/8th or 1/4 or some fraction of the sites.]]></description>
		<content:encoded><![CDATA[<p>Do you need someone else to run a script for you? Perhaps if you break the job up into a few chunks, you could distribute those chunks to us, we could run them, and then you could collate the results into one big set.</p>
<p>We might end up getting blocked too, eventually, but hopefully after we&#8217;re each finished 1/8th or 1/4 or some fraction of the sites.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve McIntyre</title>
		<link>http://climateaudit.org/2007/03/03/reading-giss-station-data/#comment-80737</link>
		<dc:creator><![CDATA[Steve McIntyre]]></dc:creator>
		<pubDate>Thu, 17 May 2007 03:09:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.climateaudit.org/?p=1217#comment-80737</guid>
		<description><![CDATA[I started downloading and collating all the GISS station data. I got about half-way through and now my access to the GISS site is blocked. It took about 8 hours so far and I&#039;m about halfway through. It&#039;s very slow because there is no organized data set that can be downloaded. They may have assumed it was a robot, but I&#039;ve been blocked by the Team before (MAnn blocked me from his ftp site at UVA and Rutherford blocked from his SI).]]></description>
		<content:encoded><![CDATA[<p>I started downloading and collating all the GISS station data. I got about half-way through and now my access to the GISS site is blocked. It took about 8 hours so far and I&#8217;m about halfway through. It&#8217;s very slow because there is no organized data set that can be downloaded. They may have assumed it was a robot, but I&#8217;ve been blocked by the Team before (MAnn blocked me from his ftp site at UVA and Rutherford blocked from his SI).</p>
]]></content:encoded>
	</item>
</channel>
</rss>
