<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: The Legality of Web Crawling</title>
	<atom:link href="http://anonymousprof.com/the-legality-of-web-crawling/feed/" rel="self" type="application/rss+xml" />
	<link>http://anonymousprof.com/the-legality-of-web-crawling/</link>
	<description>Ramblings and Ravings of an Academic</description>
	<pubDate>Wed, 07 Jan 2009 12:59:05 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: Aaron Schiff</title>
		<link>http://anonymousprof.com/the-legality-of-web-crawling/#comment-183</link>
		<dc:creator>Aaron Schiff</dc:creator>
		<pubDate>Wed, 26 Mar 2008 00:00:33 +0000</pubDate>
		<guid isPermaLink="false">http://anonymousprof.com/the-legality-of-web-crawling/#comment-183</guid>
		<description>The bit about the basic data not being copyrighted is interesting. I know that copyright only protects the form of expression and not the content itself. But I'm wondering where the limits are. What if someone has a table of data and you reformat it a bit, is that ok? Or what if you turn the table into a graph?

In terms of the TOS, I guess a lawyer could argue that way. On the other hand the risk is that the website doesn't see things the same way, and wants to go to court to find out the answer, which could be expensive. 

As far as ethics go, if they really didn't want their site crawled, they should have put something in robots.txt. If not then Google's already got all their data and done any number of things to it. And Google does no evil, right? :)</description>
		<content:encoded><![CDATA[<p>The bit about the basic data not being copyrighted is interesting. I know that copyright only protects the form of expression and not the content itself. But I&#8217;m wondering where the limits are. What if someone has a table of data and you reformat it a bit, is that ok? Or what if you turn the table into a graph?</p>
<p>In terms of the TOS, I guess a lawyer could argue that way. On the other hand the risk is that the website doesn&#8217;t see things the same way, and wants to go to court to find out the answer, which could be expensive. </p>
<p>As far as ethics go, if they really didn&#8217;t want their site crawled, they should have put something in robots.txt. If not then Google&#8217;s already got all their data and done any number of things to it. And Google does no evil, right? <img src='http://anonymousprof.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /></p>
]]></content:encoded>
	</item>
</channel>
</rss>
