<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Processing Unstructured Text with OpenCalais</title>
	<atom:link href="http://digiorgio.com/blog/?feed=rss2&#038;p=181" rel="self" type="application/rss+xml" />
	<link>http://digiorgio.com/blog/?p=181</link>
	<description>Rinaldo Di Giorgio's blog</description>
	<lastBuildDate>Thu, 26 Aug 2010 13:22:13 -0400</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Klezio</title>
		<link>http://digiorgio.com/blog/?p=181&#038;cpage=1#comment-34</link>
		<dc:creator>Klezio</dc:creator>
		<pubDate>Mon, 02 Mar 2009 11:41:11 +0000</pubDate>
		<guid isPermaLink="false">http://digiorgio.com/blog/?p=181#comment-34</guid>
		<description>OpenCalais is great, I use it in a new website I created recently : http://www.klezio.com
News are automatically classified and news metadata extracted ; Contextual information is fetched from apps such as wikipedia, flickr, twitter or delicious.
Hope it&#039;ll serve.
Regards,</description>
		<content:encoded><![CDATA[<p>OpenCalais is great, I use it in a new website I created recently : <a href="http://www.klezio.com" rel="nofollow">http://www.klezio.com</a><br />
News are automatically classified and news metadata extracted ; Contextual information is fetched from apps such as wikipedia, flickr, twitter or delicious.<br />
Hope it&#8217;ll serve.<br />
Regards,</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bostjan Spetic</title>
		<link>http://digiorgio.com/blog/?p=181&#038;cpage=1#comment-20</link>
		<dc:creator>Bostjan Spetic</dc:creator>
		<pubDate>Thu, 08 Jan 2009 15:07:46 +0000</pubDate>
		<guid isPermaLink="false">http://digiorgio.com/blog/?p=181#comment-20</guid>
		<description>Hi, it would be interesting to hear your thoughts on Zemanta API as well, because it is more focused on user generated content analysis...</description>
		<content:encoded><![CDATA[<p>Hi, it would be interesting to hear your thoughts on Zemanta API as well, because it is more focused on user generated content analysis&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rinaldo Di Giorgio</title>
		<link>http://digiorgio.com/blog/?p=181&#038;cpage=1#comment-18</link>
		<dc:creator>Rinaldo Di Giorgio</dc:creator>
		<pubDate>Wed, 07 Jan 2009 00:12:06 +0000</pubDate>
		<guid isPermaLink="false">http://digiorgio.com/blog/?p=181#comment-18</guid>
		<description>I have tried it and I will try it again. I have a few thousand pages and the semanticproxy failed to parse quite a few of them at all. I will send you the results privately if you want them. I am also doing cleansing, it will be informative to see the difference perhaps we can learn something from the differences if any.</description>
		<content:encoded><![CDATA[<p>I have tried it and I will try it again. I have a few thousand pages and the semanticproxy failed to parse quite a few of them at all. I will send you the results privately if you want them. I am also doing cleansing, it will be informative to see the difference perhaps we can learn something from the differences if any.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom Tague</title>
		<link>http://digiorgio.com/blog/?p=181&#038;cpage=1#comment-17</link>
		<dc:creator>Tom Tague</dc:creator>
		<pubDate>Tue, 06 Jan 2009 18:22:58 +0000</pubDate>
		<guid isPermaLink="false">http://digiorgio.com/blog/?p=181#comment-17</guid>
		<description>Rinaldo:

Tom Tague from Calais here. 

Just a quick suggestion. If you&#039;re trying to work directly with formatted HTML pages and Calais you might want to take a look at semanticproxy.com. This tool fetches the page for you and attempts to do basic HTML cleansing before handing it to Calais for processing.

As I&#039;m sure you&#039;re aware cleansing is hard - but it&#039;s doing fairly well and we&#039;ll continue to improve it over time.

Regards,</description>
		<content:encoded><![CDATA[<p>Rinaldo:</p>
<p>Tom Tague from Calais here. </p>
<p>Just a quick suggestion. If you&#8217;re trying to work directly with formatted HTML pages and Calais you might want to take a look at <a href="http://semanticproxy.com" title="http://semanticproxy.com" class="autohyperlink" target="_blank">semanticproxy.com</a>. This tool fetches the page for you and attempts to do basic HTML cleansing before handing it to Calais for processing.</p>
<p>As I&#8217;m sure you&#8217;re aware cleansing is hard &#8211; but it&#8217;s doing fairly well and we&#8217;ll continue to improve it over time.</p>
<p>Regards,</p>
]]></content:encoded>
	</item>
</channel>
</rss>
