<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments for Topify!</title>
	<atom:link href="http://topify.wordpress.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://topify.wordpress.com</link>
	<description>Giving natural topics a second chance</description>
	<lastBuildDate>Fri, 06 Mar 2009 21:31:06 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>Comment on Blog search broken? DEAD?! Let&#8217;s just call it &#8220;hybernating&#8221;. by Marty</title>
		<link>http://topify.wordpress.com/2009/03/04/blog-search-broken-or-even-fully-dead/#comment-51</link>
		<dc:creator>Marty</dc:creator>
		<pubDate>Fri, 06 Mar 2009 21:31:06 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=43#comment-51</guid>
		<description>Jonathan, Scott, 

Thank you very much for your comments. I must admit I probably made it all a little confusing by condemning user-generated solutions first and then saying they are the cure :) What I meant was that when looking for relevant stories, one shouldn&#039;t rely on user votes/tags/link etc, because you will get gamed or at least very biased results.

On the other hand, when all you need to do is filter out a source that is pure spam or not a blog, that&#039;s where the amazing classifier which is the human brain comes up trumps. So if you only let the users report bad sources and do semantic search on the stories from the acceptable sources, that&#039;s already a big step towards having a broad, yet good range of sources and stories.

Hope this clears our idea up a little, if you have any further questions or thoughts, let me know :)

Cheers,
Martin</description>
		<content:encoded><![CDATA[<p>Jonathan, Scott, </p>
<p>Thank you very much for your comments. I must admit I probably made it all a little confusing by condemning user-generated solutions first and then saying they are the cure :) What I meant was that when looking for relevant stories, one shouldn&#8217;t rely on user votes/tags/link etc, because you will get gamed or at least very biased results.</p>
<p>On the other hand, when all you need to do is filter out a source that is pure spam or not a blog, that&#8217;s where the amazing classifier which is the human brain comes up trumps. So if you only let the users report bad sources and do semantic search on the stories from the acceptable sources, that&#8217;s already a big step towards having a broad, yet good range of sources and stories.</p>
<p>Hope this clears our idea up a little, if you have any further questions or thoughts, let me know :)</p>
<p>Cheers,<br />
Martin</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Blog search broken? DEAD?! Let&#8217;s just call it &#8220;hybernating&#8221;. by Scott Lockhart</title>
		<link>http://topify.wordpress.com/2009/03/04/blog-search-broken-or-even-fully-dead/#comment-50</link>
		<dc:creator>Scott Lockhart</dc:creator>
		<pubDate>Wed, 04 Mar 2009 22:15:44 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=43#comment-50</guid>
		<description>I feel your pain. I&#039;m one of the co-founders of Regator, our site was mentioned in Jonathan&#039;s post over on the Blog Herald. I think your idea of a crowd-sourced Wikipedia style isn&#039;t a bad one. The angle that we&#039;re taking at Regator is to ensure quality content. So we have real people as well curate the site, we&#039;re just really selective and want a lot of the control of the quality. Once control that is dispersed into the crowd, you can get quality hiccups like on wikipedia. Idefinitely has it good side and bad side. That is not to say that we will not someday have a more social component to blog selection on our site, but we feel that through some of the things we&#039;re doing by hand and also some of the semantic stuff we&#039;v got on the backend and we&#039;re working on, we will continue to provide a solid resource for people looking for good blog content. We&#039;ll never have everything under the sun and that&#039;s okay by us. Just as long as we have relevant stuff that people find useful and interesting, without all the crap.

Twingly does well for search but if you take a look at our site, blog search is really just one part of it, it&#039;s more Blog Browsing (with bonus search!). 
Anyways, good followup post and if you have any feedback please let me know, we love to get it. You can email or call me at 404-493-5121 to discuss. Cheers!</description>
		<content:encoded><![CDATA[<p>I feel your pain. I&#8217;m one of the co-founders of Regator, our site was mentioned in Jonathan&#8217;s post over on the Blog Herald. I think your idea of a crowd-sourced Wikipedia style isn&#8217;t a bad one. The angle that we&#8217;re taking at Regator is to ensure quality content. So we have real people as well curate the site, we&#8217;re just really selective and want a lot of the control of the quality. Once control that is dispersed into the crowd, you can get quality hiccups like on wikipedia. Idefinitely has it good side and bad side. That is not to say that we will not someday have a more social component to blog selection on our site, but we feel that through some of the things we&#8217;re doing by hand and also some of the semantic stuff we&#8217;v got on the backend and we&#8217;re working on, we will continue to provide a solid resource for people looking for good blog content. We&#8217;ll never have everything under the sun and that&#8217;s okay by us. Just as long as we have relevant stuff that people find useful and interesting, without all the crap.</p>
<p>Twingly does well for search but if you take a look at our site, blog search is really just one part of it, it&#8217;s more Blog Browsing (with bonus search!).<br />
Anyways, good followup post and if you have any feedback please let me know, we love to get it. You can email or call me at 404-493-5121 to discuss. Cheers!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Blog search broken? DEAD?! Let&#8217;s just call it &#8220;hybernating&#8221;. by Jonathan Bailey</title>
		<link>http://topify.wordpress.com/2009/03/04/blog-search-broken-or-even-fully-dead/#comment-49</link>
		<dc:creator>Jonathan Bailey</dc:creator>
		<pubDate>Wed, 04 Mar 2009 20:38:45 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=43#comment-49</guid>
		<description>First off, I am definitely interested in checking out Topify when the time comes so I am very happy to be on that list of people. Definitely include me if you can.

As I said on my post on the Blog Herald, I think the semantic Web has potential to really improve things in this area, but I am not certain it will really fix the problem as it doesn&#039;t address a lot of the issues. 

One thing I do find kind of odd is that you hail a user-generated solution to the problem, namely Wikipedia-style voting and recommendations, while condemning other user-generated solutions such as inbound links, tags and voting as non-answers. Perhaps I&#039;m misreading those two paragraphs, I get the feeling I&#039;m missing something, but I don&#039;t see how one user-generated solution can be a fix for a mess created by three others. 

We will see if the semantic Web can help. I&#039;m going to remain a skeptic until I see the progress. Right now though, if blog search isn&#039;t dead, it&#039;s on life support. Doesn&#039;t mean it won&#039;t come back. If I&#039;ve learned one thing from my love affair with horror movies, it&#039;s the being dead really doesn&#039;t mean all that much.</description>
		<content:encoded><![CDATA[<p>First off, I am definitely interested in checking out Topify when the time comes so I am very happy to be on that list of people. Definitely include me if you can.</p>
<p>As I said on my post on the Blog Herald, I think the semantic Web has potential to really improve things in this area, but I am not certain it will really fix the problem as it doesn&#8217;t address a lot of the issues. </p>
<p>One thing I do find kind of odd is that you hail a user-generated solution to the problem, namely Wikipedia-style voting and recommendations, while condemning other user-generated solutions such as inbound links, tags and voting as non-answers. Perhaps I&#8217;m misreading those two paragraphs, I get the feeling I&#8217;m missing something, but I don&#8217;t see how one user-generated solution can be a fix for a mess created by three others. </p>
<p>We will see if the semantic Web can help. I&#8217;m going to remain a skeptic until I see the progress. Right now though, if blog search isn&#8217;t dead, it&#8217;s on life support. Doesn&#8217;t mean it won&#8217;t come back. If I&#8217;ve learned one thing from my love affair with horror movies, it&#8217;s the being dead really doesn&#8217;t mean all that much.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Topics on a rise? Launch of FeedVis certainly suggests so. by Marty</title>
		<link>http://topify.wordpress.com/2009/01/20/topics-on-a-rise-launch-of-feedvis-certainly-suggests-so/#comment-44</link>
		<dc:creator>Marty</dc:creator>
		<pubDate>Wed, 21 Jan 2009 18:06:06 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=32#comment-44</guid>
		<description>Hey Jason, thanks for stepping by. Yeah, I noticed a bit later that FeedVis is an open source project, but anyway it&#039;s really great some people think in this direction.

As for the &quot;detection of topics&quot;, word-count/total posts should be actually equal (in distribution) to word-count/total words, since total-words = total-posts*average-words-per-post, or, to put it bluntly, both are constant for a given corpus. What I was referring to is a measure like word-count/posts-where-word-occurred-count. This way if you have say 100 posts and both &quot;learning&quot; and &quot;classroom&quot; have a total count of 50, your current approach would rank them the same. But imagine if &quot;learning&quot; appears in 50 posts (i.e. always once in a post, never more) and &quot;classroom&quot; only in 10 posts (so average of 5 per post). This would hint to the conclusion that &quot;classroom&quot; is an important topic within the posts it comes up in, while &quot;learning&quot; is just a &#039;background-term&#039; that&#039;s parasitic to the field.

This is obviously a very simple idea, one can take this further to consider average counts per post where a word occurs, spread over time chunks etc. Hope you can catch my drift.

And yea, let&#039;s hope people recognize the power of AI and NLP, as we need more ideas here. There are some pretty powerful statistical basics in the field, what we need now are some inventive applications to push it further.</description>
		<content:encoded><![CDATA[<p>Hey Jason, thanks for stepping by. Yeah, I noticed a bit later that FeedVis is an open source project, but anyway it&#8217;s really great some people think in this direction.</p>
<p>As for the &#8220;detection of topics&#8221;, word-count/total posts should be actually equal (in distribution) to word-count/total words, since total-words = total-posts*average-words-per-post, or, to put it bluntly, both are constant for a given corpus. What I was referring to is a measure like word-count/posts-where-word-occurred-count. This way if you have say 100 posts and both &#8220;learning&#8221; and &#8220;classroom&#8221; have a total count of 50, your current approach would rank them the same. But imagine if &#8220;learning&#8221; appears in 50 posts (i.e. always once in a post, never more) and &#8220;classroom&#8221; only in 10 posts (so average of 5 per post). This would hint to the conclusion that &#8220;classroom&#8221; is an important topic within the posts it comes up in, while &#8220;learning&#8221; is just a &#8216;background-term&#8217; that&#8217;s parasitic to the field.</p>
<p>This is obviously a very simple idea, one can take this further to consider average counts per post where a word occurs, spread over time chunks etc. Hope you can catch my drift.</p>
<p>And yea, let&#8217;s hope people recognize the power of AI and NLP, as we need more ideas here. There are some pretty powerful statistical basics in the field, what we need now are some inventive applications to push it further.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Topics on a rise? Launch of FeedVis certainly suggests so. by Jason Priem</title>
		<link>http://topify.wordpress.com/2009/01/20/topics-on-a-rise-launch-of-feedvis-certainly-suggests-so/#comment-43</link>
		<dc:creator>Jason Priem</dc:creator>
		<pubDate>Tue, 20 Jan 2009 22:17:21 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=32#comment-43</guid>
		<description>Hey, it&#039;s Jason, the guy who built FeedVis.  It&#039;s cool that you&#039;re working on something similar.  To be honest, FeedVis isn&#039;t really a competitor to anything; it&#039;s just something banged out in my spare time for fun.  It&#039;s all open source, so you&#039;re welcome to take all or any of the code you like--so no &quot;head start&quot; :)  

You wrote one thing I didn&#039;t quite understand, about detecting &quot;topics.&quot;  FeedVis measures overall frequency, as in word-uses/total words for a particular word.  I originally was doing word-uses/total posts, which I think is something like what you were saying.  I was very surprised to see that these measures actually came out very similar.  It would be interesting to look more into this.

I think we&#039;re agreed that however it comes about, this kind of automatic metadata generation is the future: tags are nice, but no one is going to sit around tagging the entire internet.  That&#039;s what we&#039;ve got computers for.</description>
		<content:encoded><![CDATA[<p>Hey, it&#8217;s Jason, the guy who built FeedVis.  It&#8217;s cool that you&#8217;re working on something similar.  To be honest, FeedVis isn&#8217;t really a competitor to anything; it&#8217;s just something banged out in my spare time for fun.  It&#8217;s all open source, so you&#8217;re welcome to take all or any of the code you like&#8211;so no &#8220;head start&#8221; :)  </p>
<p>You wrote one thing I didn&#8217;t quite understand, about detecting &#8220;topics.&#8221;  FeedVis measures overall frequency, as in word-uses/total words for a particular word.  I originally was doing word-uses/total posts, which I think is something like what you were saying.  I was very surprised to see that these measures actually came out very similar.  It would be interesting to look more into this.</p>
<p>I think we&#8217;re agreed that however it comes about, this kind of automatic metadata generation is the future: tags are nice, but no one is going to sit around tagging the entire internet.  That&#8217;s what we&#8217;ve got computers for.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on What&#8217;s the matter with tag clouds? by robcottingham</title>
		<link>http://topify.wordpress.com/2008/08/05/whats-the-matter-with-tag-clouds/#comment-41</link>
		<dc:creator>robcottingham</dc:creator>
		<pubDate>Thu, 18 Sep 2008 07:59:29 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=14#comment-41</guid>
		<description>I pretty much agree with everything you say here. What&#039;s dead isn&#039;t the tag cloud as such - it&#039;s the tag cloud as nifty kewl gimmick tacked onto a web site.

Thanks for linking to the cartoon - I&#039;m glad it&#039;s sparking some discussion!</description>
		<content:encoded><![CDATA[<p>I pretty much agree with everything you say here. What&#8217;s dead isn&#8217;t the tag cloud as such &#8211; it&#8217;s the tag cloud as nifty kewl gimmick tacked onto a web site.</p>
<p>Thanks for linking to the cartoon &#8211; I&#8217;m glad it&#8217;s sparking some discussion!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Content is King. Or should I say Tyrant? by More on &#34;Too Many Choices, Too Much Content&#34; &#124; sarahintampa</title>
		<link>http://topify.wordpress.com/2008/05/14/content-is-king-or-should-i-say-tyrant/#comment-9</link>
		<dc:creator>More on &#34;Too Many Choices, Too Much Content&#34; &#124; sarahintampa</dc:creator>
		<pubDate>Wed, 14 May 2008 19:18:36 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=9#comment-9</guid>
		<description>[...] Content is King. Or should I say Tyrant? [...]</description>
		<content:encoded><![CDATA[<p>[...] Content is King. Or should I say Tyrant? [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Content is King. Or should I say Tyrant? by Sarah Perez</title>
		<link>http://topify.wordpress.com/2008/05/14/content-is-king-or-should-i-say-tyrant/#comment-8</link>
		<dc:creator>Sarah Perez</dc:creator>
		<pubDate>Wed, 14 May 2008 16:16:56 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=9#comment-8</guid>
		<description>Even filtering the duplicates would be a good start - I agree. Thanks for the link! :)</description>
		<content:encoded><![CDATA[<p>Even filtering the duplicates would be a good start &#8211; I agree. Thanks for the link! :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tags and Topics &#8211; Yin and Yang or a Dysfunctional Marriage? by Marty</title>
		<link>http://topify.wordpress.com/2008/05/07/tags-and-topics-yin-and-yang-or-a-dysfunctional-marriage/#comment-5</link>
		<dc:creator>Marty</dc:creator>
		<pubDate>Sat, 10 May 2008 09:48:03 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=7#comment-5</guid>
		<description>Hi and thanks for your comment. :) I don&#039;t think I proposed to reduce the number of tags, and if I did, let me clarify myself. I suggest that we don&#039;t need two word layers for textual content (for non-textual content like videos tags are still the number one tool). 

What we have right now in texts is the standard layer - the _words_ themselves, which are never 100% precise, since words are employed to mean exactly what the writer wants them to and not as some universal concept. And then we have the covering tag-layer, which is also comprised of _words_ that are not unambiguous. So why not just have one layer, where you have the text, from which you take the key words to &quot;tag&quot; itself. 

Again, I&#039;m only suggesting this for textual content and this way the number of tags would not be reduced (the number of tags equals the number of all words now and will be the same after, ie. infinite).</description>
		<content:encoded><![CDATA[<p>Hi and thanks for your comment. :) I don&#8217;t think I proposed to reduce the number of tags, and if I did, let me clarify myself. I suggest that we don&#8217;t need two word layers for textual content (for non-textual content like videos tags are still the number one tool). </p>
<p>What we have right now in texts is the standard layer &#8211; the _words_ themselves, which are never 100% precise, since words are employed to mean exactly what the writer wants them to and not as some universal concept. And then we have the covering tag-layer, which is also comprised of _words_ that are not unambiguous. So why not just have one layer, where you have the text, from which you take the key words to &#8220;tag&#8221; itself. </p>
<p>Again, I&#8217;m only suggesting this for textual content and this way the number of tags would not be reduced (the number of tags equals the number of all words now and will be the same after, ie. infinite).</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tags and Topics &#8211; Yin and Yang or a Dysfunctional Marriage? by Ben</title>
		<link>http://topify.wordpress.com/2008/05/07/tags-and-topics-yin-and-yang-or-a-dysfunctional-marriage/#comment-4</link>
		<dc:creator>Ben</dc:creator>
		<pubDate>Sat, 10 May 2008 07:47:47 +0000</pubDate>
		<guid isPermaLink="false">http://topify.wordpress.com/?p=7#comment-4</guid>
		<description>One thing I think is always worth thinking about when considering the role of tags is their relationship to non-textual content.  Tags are not just about stories, they are about anything.  In fact, they are much more important when the content that is tagged can&#039;t be interpreted by a normal text indexing algorithm.  The idea of using NLP techniques in the tagging process is good (see Twine.com), but there is more that can be done by engaging the viewer of the thing that is tagged because people really are, still, much smarter - or at least much more interesting, than machines.  

Why do you think it would be useful to reduce the number of possible tags ?</description>
		<content:encoded><![CDATA[<p>One thing I think is always worth thinking about when considering the role of tags is their relationship to non-textual content.  Tags are not just about stories, they are about anything.  In fact, they are much more important when the content that is tagged can&#8217;t be interpreted by a normal text indexing algorithm.  The idea of using NLP techniques in the tagging process is good (see Twine.com), but there is more that can be done by engaging the viewer of the thing that is tagged because people really are, still, much smarter &#8211; or at least much more interesting, than machines.  </p>
<p>Why do you think it would be useful to reduce the number of possible tags ?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
