<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: simpler-dup-finder</title>
	<atom:link href="http://blogs.gnome.org/newren/2006/06/01/simpler-dup-finder/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.gnome.org/newren/2006/06/01/simpler-dup-finder/</link>
	<description>Just another GNOME Blogs weblog</description>
	<lastBuildDate>Sun, 11 Jan 2009 23:03:47 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Jamie McCracken</title>
		<link>http://blogs.gnome.org/newren/2006/06/01/simpler-dup-finder/comment-page-1/#comment-55</link>
		<dc:creator>Jamie McCracken</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.gnome.org/newren/2006/06/01/simpler-dup-finder/#comment-55</guid>
		<description>I&#039;ve spent quite a bit of time optimizing fulltext queries while developing Tracker (&lt;a href=&quot;http://freedesktop.org/wiki/Software/Tracker&quot;&gt;http://freedesktop.org/wiki/Software/Tracker&lt;/a&gt;)&lt;p/&gt;The slowness is caused by having a too complicated query with fulltext in it so its a good idea to split things up if thats the case (especially with group bys).&lt;p/&gt;Mysql has a nifty temp table support which should make it easier and more efficient to get a unique column and separate out the fulltext results.&lt;p/&gt;Try the following statements (if the distinct dont work you can also try a group by as the temp table will only be 50 rows max it will still be fast):&lt;p/&gt;DROP TEMPORARY TABLE IF EXISTS DUPE_SEARCH;&lt;p/&gt;CREATE TEMPORARY TABLE DUPE_SEARCH &lt;br/&gt;(&lt;br/&gt;bug_id long,&lt;br/&gt;relevance double&lt;br/&gt;);&lt;p/&gt;insert into DUPE_SEARCH&lt;br/&gt;select distinct bugs.bug_id, $fulltext_search&lt;br/&gt;from blah where blah limit 50;&lt;p/&gt;select distinct bugs.bug_id,&lt;br/&gt;substr(thetext, 1, 5000) AS comment,&lt;br/&gt;bug_status AS status,&lt;br/&gt;resolution,&lt;br/&gt;products.name AS product,&lt;br/&gt;substr(short_desc, 1, 60) AS summary&lt;br/&gt;From Blah, DUPE_SEARCH D...  where D.bug_id = Blah.bug_id&lt;br/&gt;Order by D.relevance;&lt;p/&gt;Drop Temporary table if exists DUPE_SERACH; &lt;br/&gt;</description>
		<content:encoded><![CDATA[<p>I&#8217;ve spent quite a bit of time optimizing fulltext queries while developing Tracker (<a href="http://freedesktop.org/wiki/Software/Tracker">http://freedesktop.org/wiki/Software/Tracker</a>)
<p />The slowness is caused by having a too complicated query with fulltext in it so its a good idea to split things up if thats the case (especially with group bys).
<p />Mysql has a nifty temp table support which should make it easier and more efficient to get a unique column and separate out the fulltext results.
<p />Try the following statements (if the distinct dont work you can also try a group by as the temp table will only be 50 rows max it will still be fast):
<p />DROP TEMPORARY TABLE IF EXISTS DUPE_SEARCH;
<p />CREATE TEMPORARY TABLE DUPE_SEARCH <br />(<br />bug_id long,<br />relevance double<br />);
<p />insert into DUPE_SEARCH<br />select distinct bugs.bug_id, $fulltext_search<br />from blah where blah limit 50;
<p />select distinct bugs.bug_id,<br />substr(thetext, 1, 5000) AS comment,<br />bug_status AS status,<br />resolution,<br />products.name AS product,<br />substr(short_desc, 1, 60) AS summary<br />From Blah, DUPE_SEARCH D&#8230;  where D.bug_id = Blah.bug_id<br />Order by D.relevance;
<p />Drop Temporary table if exists DUPE_SERACH; </p>
]]></content:encoded>
	</item>
</channel>
</rss>
