comment search
Let's face it, comment search still sucks. I don't really want to deal with it, but it's a real problem so here's a bug to annoy me.
cor recommends checking out http://www.elasticsearch.org/
http://www.rallydev.com/engblog/2011/03/18/search-that-sizzles-experiences-transitioning-from-sql-like-to-solr/
we found out tonight that xapian isn't in debian anymore. that makes this even more important.
I'm really liking what I'm seeing with elasticsearch, I'm going to see if it will work for us next week.
we use elasticsearch in our company now to search through hundreds of millions of log entries. It's got some scaling issues towards that size, but for tmbo it should be great.
not that it matters at our scale, but are you clustering?
also, in case anyone's curious:
mysql> SELECT COUNT(*) FROM offensive_comments WHERE comment <> "";
+----------+
| COUNT(*) |
+----------+
| 2344658 |
+----------+
1 row in set (4.88 sec)
I dont think we're clustering. I havent set it up but as far as I can see we have 1 machine (although a kickass one)
I can't make progress on this unless we get a memory upgrade or something. back to the back burner for now.
Oh hey. I might have fixed it. Take a look at b63a8486e0d30e72ae2b697ed372cdb0819adfbd. (I also handled empty results correctly with 7fb838c08174845d57068ff3d34737e1efb03114). Deployed to sandbox for testing.
XapianQueryParser::FLAG_DEFAULT was what was missing? D'oh.