Grant Ingersoll
Grant Ingersoll
Track when users click on the "Read More/Less" links. Add a call into the SnowplowService to support this. This may need a callback function on View.
Add signals around time to first click, between clicks, time on page
It would be nice to do like JIRA does and detect when a user just wants to search within a particular project by adding query parsing to support this. For...
It would be interesting to calculate the "lifetime value" of contributors to a project. We should be able to use a variety of metrics from each project to determine such...
Train on an equally weighted downsample of the project messages, then apply that model to the full index, indexing the top 3 topics for each message, and set this field...
We'd like to showcase training and running classification of content as it flows into the system by writing a classifier that predicts what mailing list a message best belongs to...
When displaying results, roll up related threads, bot messages, blog articles and comments.
When continually crawling and indexing into Solr, as we'll do in production for searchhub, we need to make sure the threading batch job can read from a consistent point from...
Currently, the mail threading jobs do a _:_ query on Solr, but this retrieves all documents. We should add a "type" field that identifies the type of content (mail, github,...
In order to prevent signal spamming, we should add some spam detection to the proxy. For starters, perhaps we could watch for a high volume of clicks or other signals...