Chase Gilliam
Chase Gilliam
@smpallen99 Yeah, supporting both with an option on the generator sounds very nice.
@inoas are you still working on this? If so I'd be interested in pitching in.
This seems like a nice feature. Are there downsides to changing the token format?
This would be really nice!
The LSI functionality/implementation is a pretty tricky to follow. Most of it hasn't been changed since it was first written. I'm not sure I could adequately explain how to go...
We've discussed TF-IDF, but I haven't had time to dig into it.
We could probably make this configurable. I’ll happily review a PR for this.
@Christophy We just merged https://github.com/jekyll/classifier-reborn/pull/162, which allows for custom tokenizers. Could you let us know if this helps?
@tra38 could you elaborate on which part(s) you're interested in?
@tra38 I think we could expose the lsi data. It'll probably take some careful refactoring, but should be doable.