Shahrukh Khan
Shahrukh Khan
Hi @akar5h, @lalitpagaria, As discussed previously the goal here is to have more advanced document embeddings filterable by facets giving multiple views for a query document, this can be achieved...
When user stars a repo -> extract read me of repo -> index in ES
**Is your feature request related to a problem? Please describe.** Ability to create labeled datasets out of the categories present of Wikipedia pages both from category explorer and Wikipedia project...
@lalitpagaria Please feel free to add more things to this list. - Multi-language tokenization - functionality there but not tested yet - URL remove - grammar correction Multiple features can...
@lalitpagaria for getting document vectors we can use this https://github.com/UKPLab/sentence-transformers
**Is your feature request related to a problem? Please describe.** Have extendable and standard UI for the drag and drop interface. **Describe the solution you'd like** Write a small wrapper...
Hi @MaartenGr, I have fine-tuned a fuzzy transformer for char level similarity to do fuzzy matching, you can read about how I did here: LinkedIn post explanation: https://www.linkedin.com/feed/update/urn:li:activity:6819456033992253440/ Model on...
I am trying to deploy sharded LLMs to multiple RTX 3090s. So, far I have tried TGI by HF and it works fine. However, I came across Ray LLM at...