mathislucka

Results 21 comments of mathislucka

Can we reformulate the issue to something like: "Provide tokenization options to limit document or text length in pipelines" I could see multiple places where this applies and multiple strategies...

I was actually too fast with that :D Ranker only gets one document at a time.

Yes, came around on that too. My thoughts were something like: `DocumentsTokenTruncater` (probably not a great name) that accepts a list of documents and can truncate them according to different...

> One other (small) issue I forsee when using a separate component is that it's not easily possible to know how much you should truncate the documents by. To precisely...

Hey Joshua, if you just want to use the models, then you could start with this notebook: https://github.com/mathislucka/kaggle_clrp_1st_place_solution/blob/main/notebooks/05_clrp_inference.ipynb. You'd need to download the models from Kaggle (links in README). However,...

We're working on a new product for the Haystack community that will allow users to build their pipelines in a visual editor. It's currently in development and we'll announce it...

For reference: I reached out to @Deepan-kishore and he'll participate in the beta phase.

@lohit8846 happy to add you to the list. How can I best reach you?

That's great to hear @Bellk17. We now have an early access form where you can sign up to get access to the private beta: https://landing.deepset.ai/deepset-studio-waitlist We'll start onboarding people from...

@PaulBFB you can just sign-up here: https://landing.deepset.ai/deepset-studio-signup