François Massot
François Massot
Please @nigel-andrews resolve the comments that you took into account. If relevant, it's nice to provide a feedback to the reviewer directly in the PR.
> Is it always ok to merge splits from different sources (source_id)? I'm not sure of that right now but this question seems unrelated to the fundamental problem we want...
Actually my solution is not working because of the split cleanup we are doing when we start the server. I need to rethink about that.
Thanks @machete-michael for the report. More info on this issue. Here is a request made on a default dynamic mapping (see docs example) that shows the same error: ```bash curl...
@machete-michael sorry for the long silence. A new eye on this issue made me think that you may be interested in a uuid friendly tokenizer. We have open an issue...
Good point, I forgot to open an issue on this, thanks :)
@gnufree Concerning the error, currently a field marked as "fast" must be present in the document. If not, quickwit will not index the document and log this error `RequiredFastField("id")`.
> Can an indexer consume records with the same ID? Yes, no problem with that.
@gnufree it seems like your documents are pretty small, can you share an extract of the documents you are indexing? Also, can you share your index config too?
Interesting to see how databend is doing it: https://github.com/datafuselabs/databend/issues/3084