manticoresearch-php icon indicating copy to clipboard operation
manticoresearch-php copied to clipboard

Method to upload a large batch at max speed

Open marclaporte opened this issue 1 year ago • 3 comments

Could / should the Manticore PHP client be adapted?

2024-09-23_170943

marclaporte avatar Sep 23 '24 21:09 marclaporte

Hi @marclaporte

  1. disable binlog

This has been mitigated here https://github.com/manticoresoftware/manticoresearch/issues/879

  1. write to multiple tables at once

This will be resolved once auto-sharding is completed:

  • https://github.com/manticoresoftware/manticoresearch-buddy/issues/150 - done
  • https://github.com/manticoresoftware/manticoresearch/issues/1087 - yet to do

Also, I believe the client shouldn't manage the schema directly anyway.

  1. write with a higher concurrency
  2. write with a larger batch

It might make sense to integrate this into the client. The idea would be that when the client receives a large batch of documents, it starts sending them to Manticore at the maximum possible speed, automatically adjusting concurrency and batch size. This could be based on factors like the number of CPU cores and workers available in the Manticore instance, increasing until further gains are no longer feasible.

sanikolaev avatar Sep 24 '24 03:09 sanikolaev

Many things have been addressed.

One remaining blocker: https://github.com/manticoresoftware/manticoresearch/pull/3073

marclaporte avatar Feb 19 '25 13:02 marclaporte

AFAICT, the last blocker is https://github.com/manticoresoftware/manticoresearch/issues/1087

marclaporte avatar Mar 21 '25 17:03 marclaporte