milvus icon indicating copy to clipboard operation
milvus copied to clipboard

[Bug]: [benchmark] insert 100 million data, 2 replica , concurrent operations, IVF_SQ8 index search latency is twice that of November 2022

Open elstic opened this issue 1 year ago • 3 comments

Is there an existing issue for this?

  • [X] I have searched the existing issues

Environment

- Milvus version: 2.2.0-20230427-c82ec0b1
- Deployment mode(standalone or cluster):cluster
- MQ type(rocksmq, pulsar or kafka):    
- SDK version(e.g. pymilvus v2.0.0rc2):
- OS(Ubuntu or CentOS): 
- CPU/Memory: 
- GPU: 
- Others:

Current Behavior

Same scenario, insert 100 million data, concurrent operations, IVF_SQ8 index search latency is twice as high as in November 2022 .

image: 2.2.0-20230427-c82ec0b1 search latency : about 0.233s image

image : 2.2.0-20221118-e7429f88 search latency : about 0.109s image

Expected Behavior

No response

Steps To Reproduce

1. create a collection or use an existing collection
        2. build index on vector column
        3. insert a certain number of vectors
        4. flush collection
        5. build index on vector column with the same parameters
        6. build index on on scalars column or not
        7. count the total number of rows
        8. load collection
        9. perform concurrent operations (search、load、query、ddl)
        10. clean all collections or not

Milvus Log

No response

Anything else?

No response

elstic avatar Apr 28 '23 05:04 elstic

/assign @MrPresent-Han /unassign

yanliang567 avatar Apr 28 '23 06:04 yanliang567

/assign @cydrain

xiaofan-luan avatar Apr 28 '23 17:04 xiaofan-luan

@cydrain pls help on investigate it

xiaofan-luan avatar Apr 28 '23 17:04 xiaofan-luan

image: 2.2.0-20230427-c82ec0b1 (knowhere-1.3.12) Screenshot from 2023-05-22 11-41-48

image : 2.2.0-20221118-e7429f88 (knowhere-1.3.2) Screenshot from 2023-05-22 11-40-14

test with sift1m dataset, see no obvious performance difference

cydrain avatar May 22 '23 02:05 cydrain

case :test_concurrent_locust_100m_ivf_sq8_ddl_dql_filter_replica2_cluster using dataset: SIFT , metirc_type:L2

search (weight=20):

 nq=10, top_k=10, search_param={"nprobe": 16},
 expr=eval("{'float_1': {'GT': -1.0, 'LT': parser_data_size(data_size) * 0.5}}")),

query (weight=10):

ids=[i for i in range(10)]

load(weight=1) ddl (weight=2)

create collection,insert vectors, flush, build index ,time.sleep(59), drop collection 

elstic avatar May 22 '23 03:05 elstic

Synchronized information: ddl operations have slowed down more : image between 112s and 114s

In the previous version v2.2.4: image

case: test_concurrent_locust_100m_ivf_sq8_ddl_dql_filter_replica2_cluster

elstic avatar May 22 '23 08:05 elstic

/assign @jiaoew1991 ddl is slowing down, please help troubleshoot

elstic avatar May 22 '23 08:05 elstic

/unassign

cydrain avatar May 23 '23 06:05 cydrain

Is this still a problem, @elstic?

jiaoew1991 avatar Jun 25 '23 07:06 jiaoew1991

Is this still a problem, @elstic?

This issue no longer exists. Please see the search latency of this case recently run, it and our benchmark data has been about the same, according to my judgment, should be this issue under the pr repair: https://github.com/milvus-io/milvus/issues/24022 image

elstic avatar Jun 25 '23 08:06 elstic