milvus
milvus copied to clipboard
[Bug]: Query performance is very low (below 150qps) despite important query node number
Is there an existing issue for this?
- [X] I have searched the existing issues
Environment
- Milvus version: 2.2.2
- Deployment mode(standalone or cluster): cluster AWS
- MQ type(rocksmq, pulsar or kafka): pulsar
- SDK version(e.g. pymilvus v2.0.0rc2): github.com/milvus-io/milvus-sdk-go/v2 v2.2.0
- OS(Ubuntu or CentOS): CentOS
- CPU/Memory: c5.4xlarge => 16 VCPU, 32GB
- GPU:
- Others:
Current Behavior
Querying on a new sending node (other AWS server) and doing some random benchmark I see very low throughput.
I'm doing 2 machine (1 local + 1 aws server) 4 processes, batches of 1000query with 10 go-routines. Each go routines as a single connection to a single Proxy node (using 2 proxy nodes randomly).
Here's my collection info:
- name: embedding
- FloatVector: 384
- _default_idx: IVF_FLAT
- metric_type: IP nlist:1800
- Approx Entity Count :11,032,326
- Collection is loaded and replicated on 30 query nodes.
- Collection gains about 3k-4k vectors/second (through another process inserting new records)
Collection ID: 438693529927026545 Collection Name: mycollection Partitions: Fields:
- Field ID: 0 Field Name: RowID Field Type: Int64
- Field ID: 1 Field Name: Timestamp Field Type: Int64
- Field ID: 100 Field Name: entity_id Field Type: Int64 - Primary Key, AutoID: false
- Field ID: 101 Field Name: text_count Field Type: Int16
- Field ID: 102 Field Name: hash_count Field Type: Int8
- Field ID: 103 Field Name: other_count Field Type: Int8
- Field ID: 104 Field Name: mention_count Field Type: Int8
- Field ID: 105 Field Name: media_count Field Type: Int8
- Field ID: 106 Field Name: is_bool Field Type: Bool
- Field ID: 107 Field Name: has_quote Field Type: Bool
- Field ID: 108 Field Name: creation_timestamp Field Type: Int32
- Field ID: 109 Field Name: embedding Field Type: FloatVector - Type Param dim: 384 Consistency Level: Bounded Start position for channel by-dev-rootcoord-dml_32: [8 42 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_35: [8 45 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_56: [8 66 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_30: [8 40 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_47: [8 57 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_48: [8 58 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_51: [8 61 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_52: [8 62 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_34: [8 44 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_38: [8 48 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_42: [8 52 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_43: [8 53 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_44: [8 54 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_55: [8 65 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_37: [8 47 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_46: [8 56 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_50: [8 60 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_53: [8 63 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_54: [8 64 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_41: [8 51 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_40: [8 50 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_45: [8 55 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_49: [8 59 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_59: [8 69 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_36: [8 46 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_39: [8 49 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_57: [8 67 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_31: [8 41 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_58: [8 68 16 0 24 0 32 0] Start position for channel by-dev-rootcoord-dml_33: [8 43 16 0 24 0 32 0]
here's the query:
vec2search := []entity.Vector{searchParams.vector}
start := time.Now()
sp, _ := entity.NewIndexIvfFlatSearchParam(1)
sRet, err := milvusClient.Search(ctx, "mycollection", nil, "", []string{"entity_id"},
vec2search, "embedding", entity.IP, 20, sp,
client.WithSearchQueryConsistencyLevel(entity.ClBounded), client.WithLimit(20))
if err != nil {
fmt.Println(err)
return false, 0, 0, fmt.Errorf("failed to search collection, err: %v", err)
}

Process 1:
Queried nbSearch: 1000, nbError: 0 records in 29.460374951s (total:4m53.267963685s) => 33.94 qps (avg latency: 293.27ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 29.839000831s (total:4m55.789648876s) => 33.51 qps (avg latency: 295.79ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 30.076635863s (total:4m59.234612841s) => 33.25 qps (avg latency: 299.23ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 29.815182548s (total:4m56.846613236s) => 33.54 qps (avg latency: 296.85ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 30.022772186s (total:4m58.777589485s) => 33.31 qps (avg latency: 298.78ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 29.117218885s (total:4m49.991001462s) => 34.34 qps (avg latency: 289.99ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 27.628981886s (total:4m34.572110472s) => 36.20 qps (avg latency: 274.57ms, avg results: 20.00)
Process 2
Queried nbSearch: 1000, nbError: 0 records in 27.778894894s (total:4m36.510751041s) => 36.00 qps (avg latency: 276.51ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 27.79649825s (total:4m36.765207525s) => 35.98 qps (avg latency: 276.76ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 27.916158123s (total:4m38.357692297s) => 35.82 qps (avg latency: 278.36ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 27.888368544s (total:4m37.913993133s) => 35.86 qps (avg latency: 277.91ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 28.066101082s (total:4m39.291657502s) => 35.63 qps (avg latency: 279.29ms, avg results: 20.00)
Process 3
Queried nbSearch: 1000, nbError: 0 records in 29.68494171s (total:4m55.683594435s) => 33.69 qps (avg latency: 295.68ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 29.485262268s (total:4m53.881224462s) => 33.92 qps (avg latency: 293.88ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 28.993143544s (total:4m48.817353677s) => 34.49 qps (avg latency: 288.82ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 29.305538861s (total:4m51.002148977s) => 34.12 qps (avg latency: 291.00ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 29.322604115s (total:4m50.888327916s) => 34.10 qps (avg latency: 290.89ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 29.495849313s (total:4m53.244181178s) => 33.90 qps (avg latency: 293.24ms, avg results: 20.00)
Process 4
Queried nbSearch: 1000, nbError: 0 records in 1m14.986758708s (total:6m13.327321304s) => 13.34 qps (avg latency: 373.33ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 1m17.293226959s (total:6m24.871092199s) => 12.94 qps (avg latency: 384.87ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 1m16.322780041s (total:6m19.836449932s) => 13.10 qps (avg latency: 379.84ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 1m16.321409708s (total:6m19.74849057s) => 13.10 qps (avg latency: 379.75ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records in 1m17.547682833s (total:6m25.682184784s) => 12.90 qps (avg latency: 385.68ms, avg results: 20.00)
Expected Behavior
No response
Steps To Reproduce
No response
Milvus Log
No response
Anything else?
No response
I looked into QueryCoord/Query node no logs stands out that would point to a problem. I'm sure what I'm doing wrong here, would love some help!
I looked into QueryCoord/Query node no logs stands out that would point to a problem. I'm sure what I'm doing wrong here, would love some help!
Question, if you already know the entity_id and embedding it self, why do you do search?
I actually don't know the entity_id, I'm trying to get back an entity id given a vector in my embedding
column (search the associated entity_id with this vector). Is this not what this do?
milvusClient.Search(ctx, "mycollection", nil, "", []string{"entity_id"}, vec2search, "embedding", entity.IP, 20, sp,
client.WithSearchQueryConsistencyLevel(entity.ClBounded), client.WithLimit(20))
- :
Gotcha, 30 * 16 vcpu with 10m data should be much faster than hundred QPS. Few things to check:
- If all the querynodes in the cluster are balanced? I mean the CPU usage.
- If balanced
- if you are using too large nprobes? try to change nprobe to 16-32 and retry
- If Cpu usage high enough -> change to HNSW index and retyr
- If not -> check prometheus metrics and logs, see if it's because timetick delay and search can not be issued due to consistency limited.
- if not balanced
- if there are only shard numbers querynode with very high cpu usage, might be related to growing data, use more shards or smaller growing segment ratio would help. (dataCoord.segment.sealProportion)
- otherwise you will need to check the balance policy, why there are some querynode with less load
- Yes node are well balanced CPU wise.
- as you can see in my query
sp, _ := entity.NewIndexIvfFlatSearchParam(1)
I also tried with 20 & 32 - makes no difference.
here's what I see on the query node during a query:
[2023/01/12 18:19:19.103 +00:00] [DEBUG] [querynode/impl.go:724] ["Received SearchRequest"] [traceID=431d4bef9128619a] [vChannels="[by-dev-rootcoord-dml_37_438693529927026545v7,by-dev-rootcoord-dml_40_438693529927026545v10]"] [segmentIDs="[]"] [guaranteeTimestamp=438710450014650372] [timeTravel=18446744073709551615]
[2023/01/12 18:19:19.103 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=8942b1907336b65] [msgID=438710451299155969] [segmentID=438693529942273577] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.104 +00:00] [DEBUG] [querynode/impl.go:855] ["start do search"] [traceID=431d4bef9128619a] [vChannel=by-dev-rootcoord-dml_37_438693529927026545v7] [segmentIDs="[]"]
[2023/01/12 18:19:19.104 +00:00] [INFO] [querynode/shard_cluster.go:927] ["cluster segment distribution"] [len=0] [partitionIDs="[]"]
[2023/01/12 18:19:19.104 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(431d4bef9128619a)] [msg="enqueue done"] [duration=33.088µs]
[2023/01/12 18:19:19.105 +00:00] [DEBUG] [querynode/task_read.go:173] ["query msg can do"] [collectionID=438693529927026545] [sm.GuaranteeTimestamp=2023/01/12 18:19:14.072 +00:00] [serviceTime=2023/01/12 18:19:18.622 +00:00] ["delta milliseconds"=-4550] [channel=by-dev-rootcoord-dml_37_438693529927026545v7] [msgID=438710451325370372]
[2023/01/12 18:19:19.105 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=2e56946eefb79f1f] [msgID=438710451299155970] [segmentID=438693529939670033] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.105 +00:00] [DEBUG] [querynode/task_read.go:180] [tr/waitTSafeTimeRecorder] [msg="wait for tsafe done"] [duration=22.024µs]
[2023/01/12 18:19:19.105 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=1d5b183b1015a12f] [msgID=438710451312263169] [segmentID=438693529934662297] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.105 +00:00] [DEBUG] [querynode/impl.go:855] ["start do search"] [traceID=431d4bef9128619a] [vChannel=by-dev-rootcoord-dml_40_438693529927026545v10] [segmentIDs="[]"]
[2023/01/12 18:19:19.105 +00:00] [INFO] [querynode/shard_cluster.go:927] ["cluster segment distribution"] [len=0] [partitionIDs="[]"]
[2023/01/12 18:19:19.105 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(431d4bef9128619a)] [msg="enqueue done"] [duration=104.974µs]
[2023/01/12 18:19:19.106 +00:00] [DEBUG] [querynode/task_read.go:173] ["query msg can do"] [collectionID=438693529927026545] [sm.GuaranteeTimestamp=2023/01/12 18:19:14.072 +00:00] [serviceTime=2023/01/12 18:19:18.622 +00:00] ["delta milliseconds"=-4550] [channel=by-dev-rootcoord-dml_40_438693529927026545v10] [msgID=438710451325370372]
[2023/01/12 18:19:19.106 +00:00] [DEBUG] [querynode/task_read.go:180] [tr/waitTSafeTimeRecorder] [msg="wait for tsafe done"] [duration=52.144µs]
[2023/01/12 18:19:19.109 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=8942b1907336b65] [msgID=438710451299155969] [segmentID=438693529944677347] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.109 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=2e56946eefb79f1f] [msgID=438710451299155970] [segmentID=438693529932258033] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.113 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=2e56946eefb79f1f] [msgID=438710451299155970] [segmentID=438693529927228394] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.114 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=8942b1907336b65] [msgID=438710451299155969] [segmentID=438693529942273388] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.119 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=1d5b183b1015a12f] [msgID=438710451312263169] [segmentID=438693529944677492] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.123 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=1d5b183b1015a12f] [msgID=438710451312263169] [segmentID=438693529929654236] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.123 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529932258189] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.124 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=8942b1907336b65] [msgID=438710451299155969] [segmentID=438693529937266493] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.130 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=2e56946eefb79f1f] [msgID=438710451299155970] [segmentID=438693529934662647] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.132 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=8942b1907336b65] [msgID=438710451299155969] [segmentID=438693529932258004] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.132 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=8942b1907336b65] [segmentIDs="[438693529939669728,438693529927228416,438693529942273577,438693529944677347,438693529929654539,438693529934662254,438693529932258004,438693529937266058]"]
[2023/01/12 18:19:19.132 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=2e56946eefb79f1f] [msgID=438710451299155970] [segmentID=438693529937266551] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.132 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529939670178] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.134 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=1d5b183b1015a12f] [msgID=438710451312263169] [segmentID=438693529939669756] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.135 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=8942b1907336b65] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_37_438693529927026545v7, segmentIDs = []"] [duration=162.210773ms]
[2023/01/12 18:19:19.135 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=8942b1907336b65] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.135 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=8942b1907336b65] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_37_438693529927026545v7, segmentIDs = []"] [duration=162.316607ms]
[2023/01/12 18:19:19.136 +00:00] [DEBUG] [querynode/task_read.go:80] [tr/searchTask(5b3d10f134cb77fd)] [msg="start to process"] [duration=67.02202ms]
[2023/01/12 18:19:19.136 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529944677129] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.136 +00:00] [DEBUG] [querynode/validate.go:115] ["read target partitions on stream replica"] [traceID=5b3d10f134cb77fd] [collectionID=438693529927026545] [partitionIDs="[438693529927026546]"]
[2023/01/12 18:19:19.137 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529934662649] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.137 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529929654422] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.137 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529944676752] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.137 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529927228405] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.137 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529932258194] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.137 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529942273246] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.137 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529937266403] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.137 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529939669757] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.139 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529939670223] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.140 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=2e56946eefb79f1f] [msgID=438710451299155970] [segmentID=438693529929654714] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.140 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=2e56946eefb79f1f] [msgID=438710451299155970] [segmentID=438693529942273810] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.140 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=2e56946eefb79f1f] [segmentIDs="[438693529942273810,438693529929654714,438693529932258033,438693529927228394,438693529934662647,438693529944677289,438693529937266551,438693529939670033]"]
[2023/01/12 18:19:19.141 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529932258130] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.141 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=2e56946eefb79f1f] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_54_438693529927026545v24, segmentIDs = []"] [duration=157.180052ms]
[2023/01/12 18:19:19.141 +00:00] [DEBUG] [querynode/task_read.go:80] [tr/searchTask(40171f661264bf69)] [msg="start to process"] [duration=55.340656ms]
[2023/01/12 18:19:19.141 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=2e56946eefb79f1f] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.141 +00:00] [DEBUG] [querynode/validate.go:115] ["read target partitions on stream replica"] [traceID=40171f661264bf69] [collectionID=438693529927026545] [partitionIDs="[438693529927026546]"]
[2023/01/12 18:19:19.141 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=2e56946eefb79f1f] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_54_438693529927026545v24, segmentIDs = []"] [duration=157.559681ms]
[2023/01/12 18:19:19.141 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529939670004] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.141 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=2e56946eefb79f1f] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/impl.go:773] [tr/Search] [traceID=2e56946eefb79f1f] [msg="search done in all shards"] [duration=158.008352ms]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529927228413] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529929654333] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=8942b1907336b65] [msgID=438710451299155969] [segmentID=438693529927228403] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529937266158] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529929654715] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529944677535] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529934662341] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529932258221] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.142 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529942273913] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.143 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529942273518] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.144 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=1d5b183b1015a12f] [msgID=438710451312263169] [segmentID=438693529937266085] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.145 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529944677202] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.147 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=8942b1907336b65] [msgID=438710451299155969] [segmentID=438693529934662603] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=8942b1907336b65] [segmentIDs="[438693529927228403,438693529929654162,438693529944677115,438693529939670049,438693529932257989,438693529937266493,438693529942273388,438693529934662603]"]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/task_read.go:80] [tr/searchTask(40171f661264bf69)] [msg="start to process"] [duration=62.192697ms]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529934662501] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/validate.go:115] ["read target partitions on stream replica"] [traceID=40171f661264bf69] [collectionID=438693529927026545] [partitionIDs="[438693529927026546]"]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=8942b1907336b65] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_34_438693529927026545v4, segmentIDs = []"] [duration=175.108159ms]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=8942b1907336b65] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=8942b1907336b65] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_34_438693529927026545v4, segmentIDs = []"] [duration=175.189877ms]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=8942b1907336b65] [numbers=2] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529932257802] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/impl.go:773] [tr/Search] [traceID=8942b1907336b65] [msg="search done in all shards"] [duration=175.696861ms]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529942273333] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529929654340] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529939669729] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.149 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529929654339] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529937265955] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529927228415] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529934662051] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.148 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529944676941] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.153 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529942273547] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.158 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529937266406] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.164 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529927228398] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.166 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529937266245] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.167 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=7780a69791e7d9a9] [segmentIDs="[438693529929654339,438693529942273518,438693529927228398,438693529932258130,438693529937266245,438693529944677202,438693529934662501,438693529939670178]"]
[2023/01/12 18:19:19.167 +00:00] [DEBUG] [querynode/task_read.go:80] [tr/searchTask(40171f661264bf69)] [msg="start to process"] [duration=77.657551ms]
[2023/01/12 18:19:19.167 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=7780a69791e7d9a9] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_35_438693529927026545v5, segmentIDs = []"] [duration=116.915274ms]
[2023/01/12 18:19:19.167 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=7780a69791e7d9a9] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.167 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=7780a69791e7d9a9] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_35_438693529927026545v5, segmentIDs = []"] [duration=117.01358ms]
[2023/01/12 18:19:19.167 +00:00] [DEBUG] [querynode/validate.go:115] ["read target partitions on stream replica"] [traceID=40171f661264bf69] [collectionID=438693529927026545] [partitionIDs="[438693529927026546]"]
[2023/01/12 18:19:19.170 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529939669583] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.170 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529927228401] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.170 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529929654104] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.172 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529934662427] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.173 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529944676883] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.173 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529932257816] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.174 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529927228421] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.174 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529942273143] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.174 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529937266102] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.177 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=1d5b183b1015a12f] [msgID=438710451312263169] [segmentID=438693529942273594] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.183 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529939669757] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.187 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529942273913] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.189 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529929654333] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.192 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529937266403] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.194 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=7780a69791e7d9a9] [msgID=438710451312263172] [segmentID=438693529934662605] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.194 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529942273246] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.194 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=7780a69791e7d9a9] [segmentIDs="[438693529929654715,438693529942273547,438693529927228401,438693529932258189,438693529934662605,438693529937266406,438693529944677129,438693529939670223]"]
[2023/01/12 18:19:19.196 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529937265955] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=7780a69791e7d9a9] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_49_438693529927026545v19, segmentIDs = []"] [duration=146.894074ms]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/task_read.go:80] [tr/searchTask(431d4bef9128619a)] [msg="start to process"] [duration=92.209216ms]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=7780a69791e7d9a9] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/validate.go:115] ["read target partitions on stream replica"] [traceID=431d4bef9128619a] [collectionID=438693529927026545] [partitionIDs="[438693529927026546]"]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529942273577] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529927228416] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529929654539] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529944677535] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=7780a69791e7d9a9] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_49_438693529927026545v19, segmentIDs = []"] [duration=147.574214ms]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=7780a69791e7d9a9] [numbers=2] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/impl.go:773] [tr/Search] [traceID=7780a69791e7d9a9] [msg="search done in all shards"] [duration=147.766256ms]
[2023/01/12 18:19:19.197 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529944677347] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.198 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529934662254] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.198 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529939669728] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.198 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529937266058] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.198 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529932258004] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.201 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=1d5b183b1015a12f] [msgID=438710451312263169] [segmentID=438693529932257847] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.203 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=1d5b183b1015a12f] [segmentIDs="[438693529939669756,438693529942273594,438693529944677492,438693529932257847,438693529927228395,438693529934662297,438693529929654236,438693529937266085]"]
[2023/01/12 18:19:19.203 +00:00] [DEBUG] [querynode/task_read.go:80] [tr/searchTask(431d4bef9128619a)] [msg="start to process"] [duration=97.804711ms]
[2023/01/12 18:19:19.203 +00:00] [DEBUG] [querynode/validate.go:115] ["read target partitions on stream replica"] [traceID=431d4bef9128619a] [collectionID=438693529927026545] [partitionIDs="[438693529927026546]"]
[2023/01/12 18:19:19.203 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529934662559] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.204 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529944677028] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.204 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529927228399] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.204 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529929654685] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.205 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529932258369] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.206 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529939669840] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.206 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529942273491] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.206 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529937266261] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.208 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529944676941] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.211 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529927228413] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.213 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529927228415] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.215 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529937266158] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.221 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529942273333] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.223 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529939670004] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.225 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529929654422] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.229 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529934662649] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.204 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=1d5b183b1015a12f] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_53_438693529927026545v23, segmentIDs = []"] [duration=160.314339ms]
[2023/01/12 18:19:19.229 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=1d5b183b1015a12f] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.230 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529944677347] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.229 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=1d5b183b1015a12f] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_53_438693529927026545v23, segmentIDs = []"] [duration=185.749807ms]
[2023/01/12 18:19:19.231 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=1d5b183b1015a12f] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.231 +00:00] [DEBUG] [querynode/impl.go:773] [tr/Search] [traceID=1d5b183b1015a12f] [msg="search done in all shards"] [duration=187.342214ms]
[2023/01/12 18:19:19.233 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529929654340] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.236 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529944676752] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.237 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529939669729] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.237 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529934662341] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.238 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529927228405] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.239 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529937266102] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.239 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529932258221] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.239 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=40171f661264bf69] [segmentIDs="[438693529939670004,438693529927228413,438693529929654333,438693529937266158,438693529944677535,438693529934662341,438693529932258221,438693529942273913]"]
[2023/01/12 18:19:19.240 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=40171f661264bf69] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_46_438693529927026545v16, segmentIDs = []"] [duration=154.480867ms]
[2023/01/12 18:19:19.240 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=40171f661264bf69] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.240 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=40171f661264bf69] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_46_438693529927026545v16, segmentIDs = []"] [duration=154.577629ms]
[2023/01/12 18:19:19.240 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529942273143] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.243 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529927228399] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.244 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529944676883] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.244 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529934662051] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.245 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529927228421] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.247 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529929654685] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.247 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529934662427] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.247 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529937266261] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.248 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529932257802] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.248 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=40171f661264bf69] [segmentIDs="[438693529934662051,438693529932257802,438693529942273333,438693529929654340,438693529939669729,438693529937265955,438693529927228415,438693529944676941]"]
[2023/01/12 18:19:19.248 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529944677028] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.248 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=40171f661264bf69] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_38_438693529927026545v8, segmentIDs = []"] [duration=162.762245ms]
[2023/01/12 18:19:19.248 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=40171f661264bf69] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.248 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=40171f661264bf69] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_38_438693529927026545v8, segmentIDs = []"] [duration=162.854185ms]
[2023/01/12 18:19:19.248 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529942273577] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.249 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529942273491] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.252 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529932258004] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.254 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=5b3d10f134cb77fd] [msgID=438710451312263173] [segmentID=438693529932258194] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.254 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=5b3d10f134cb77fd] [segmentIDs="[438693529934662649,438693529929654422,438693529944676752,438693529927228405,438693529942273246,438693529932258194,438693529937266403,438693529939669757]"]
[2023/01/12 18:19:19.254 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=5b3d10f134cb77fd] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_55_438693529927026545v25, segmentIDs = []"] [duration=186.050473ms]
[2023/01/12 18:19:19.254 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=5b3d10f134cb77fd] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.254 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=5b3d10f134cb77fd] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_55_438693529927026545v25, segmentIDs = []"] [duration=186.137393ms]
[2023/01/12 18:19:19.255 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=5b3d10f134cb77fd] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.255 +00:00] [DEBUG] [querynode/impl.go:773] [tr/Search] [traceID=5b3d10f134cb77fd] [msg="search done in all shards"] [duration=186.356188ms]
[2023/01/12 18:19:19.255 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529939669728] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.255 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529932258369] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.256 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529939669840] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.258 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529929654104] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.260 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529939669583] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.261 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529937266058] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.262 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=40171f661264bf69] [msgID=438710451325370371] [segmentID=438693529932257816] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.262 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=40171f661264bf69] [segmentIDs="[438693529939669583,438693529929654104,438693529934662427,438693529944676883,438693529932257816,438693529927228421,438693529942273143,438693529937266102]"]
[2023/01/12 18:19:19.262 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=40171f661264bf69] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_56_438693529927026545v26, segmentIDs = []"] [duration=176.558838ms]
[2023/01/12 18:19:19.262 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=40171f661264bf69] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.262 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=40171f661264bf69] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_56_438693529927026545v26, segmentIDs = []"] [duration=176.649541ms]
[2023/01/12 18:19:19.262 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=40171f661264bf69] [numbers=3] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.262 +00:00] [DEBUG] [querynode/impl.go:773] [tr/Search] [traceID=40171f661264bf69] [msg="search done in all shards"] [duration=177.038322ms]
[2023/01/12 18:19:19.264 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529927228416] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.264 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529929654539] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.264 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529934662254] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.264 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=431d4bef9128619a] [segmentIDs="[438693529942273577,438693529927228416,438693529929654539,438693529944677347,438693529934662254,438693529932258004,438693529937266058,438693529939669728]"]
[2023/01/12 18:19:19.264 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=431d4bef9128619a] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_37_438693529927026545v7, segmentIDs = []"] [duration=160.057002ms]
[2023/01/12 18:19:19.264 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=431d4bef9128619a] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.265 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=431d4bef9128619a] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_37_438693529927026545v7, segmentIDs = []"] [duration=160.125463ms]
[2023/01/12 18:19:19.265 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529934662559] [segmentType=2] [loadIndex=false]
[2023/01/12 18:19:19.265 +00:00] [INFO] [querynode/search.go:97] ["search growing/sealed segments without indexes"] [traceID=431d4bef9128619a] [segmentIDs="[438693529934662559,438693529944677028,438693529927228399,438693529929654685,438693529932258369,438693529939669840,438693529942273491,438693529937266261]"]
[2023/01/12 18:19:19.265 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=431d4bef9128619a] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_40_438693529927026545v10, segmentIDs = []"] [duration=160.712528ms]
[2023/01/12 18:19:19.265 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=431d4bef9128619a] [numbers=1] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.265 +00:00] [DEBUG] [querynode/impl.go:915] [tr/SearchShard] [traceID=431d4bef9128619a] [msg="do reduce done in shard cluster, vChannel = by-dev-rootcoord-dml_40_438693529927026545v10, segmentIDs = []"] [duration=160.766868ms]
[2023/01/12 18:19:19.265 +00:00] [DEBUG] [querynode/result.go:86] [reduceSearchResultData] [traceID=431d4bef9128619a] [numbers=2] [targetNq=1] [targetTopk=20]
[2023/01/12 18:19:19.265 +00:00] [DEBUG] [querynode/impl.go:773] [tr/Search] [traceID=431d4bef9128619a] [msg="search done in all shards"] [duration=162.035403ms]
[2023/01/12 18:19:19.319 +00:00] [INFO] [gc/gc_tuner.go:81] ["GC Tune done"] ["previous GOGC"=30] ["heapuse "=71] ["total memory"=29595] ["next GC"=70] ["new GOGC"=30]
we can see many lines of logs show:
[2023/01/12 18:19:19.255 +00:00] [DEBUG] [querynode/segment.go:344] ["do search on segment done"] [traceID=431d4bef9128619a] [msgID=438710451325370372] [segmentID=438693529939669728] [segmentType=2] [loadIndex=false]
which indicates the searches are doing without indexes. Please make sure all the index tasks were completed, and then you can release the collection and reload the collection.
/assign @ThomasAlxDmy please try as suggested above, and feel free to keep us posted.
Thanks for helping @yanliang567 @xiaofan-luan
So I started to spin up a smaller environment so we can iterate faster and understand what's going on:

Collection Loaded => 1,265,800 records. I try creating/recreating the index and reloading the collection same effect, I still see some segment not being indexed.
In this setting I have 4 queries nodes (c6in.8xlarge): 32CPU | 64 GB RAM | 50 Gbps

I max at 850qps =>
Queried nbSearch: 1000, nbError: 0 records(1 vectors) in 1.187706286s (total:15m6.458642827s) => 842.46 vps (avg latency: 906.46ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records(1 vectors) in 1.211963839s (total:15m17.097020959s) => 825.76 vps (avg latency: 917.10ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records(1 vectors) in 1.185255229s (total:15m7.283401702s) => 843.88 vps (avg latency: 907.28ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records(1 vectors) in 1.181801638s (total:15m29.827552142s) => 846.74 vps (avg latency: 929.83ms, avg results: 20.00)
Queried nbSearch: 1000, nbError: 0 records(1 vectors) in 1.186050791s (total:14m25.938503893s) => 843.17 vps (avg latency: 865.94ms, avg results: 20.00)
Index node:
[2023/01/15 06:46:52.555 +00:00] [INFO] [indexcoord/garbage_collector.go:270] ["recycle index files"] [buildID=438763080441655109] ["meta files num"=6] ["chunkManager files num"=6]
[2023/01/15 06:46:52.555 +00:00] [INFO] [indexcoord/garbage_collector.go:283] ["index files recycle success"] [buildID=438763080441655109] ["delete index files num"=0]
[2023/01/15 06:46:54.898 +00:00] [DEBUG] [indexcoord/index_coord.go:772] ["IndexCoord GetIndexInfos"] [collectionID=438763080440024570] [indexName=] [segIDs="[438763080441653218]"]
[2023/01/15 06:46:54.899 +00:00] [INFO] [indexcoord/index_coord.go:829] ["IndexCoord DescribeIndex"] [collectionID=438763080440024570] [indexName=]
[2023/01/15 06:46:54.902 +00:00] [INFO] [indexcoord/index_coord.go:588] ["IndexCoord completeIndexInfo"] [collID=438763080440024570] [indexName=test]
[2023/01/15 06:46:54.902 +00:00] [DEBUG] [indexcoord/meta_table.go:651] ["IndexCoord get index states success"] [indexID=438763080441655077] [total=32] [None=0] [Unissued=0] [InProgress=0] [Finished=32] [Failed=0]
Query node:
[2023/01/15 06:54:30.654 +00:00] [INFO] [querynode/segment_loader.go:826] ["All data has been read, there is no more data"] [collectionID=438763080440024570] [channel=by-dev-rootcoord-dml_83] [msgID="CLkOEJO9ARgAIAA="]
[2023/01/15 06:54:30.654 +00:00] [INFO] [querynode/segment_loader.go:854] ["from dml check point load done"] [subName=querynode-delta-loader-15-438763080440024570-4678754740599929807] [timeTake=175.590902ms]
[2023/01/15 06:54:30.654 +00:00] [INFO] [msgstream/mq_msgstream.go:182] ["start to close mq msg stream"] ["producer num"=0] ["consumer num"=1]
time="2023-01-15T06:54:30Z" level=info msg="The consumer[264] successfully unsubscribed" consumerID=264 name=ahddj subscription=querynode-delta-loader-15-438763080440024570-4678754740599929807 topic="persistent://public/default/by-dev-rootcoord-dml_83"
time="2023-01-15T06:54:30Z" level=info msg="Closing consumer=264" consumerID=264 name=ahddj subscription=querynode-delta-loader-15-438763080440024570-4678754740599929807 topic="persistent://public/default/by-dev-rootcoord-dml_83"
time="2023-01-15T06:54:30Z" level=info msg="[Closed consumer]" consumerID=264 name=ahddj subscription=querynode-delta-loader-15-438763080440024570-4678754740599929807 topic="persistent://public/default/by-dev-rootcoord-dml_83"
[2023/01/15 06:54:30.658 +00:00] [INFO] [querynode/load_segment_task.go:130] ["LoadSegmentTask Execute done"] [collectionID=438763080440024570] [replicaID=438763104202653712] [msgID=2669]
[2023/01/15 06:54:30.658 +00:00] [INFO] [querynode/impl.go:501] ["loadSegmentsTask WaitToFinish done"] [collectionID=438763080440024570] [segmentIDs="[438763080440024583,438763080440229698]"] [nodeID=15]
[2023/01/15 06:54:30.658 +00:00] [INFO] [querynode/shard_cluster.go:281] ["ShardCluster update segment"] [collectionID=438763080440024570] [channel=by-dev-rootcoord-dml_83_438763080440024570v20] [replicaID=438763104202653712] [nodeID=15] [segmentID=438763080440024583] [state=3]
[2023/01/15 06:54:30.658 +00:00] [INFO] [querynode/shard_cluster.go:281] ["ShardCluster update segment"] [collectionID=438763080440024570] [channel=by-dev-rootcoord-dml_83_438763080440024570v20] [replicaID=438763104202653712] [nodeID=15] [segmentID=438763080440229698] [state=3]
[2023/01/15 06:54:30.658 +00:00] [INFO] [querynode/shard_cluster_version.go:91] ["Update shard cluster version"] [newVersionID=2] [newAllocation="{Node 15: [438763080440024583 438763080440229698] }"]
[2023/01/15 06:54:30.658 +00:00] [INFO] [querynode/impl_utils.go:48] ["LoadSegment transfer load done"] [traceID=208cd9180900ff18] [shard=by-dev-rootcoord-dml_83_438763080440024570v20] [segmentIDs="[438763080440024583,438763080440229698]"]
[2023/01/15 06:54:36.588 +00:00] [DEBUG] [querynode/flow_graph_service_time_node.go:70] ["update tSafe:"] [collectionID=438763080440024570] [tSafe=438767629472890881] [tSafe_p=2023/01/15 06:54:36.395 +00:00] [tsLag=193.300787ms] [channel=by-dev-rootcoord-dml_92_438763080440024570v29]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(768cc85123de985f)] [msg="enqueue done"] [duration=7.291µs]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:929] ["segments distribution"] [nodeID=15] [segments="[438763080441653034]"]
[2023/01/15 06:56:09.320 +00:00] [DEBUG] [querynode/task_read.go:80] [tr/searchTask(523f606e4fa23962)] [msg="start to process"] [duration=1.809204ms]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(768cc85123de985f)] [msg="enqueue done"] [duration=4.01µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:855] ["start do search"] [traceID=66d3b50ae876ac1b] [vChannel=by-dev-rootcoord-dml_79_438763080440024570v16] [segmentIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(768cc85123de985f)] [msg="enqueue done"] [duration=15.015µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:173] ["query msg can do"] [collectionID=438763080440024570] [sm.GuaranteeTimestamp=2023/01/15 06:56:04.295 +00:00] [serviceTime=2023/01/15 06:56:08.845 +00:00] ["delta milliseconds"=-4550] [channel=by-dev-rootcoord-delta_91_438763080440024570v28] [msgID=438767653826068514]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(66d3b50ae876ac1b)] [msg="enqueue done"] [duration=441.318µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:724] ["Received SearchRequest"] [traceID=4c6daa0bd718d9b3] [vChannels="[by-dev-rootcoord-dml_91_438763080440024570v28,by-dev-rootcoord-dml_85_438763080440024570v22,by-dev-rootcoord-dml_72_438763080440024570v9]"] [segmentIDs="[]"] [guaranteeTimestamp=438767652515348536] [timeTravel=18446744073709551615]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:855] ["start do search"] [traceID=66d3b50ae876ac1b] [vChannel=by-dev-rootcoord-dml_68_438763080440024570v5] [segmentIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:927] ["cluster segment distribution"] [len=1] [partitionIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/validate.go:50] ["read target partitions on historical replica"] [traceID=713d3e5adf43d7aa] [collectionID=438763080440024570] [partitionIDs="[438763080440024571]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:816] ["start do subsearch"] [traceID=66d3b50ae876ac1b] [fromShardLeader=true] [vChannel=by-dev-rootcoord-dml_74_438763080440024570v11] [segmentIDs="[438763080441653265]"]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:929] ["segments distribution"] [nodeID=15] [segments="[438763080441653118]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:816] ["start do subsearch"] [traceID=5037a6c77dfb4690] [fromShardLeader=true] [vChannel=by-dev-rootcoord-dml_71_438763080440024570v8] [segmentIDs="[438763080441652639]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=6fcc0ec2ae51b0d4] [msgID=438767653826068505] [segmentID=438763080441653035] [segmentType=3] [loadIndex=true]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=713d3e5adf43d7aa] [msgID=438767653826068502] [segmentID=438763080440229696] [segmentType=3] [loadIndex=false]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/validate.go:50] ["read target partitions on historical replica"] [traceID=523f606e4fa23962] [collectionID=438763080440024570] [partitionIDs="[438763080440024571]"]
[2023/01/15 06:56:09.320 +00:00] [DEBUG] [querynode/validate.go:50] ["read target partitions on historical replica"] [traceID=523f606e4fa23962] [collectionID=438763080440024570] [partitionIDs="[438763080440024571]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(66d3b50ae876ac1b)] [msg="enqueue done"] [duration=278.024µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:816] ["start do subsearch"] [traceID=768cc85123de985f] [fromShardLeader=true] [vChannel=by-dev-rootcoord-dml_89_438763080440024570v26] [segmentIDs="[438763080441653034]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=713d3e5adf43d7aa] [msgID=438767653826068502] [segmentID=438763080440024596] [segmentType=3] [loadIndex=true]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(5037a6c77dfb4690)] [msg="enqueue done"] [duration=9.245µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(5037a6c77dfb4690)] [msg="enqueue done"] [duration=14.553µs]
[2023/01/15 06:56:09.320 +00:00] [INFO] [querynode/shard_cluster.go:927] ["cluster segment distribution"] [len=1] [partitionIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:855] ["start do search"] [traceID=4c6daa0bd718d9b3] [vChannel=by-dev-rootcoord-dml_85_438763080440024570v22] [segmentIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=523f606e4fa23962] [msgID=438767653826068503] [segmentID=438763080441652843] [segmentType=3] [loadIndex=true]
[2023/01/15 06:56:09.320 +00:00] [DEBUG] [querynode/impl.go:855] ["start do search"] [traceID=5037a6c77dfb4690] [vChannel=by-dev-rootcoord-dml_87_438763080440024570v24] [segmentIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:855] ["start do search"] [traceID=4c6daa0bd718d9b3] [vChannel=by-dev-rootcoord-dml_91_438763080440024570v28] [segmentIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:816] ["start do subsearch"] [traceID=66d3b50ae876ac1b] [fromShardLeader=true] [vChannel=by-dev-rootcoord-dml_79_438763080440024570v16] [segmentIDs="[438763080441653118]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(768cc85123de985f)] [msg="enqueue done"] [duration=7.449µs]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:929] ["segments distribution"] [nodeID=15] [segments="[438763080441652637]"]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:927] ["cluster segment distribution"] [len=1] [partitionIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:927] ["cluster segment distribution"] [len=1] [partitionIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:927] ["cluster segment distribution"] [len=1] [partitionIDs="[]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(66d3b50ae876ac1b)] [msg="enqueue done"] [duration=5.595µs]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:929] ["segments distribution"] [nodeID=15] [segments="[438763080441653183]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/segment.go:330] ["start do search on segment"] [traceID=523f606e4fa23962] [msgID=438767653826068503] [segmentID=438763080441652972] [segmentType=3] [loadIndex=true]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:929] ["segments distribution"] [nodeID=15] [segments="[438763080441653160]"]
[2023/01/15 06:56:09.321 +00:00] [INFO] [querynode/shard_cluster.go:929] ["segments distribution"] [nodeID=15] [segments="[438763080441652972]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(768cc85123de985f)] [msg="enqueue done"] [duration=27.569µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:816] ["start do subsearch"] [traceID=5037a6c77dfb4690] [fromShardLeader=true] [vChannel=by-dev-rootcoord-dml_81_438763080440024570v18] [segmentIDs="[438763080441652637]"]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(5037a6c77dfb4690)] [msg="enqueue done"] [duration=11.049µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(5037a6c77dfb4690)] [msg="enqueue done"] [duration=4.598µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/task_read.go:78] [tr/searchTask(66d3b50ae876ac1b)] [msg="enqueue done"] [duration=18.769µs]
[2023/01/15 06:56:09.321 +00:00] [DEBUG] [querynode/impl.go:816] ["start do subsearch"] [traceID=4c6daa0bd718d9b3] [fromShardLeader=true] [vChannel=by-dev-rootcoord-dml_85_438763080440024570v22] [segmentIDs="[438763080441653183]"]
[2023/01/15 06:56:09.320 +00:00] [DEBUG] [querynode/impl.go:906] [tr/SearchShard] [traceID=2a01ed2e4445a893] [msg="do search done in shard cluster, vChannel = by-dev-rootcoord-dml_77_438763080440024570v14, segmentIDs = []"] [duration=2.010617ms]
CPU:
Something I don't understand is that I can't get more QPS out of the systems and limit doesn't seem to be either CPU or I/O:
Network I/O out:
Network I/O in:
Any idea what's going on?
@ThomasAlxDmy thank you for the detailed feedbacks. According to the logs above, it looks like the bottleneck is not on querynode parts, but we still need to double confirm that with full milvus logs. So
- could you please offer all the miluvs pod logs, including proxy, querycoord, querynode, indexcoord, indexnode, rootcoord, etcd.
- Do you happen to have Milvus metrics with grafana, so that we can help you to address which phase the search costs most in milvus.
- could you please share the etcd backup with us to help us understand the segments status. please read https://github.com/milvus-io/birdwatcher/blob/main/README.md to backup etcd metadata
/assign @ThomasAlxDmy
Here's the etcd backup! (unfortunately cluster is ansible/terraform I haven't set up monitoring yet). bw_etcd_ALL.230118-181249.bak.gz
/assign @liliu-z pls investigate on this issue when you have time
@ThomasAlxDmy Just give a quick check if we still tracking here. Since I noticed we are communicating offline through email,
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Rotten issues close after 30d of inactivity. Reopen the issue with /reopen
.
/unassign
/assign @zhagnlu
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Rotten issues close after 30d of inactivity. Reopen the issue with /reopen
.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Rotten issues close after 30d of inactivity. Reopen the issue with /reopen
.