RyanHTR
Results
1
comments of
RyanHTR
> 2. (a) We only use masks for detection data that have annotations. For other data, we do not separate phrases in one sentence. (b) We clip the sentence to...