cerberus-core icon indicating copy to clipboard operation
cerberus-core copied to clipboard

Add sikuli actions associated with ocr

Open R-Loic opened this issue 7 years ago • 5 comments

Hello, Could you please have a look on the ocr technology associated with sikuli ? The ocr technology will enable to read text on a picture and it would be great if we could for example click on the text. Thank you!

R-Loic avatar Jul 13 '17 13:07 R-Loic

Hello @R-Loic You have now the possibility to use 2 identifier with sikuli : picture and text. If you set picture= , it will focus on picture If you set text=, it will find the text in the page and then focus on it.

bcivel avatar Aug 02 '17 23:08 bcivel

Hello @bcivel , The implementation with "text=" doesn't work everytime, for some words sikuli returns the error : image Action is Click on text=Monsieur cerberus/Sikuli try to get an image. Could you have a look ? thanks !

R-Loic avatar Aug 22 '17 08:08 R-Loic

I got a weird behavior with "text=" :

image

The control is : VerifyElementPresent returns KO with the following message : Element 'Excel' is present on the page. The execution time is 0 ms. It seems sikuli didn't search for it.

R-Loic avatar Aug 22 '17 12:08 R-Loic

When sikuli desn't find the text, it returns : image It seems the several words are not well managed.

R-Loic avatar Aug 22 '17 12:08 R-Loic

Hello @R-Loic In fact, the two different identifier (text and picture) are used by Cerberus to make the difference between action that need to upload and forward picture (picture=) and action that only forward text (text=)

Once forwarded to sikuli, you only have 1 API (for exemple : s.click(element) . And by default, Sikuli try first to load the picture and if it fail, it log the error you get and try to find the text. This is the internal behavior of sikuli.

For the other issues, I'll have a look. But I can already confirm that Sikuli have a lot of difficulty to work with multiple words.

bcivel avatar Aug 22 '17 20:08 bcivel