spiral icon indicating copy to clipboard operation
spiral copied to clipboard

Ronin splits

Open rsuhaibani opened this issue 6 years ago • 2 comments

The identifier [unloadAssemblies] has been divided as ['unload', 'Ass', 'embl', 'ies']

rsuhaibani avatar Oct 08 '18 18:10 rsuhaibani

Thanks for your report. Sometimes, some identifiers will inevitably be split suboptimally, and I guess this is an example of that. As mentioned in the README section on performance, the heuristic nature of Ronin makes it hard to be perfect for all cases. I think the only thing that could be done here is to retrain it on more/other examples or to come up with a different algorithm altogether.

mhucka avatar Oct 09 '18 15:10 mhucka

Clear! So, I can train it by myself on other examples and correct the results? I am working on analyzing more than 100K identifiers. Is it easy to do that?

rsuhaibani avatar Oct 09 '18 16:10 rsuhaibani