SmartEmbed
SmartEmbed copied to clipboard
Function level tokenizer
Hi, great repo. Could you share the Normalizer and Java Parser code for function_level
to regenerate token list similar to your function_normalized_tokens
and function_tokens
in the original processed dataset ?