html-agility-pack icon indicating copy to clipboard operation
html-agility-pack copied to clipboard

HtmlEntity.DeEntitize not working for more than two byte entities (like emojis)

Open tompazourek opened this issue 7 years ago • 2 comments

Example:

HtmlAgilityPack.HtmlEntity.DeEntitize("👍")

Current output (using version 1.8.4):

"&##128077;"

Expected output:

"👍" // aka "\uD83D\uDC4D"

Note: The DeEntitize works for two-byte HTML entities, e.g. Ԁ gets converted to Ԁ, which is correct. But when it's more bytes, it no longer works, and it actually malforms the input by adding additional # character.

tompazourek avatar Jun 04 '18 10:06 tompazourek

Hello @tompazourek ,

Thank you for reporting, we will look at it.

Best Regards,

Jonathan


Performance Libraries context.BulkInsert(list, options => options.BatchSize = 1000); Entity Framework ExtensionsBulk OperationsDapper PlusLinqToSql Plus

Runtime Evaluation Eval.Execute("x + y", new {x = 1, y = 2}); // return 3 C# Eval FunctionSQL Eval Function

JonathanMagnan avatar Jun 07 '18 22:06 JonathanMagnan

Hello,

Is there any update to this one?

simmikava avatar Oct 10 '22 12:10 simmikava