future_cxx icon indicating copy to clipboard operation
future_cxx copied to clipboard

cXXXX - Unicode Conversions, Encoding Registries, and More

Open ThePhD opened this issue 2 years ago • 2 comments

ThePhD avatar Jul 25 '22 15:07 ThePhD

@h-vetinari Take a look: https://thephd.dev/_vendor/future_cxx/papers/C%20-%20Restartable%20and%20Non-Restartable%20Character%20Functions%20for%20Efficient%20Conversions.html

It's got new sections to (hopefully) kill of the wchar_t requirement issues voiced last meeting. I'm getting to work on the benchmarks, and hopefully it'll be published in the Journal of Open Source Software (JOSS) before our next meeting in January.

ThePhD avatar Aug 10 '22 19:08 ThePhD

Some initial benchmarks, since I'm gonna need these for the updated paper. I plan on improving cuneicode but, for now, it's easy enough to prove that cuneicode_registry_fast can compete (by a slight sleight-of-hand, but still works nevertheless).

Conversion Tests - UTF-16 to UTF-8 (Well-Formed) Conversion Tests - UTF-16 to UTF-32 (Well-Formed) Conversion Tests - UTF-32 to UTF-8 (Well-Formed) Conversion Tests - UTF-32 to UTF-16 (Well-Formed) Conversion Tests - UTF-8 to UTF-16 (Well-Formed) Conversion Tests - UTF-8 to UTF-32 (Well-Formed)

ThePhD avatar Aug 16 '22 03:08 ThePhD