pucu
pucu copied to clipboard
PUCU Pascal UniCode Utils Libary
Decomposing and normalizing the letter Å (Latin Capital Letter A with Ring Above, codepoint `$00C5`) produces the sequence `$0041 $030A`. This is correct. However, composing the sequence `$0041 $030A` produces...
This is hard to build on Linux I always have to do it like this: ``` $ fpc PUCUConvertUnicode.dpr .. $ ./PUCUConvertUnicode .. $ fpc -Twin32 PUCUGenCodePages.dpr Error: Illegal parameter:...
the W3C defines \w as [#X0000-#x10FFFF]-[\p{P}\p{Z}\p{C}], see flre/#73
In Delphi LongInt and LongWord are platform dependant types: 32-bit on Windows and 64-bit on Linux, iOS, and Android. In FreePascal LongInt and LongWord are 32-bit on all platforms. I...
Some characters need special handling because they turn into multiple characters rather than one when case converting, as in "SpecialCasing.txt". E.g. when converting to lower case `İ` becomes something like`0069...
``` function PUCUUnicodeIsWhiteSpace(c:TPUCUUInt32):boolean; {$ifdef caninline}inline;{$endif} begin //result:=UnicodeGetCategoryFromTable(c) in [PUCUUnicodeCategoryZs,PUCUUnicodeCategoryZp,PUCUUnicodeCategoryZl]; result:=((c>=$0009) and (c=$2000) and (c
I went through PUCU.pas as carefully as I could, but I couldn't find an answer to changing case (i.e. upper/lower/title) of a codepoint based on language. Let me try to...