Fork me on GitHub

Charsets experiments

/!\ CPU intensive /!\

Comparing non-ASCII characters on legacy charsets


Unicode: all code points (0x0 - 0x10FFFF)


Unicode 6.3: all assigned code points



How many bytes are used by each assigned Unicode character in each encoding


Unicode 8 glyphs names


kDefinitions of all Han code points in Unicode 8