« May 9, 2020 | Main

Saturday, May 16, 2020

UNUM 3.2: Updated to Unicode 13

Version 3.2 of UNUM is now available for downloading. Version 3.2 incorporates the Unicode 13.0.0 standard, released on March 10th, 2020. The update to Unicode adds support for four scripts for languages, additional CJK (Chinese, Japanese, and Korean) symbols, 55 new emoji, and symbols from legacy computer and teletext systems and Creative Commons licenses. There are a total of 143,859 characters in 13.0.0, of which 5930 are new since 12.1.0. (UNUM also supports an additional 65 ASCII control characters, which are not assigned graphic code points in the Unicode database.)

This is an incremental update to Unicode. There are no structural changes in how characters are defined in the databases, and other than the presence of the new characters, the operation of UNUM is unchanged.

UNUM also contains a database of HTML named character references (the sequences like “<” you use in HTML source code when you need to represent a character which has a syntactic meaning in HTML or which can't be directly included in a file with the character encoding you're using to write it). There have been no changes to this standard since UNUM 2.2 was released in September 2017, so UNUM 3.2 will behave identically when querying these references except, of course, that numerical references to the new Unicode characters will be interpreted correctly.

UNUM Documentation and Download Page

Posted at 13:26 Permalink