ICT Assignment ASCII Table and UNI Code
ICT Assignment ASCII Table and UNI Code
3/24/2024
Unicode
ASCII, although widely adopted, is a very old standard (devised in early 1960s).
More recently, Unicode has become more prevalent. Unicode enables you to
handle character data and express that data across platforms in a uniform way.
Unicode comprises: set of code charts that handle vispual reference, a data
encoding method, a set of standard character encodings, a set of reference data
files, additional properties, including: Character properties, Rules to handle normalization,
rendering, display order (for languages that display right to left instead of left to right).
Unicode has largely succeeded the previous environment of myriad incompatible character sets,
each used within different locales and on different computer architectures. Unicode is used to
encode the vast majority of text on the Internet, including most web pages, and relevant
Unicode support has become a common consideration in contemporary software development.
History
The origins of Unicode can be traced back to the 1980s, to a group of individuals with
connections to Xerox's Character Code Standard (XCCS). In 1987, Xerox employee Joe
Becker, along with Apple employees Lee Collins and Mark Davis, started investigating the
practicalities of creating a universal character set. With additional input from Peter Fenwick
and Dave Opstad, Becker published a draft proposal for an "international/multilingual text
character encoding system in August 1988, tentatively called Unicode". He explained that "the
name 'Unicode' is intended to suggest a unique, unified, universal encoding."
Unicode text is processed and stored as binary data using one of several encodings, which
define how to translate the standard's abstracted codes for characters into sequences of
bytes. The Unicode Standard itself defines three encodings: UTF-8, UTF-16, and UTF-32,
though several others exist. Of these, UTF-8 is the most widely used by a large margin, in part
due to its backwards-compatibility with ASCII.
Versions
Unicode 15.1, the latest version, was released on 12 September 2023. It is a minor version
update to version 15.0—released on 13 September 2022—which added a total of 4,489 new
characters, including two new scripts, an extension to the CJK Unified Ideographs block, and
multiple additions to existing blocks. 33 new emoji were added, such as the "wireless" (network)
symbol and additional colored hearts.
Version 15.1 of the standard defines 149813 characters and 161 scripts used in various
ordinary, literary, academic, and technical contexts. Unicode is ultimately capable of encoding
more than 1.1 million characters.
Version 15.1 characters’ summary
Character type Count Character type count Character type count
Uppercase letter 1831 Number other 915 Symbol, modifier 125
Lowercase letter 2233 Punctuation connector 10 Symbol, other 6639
Title case 31 Punctuation dash 26 Separator, space 17
modifier 397 Punctuation open 79 Separator, line 1
Lower, other 132234 Punctuation close 77 Separator, paragraph 1
Mark, no spacing 1985 Punctuation initial quote 12 Other, control 65
Mark, spacing 452 Punctuation final quote 10 Other, format 170
Mark, enclosing 13 Punctuation other 628 Other, surrogate 2048
Number, decimal 680 Symbol, math 948 Other, private use reserved
Number, letter 236 Symbol, currency 63 Other, not assigned reserved