The concept of the Unicode character set began in 1987, thanks to Joe Becker from Xerox and Mark Davis from Apple. The following year, Becker, Davis, and Lee Collins (currently of Xerox; formerly of Apple) began investigating the design and soon made the case for Han unification to ANSI, ISO. Unicode is, indeed, based on the historic evolution of the Chinese character set (Han). Several people from various high tech companies began holding bimonthly meetings in 1989. By the end of 1990 , an initial, full-review draft was created. In 1991, the group became the Unicode Consortium, a non-profit organization incorporated as Unicode, Inc. Version 1.0 became available to the public for the first time in 1992.
Vine, Andrea and Bill Hall. SDL International (1998). Articles>Language>Localization>Unicode
ISO 10646 ist ein Zeichensatz, der Schriftzeichen binären Codenummern zuordnet. Unicode hingegen ordnet den 2- bzw. 4-Byte-Code denselben Schriftzeichen zu, ergänzt die Definition aber um Zeicheneigenschaften, Implementationsregeln und Hinweise. Unicode ist eine private Organisation verschiedener kommerzieller Unternehmen, akademischer Einrichtungen und Anwendergruppen. ISO (International Standards Organisation, eine Unterorganisation der UNO) und Unicode arbeiten seit 1991 zusammen, um Diskrepanzen zwischen ISO 10646 und Unicode zu vermeiden.
Use the Unicode Database to Find Characters for XML Documents
The Unicode consortium is dedicated to maintaining a character set that allows computers to deal with the vast array of human writing systems. When you think of computers that manage such a large and complex data set, you think databases, and this is precisely what the consortium provides for computer access to versions of the Unicode standard. The Unicode Character Database comprises files that present detailed information for each character and class of character. The strong tie between XML and Unicode means this database is very valuable to XML developers and authors. In this article Uche Ogbuji introduces the Unicode Character Database and shows how XML developers can put it to use.
Ogbuji, Uche. IBM (2006). Articles>Language>Localization>Unicode
Understanding Bidirectional (BIDI) Text in Unicode
A little-understood corner of Unicode is its handling for bidirectional text (The spec is a little dry). While English languages are read left-to-right, plenty of scripts (notably Arabic and Hebrew) are read from right to left. When only a single direction of text is used in a document, it's fairly straight forward, but when texts with different directions are mixed in one document, some difficulty arises in determining direction. This document attempts to explain how bidirectional text in Unicode works and what this means for the web. In the Unicode standard, characters have a representational order in memory (which English speakers tend to think of as left to right, but is really start-to-finish in a file), which the bidirectional algorithm then operates on to determine the display characteristics.
Henderson, Cal. Iamcal (2009). Articles>Language>Localization>Unicode
This document is mainly intended for “ordinary” people who read the Unicode standard in order to get information about some particular characters or character processing issues that are important to them. The standard, though available online, is difficult to use without some help, and you can easily miss essential information when looking up things in it.
Tampereen Teknillinen Yliopisto (2005). Articles>Language>Localization>Unicode
There are 19 readers currently online: 1 registered user and 18 guests. Register.

![]()
![]()


![]()
![]()
![]()