Back-End/Java_1

[Java] UnicodeSystem

리딩리드 2016. 6. 10. 15:30

UnicodeSystem

- Unicode is a universal international standard character encoding that is capable of representing most of the world's written languages


1. Why java uses Unicode System

There were many language standards

- ASCII : for the United States

- ISO 8859-1 : for Western European Language

- KOI-8 : for Russian

- GB18030 and BIG-5 : for chinese and so on


2. Problem

- A particular code value corresponds to different letters in the various language standards

- The encodings for languages with large character sets have variable length, Some common characters are encoded as single bytes, other require two or more byte


3. To solve the problem

A new language standard was developed i.e. Unicode System

In unicode, character holds 2 byte, so java also uses 2byte for charaters


4. etc

- lowest value : \u0000

- highest value : \uFFFF