A 7-bit transformation format of ISO 10646 for Internet mails

Autor: Pei-Chi Wu
Rok vydání: 2002
Předmět:
Zdroj: Computer Standards & Interfaces. 24:247-255
ISSN: 0920-5489
DOI: 10.1016/s0920-5489(01)00106-4
Popis: ISO 10646 Universal Character Set (UCS) is a 31-bit character-encoding scheme. UTF-7 is a UCS transformation format (UTF) designed mainly for 7-bit mail transports. For compatibility with character set such as EBCDIC, it utilizes only a safe subset of ASCII. When this compatibility is not needed, such a 7-bit encoding can be much improved. In this paper, we propose a transformation format that utilizes the full set of ASCII. The format is designed to be a character set in text/plain media type of Internet mails. Our results show that this format outperforms UTF-7 in space efficiency.
Databáze: OpenAIRE