Popis: |
ISO 10646 Universal Character Set (UCS) is a 31-bit character-encoding scheme. UTF-7 is a UCS transformation format (UTF) designed mainly for 7-bit mail transports. For compatibility with character set such as EBCDIC, it utilizes only a safe subset of ASCII. When this compatibility is not needed, such a 7-bit encoding can be much improved. In this paper, we propose a transformation format that utilizes the full set of ASCII. The format is designed to be a character set in text/plain media type of Internet mails. Our results show that this format outperforms UTF-7 in space efficiency. |