Multicode: A Truly Multilingual Approach to Text Encoding

Authors:
Muhammad F. Mudawwar
Affiliations:
-
Venue:
Computer
Year:
1997

Citing 2
Cited 5

The Unicode standard, version 2.0

The Unicode standard, version 2.0
Inside MacIntosh

Inside MacIntosh

Simplifying array processing languages

APL '98 Proceedings of the APL98 conference on Array processing language
A 7-bit transformation format of ISO 10646 for internet mails

Computer Standards & Interfaces
Chinese-Japanese cross language information retrieval: a Han character based approach

WWSM '00 Proceedings of the ACL-2000 workshop on Word senses and multi-linguality - Volume 8
On the cost of multilingualism in database systems

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Chinese-Japanese cross language information retrieval: a Han character based approach

WorkSense '00 Proceedings of the ACL-2000 Workshop on Word Senses and Multi-Linguality

Quantified Score

Hi-index	4.10

Visualization

Abstract

The global market for software written in native languages has spurred the interest in multilingual text encoding. Although ASCII is the accepted standard and works well for English, its 7-bit structure provides codes for only 128 characters, a number insufficient to represent some languages. The limited number of character codes also presents problems when you want to mix languages. Unicode is one proposed solution. A consortium met in 1991 to develop and promote Unicode. They chose a 16-bit structure and, for simplicity, did not include information on what language the code represents. Although this does provide a general, comprehensive encoding structure, Unicode's attempt to unify the character representation for all languages can lead to unwieldy files and more work for programmers. Because of these problems, the author created Multicode in 1996. Its most important feature is the use of multiple character sets, each of which can represent a particular language. Multicode allows 8-bit representations-adequate for most languages-as well as the 16-bit representations required for languages with more characters, such as Japanese, Chinese, and Korean. Through switch characters, Multicode can support files that incorporate more than one language.