XEmacs -- Emacs: The Next Generation
English
German
Japanese
America
Asia
Australia
Europe
 
     Searching XEmacs
Quick Links About XEmacs Getting XEmacs Customizing XEmacs Troubleshooting XEmacs Developing XEmacs
      

Unicode internal encoding

Stephen J. Turnbull <stephen@xemacs.org>

The traditional Mule leading byte representation leads to a number of problems. It conceals difficulties in the implementation of display and I/O streams because the internal representation is very close to the ISO 2022 family, the most common external coding system. (ISO 2022 can be considered a superset of ISO 8859, and is used by most multibyte encodings, e.g. for East Asian languages, as well.) It also tends to lead to bugs in autosave files and so on. A Unicode internal representation would alleviate both of these, and be directly interchangeable with many external programs as well. ``UTF-2000'' refers to the implementation developed by Tomohiko Morioka, likely to be a source of much code.

Status

UTF-2000 (Unicode as internal encoding). I've been puttering away at this, but as maintained by Morioka, it's a moving target and a research vehicle, not a production design. Not ready for prime time.

Open bugs

None.

Other open issues

None.

Discussion

None.

Closed bugs

None.

 
 

Conform with <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
Automatically validated by PSGML