Indic-Computing Logo

Character Encodings

SourceForge Logo
Home Project Documentation Mailing Lists Site Map

The Indic-Computing Project > Indic-Computing: Frequently Asked Questions > Character Encodings

Chapter 4 Character Encodings

Frequently asked questions about character encodings.

1. ISCII
4.1.1. What is ISCII?
2. Unicode
4.2.1. What is Unicode?
4.2.2. How do I display a vowel sign without attaching it to a consonant?
3. KSCLP
4.3.1. What is KSCLP?
4. General information
4.4.1. Where can I find out more about character encodings?

1. ISCII

Questions about ISCII

4.1.1. What is ISCII?

2. Unicode

Questions about Unicode

4.2.1. What is Unicode?

Unicode is a character set encoding that attempts to encode nearly all the living languages of the world.

More information about Unicode can be found at the unicode.org website.

4.2.2. How do I display a vowel sign without attaching it to a consonant?

The canonical means of exhibiting a combining mark such as a vowel sign in isolation is to apply it to U+00A0 NO-BREAK SPACE.

On Microsoft operating systems the required sequence is U+0020 SPACE, U+200D ZERO WIDTH JOINER, followed by the vowel sign.

3. KSCLP

Questions about the KSCLP encoding

4.3.1. What is KSCLP?

KSCLP stands for ``Kannada Standard Code for Language Processing''. It is a character set encoding promoted by the Kannada Ganaka Parishat.

4. General information

4.4.1. Where can I find out more about character encodings?

You could start at the Open Directory Project's resource page on character encodings or at Google's enhanced version of the same page.

This, and other project documentation, can be downloaded from [ http://indic-computing.sourceforge.net/documentation.html ].


Copyright © 2001--2009 The Indic-Computing Project.
Contact: jkoshy
View document revision history
Built With WebMake
Site Search Google