Indic-Computing Logo

Tutorial: Computers and Indian Languages

SourceForge Logo
Home Project Documentation Mailing Lists Site Map

The Indic-Computing Project > Indic-Computing Handbook > Getting Started > Tutorial: Computers and Indian Languages

Chapter 2 Tutorial: Computers and Indian Languages

Table of Contents
2.1 Introduction
2.2 Character Encodings
2.3 Fonts
2.4 The User Interface: Data Input and Output
2.5 Locales
2.6 Input methods
2.7 Output methods
2.8 Representing Indian Languages on a Computer
Author: Joseph Koshy.

Caution: This tutorial is in the process of being rewritten and revised. It has been published on our site to facilitate reviews--it is not yet at a stage where it is ready for public consumption. Please pardon the errors and please help us improve the tutorial.

In this tutorial we will take a peek ``under the hood'' and examine how computers actually process text. We will talk about character encodings, input methods, fonts, locales and many other things that the casual user is not aware of when using computers.

However, these concepts are very important for software developers wanting to add support for new languages to computers.

This tutorial covers the basics; the special issues that come up when processing Indian scripts and languages get detailed coverage in other parts of the Handbook.

After reading and digesting this tutorial, you should be able to:

After reading and digesting this tutorial you will be in a position to use the rest of the material in this Handbook in an effective manner.

The tutorial is organized as follows:

2.1 Introduction

Computers are introduced to neophytes as ``number crunching machines'', as devices related to the ancient abacus, only more electronic. Yet, for the most part, the modern computer spends an insignificant portion of its time doing numeric calculation. Instead, the great utility of computers comes from the fact that they are excellent symbol manipulation machines.

To a computer, human readable text is merely a sequence of symbols; symbols that it can process and transform at blinding speed, in the precise manner written down in the software program that it is executing.

Here is a list of a few typical text transformation processes that computers are called on to do:

Clearly, these diverse tasks are possible only if the underlying software architecture has been designed to enable these in the first place. In subsequent sections of this tutorial we will examine the foundations over which computer processing of text is built.

This, and other project documentation, can be downloaded from [ http://indic-computing.sourceforge.net/documentation.html ].


Copyright © 2001--2009 The Indic-Computing Project.
Contact: jkoshy
View document revision history
Built With WebMake
Site Search Google