Unicode is essentially a map or catalogue where characters used by the languages of the world have been assigned a number. This number is unique for this character: ‘A’ is 41, ‘!’ is 21, and ‘ə’ is 259. Your computer does not deal internally with letters and punctuation, only these reference numbers. The problem is that there are several competing cataloguing systems out there. Some of the earliest Latin-based catalogues (or code pages) were ASCII and ANSI. Each could only contain a maximum of 256 characters, obviously not enough for all of the world’s languages. But these two systems are still with us today; many older operating systems do not accomodate Unicode at all. Most Western European languages can be sufficiently typed in ANSI; thus many webpages and software applications do not bother with Unicode. For the majority of Native languages, using Unicode is essential. Cherokee and Syllabics language materials can not be consistently typed any other way, and ASCII/ANSI fonts do not allow for many of the characters found in Native languages, such as ƛ θ k̓ ʷ etc. Because of the legacy of ASCII/ANSI, much web material and e-mail services use these older code pages as default. This isn’t a big deal if the text is in English or Icelandic, but most languages require characters outside ASCII/ANSI: the French ligatured ‘œ’ is unavailable. The most obvious sign that your web browser is reading a Unicode page incorrectly is when you see this type of gobble-dee-gook: |
a) “VÌḿâ€á‘²á”á˜ë½€ë½€í•´ b) ᏣᎳᎩᓀᐦ c) ???????????????????????? d) empty boxes The garbage in type a) means that your browser is reading Unicode characters as ASCII/ANSI. This can be fixed on your end. In the View drop-down menu in your browser, select (Character) Encoding. A list of possible encodings appears; it is probably already set for ‘Western European’. Change this to ‘Unicode’ Type b) means that the text was typed in software which is not Unicode compliant. I don’t know of a way to make this legible. Type c) means that the text was typed in software which is not Unicode compliant. I don’t know of a way to make this legible. Type d) indicates that the text was typed in Unicode, your browser is set properly for Unicode, but you are using a font which doesn’t contain the required characters. The solution is to install an appropriate font. Most Native languages can be viewed using a languagegeek font. |
Microsoft Internet Explorer |
Microsoft Windows XP |
Netscape |
Opera |
Microsoft Internet Explorer
OR
Microsoft Windows XP
OR Netscape
Opera
|
|
|
|
Last Update: |