Welcome to SP!  -
Rendering special characters in SP pages
Article
 
Parents 
Articles
 

Rendering special characters in SP pages

  Featured on the Front Page
Rendering special characters in SP pages

Page Type: Article

Object Title: Rendering special characters in SP pages

 

Page By: peterbud

Created/Edited: Aug 2, 2007 / Sep 8, 2008

Object ID: 319129

Hits: 5849 

Page Score: 91.14%  - 34 Votes 

Vote: Log in to vote

 

Introduction


Several of us have been working with mountains/areas/etc. having other than English names. Given the variety of languages and the multitude of special characters they use, sometimes it gives a hard time to get "native" names appear right on the screen. Of course, there is the option to neglect the special characters and take what's closest in the "default" Latin character set. While this is practical for saving time and effort, I think we loose quite a bit of "spice" this way. So, I've tried to hunt down all the special characters I needed for my pages so far - some were easy to get by, some not.

Several special characters can be copy-pasted directly into the SP editor without any problem, from another webpage written in the given language (for this you have to search one on the web and find the character you wanted). If you hit the submit button, and then open the page for editing again, a couple of times you'll see the special characters in the text without any change. In other cases, they will get automatically converted to a code, having the following format: &#number; ("number" is a three or four digit number). In the remaining cases, copy-pasting does not work properly for some reason (you get something different from the character you wanted) and you will have to insert the code yourself manually.

To save you from having to look up native pages and hunting down codes on your own, here's a summary of special characters for a few languages. There are some overlappings between some of them - for good orientation I listed these "shared" characters at each language that uses them. Upper and lower case characters are listed in parallel within the tables. If you need one of them, first try just to simply copy-paste it from here. When that does not work, use the code format mentioned above, inserting the appropriate numbers. In case of Cyrillic alphabets, there is an extra column for the romanization of characters. Further resources on alphabets and encoding are listed in the External Links section

Some further languages would be useful to include. If you have information about those that are missing at the moment, you're very welcome to share it, just drop me a PM and we'll tailor the page further. Thanks to Lolli and Dinaric-ZG for already contributing to the page (Scandinavian and Albanian sections).

Suggestion on use: While I would encourage using the appropriate characters in the body text of pages, it is better to avoid using them in page titles and headers, as those are used to generate automatic links, and special characters can occasionally mess them up.


Special characters collection

Albanian


characterinsert numbercharacterinsert number
Ç199ç231
Ë203ë235


Bulgarian Cyrillic (complete)


characterinsert numbercharacterinsert numberromanization*
А1040а1072a
Б1041б1073b
В1042в1074v
Г1043г1075g
Д1044д1076d
Е1045е1077e
Ж1046ж1078zh
З1047з1079z
И1048и1080i
Й1049й1081y
К1050к1082k
Л1051л1083l
М1052м1084m
Н1053н1085n
О1054о1086o
П1055п1087p
Р1056р1088r
С1057с1089s
Т1058т1090t
У1059у1091u
Ф1060ф1092f
Х1061х1093h
Ц1062ц1094ts
Ч1063ч1095ch
Ш1064ш1096sh
Щ1065щ1097sht
Ъ1066ъ1098a
Ь1068ь1100y
Ю1070ю1102yu
Я1071я1103ya


*according to the official Bulgarian transliteration


Croatian


characterinsert numbercharacterinsert number
Ć262ć263
Č268č269
Đ272đ273
Š352š353
Ž381ž382


Czech


characterinsert numbercharacterinsert number
Á193á225
Č268č269
Ď270ď271
É201é233
Ě282ě283
Í205í237
Ň327ň328
Ó211ó243
Ř344ř345
Š352š353
Ť356ť357
Ú218ú250
Ů366ů367
Ý221ý253
Ž381ž382


German


characterinsert numbercharacterinsert number
Ä196ä228
Ö214ö246
Ü220ü252
ß223


Szet (ß) is used only at the end of words.


Greek


characterinsert numbercharacterinsert number
Α913α945
Β914β946
Γ915γ947
Δ916δ948
Ε917ε949
Ζ918ζ950
Η919η951
Θ920θ952
Ι921ι953
Κ922κ954
Λ923λ955
Μ924μ956
Ν925ν957
Ξ926ξ958
Ο927ο959
Π928π960
Ρ929ρ961
Σ931σ , ς963, 962
Τ932τ964
Υ933υ965
Φ934φ966
Χ935χ967
Ψ936ψ968
Ω937ω969


Second lowercase 's' (ς) is used only at the end of words.


Hungarian


characterinsert numbercharacterinsert number
Á193á225
É201é233
Í205í237
Ó211ó243
Ö214ö246
Ő336ő337
Ú218ú250
Ü220ü252
Ű368ű369


Polish


characterinsert numbercharacterinsert number
Ą260ą261
Ć262ć263
Ę280ę281
Ł321ł322
Ń323ń324
Ó211ó243
Ś346ś347
Ź377ź378
Ż379ż380


Portuguese


characterinsert numbercharacterinsert number
À192à224
Á193á225
Â194â226
É201é233
Ê202ê234
Ç199ç231
Í205í237
Ó211ó243
Ô212ô244
Õ213õ245
Ú218ú250
Ü220ü252


Romanian


characterinsert numbercharacterinsert number
Ă258ă259
Â194â226
Î206î238
Ş350ş351
Ţ354ţ355


Russian Cyrillic (complete)


characterinsert numbercharacterinsert numberromanization*
А1040а1072a
Б1041б1073b
В1042в1074v
Ґ1168ґ1169g
Д1044д1076d
Е1045е1077e, ye
Ё1045ё1077ё, yё
Ж1046ж1078ž, zh
З1047з1079z
И1048и1080i
Й1049й1081j
К1050к1082k
Л1051л1083l
М1052м1084m
Н1053н1085n
О1054о1086o
П1055п1087p
Р1056р1088r
С1057с1089s
Т1058т1090t
У1059у1091u
Ф1060ф1092f
Х1061х1093h, kh
Ц1062ц1094c, ts
Ч1063ч1095č, ch
Ш1064ш1096š, sh
Щ1065щ1097šč, ŝ, shch
Ъ1065ъ1097"
Ы1065ы1097y
Ь1065ь1097'
Э1065э1097è, e
Ю1065ю1097ju, yu, û
Я1065я1097ja, ya, â


*according to the UN / national transliteration standars / ALA-LC


Scandinavian languages (Danish, Finnish, Norwegian, Swedish)


characterinsert numbercharacterinsert number
Ä196ä228
Å197å229
Ø216ø248
Æ198æ230
Ö214ö246


Slovak


characterinsert numbercharacterinsert number
Á193á225
Ä196ä228
Č268č269
Ď270ď271
É201é233
Í205í237
Ĺ313ĺ314
Ľ317ľ318
Ň327ň328
Ó211ó243
Ô212ô244
Ŕ340ŕ341
Š352š353
Ť356ť357
Ú218ú250
Ý221ý253
Ž381ž382


Slovenian


characterinsert numbercharacterinsert number
Č268č269
Š352š353
Ž381ž382


Turkish


characterinsert numbercharacterinsert number
Ç199ç231
Ğ286ğ287
Iı305
İ304i
Ö214ö246
Ş350ş351
Ü220ü252


Uppercase dotless "I" and lowercase dotted "i" are characters of the usual Latin alphabet.


Ukrainian Cyrillic (complete)


characterinsert numbercharacterinsert numberromanization*
А1040а1072a
Б1041б1073b
В1042в1074v
Г1043г1075h, gh
Ґ1168ґ1169g
Д1044д1076d
Е1045е1077e
Є1028є1108ie, ye
Ж1046ж1078zh
З1047з1079zh
И1048и1080y
І1030і1110i
Ї1031ї1111i, yi
Й1049й1081i, y
К1050к1082k
Л1051л1083l
М1052м1084m
Н1053н1085n
О1054о1086o
П1055п1087p
Р1056р1088r
С1057с1089s
Т1058т1090t
У1059у1091u
Ф1060ф1092f
Х1061х1093kh
Ц1062ц1094ts
Ч1063ч1095ch
Ш1064ш1096sh
Щ1065щ1097sch
Ю1070ю1102iu, yu
Я1071я1103ia, ya
Ь1068ь1100'


*according to the Ukrainian National transliteration (1996)


External Links


You can find more (pronunciation, complete alphabets for these languages and some other) on this external page.

Wikipedia alphabet pages: Albanian, Bulgarian, Croatian, Czech, German, Greek, Hungarian, Polish, Portuguese, Romanian, Russian, Danish and Norwegian, Finnish, Swedish, Slovak, Slovenian, Turkish, Ukrainian

Wikipedia pages on romanization of Cyrillic scripts: Bulgarian, Ukrainian , Russian

Images

for the special characters article

Comments


[ Post a Comment ]
Viewing: 1-14 of 14    

AndinoGood...

Andino

Voted 10/10

initiative !

Any nice picture for the Primary Image ?
Posted Aug 2, 2007 1:15 pm

peterbudRe: Good...

peterbud

Hasn't voted

Thanks! I sticked a sign from Slovakia as primary image for now.
Posted Aug 3, 2007 4:07 am

AndinoRe: Good...

Andino

Voted 10/10

Perfect ! ;o)
Posted Aug 6, 2007 6:11 am

rpcsomething useful

rpc

Voted 10/10

very nice - thanks for the effort.
Posted Aug 2, 2007 4:49 pm

peterbudRe: something useful

peterbud

Hasn't voted

Thanks, and welcome :)
Posted Aug 3, 2007 4:09 am

NanulsGood job

Nanuls

Voted 10/10

Excellent work here, a very good idea

Thanks
Dan
Posted Aug 3, 2007 9:00 am

dmikiGratula!

dmiki

Voted 10/10

You can find some more info about Windows-1250 / Windows-1252 code pages, for example at http://en.wikipedia.org/wiki/Windows-1250
Posted Aug 6, 2007 7:40 am

Dan DaltonSuper informative...

Dan Dalton

Voted 10/10

I thank you for your time in making this and I am sure that it will benifit everyone.

Dan
Posted Aug 6, 2007 2:35 pm

dalton1Fantastic work!

dalton1

Voted 10/10

Helped me a lot!!!
Posted Aug 8, 2007 3:55 am

Rick BNice work

Rick B

Voted 10/10

Hey nice work! How about Greek? (if you have time to spare ;)

Cheers,

Rick
Posted Aug 8, 2007 5:44 am

peterbudRe: Nice work

peterbud

Hasn't voted

Done - no time, but couldn't stand the challenge :)
Posted Aug 14, 2007 4:53 am

Tomek LodowyGreat job

Tomek Lodowy

Voted 10/10

Congrats for the idea.

Tomek
Posted Aug 10, 2007 6:51 am

peterbudThanks...

peterbud

Hasn't voted

...for the praise :) and the addititions. Glad if I could help.
Posted Aug 12, 2007 7:57 am

CoraxGood!

Corax

Voted 10/10

Great to have all this info in one place!
Posted Aug 13, 2007 3:11 pm

Viewing: 1-14 of 14