KOI8-F or KOI8 Unified is an 8-bit character set.[1] It was designed by Peter Cassetta[2] of Fingertip Software (now defunct) as an attempt to support all the encoded letters from both KOI8-E (ISO-IR-111) and KOI8-RU (and hence also, KOI8-U and KOI8-R), along with some of the pseudographics from KOI8-R,[3][2] with some additional punctuation in the remaining space, sourced partly from Windows-1251.[2] This encoding was only used in the software of that company. FreeDOS calls it code page 60270.
Alias(es) | KOI8-F |
---|---|
Language(s) | Belarusian, Ukrainian, Russian, Bulgarian, Serbian Cyrillic, Macedonian |
Created by | Peter Cassetta (Fingertip Software) |
Classification | 8-bit KOI, extended ASCII |
Extends | KOI8-B |
Based on | KOI8-RU, KOI8-E |
Other related encoding(s) | KOI8-R, KOI8-U |
Character set
editThe following table shows the KOI8-F encoding. Each character is shown with its equivalent Unicode code point. Differences from ISO-IR-111 are boxed; other relevant encodings which are matched, if any, are noted in footnotes.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | ||||||||||||||||
1x | ||||||||||||||||
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | , | - | . | / | |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | |
8x | ─[a] 2500 |
│[a] 2502 |
┌[a] 250C |
┐[a] 2510 |
└[a] 2514 |
┘[a] 2518 |
├[a] 251C |
┤[a] 2524 |
┬[a] 252C |
┴[a] 2534 |
┼[a] 253C |
▀[a] 2580 |
▄[a] 2584 |
█[a] 2588 |
▌[a] 258C |
▐[a] 2590 |
9x | ░[a] 2591 |
‘[b] 2018 |
’[b] 2019 |
“[b] 201C |
”[b] 201D |
∙/•[c] | –[b] 2013 |
—[b] 2014 |
© 00A9 |
™[b] 2122 |
NBSP[d] | » 00BB |
® 00AE |
« 00AB |
·[a] 00B7 |
¤ 00A4 |
Ax | NBSP[d] | ђ 0452 |
ѓ 0453 |
ё 0451 |
є 0454 |
ѕ 0455 |
і 0456 |
ї 0457 |
ј 0458 |
љ 0459 |
њ 045A |
ћ 045B |
ќ 045C |
ґ[e] 0491 |
ў 045E |
џ 045F |
Bx | № 2116 |
Ђ 0402 |
Ѓ 0403 |
Ё 0401 |
Є 0404 |
Ѕ 0405 |
І 0406 |
Ї 0407 |
Ј 0408 |
Љ 0409 |
Њ 040A |
Ћ 040B |
Ќ 040C |
Ґ[e] 0490 |
Ў 040E |
Џ 040F |
Cx | ю 044E |
а 0430 |
б 0431 |
ц 0446 |
д 0434 |
е 0435 |
ф 0444 |
г 0433 |
х 0445 |
и 0438 |
й 0439 |
к 043A |
л 043B |
м 043C |
н 043D |
о 043E |
Dx | п 043F |
я 044F |
р 0440 |
с 0441 |
т 0442 |
у 0443 |
ж 0436 |
в 0432 |
ь 044C |
ы 044B |
з 0437 |
ш 0448 |
э 044D |
щ 0449 |
ч 0447 |
ъ 044A |
Ex | Ю 042E |
А 0410 |
Б 0411 |
Ц 0426 |
Д 0414 |
Е 0415 |
Ф 0424 |
Г 0413 |
Х 0425 |
И 0418 |
Й 0419 |
К 041A |
Л 041B |
М 041C |
Н 041D |
О 041E |
Fx | П 041F |
Я 042F |
Р 0420 |
С 0421 |
Т 0422 |
У 0423 |
Ж 0416 |
В 0412 |
Ь 042C |
Ы 042B |
З 0417 |
Ш 0428 |
Э 042D |
Щ 0429 |
Ч 0427 |
Ъ 042A |
- ^ a b c d e f g h i j k l m n o p q r Matching KOI8-R, KOI8-U, KOI8-RU.
- ^ a b c d e f g Matching Windows-1251 and Windows-1252.
- ^ May be U 2219, which matches RFC 1489 (KOI8-R),[4] or U 2022, which matches Windows-1251 and Windows-1252.
- ^ a b The non-breaking space is encoded twice: first at 0x9A matching KOI8-R, and then at 0xA0 matching KOI8-E (the latter of which also happens to be its location in Windows-1251 and Windows-1252).
- ^ a b Matching KOI8-U and KOI8-RU.
KOI8-C/KOI8-CA
editA variant is KOI8-C, also known as KOI8-CA, is an 8-bit character set. It is a modification of KOI8-F to support Caucasian languages while retaining support in the same languages as KOI8-F. FreeDOS calls it code page 61294. It has hardly ever been used. KOI8-C once referred to what is now known as KOI8-O.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
8x | ғ 0493 |
җ 0497 |
қ 049B |
ҝ 049D |
ң 04A3 |
ү 04AF |
ұ 04B1 |
ҳ 04B3 |
ҷ 04B7 |
ҹ 04B9 |
һ 04BB |
▀ 2580 |
ә 04D9 |
ӣ 04E3 |
ө 04E9 |
ӯ 04EF |
9x | Ғ 0492 |
Җ 0496 |
Қ 049A |
Ҝ 049C |
Ң 04A2 |
Ү 04AE |
Ұ 04B0 |
Ҳ 04B2 |
Ҷ 04B6 |
Ҹ 04B8 |
Һ 04BA |
⌡ 2321 |
Ә 04D8 |
Ӣ 04E2 |
Ө 04E8 |
Ӯ 04EE |
See also
editReferences
edit- ^ Nechayev, Valentin (2013) [2001]. "Review of 8-bit Cyrillic encodings universe". Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ^ a b c Czyborra, Roman (1998-11-30) [1998-05-25]. "The Cyrillic Charset Soup". Archived from the original on 2016-12-03. Retrieved 2016-12-03.
- ^ "KOI8 Unified". Fingertip Software. Archived from the original on 1998-01-09. Retrieved 2020-02-11.
- ^ a b Leisher, Mark (2008) [1998-03-05]. "KOI8 Unified Cyrillic to Unicode 2.1 mapping table". Department of Mathematical Sciences, New Mexico State University. Retrieved 2020-05-02.
- ^ Discussion