T.61 is an ITU-T Recommendation for a Teletex character set. T.61 predated Unicode, and was the primary character set in ASN.1 used in early versions of X.500 and X.509 for encoding strings containing characters used in Western European languages.[1] It is also used by older versions of LDAP.[2] While T.61 continues to be supported in modern versions of X.500 and X.509, it has been deprecated in favor of Unicode. It is also called Code page 1036, CP1036, or IBM 01036.

T.61
Character repertoire and coded character sets for the international teletex service
StatusWithdrawn
Year started1980
Latest version(03/93)
March 1993
OrganizationITU-T
CommitteeStudy Group VIII
Related standardsT.51, ASN.1, X.500, X.509
Domainencoding
LicenseFreely available
Websitehttps://www.itu.int/rec/T-REC-T.61

While ASN.1 does see wide use and the T.61 character set is used on some standards using ASN.1 (for example in RSA Security's PKCS #9), the 1988-11 version of the T.61 standard itself was superseded by a never-published 1993-03 version; the 1993-03 version was withdrawn by the ITU-T.[3] The 1988-11 version is still available.[3]

T.61 was one of the encodings supported by Mozilla software in email and HTML until 2014, when the supported encodings were limited to those in the WHATWG Encoding Standard (although T.61 remained supported for LDAP).[4]

Code page layout

edit
T.61
Alias(es)
  • Code page 1036
  • Code page 20261
  • ISO-IR-103
StandardITU T.61
Other related encoding(s)

The following table maps the T.61 characters to their equivalent Unicode code points.

T.61-8bit[5][6][7][8]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0x BS LF FF CR LS1 LS0
1x SS2[a] SUB ESC SS3[a]
2x  SP  ! " % & ' ( ) * , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ ] _
6x a b c d e f g h i j k l m n o
7x p q r s t u v w x y z | DEL
8x PLD PLU
9x CSI
Ax NBSP ¡ ¢ £ $ ¥ # § ¤ «
Bx ° ± ² ³ × µ · ÷ » ¼ ½ ¾ ¿
Cx ◌̀ ◌́ ◌̂ ◌̃ ◌̄ ◌̆ ◌̇ ◌̈ ◌̈[b] ◌̊ ◌̧ ◌̲ ◌̋ ◌̨ ◌̌
Dx
Ex Æ Ð ª Ħ IJ Ŀ Ł Ø Œ º Þ Ŧ Ŋ ʼn
Fx ĸ æ đ ð ħ ı ij ŀ ł ø œ ß þ ŧ ŋ

See ITU T.51 for a description of how the accents at 0xC0..CF worked. They prefix the letters, as opposed to postfix used by Unicode.

See also

edit

Footnotes

edit
  1. ^ a b The ISO/IEC 2022 C0 control set defined by T.61 (ISO-IR-106) encodes SS2 and SS3 at 0x19 and 0x1D respectively.[6] It is permitted to use ISO/IEC 2022 sequences for switching control sets (as opposed to graphical sets) within ISO/IEC 10646 (UCS/Unicode) in contexts where processing ANSI escape codes is appropriate, provided that each byte in the sequence is padded to the code unit size of the encoding,[9] in which case SS2 and SS3 would be available at U 0019 and U 001D. However, the more common ISO/IEC 6429 control set, the names of which are listed for reference in the Unicode code chart,[10] encodes SS2 and SS3 at U 008E and U 008F respectively.
  2. ^ Noted in the 1988 revision of the specification as having been allocated to the umlaut in the 1980 revision of the specification, but having since been deprecated.[5] It was initially used to distinguish between umlaut and diaeresis.[11]

References

edit
  1. ^ "T.61 : Character repertoire and coded character sets for the international teletex service". www.itu.int. Retrieved 2019-11-14.
  2. ^ "ldap_8859_to_t61". PHP Manual. The PHP Group.
  3. ^ a b "Recommendation T.61". ITU-T.
  4. ^ Sivonen, Henri (2014-09-26). "Character encoding changes in m-c require c-c action". mozilla.dev.apps.thunderbird.
  5. ^ a b CCITT (1988-11-25). Character repertoire and coded character sets for the international teletex service (1988 ed.). Recommendation T.61.
  6. ^ a b ITU (1985-08-01). Teletex Primary Set of Control Functions (PDF). ITSCJ/IPSJ. ISO-IR-106.
  7. ^ ITU (1985-08-01). Teletex Supplementary Set of Control Functions (PDF). ITSCJ/IPSJ. ISO-IR-107.
  8. ^ RFC 1345
  9. ^ ISO/IEC JTC 1/SC 2 (2017). "12.4: Identification of control function set". Information technology — Universal Coded Character Set (UCS) (5th ed.). ISO. pp. 19–20. ISO/IEC 10646.{{citation}}: CS1 maint: numeric names: authors list (link)
  10. ^ Unicode Consortium (2019). "C1 Controls and Latin-1 Supplement" (PDF). The Unicode Standard, Version 12.1.
  11. ^ ITU-T (1994-11-11). "Table C.9/T.101 – The supplementary set of graphic characters (default G2 set)". International interworking for Videotex services. p. 248. T.101:1994. (Difference between the codes is explained in the first note below table. Caveats: Is in second of the segmented PDFs in the download. The table itself is displayed in the PDF with severe mojibake (hence why the displayed table does not appear to correspond to the notes), and is supposed to look roughly like this.
edit