Latin-1 Supplement (Unicode block)
C1 Controls and Latin-1 Supplement | |
---|---|
Range | U+0080..U+00FF (128 code points) |
Plane | BMP |
Scripts | Latin (64 char.) Common (64 char.) |
Major alphabets | French German Spanish Icelandic Vietnamese |
Symbol sets | Punctuation Mathematics Currency |
Assigned | 128 code points 33 Control or Format |
Unused | 0 reserved code points |
Source standards | ISO/IEC 8859-1 |
Unicode version history | |
1.0.0 (1991) | 128 (+128) |
Note: [1][2] |
The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). Controls C1 (0080–009F) are not graphic.
The C1 Controls and Latin-1 Supplement block has been included in its present form, with the same character repertoire since version 1.0 of the Unicode Standard, where it was known as Latin 1.[2]
Contents
Character table
Code | Result | Description |
---|---|---|
C1 Controls | ||
U+0080 | PAD – Padding Character | |
U+0081 | HOP – High Octet Preset | |
U+0082 | BPH – Break Permitted Here | |
U+0083 | NBH – No Break Here | |
U+0084 | IND – Index | |
U+0085 | NEL – Next Line | |
U+0086 | SSA – Start of Selected Area | |
U+0087 | ESA – End of Selected Area | |
U+0088 | HTS – Character (Horizontal) Tabulation Set | |
U+0089 | HTJ – Character (Horizontal) Tabulation with Justification | |
U+008A | LTS – Line (Vertical) Tabulation Set | |
U+008B | PLD – Partial Line Forward (Down) | |
U+008C | PLU – Partial Line Backward (Up) | |
U+008D | RI – Reverse Line Feed (Index) | |
U+008E | SS2 – Single-Shift Two | |
U+008F | SS3 – Single-Shift Three | |
U+0090 | DCS – Device Control String | |
U+0091 | PU1 – Private Use 1 | |
U+0092 | PU2 – Private Use 2 | |
U+0093 | STS – Set Transmit State | |
U+0094 | CCH – Cancel character | |
U+0095 | MW – Message Waiting | |
U+0096 | SPA – Start of Protected Area | |
U+0097 | EPA – End of Protected Area | |
U+0098 | SOS – Start of String | |
U+0099 | SGCI – Single Graphic Character Introducer | |
U+009A | SCI – Single Character Introducer | |
U+009B | CSI – Control Sequence Introducer | |
U+009C | ST – String Terminator | |
U+009D | OSC – Operating System Command | |
U+009E | PM – Private Message | |
U+009F | APC – Application Program Command | |
Latin-1 Punctuation and Symbols | ||
U+00A0 | Non-breaking space | |
U+00A1 | ¡ | Inverted Exclamation Mark |
U+00A2 | ¢ | Cent sign |
U+00A3 | £ | Pound sign |
U+00A4 | ¤ | Currency sign |
U+00A5 | ¥ | Yen sign |
U+00A6 | ¦ | Broken bar |
U+00A7 | § | Section sign |
U+00A8 | ¨ | Diaeresis |
U+00A9 | © | Copyright sign |
U+00AA | ª | Feminine Ordinal Indicator |
U+00AB | « | Left-pointing double angle quotation mark |
U+00AC | ¬ | Not sign |
U+00AD | Soft hyphen | |
U+00AE | ® | Registered sign |
U+00AF | ¯ | Macron |
U+00B0 | ° | Degree symbol |
U+00B1 | ± | Plus-minus sign |
U+00B2 | ² | Superscript two |
U+00B3 | ³ | Superscript three |
U+00B4 | ´ | Acute accent |
U+00B5 | µ | Micro sign |
U+00B6 | ¶ | Pilcrow sign |
U+00B7 | · | Middle dot |
U+00B8 | ¸ | Cedilla |
U+00B9 | ¹ | Superscript one |
U+00BA | º | Masculine ordinal indicator |
U+00BB | » | Right-pointing double-angle quotation mark |
U+00BC | ¼ | Vulgar fraction one quarter |
U+00BD | ½ | Vulgar fraction one half |
U+00BE | ¾ | Vulgar fraction three quarters |
U+00BF | ¿ | Inverted Question Mark |
Letters | ||
U+00C0 | À | Latin Capital Letter A with grave |
U+00C1 | Á | Latin Capital letter A with acute |
U+00C2 | Â | Latin Capital letter A with circumflex |
U+00C3 | Ã | Latin Capital letter A with tilde |
U+00C4 | Ä | Latin Capital letter A with diaeresis |
U+00C5 | Å | Latin Capital letter A with ring above |
U+00C6 | Æ | Latin Capital letter AE |
U+00C7 | Ç | Latin Capital letter C with cedilla |
U+00C8 | È | Latin Capital letter E with grave |
U+00C9 | É | Latin Capital letter E with acute |
U+00CA | Ê | Latin Capital letter E with circumflex |
U+00CB | Ë | Latin Capital letter E with diaeresis |
U+00CC | Ì | Latin Capital letter I with grave |
U+00CD | Í | Latin Capital letter I with acute |
U+00CE | Î | Latin Capital letter I with circumflex |
U+00CF | Ï | Latin Capital letter I with diaeresis |
U+00D0 | Ð | Latin Capital letter Eth |
U+00D1 | Ñ | Latin Capital letter N with tilde |
U+00D2 | Ò | Latin Capital letter O with grave |
U+00D3 | Ó | Latin Capital letter O with acute |
U+00D4 | Ô | Latin Capital letter O with circumflex |
U+00D5 | Õ | Latin Capital letter O with tilde |
U+00D6 | Ö | Latin Capital letter O with diaeresis |
Mathematical Operator | ||
U+00D7 | × | Multiplication sign |
Letters | ||
U+00D8 | Ø | Latin Capital letter O with stroke |
U+00D9 | Ù | Latin Capital letter U with grave |
U+00DA | Ú | Latin Capital letter U with acute |
U+00DB | Û | Latin Capital Letter U with circumflex |
U+00DC | Ü | Latin Capital Letter U with diaeresis |
U+00DD | Ý | Latin Capital Letter Y with acute |
U+00DE | Þ | Latin Capital Letter Thorn |
U+00DF | ß | Latin Small Letter sharp S |
U+00E0 | à | Latin Small Letter A with grave |
U+00E1 | á | Latin Small Letter A with acute |
U+00E2 | â | Latin Small Letter A with circumflex |
U+00E3 | ã | Latin Small Letter A with tilde |
U+00E4 | ä | Latin Small Letter A with diaeresis |
U+00E5 | å | Latin Small Letter A with ring above |
U+00E6 | æ | Latin Small Letter AE |
U+00E7 | ç | Latin Small Letter C with cedilla |
U+00E8 | è | Latin Small Letter E with grave |
U+00E9 | é | Latin Small Letter E with acute |
U+00EA | ê | Latin Small Letter E with circumflex |
U+00EB | ë | Latin Small Letter E with diaeresis |
U+00EC | ì | Latin Small Letter I with grave |
U+00ED | í | Latin Small Letter I with acute |
U+00EE | î | Latin Small Letter I with circumflex |
U+00EF | ï | Latin Small Letter I with diaeresis |
U+00F0 | ð | Latin Small Letter Eth |
U+00F1 | ñ | Latin Small Letter N with tilde |
U+00F2 | ò | Latin Small Letter O with grave |
U+00F3 | ó | Latin Small Letter O with acute |
U+00F4 | ô | Latin Small Letter O with circumflex |
U+00F5 | õ | Latin Small Letter O with tilde |
U+00F6 | ö | Latin Small Letter O with diaeresis |
Mathematical Operator | ||
U+00F7 | ÷ | Division sign |
Letters | ||
U+00F8 | ø | Latin Small Letter O with stroke |
U+00F9 | ù | Latin Small Letter U with grave |
U+00FA | ú | Latin Small Letter U with acute |
U+00FB | û | Latin Small Letter U with circumflex |
U+00FC | ü | Latin Small Letter U with diaeresis |
U+00FD | ý | Latin Small Letter Y with acute |
U+00FE | þ | Latin Small Letter Thorn |
U+00FF | ÿ | Latin Small Letter Y with diaeresis |
Subheadings
The C1 Controls and Latin-1 Supplement block has four subheadings within its character collection: C1 Controls, Latin-1 Punctuation and Symbols, Letters, and Mathematical Operator(s).[3]
C1 controls
The C1 Controls subheading contains 32 supplementary control codes inherited from ISO/IEC 8859-1 and many other 8-bit character standards. The alias names for the C0 and C1 control codes are taken from ISO/IEC 6429:1992.[3]
Latin-1 punctuation and symbols
The Latin-1 Punctuation and Symbols subheading contains 32 characters of common international punctuation characters, such as inverted exclamation and question marks, and a middle dot; and symbols like foreign currency signs, spacing diacritic marks, vulgar fraction, and superscript numbers.[3]
Letters
The Letters subheading contains 30 pairs of majuscule and minuscule accented or novel Latin characters for western European languages, and two extra minuscule characters not commonly used word-initially.[3]
Mathematical operator
The Mathematical Operator subheading is used for the multiplication and division signs.[3]
Compact table
C1 Controls and Latin-1 Supplement[1] Official Unicode Consortium code chart (PDF) |
||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+008x | XXX | XXX | BPH | NBH | IND | NEL | SSA | ESA | HTS | HTJ | VTS | PLD | PLU | RI | SS2 | SS3 |
U+009x | DCS | PU1 | PU2 | STS | CCH | MW | SPA | EPA | SOS | XXX | SCI | CSI | ST | OSC | PM | APC |
U+00Ax | NB SP |
¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | SHY |
® | ¯ |
U+00Bx | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
U+00Cx | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
U+00Dx | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß |
U+00Ex | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
U+00Fx | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ |
Notes
|
Emoji
The Latin-1 Supplement block contains two emoji: U+00A9 and 00AE.[4][5]
The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.[6][7]
U+ | 00A9 | 00AE |
base codepoint | © | ® |
base+VS15 (text) | ©︎ | ®︎ |
base+VS16 (emoji) | ©️ | ®️ |
See also
External links
<templatestyles src="Reflist/styles.css" />
Cite error: Invalid <references>
tag; parameter "group" is allowed only.
<references />
, or <references group="..." />
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ 2.0 2.1 Lua error in package.lua at line 80: module 'strict' not found.
- ↑ 3.0 3.1 3.2 3.3 3.4 Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.