From Wikipedia, the free encyclopedia
This article is about the character encoding commonly mislabeled as «ANSI». For the actual ANSI character encoding, see ASCII. For the actual «ANSI extended Latin» encoding, see ANSEL.
Windows-1252
MIME / IANA | windows-1252[1] |
---|---|
Alias(es) | cp1252 (code page 1252) |
Language(s) | All supported by ISO/IEC 8859-1 plus full support for French[a] and Finnish and ligature forms for English; e.g. Danish (except for a rare exceptional letter), Irish, Italian, Norwegian, Portuguese, Spanish, Swedish, German (missing uppercase ẞ[b]), Icelandic, Faroese, Luxembourgish, Albanian, Estonian, Swahili, Tswana, Catalan, Basque, Occitan, Rotokas, Toki Pona, Lojban, Romansh, Dutch (except the IJ/ij character, substituted by IJ/ij or ÿ), and Slovene (except the č character, substituted by ç). Some languages lack their standard quotation marks (such as German „quotes“). |
Created by | Microsoft |
Standard | WHATWG Encoding Standard |
Classification | extended ASCII, Windows-125x |
Extends | ISO 8859-1 (excluding C1 controls) |
Transforms / Encodes | ISO 8859-15 |
Succeeded by | Unicode (UTF-8, UTF-16) |
Windows-1252 or CP-1252 (Windows code page 1252) is a legacy single-byte character encoding[2] that is used by default (as the «ANSI code page») in Microsoft Windows throughout the Americas, Western Europe, Oceania, and much of Africa.[3]
Initially the same as ISO 8859-1, it began to diverge starting in Windows 2.0 by adding additional characters in the 0x80 to 0x9F (hex) range (the ISO standards reserve this range for C1 control codes). Notable additional characters include curly quotation marks and all printable characters from ISO 8859-15.
It is the most-used single-byte character encoding in the world. Although almost all websites now use the multi-byte character encoding UTF-8, as of April 2025, 1.1%[4] of websites declared ISO 8859-1 which is treated as Windows-1252 by all modern browsers (as required by the HTML5 standard[5]), plus 0.3% declared Windows-1252 directly,[4][6] for a total of 1.4%. Some countries or languages show a higher usage than the global average, in 2025 Brazil according to website use, use is at 2.9%,[7] and in Germany at 2.4%[8][9] (these are the sums of ISO-8859-1 and CP-1252 declarations).
It is known to Windows by the code page number 1252, and by the IANA-approved name «windows-1252».
Historically, the phrase «ANSI Code Page» was used in Windows to refer to non-DOS encodings; the intention was that most of these would be ANSI standards such as ISO-8859-1. Even though Windows-1252 was the first and by far most popular code page named so in Microsoft Windows parlance, the code page has never been an ANSI standard. Microsoft explains, «The term ANSI as used to signify Windows code pages is a historical reference, but is nowadays a misnomer that continues to persist in the Windows community.»[10]
LaTeX can input Windows-1252 by using inputenc.sty with parameter ansinew (and more recently cp1252).[11][12]
IBM uses code page 1252 (CCSID 1252 and euro sign extended CCSID 5348) for Windows-1252.[13][14][15]
It is called «WE8MSWIN1252» by Oracle Database.[16]
- The first version of the codepage was used in Microsoft Windows 1.0. It matched the ISO-8859-1 standard (including leaving code points 0xD7 and 0xF7 undefined, as they were not in the standard at that time).
- The second version of the codepage was introduced in Microsoft Windows 2.0. In this version, code points 0xD7, 0xF7, 0x91, and 0x92 are defined.
- The third version of the codepage was introduced in Microsoft Windows 3.1. It defined all code points used in the final version except the euro sign and the Z with caron character pair.
- The final version (shown below) was introduced in Microsoft Windows 98.
Starting in the 1990s, many Microsoft products that could produce HTML included Windows-1252-exclusive characters, but marked the encoding as ISO-8859-1, ASCII, or undeclared.[citation needed] Characters exclusive to Windows-1252 would render incorrectly on non-Windows operating systems (often as question marks).[17][18] In particular, typographers’ quotes—curly variants of the standard straight apostrophes and quotation marks in US-ASCII—were commonly used in files produced in Windows applications such as Microsoft Word due to the smart quotes feature, which can automatically convert straight apostrophes and quotation marks to the curly variants.[19] To fix this, by 2000 most web browsers and e-mail clients treated the charsets ISO-8859-1 and US-ASCII as Windows-1252[citation needed]—this behavior is now required by the HTML5 specification.[5] Undeclared charsets in HTML are also assumed to be Windows-1252.[20][21]
Although Windows NT supported Unicode and attempted to encourage programs to use it, it only provided the 16-bit code units of UCS-2/UTF-16, despite the existing support for other multibyte character encodings such as Shift-JIS. As many applications preferred to use 8-bit strings, Windows-1252 remained the most popular encoding on Windows.[citation needed] UTF-8 has been supported since Windows 10 so this is gradually changing.[citation needed]
The following table shows Windows-1252. Differences from ISO-8859-1 have the Unicode code point number below the character, based on the Unicode.org mapping of Windows-1252 with «best fit». A tooltip, generally available only when one points to the immediate right of the character, shows the Unicode code point name and the decimal Alt code.
Windows-1252 (CP1252)[22][23][24][25][26]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0_ | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1_ | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
2_ | SP | ! | » | # | $ | % | & | ‘ | ( | ) | * | + | , | — | . | / |
3_ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4_ | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5_ | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6_ | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7_ | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL |
8_ | € 20AC |
‚ 201A |
ƒ 0192 |
„ 201E |
… 2026 |
† 2020 |
‡ 2021 |
ˆ 02C6 |
‰ 2030 |
Š 0160 |
‹ 2039 |
Œ 0152 |
Ž 017D |
|||
9_ | ‘ 2018 |
’ 2019 |
“ 201C |
” 201D |
• 2022 |
– 2013 |
— 2014 |
˜ 02DC |
™ 2122 |
š 0161 |
› 203A |
œ 0153 |
ž 017E |
Ÿ 0178 |
||
A_ | NBSP | ¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | SHY | ® | ¯ |
B_ | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
C_ | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
D_ | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß |
E_ | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
F_ | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ |
According to the information on Microsoft’s and the Unicode Consortium’s websites, positions 81, 8D, 8F, 90, and 9D are unused; however, the Windows API MultiByteToWideChar
maps these to the corresponding C1 control codes. The «best fit» mapping documents this behavior, too.[22]
The OS/2 operating system supports an encoding by the name of Code page 1004 (CCSID 1004) or «Windows Extended».[27][28] This mostly matches code page 1252, with the exception of certain C0 control characters being replaced by diacritic characters.
Code page 1004 (differing rows only)[29][30][31][32]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0_ | NUL | SOH | STX | ETX | ˉ 02C9 |
˘ 02D8 |
˙ 02D9 |
BEL | ˚ 02DA |
HT | ˝ 02DD |
˛ 02DB |
ˇ 02C7 |
CR | SO | SI |
MS-DOS extensions (rare)
[edit]
There is a rarely used, but useful, graphics extended code page 1252 where codes 0x00 to 0x1f allow for box drawing as used in applications such as MSDOS Edit and Codeview. One of the applications to use this code page was an Intel Corporation Install/Recovery disk image utility from mid/late 1995. These programs were written for its P6 User Test Program machines (US example[33]). It was used exclusively in its then EMEA region (Europe, Middle East & Africa). In time the programs were changed to use code page 850.
Graphics Extended Code Page 1252[citation needed]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0_ | ○ | ■ | ↑ | ↓ | → | ← | ║ | ═ | ╔ | ╗ | ╚ | ╝ | ░ | ▒ | ► | ◄ |
1_ | │ | ─ | ┌ | ┐ | └ | ┘ | ├ | ┤ | ┴ | ┬ | ♦ | ┼ | █ | ▄ | ▀ | ▬ |
- Latin script in Unicode
- Unicode
- Universal Coded Character Set
- European Unicode subset (DIN 91379)
- UTF-8
- Western Latin character sets (computing)
- Windows-1250
- Windows code pages
- ISO/IEC JTC 1/SC 2
- Extended ASCII
- ^ Excluding the narrow non-breaking space, which is preferred to the regular non-breaking space when spacing certain kinds of punctuation.
- ^ uppercase ẞ was not officially adopted until 2017
- ^ Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
- ^ «Encoding. Living Standard». WHATWG. 13 June 2024. § 9. Legacy single-byte encodings. Retrieved 2024-06-28.
- ^ Karl-Bridge-Microsoft (2021-10-26). «Code Pages — Win32 apps». learn.microsoft.com. Retrieved 2024-10-09.
- ^ a b «Historical trends in the usage statistics of character encodings for websites, December 2024». w3techs.com. Retrieved 2024-12-16.
- ^ a b «Encoding». WHATWG. 27 January 2015. sec. 5.2 Names and labels. Archived from the original on 4 February 2015. Retrieved 4 February 2015.
- ^ «Frequenty Asked Questions». w3techs.com.
- ^ «Distribution of Character Encodings among websites that use Brazil». W3Techs. Retrieved 2024-12-16.
- ^ «Distribution of Character Encodings among websites that use .de». W3Techs. Retrieved 2025-04-16.
- ^ «Distribution of Character Encodings among websites that use German». W3Techs. Archived from the original on 4 April 2024. Retrieved 2025-04-16.
- ^ Wissink, Cathy (5 April 2002). «Unicode and Windows XP» (PDF). Microsoft. p. 1. Archived from the original (PDF) on 4 February 2015. Retrieved 4 February 2015.
- ^ «LaTeX News, Issue 28» (PDF; 379 KB). The LaTeX Project. Apr 2018. Retrieved 2024-07-27.
- ^ «Inputenc – Accept different input encodings». The LaTeX Project. 2024-02-08. Retrieved 2024-07-27.
- ^ «Code page 1252 information document». IBM. 30 September 1997. Archived from the original on 2016-03-03.
- ^ «CCSID 1252 information document». IBM. Archived from the original on 2016-03-26.
- ^ «CCSID 5348 information document». IBM. Archived from the original on 2014-11-29.
- ^ «Database Client Installation Guide». Oracle. Retrieved 2021-02-14.
- ^ Texin, Tex. «Comparing Characters in Windows-1252, ISO-8859-1, ISO-8859-15». I18nQA.com.
- ^ van Emden, Eva (28 January 2011). «How to make typographers’ quotes in HTML». vancouvereditor.com. Retrieved 7 January 2024.
If you use typographers’ quotes without specifying the right character encoding for your HTML file, some of your viewers are going to see question marks, boxes, or other crazy symbols instead of the beautiful curly quotes you intended them to see.
- ^ «Smart quotes in Word». Microsoft Support. Microsoft. Retrieved 7 January 2024.
- ^ «NetWare Web Search: Understanding Character Set Encodings». Novell Documentation. Novell.
if a document does not contain a CHARSET encoding value, the default encoding for HTML documents is ISO-8859-1, also known as Latin1. The default encoding for plain text documents is US-ASCII.
- ^ Observed behavior in Chrome, this may be UTF-8 in some browsers.[original research?]
- ^ a b «Unicode mappings of Windows-1252 with ‘Best Fit’«. Unicode. Archived from the original on 4 February 2015. Retrieved 4 February 2015.
- ^ Code Page 01252 (PDF), IBM, 1998, archived (PDF) from the original on 27 October 2023
- ^ Code Page (CPGID) 01252 (txt), IBM, 1998, archived from the original on 8 April 2023
- ^ International Components for Unicode (ICU), ibm-1252_P100-2000.ucm, 2002-12-03
- ^ International Components for Unicode (ICU), ibm-5348_P100-1997.ucm, 2002-12-03
- ^ «Code page 1004 information document». Archived from the original on 2015-06-25.
- ^ «CCSID 1004 information document». Archived from the original on 2016-03-26.
- ^ «Code Page 01004» (PDF). IBM. Archived (PDF) from the original on 2015-07-08. (version based on Windows 3.1 version of Windows-1252)
- ^ Code Page CPGID 01004 (pdf) (PDF), IBM
- ^ Code Page CPGID 01004 (txt), IBM
- ^ Borgendale, Ken (2001). «Codepage 1004 — Windows Extended». OS/2 codepages by number. Archived from the original on 2018-05-13. Retrieved 2018-05-13. (version based on current version of Windows-1252)
- ^ Storaasli, Olaf (1996). «Performance of the NASA equation solvers on computational mechanics applications» (PDF). Performance of NASA Equation Solvers on Computational Mechanics Applications. NASA. doi:10.2514/6.1996-1505. S2CID 15711051. Archived from the original (PDF) on 2019-05-03.
- Microsoft’s code charts for Windows-1252 («Code Page 1252 Windows Latin 1 (ANSI)»)
- Unicode mapping table and code page definition with best fit mappings for Windows-1252
Dec
Hex
Char
Name
32
20
SPACE
33
21
!
EXCLAMATION MARK
34
22
«
QUOTATION MARK
35
23
#
NUMBER SIGN
36
24
$
DOLLAR SIGN
37
25
%
PERCENT SIGN
38
26
&
AMPERSAND
39
27
\’
APOSTROPHE
40
28
(
LEFT PARENTHESIS
41
29
)
RIGHT PARENTHESIS
42
2A
*
ASTERISK
43
2B
+
PLUS SIGN
44
2C
,
COMMA
45
2D
—
HYPHEN-MINUS
46
2E
.
FULL STOP
47
2F
/
SOLIDUS
48
30
0
DIGIT ZERO
49
31
1
DIGIT ONE
50
32
2
DIGIT TWO
51
33
3
DIGIT THREE
52
34
4
DIGIT FOUR
53
35
5
DIGIT FIVE
54
36
6
DIGIT SIX
55
37
7
DIGIT SEVEN
56
38
8
DIGIT EIGHT
57
39
9
DIGIT NINE
58
3A
:
COLON
59
3B
;
SEMICOLON
60
3C
<
LESS-THAN SIGN
61
3D
=
EQUALS SIGN
62
3E
>
GREATER-THAN SIGN
63
3F
?
QUESTION MARK
64
40
@
COMMERCIAL AT
65
41
A
LATIN CAPITAL LETTER A
66
42
B
LATIN CAPITAL LETTER B
67
43
C
LATIN CAPITAL LETTER C
68
44
D
LATIN CAPITAL LETTER D
69
45
E
LATIN CAPITAL LETTER E
70
46
F
LATIN CAPITAL LETTER F
71
47
G
LATIN CAPITAL LETTER G
72
48
H
LATIN CAPITAL LETTER H
73
49
I
LATIN CAPITAL LETTER I
74
4A
J
LATIN CAPITAL LETTER J
75
4B
K
LATIN CAPITAL LETTER K
76
4C
L
LATIN CAPITAL LETTER L
77
4D
M
LATIN CAPITAL LETTER M
78
4E
N
LATIN CAPITAL LETTER N
79
4F
O
LATIN CAPITAL LETTER O
80
50
P
LATIN CAPITAL LETTER P
81
51
Q
LATIN CAPITAL LETTER Q
82
52
R
LATIN CAPITAL LETTER R
83
53
S
LATIN CAPITAL LETTER S
84
54
T
LATIN CAPITAL LETTER T
85
55
U
LATIN CAPITAL LETTER U
86
56
V
LATIN CAPITAL LETTER V
87
57
W
LATIN CAPITAL LETTER W
88
58
X
LATIN CAPITAL LETTER X
89
59
Y
LATIN CAPITAL LETTER Y
90
5A
Z
LATIN CAPITAL LETTER Z
91
5B
[
LEFT SQUARE BRACKET
92
5C
\
REVERSE SOLIDUS
93
5D
]
RIGHT SQUARE BRACKET
94
5E
^
CIRCUMFLEX ACCENT
95
5F
_
LOW LINE
96
60
`
GRAVE ACCENT
97
61
a
LATIN SMALL LETTER A
98
62
b
LATIN SMALL LETTER B
99
63
c
LATIN SMALL LETTER C
100
64
d
LATIN SMALL LETTER D
101
65
e
LATIN SMALL LETTER E
102
66
f
LATIN SMALL LETTER F
103
67
g
LATIN SMALL LETTER G
104
68
h
LATIN SMALL LETTER H
105
69
i
LATIN SMALL LETTER I
106
6A
j
LATIN SMALL LETTER J
107
6B
k
LATIN SMALL LETTER K
108
6C
l
LATIN SMALL LETTER L
109
6D
m
LATIN SMALL LETTER M
110
6E
n
LATIN SMALL LETTER N
111
6F
o
LATIN SMALL LETTER O
112
70
p
LATIN SMALL LETTER P
113
71
q
LATIN SMALL LETTER Q
114
72
r
LATIN SMALL LETTER R
115
73
s
LATIN SMALL LETTER S
116
74
t
LATIN SMALL LETTER T
117
75
u
LATIN SMALL LETTER U
118
76
v
LATIN SMALL LETTER V
119
77
w
LATIN SMALL LETTER W
120
78
x
LATIN SMALL LETTER X
121
79
y
LATIN SMALL LETTER Y
122
7A
z
LATIN SMALL LETTER Z
123
7B
{
LEFT CURLY BRACKET
124
7C
|
VERTICAL LINE
125
7D
}
RIGHT CURLY BRACKET
126
7E
~
TILDE
128
80
€
EURO SIGN
130
82
‚
SINGLE LOW-9 QUOTATION MARK
131
83
ƒ
LATIN SMALL LETTER F WITH HOOK
132
84
„
DOUBLE LOW-9 QUOTATION MARK
133
85
…
HORIZONTAL ELLIPSIS
134
86
†
DAGGER
135
87
‡
DOUBLE DAGGER
136
88
ˆ
MODIFIER LETTER CIRCUMFLEX ACCENT
137
89
‰
PER MILLE SIGN
138
8A
Š
LATIN CAPITAL LETTER S WITH CARON
139
8B
‹
SINGLE LEFT-POINTING ANGLE QUOTATION MARK
140
8C
Œ
LATIN CAPITAL LIGATURE OE
142
8E
Ž
LATIN CAPITAL LETTER Z WITH CARON
145
91
‘
LEFT SINGLE QUOTATION MARK
146
92
’
RIGHT SINGLE QUOTATION MARK
147
93
“
LEFT DOUBLE QUOTATION MARK
148
94
”
RIGHT DOUBLE QUOTATION MARK
149
95
•
BULLET
150
96
–
EN DASH
151
97
—
EM DASH
152
98
˜
SMALL TILDE
153
99
™
TRADE MARK SIGN
154
9A
š
LATIN SMALL LETTER S WITH CARON
155
9B
›
SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
156
9C
œ
LATIN SMALL LIGATURE OE
158
9E
ž
LATIN SMALL LETTER Z WITH CARON
159
9F
Ÿ
LATIN CAPITAL LETTER Y WITH DIAERESIS
160
A0
NO-BREAK SPACE
161
A1
¡
INVERTED EXCLAMATION MARK
162
A2
¢
CENT SIGN
163
A3
£
POUND SIGN
164
A4
¤
CURRENCY SIGN
165
A5
¥
YEN SIGN
166
A6
¦
BROKEN BAR
167
A7
§
SECTION SIGN
168
A8
¨
DIAERESIS
169
A9
©
COPYRIGHT SIGN
170
AA
ª
FEMININE ORDINAL INDICATOR
171
AB
«
LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
172
AC
¬
NOT SIGN
173
AD
SOFT HYPHEN
174
AE
®
REGISTERED SIGN
175
AF
¯
MACRON
176
B0
°
DEGREE SIGN
177
B1
±
PLUS-MINUS SIGN
178
B2
²
SUPERSCRIPT TWO
179
B3
³
SUPERSCRIPT THREE
180
B4
´
ACUTE ACCENT
181
B5
µ
MICRO SIGN
182
B6
¶
PILCROW SIGN
183
B7
·
MIDDLE DOT
184
B8
¸
CEDILLA
185
B9
¹
SUPERSCRIPT ONE
186
BA
º
MASCULINE ORDINAL INDICATOR
187
BB
»
RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
188
BC
¼
VULGAR FRACTION ONE QUARTER
189
BD
½
VULGAR FRACTION ONE HALF
190
BE
¾
VULGAR FRACTION THREE QUARTERS
191
BF
¿
INVERTED QUESTION MARK
192
C0
À
LATIN CAPITAL LETTER A WITH GRAVE
193
C1
Á
LATIN CAPITAL LETTER A WITH ACUTE
194
C2
Â
LATIN CAPITAL LETTER A WITH CIRCUMFLEX
195
C3
Ã
LATIN CAPITAL LETTER A WITH TILDE
196
C4
Ä
LATIN CAPITAL LETTER A WITH DIAERESIS
197
C5
Å
LATIN CAPITAL LETTER A WITH RING ABOVE
198
C6
Æ
LATIN CAPITAL LETTER AE
199
C7
Ç
LATIN CAPITAL LETTER C WITH CEDILLA
200
C8
È
LATIN CAPITAL LETTER E WITH GRAVE
201
C9
É
LATIN CAPITAL LETTER E WITH ACUTE
202
CA
Ê
LATIN CAPITAL LETTER E WITH CIRCUMFLEX
203
CB
Ë
LATIN CAPITAL LETTER E WITH DIAERESIS
204
CC
Ì
LATIN CAPITAL LETTER I WITH GRAVE
205
CD
Í
LATIN CAPITAL LETTER I WITH ACUTE
206
CE
Î
LATIN CAPITAL LETTER I WITH CIRCUMFLEX
207
CF
Ï
LATIN CAPITAL LETTER I WITH DIAERESIS
208
D0
Ð
LATIN CAPITAL LETTER ETH
209
D1
Ñ
LATIN CAPITAL LETTER N WITH TILDE
210
D2
Ò
LATIN CAPITAL LETTER O WITH GRAVE
211
D3
Ó
LATIN CAPITAL LETTER O WITH ACUTE
212
D4
Ô
LATIN CAPITAL LETTER O WITH CIRCUMFLEX
213
D5
Õ
LATIN CAPITAL LETTER O WITH TILDE
214
D6
Ö
LATIN CAPITAL LETTER O WITH DIAERESIS
215
D7
×
MULTIPLICATION SIGN
216
D8
Ø
LATIN CAPITAL LETTER O WITH STROKE
217
D9
Ù
LATIN CAPITAL LETTER U WITH GRAVE
218
DA
Ú
LATIN CAPITAL LETTER U WITH ACUTE
219
DB
Û
LATIN CAPITAL LETTER U WITH CIRCUMFLEX
220
DC
Ü
LATIN CAPITAL LETTER U WITH DIAERESIS
221
DD
Ý
LATIN CAPITAL LETTER Y WITH ACUTE
222
DE
Þ
LATIN CAPITAL LETTER THORN
223
DF
ß
LATIN SMALL LETTER SHARP S
224
E0
à
LATIN SMALL LETTER A WITH GRAVE
225
E1
á
LATIN SMALL LETTER A WITH ACUTE
226
E2
â
LATIN SMALL LETTER A WITH CIRCUMFLEX
227
E3
ã
LATIN SMALL LETTER A WITH TILDE
228
E4
ä
LATIN SMALL LETTER A WITH DIAERESIS
229
E5
å
LATIN SMALL LETTER A WITH RING ABOVE
230
E6
æ
LATIN SMALL LETTER AE
231
E7
ç
LATIN SMALL LETTER C WITH CEDILLA
232
E8
è
LATIN SMALL LETTER E WITH GRAVE
233
E9
é
LATIN SMALL LETTER E WITH ACUTE
234
EA
ê
LATIN SMALL LETTER E WITH CIRCUMFLEX
235
EB
ë
LATIN SMALL LETTER E WITH DIAERESIS
236
EC
ì
LATIN SMALL LETTER I WITH GRAVE
237
ED
í
LATIN SMALL LETTER I WITH ACUTE
238
EE
î
LATIN SMALL LETTER I WITH CIRCUMFLEX
239
EF
ï
LATIN SMALL LETTER I WITH DIAERESIS
240
F0
ð
LATIN SMALL LETTER ETH
241
F1
ñ
LATIN SMALL LETTER N WITH TILDE
242
F2
ò
LATIN SMALL LETTER O WITH GRAVE
243
F3
ó
LATIN SMALL LETTER O WITH ACUTE
244
F4
ô
LATIN SMALL LETTER O WITH CIRCUMFLEX
245
F5
õ
LATIN SMALL LETTER O WITH TILDE
246
F6
ö
LATIN SMALL LETTER O WITH DIAERESIS
247
F7
÷
DIVISION SIGN
248
F8
ø
LATIN SMALL LETTER O WITH STROKE
249
F9
ù
LATIN SMALL LETTER U WITH GRAVE
250
FA
ú
LATIN SMALL LETTER U WITH ACUTE
251
FB
û
LATIN SMALL LETTER U WITH CIRCUMFLEX
252
FC
ü
LATIN SMALL LETTER U WITH DIAERESIS
253
FD
ý
LATIN SMALL LETTER Y WITH ACUTE
254
FE
þ
LATIN SMALL LETTER THORN
255
FF
ÿ
LATIN SMALL LETTER Y WITH DIAERESIS
- Windows-1252
-
ISO/IEC 8859-1 (также известная как ISO 8859-1 и Latin-1) — кодовая страница, предназначенная для западноевропейских языков; она базируется на символьном наборе популярных в прошлом терминалов ISO 8859.
ISO-8859-1 — кодировка, зарегистрированная 1992 г. В отличие от ISO/IEC 8859-1, кодовые позиции 0—31 и 127—159 здесь заполнены управляющими символами (большинство из которых, впрочем, всё равно никто не использует). В XHTML, однако, кодировкой по умолчанию является ISO_8859-1:1987, ISO_8859-1, ISO-8859-1, iso-ir-100, sISOLatin1, latin1, l1, IBM819, CP819. ]./
Таблицы
Нижняя часть (0—127) таблиц кодировки не показана, поскольку полностью соответствует обычному Юникоде.
ISO-8859-1
.0 .1 .2 .3 .4 .5 .6 .7 .8 .9 .A .B .C .D .E .F
8.
PAD
80HOP
81BPH
82NBH
83IND
84NEL
85SSA
86ESA
87HTS
88HTJ
89VTS
8APLD
8BPLU
8CRI
8DSS2
8ESS3
8F
9.
DCS
90PU1
91PU2
92STS
93CCH
94MW
95SPA
96EPA
97SOS
98SGCI
99SCI
9ACSI
9BST
9COSC
9DPM
9EAPC
9F
A.
A0¡
A1¢
A2£
A3¤
A4¥
A5¦
A6§
A7¨
A8©
A9ª
AA«
AB¬
AC
AD®
AE¯
AF
B.
°
B0±
B1²
B2³
B3´
B4µ
B5¶
B6·
B7¸
B8¹
B9º
BA»
BB¼
BC½
BD¾
BE¿
BF
C.
À
C0Á
C1Â
C2Ã
C3Ä
C4Å
C5Æ
C6Ç
C7È
C8É
C9Ê
CAË
CBÌ
CCÍ
CDÎ
CEÏ
CF
D.
Ð
D0Ñ
D1Ò
D2Ó
D3Ô
D4Õ
D5Ö
D6×
D7Ø
D8Ù
D9Ú
DAÛ
DBÜ
DCÝ
DDÞ
DEß
DF
E.
à
E0á
E1â
E2ã
E3ä
E4å
E5æ
E6ç
E7è
E8é
E9ê
EAë
EBì
ECí
EDî
EEï
EF
F.
ð
F0ñ
F1ò
F2ó
F3ô
F4õ
F5ö
F6÷
F7ø
F8ù
F9ú
FAû
FBü
FCý
FDþ
FEÿ
FFWindows−1252
В первоначальной версии этой кодировки отсутствовали символы: € (0x80), ˆ (0x88), ˜ (0x98), Ž (0x8E), ž (0x9E).
.0 .1 .2 .3 .4 .5 .6 .7 .8 .9 .A .B .C .D .E .F
8.
€
20AC‚
201Aƒ
192„
201E…
2026†
2020‡
2021ˆ
2C6‰
2030Š
160‹
2039Œ
152Ž
17D
9.
‘
2018’
2019“
201C”
201D•
2022–
2013—
2014˜
2DC™
2122š
161›
203Aœ
153ž
17EŸ
178
A.
A0¡
A1¢
A2£
A3¤
A4¥
A5¦
A6§
A7¨
A8©
A9ª
AA«
AB¬
AC
AD®
AE¯
AF
B.
°
B0±
B1²
B2³
B3´
B4µ
B5¶
B6·
B7¸
B8¹
B9º
BA»
BB¼
BC½
BD¾
BE¿
BF
C.
À
C0Á
C1Â
C2Ã
C3Ä
C4Å
C5Æ
C6Ç
C7È
C8É
C9Ê
CAË
CBÌ
CCÍ
CDÎ
CEÏ
CF
D.
Ð
D0Ñ
D1Ò
D2Ó
D3Ô
D4Õ
D5Ö
D6×
D7Ø
D8Ù
D9Ú
DAÛ
DBÜ
DCÝ
DDÞ
DEß
DF
E.
à
E0á
E1â
E2ã
E3ä
E4å
E5æ
E6ç
E7è
E8é
E9ê
EAë
EBì
ECí
EDî
EEï
EF
F.
ð
F0ñ
F1ò
F2ó
F3ô
F4õ
F5ö
F6÷
F7ø
F8ù
F9ú
FAû
FBü
FCý
FDþ
FEÿ
FF
Wikimedia Foundation.
2010.
Полезное
Смотреть что такое «Windows-1252» в других словарях:
-
Windows-1252 — Windows 1252, sometimes called incorrectly ANSI . Blue dots indicate unused or control characters Windows 1252 or CP 1252 is a character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows in English and… … Wikipedia
-
Windows-1252 — Windows 1252, sometimes called incorrectly ANSI . Blue dots indicate unused or control characters Windows 1252 or CP 1252 es una codificacion de caracteres del alfabeto latino, usado por defecto en los componentes oficiales de Microsoft Windows… … Wikipedia Español
-
Windows-1252 — ou CP1252 est un jeu de caractères, utilisé historiquement par défaut sur le système d exploitation Microsoft Windows en anglais et dans les principales langues d’Europe de l’Ouest (dont le français). Sommaire 1 Contexte 2 Aspects techniques … Wikipédia en Français
-
Windows 1252 — ou CP1252 est un jeu de caractères disponible sur le système d exploitation Microsoft Windows, aux États Unis, et dans certains pays de l Union européenne. Sommaire 1 Contexte 2 Aspects techniques 2.1 Support en HTML … Wikipédia en Français
-
Windows-1252 — ISO 8859 1 Latin 1, Westeuropäisch 2 Latin 2, Mitteleuropäisch 3 Latin 3, Südeuropäisch 4 Latin 4, Baltisch 5 Kyrillisch 6 Arabisch 7 Griechisch 8 … Deutsch Wikipedia
-
Windows-1250 — is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use Latin script, such as Polish, Czech, Slovak, Hungarian, Slovene, Bosnian, Croatian, Serbian (Latin script), Romanian and… … Wikipedia
-
Windows code page — Windows code pages are sets of characters or code pages (known as character encodings in other operating systems) used in Microsoft Windows from the 1980s and 1990s. Windows code pages were gradually superseded when Unicode was implemented in… … Wikipedia
-
Windows-1258 — is a codepage used in Microsoft Windows to represent Vietnamese texts. It makes use of combining diacritical marks. Windows 1258 is not compatible with VISCII. It is very similar to windows 1252 with the differences being that s caron and z caron … Wikipedia
-
Windows-1257 — (Windows Baltic) is a single byte code page used to support the Estonian, Latvian and Lithuanian languages under Microsoft Windows. This code page is similar in layout to ISO 8859 13, but they differ in codepoints A1, A5, B4, FF, and of course in … Wikipedia
-
Windows-1251 — набор символов и кодировка, являющаяся стандартной 8 битной кодировкой для всех русских версий Microsoft Windows. Пользуется довольно большой популярностью. Была создана на базе кодировок, использовавшихся в ранних «самопальных» русификаторах… … Википедия
Content
Overview
ASCII Control Characters (0-31 and 127)
ASCII Characters6)
ANSI Characters (128-159)
ANSI Characters (160-255)
Overview
ASCII (American Standard Code for Information Interchange) is a 7-bit character set that contains characters from 0 to 127.
The generic term ANSI (American National Standards Institute) is used for 8-bit character sets. These character sets contain the unchanged ASCII character set. In addition, they contain further characters from 128 to 255, which differ in the various ANSI character sets. There are character sets for western special characters and umlauts, and for Arabic, Greek or Cyrillic characters.
The following table shows which characters are available in which (western) character set:
Did you like my page, one of my freeware applications or online tools?
Then, please donate via PayPal in order to help keeping its content free — each amount is welcome!
Read more about support options…
ASCII Control Characters (0-31 and 127)
These characters are part of ASCII, Windows-1252 and ISO-8859-1.
The characters with the ASCII codes 0 to 31 and 127 are control characters which are not intended for display.
The caret notation (in column «C») is often used in terminals to display control characters. These can usually be entered using the control key (Ctrl). For example, the notation «^C» corresponds to the key combination Ctrl+C.
The escape sequence (in column «E») is used e.g. in programming languages or search functions to be able to enter control characters as text.
ASCII Characters (32-126)
These characters are part of ASCII, Windows-1252 and ISO-8859-1.
Characters with ASCII codes 32 to 126 are so-called printable characters intended for display or output on printers.
ANSI Characters (128-159)
These characters are part of Windows-1252. In ISO-8859-1 these characters are control characters.
ANSI Characters (160-255)
These characters are part of Windows-1252 and ISO-8859-1.
Windows-1252 code page
Windows-1252 (legacy, Western Europe) is a 8-bit single-byte coded character set.
This Windows code page is similar to ISO-8859-1.
Hex to decimal converter
The code page above has hexadecimal numbers, use this tool to convert to decimal:
More character sets
- US-ASCII (basic English)
- ISO-8859-1 (Western Europe)
- ISO-8859-2 (Central Europe)
- ISO-8859-3 (Southern Europe)
- ISO-8859-4 (Baltic)
- ISO-8859-5 (Cyrillic)
- ISO-8859-6 (Arabic)
- ISO-8859-7 (Greek)
- ISO-8859-8 (Hebrew)
- ISO-8859-9 (Turkish)
- ISO-8859-15 (Latin 9)
- SHIFT_JIS (Japanese, Win/Mac)
- Windows-1250 (legacy, Central Europe)
- Windows-1251 (legacy, Cyrillic)
- Windows-1252 (legacy, Western Europe)
- Windows-1253 (legacy, Greek)
- Windows-1254 (legacy, Turkish)
- Windows-1255 (legacy, Hebrew)
- Windows-1256 (legacy, Arabic)
- Windows-1257 (legacy, Baltic Rim)
- Windows-1258 (legacy, Vietnam)