Windows 1252 кодировка таблица — Ваш верный помощник с OS Windows

From Wikipedia, the free encyclopedia

This article is about the character encoding commonly mislabeled as «ANSI». For the actual ANSI character encoding, see ASCII. For the actual «ANSI extended Latin» encoding, see ANSEL.

Windows-1252


MIME / IANA	windows-1252^[1]
Alias(es)	cp1252 (code page 1252)
Language(s)	All supported by ISO/IEC 8859-1 plus full support for French^[a] and Finnish and ligature forms for English; e.g. Danish (except for a rare exceptional letter), Irish, Italian, Norwegian, Portuguese, Spanish, Swedish, German (missing uppercase ẞ^[b]), Icelandic, Faroese, Luxembourgish, Albanian, Estonian, Swahili, Tswana, Catalan, Basque, Occitan, Rotokas, Toki Pona, Lojban, Romansh, Dutch (except the Ĳ/ĳ character, substituted by IJ/ij or ÿ), and Slovene (except the č character, substituted by ç). Some languages lack their standard quotation marks (such as German „quotes“).
Created by	Microsoft
Standard	WHATWG Encoding Standard
Classification	extended ASCII, Windows-125x
Extends	ISO 8859-1 (excluding C1 controls)
Transforms / Encodes	ISO 8859-15
Succeeded by	Unicode (UTF-8, UTF-16)

Windows-1252 or CP-1252 (Windows code page 1252) is a legacy single-byte character encoding^[2] that is used by default (as the «ANSI code page») in Microsoft Windows throughout the Americas, Western Europe, Oceania, and much of Africa.^[3]

Initially the same as ISO 8859-1, it began to diverge starting in Windows 2.0 by adding additional characters in the 0x80 to 0x9F (hex) range (the ISO standards reserve this range for C1 control codes). Notable additional characters include curly quotation marks and all printable characters from ISO 8859-15.

It is the most-used single-byte character encoding in the world. Although almost all websites now use the multi-byte character encoding UTF-8, as of April 2025, 1.1%^[4] of websites declared ISO 8859-1 which is treated as Windows-1252 by all modern browsers (as required by the HTML5 standard^[5]), plus 0.3% declared Windows-1252 directly,^[4]^[6] for a total of 1.4%. Some countries or languages show a higher usage than the global average, in 2025 Brazil according to website use, use is at 2.9%,^[7] and in Germany at 2.4%^[8]^[9] (these are the sums of ISO-8859-1 and CP-1252 declarations).

It is known to Windows by the code page number 1252, and by the IANA-approved name «windows-1252».

Historically, the phrase «ANSI Code Page» was used in Windows to refer to non-DOS encodings; the intention was that most of these would be ANSI standards such as ISO-8859-1. Even though Windows-1252 was the first and by far most popular code page named so in Microsoft Windows parlance, the code page has never been an ANSI standard. Microsoft explains, «The term ANSI as used to signify Windows code pages is a historical reference, but is nowadays a misnomer that continues to persist in the Windows community.»^[10]

LaTeX can input Windows-1252 by using inputenc.sty with parameter ansinew (and more recently cp1252).^[11]^[12]

IBM uses code page 1252 (CCSID 1252 and euro sign extended CCSID 5348) for Windows-1252.^[13]^[14]^[15]

It is called «WE8MSWIN1252» by Oracle Database.^[16]

The first version of the codepage was used in Microsoft Windows 1.0. It matched the ISO-8859-1 standard (including leaving code points 0xD7 and 0xF7 undefined, as they were not in the standard at that time).
The second version of the codepage was introduced in Microsoft Windows 2.0. In this version, code points 0xD7, 0xF7, 0x91, and 0x92 are defined.
The third version of the codepage was introduced in Microsoft Windows 3.1. It defined all code points used in the final version except the euro sign and the Z with caron character pair.
The final version (shown below) was introduced in Microsoft Windows 98.

Starting in the 1990s, many Microsoft products that could produce HTML included Windows-1252-exclusive characters, but marked the encoding as ISO-8859-1, ASCII, or undeclared.^{[citation needed]} Characters exclusive to Windows-1252 would render incorrectly on non-Windows operating systems (often as question marks).^[17]^[18] In particular, typographers’ quotes—curly variants of the standard straight apostrophes and quotation marks in US-ASCII—were commonly used in files produced in Windows applications such as Microsoft Word due to the smart quotes feature, which can automatically convert straight apostrophes and quotation marks to the curly variants.^[19] To fix this, by 2000 most web browsers and e-mail clients treated the charsets ISO-8859-1 and US-ASCII as Windows-1252^{[citation needed]}—this behavior is now required by the HTML5 specification.^[5] Undeclared charsets in HTML are also assumed to be Windows-1252.^[20]^[21]

Although Windows NT supported Unicode and attempted to encourage programs to use it, it only provided the 16-bit code units of UCS-2/UTF-16, despite the existing support for other multibyte character encodings such as Shift-JIS. As many applications preferred to use 8-bit strings, Windows-1252 remained the most popular encoding on Windows.^{[citation needed]} UTF-8 has been supported since Windows 10 so this is gradually changing.^{[citation needed]}

The following table shows Windows-1252. Differences from ISO-8859-1 have the Unicode code point number below the character, based on the Unicode.org mapping of Windows-1252 with «best fit». A tooltip, generally available only when one points to the immediate right of the character, shows the Unicode code point name and the decimal Alt code.

Windows-1252 (CP1252)^[22]^[23]^[24]^[25]^[26]

NUL

SOH

STX

ETX

EOT

ENQ

ACK

BEL

DLE

DC1

DC2

DC3

DC4

NAK

SYN

ETB

CAN

SUB

ESC

‘

(

)

—

;

[

]

{

}

DEL

€
20AC

‚
201A

ƒ
0192

„
201E

…
2026

†
2020

‡
2021

ˆ
02C6

‰
2030

Š
0160

‹
2039

Œ
0152

Ž
017D

‘
2018

’
2019

“
201C

”
201D

•
2022

–
2013

—
2014

˜
02DC

™
2122

š
0161

›
203A

œ
0153

ž
017E

Ÿ
0178

NBSP

SHY

According to the information on Microsoft’s and the Unicode Consortium’s websites, positions 81, 8D, 8F, 90, and 9D are unused; however, the Windows API MultiByteToWideChar maps these to the corresponding C1 control codes. The «best fit» mapping documents this behavior, too.^[22]

The OS/2 operating system supports an encoding by the name of Code page 1004 (CCSID 1004) or «Windows Extended».^[27]^[28] This mostly matches code page 1252, with the exception of certain C0 control characters being replaced by diacritic characters.

Code page 1004 (differing rows only)^[29]^[30]^[31]^[32]

NUL

SOH

STX

ETX

ˉ
02C9

˘
02D8

˙
02D9

BEL

˚
02DA

˝
02DD

˛
02DB

ˇ
02C7

MS-DOS extensions (rare)

[edit]

There is a rarely used, but useful, graphics extended code page 1252 where codes 0x00 to 0x1f allow for box drawing as used in applications such as MSDOS Edit and Codeview. One of the applications to use this code page was an Intel Corporation Install/Recovery disk image utility from mid/late 1995. These programs were written for its P6 User Test Program machines (US example^[33]). It was used exclusively in its then EMEA region (Europe, Middle East & Africa). In time the programs were changed to use code page 850.

Graphics Extended Code Page 1252^{[citation needed]}

○

■

↑

↓

→

←

║

═

╔

╗

╚

╝

░

▒

►

◄

│

─

┌

┐

└

┘

├

┤

┴

┬

♦

┼

█

▄

▀

▬

Latin script in Unicode
Unicode
Universal Coded Character Set
- European Unicode subset (DIN 91379)
UTF-8
Western Latin character sets (computing)
Windows-1250
Windows code pages
ISO/IEC JTC 1/SC 2
Extended ASCII

^ Excluding the narrow non-breaking space, which is preferred to the regular non-breaking space when spacing certain kinds of punctuation.
^ uppercase ẞ was not officially adopted until 2017

^ Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
^ «Encoding. Living Standard». WHATWG. 13 June 2024. § 9. Legacy single-byte encodings. Retrieved 2024-06-28.
^ Karl-Bridge-Microsoft (2021-10-26). «Code Pages — Win32 apps». learn.microsoft.com. Retrieved 2024-10-09.
^ ^a ^b «Historical trends in the usage statistics of character encodings for websites, December 2024». w3techs.com. Retrieved 2024-12-16.
^ ^a ^b «Encoding». WHATWG. 27 January 2015. sec. 5.2 Names and labels. Archived from the original on 4 February 2015. Retrieved 4 February 2015.
^ «Frequenty Asked Questions». w3techs.com.
^ «Distribution of Character Encodings among websites that use Brazil». W3Techs. Retrieved 2024-12-16.
^ «Distribution of Character Encodings among websites that use .de». W3Techs. Retrieved 2025-04-16.
^ «Distribution of Character Encodings among websites that use German». W3Techs. Archived from the original on 4 April 2024. Retrieved 2025-04-16.
^ Wissink, Cathy (5 April 2002). «Unicode and Windows XP» (PDF). Microsoft. p. 1. Archived from the original (PDF) on 4 February 2015. Retrieved 4 February 2015.
^ «LaTeX News, Issue 28» (PDF; 379 KB). The LaTeX Project. Apr 2018. Retrieved 2024-07-27.
^ «Inputenc – Accept different input encodings». The LaTeX Project. 2024-02-08. Retrieved 2024-07-27.
^ «Code page 1252 information document». IBM. 30 September 1997. Archived from the original on 2016-03-03.
^ «CCSID 1252 information document». IBM. Archived from the original on 2016-03-26.
^ «CCSID 5348 information document». IBM. Archived from the original on 2014-11-29.
^ «Database Client Installation Guide». Oracle. Retrieved 2021-02-14.
^ Texin, Tex. «Comparing Characters in Windows-1252, ISO-8859-1, ISO-8859-15». I18nQA.com.
^ van Emden, Eva (28 January 2011). «How to make typographers’ quotes in HTML». vancouvereditor.com. Retrieved 7 January 2024. If you use typographers’ quotes without specifying the right character encoding for your HTML file, some of your viewers are going to see question marks, boxes, or other crazy symbols instead of the beautiful curly quotes you intended them to see.
^ «Smart quotes in Word». Microsoft Support. Microsoft. Retrieved 7 January 2024.
^ «NetWare Web Search: Understanding Character Set Encodings». Novell Documentation. Novell. if a document does not contain a CHARSET encoding value, the default encoding for HTML documents is ISO-8859-1, also known as Latin1. The default encoding for plain text documents is US-ASCII.
^ Observed behavior in Chrome, this may be UTF-8 in some browsers.^{[original research?]}
^ ^a ^b «Unicode mappings of Windows-1252 with ‘Best Fit’«. Unicode. Archived from the original on 4 February 2015. Retrieved 4 February 2015.
^ Code Page 01252 (PDF), IBM, 1998, archived (PDF) from the original on 27 October 2023
^ Code Page (CPGID) 01252 (txt), IBM, 1998, archived from the original on 8 April 2023
^ International Components for Unicode (ICU), ibm-1252_P100-2000.ucm, 2002-12-03
^ International Components for Unicode (ICU), ibm-5348_P100-1997.ucm, 2002-12-03
^ «Code page 1004 information document». Archived from the original on 2015-06-25.
^ «CCSID 1004 information document». Archived from the original on 2016-03-26.
^ «Code Page 01004» (PDF). IBM. Archived (PDF) from the original on 2015-07-08. (version based on Windows 3.1 version of Windows-1252)
^ Code Page CPGID 01004 (pdf) (PDF), IBM
^ Code Page CPGID 01004 (txt), IBM
^ Borgendale, Ken (2001). «Codepage 1004 — Windows Extended». OS/2 codepages by number. Archived from the original on 2018-05-13. Retrieved 2018-05-13. (version based on current version of Windows-1252)
^ Storaasli, Olaf (1996). «Performance of the NASA equation solvers on computational mechanics applications» (PDF). Performance of NASA Equation Solvers on Computational Mechanics Applications. NASA. doi:10.2514/6.1996-1505. S2CID 15711051. Archived from the original (PDF) on 2019-05-03.

Microsoft’s code charts for Windows-1252 («Code Page 1252 Windows Latin 1 (ANSI)»)
Unicode mapping table and code page definition with best fit mappings for Windows-1252

Источник

Dec
Hex
Char
Name

32
20

SPACE

33
21
!
EXCLAMATION MARK

34
22
«
QUOTATION MARK

35
23
#
NUMBER SIGN

36
24
$
DOLLAR SIGN

37
25
%
PERCENT SIGN

38
26
&
AMPERSAND

39
27
\’
APOSTROPHE

40
28
(
LEFT PARENTHESIS

41
29
)
RIGHT PARENTHESIS

42
2A
*
ASTERISK

43
2B
+
PLUS SIGN

44
2C
,
COMMA

45
2D
—
HYPHEN-MINUS

46
2E
.
FULL STOP

47
2F
/
SOLIDUS

48
30
0
DIGIT ZERO

49
31
1
DIGIT ONE

50
32
2
DIGIT TWO

51
33
3
DIGIT THREE

52
34
4
DIGIT FOUR

53
35
5
DIGIT FIVE

54
36
6
DIGIT SIX

55
37
7
DIGIT SEVEN

56
38
8
DIGIT EIGHT

57
39
9
DIGIT NINE

58
3A
:
COLON

59
3B
;
SEMICOLON

60
3C
<
LESS-THAN SIGN

61
3D
=
EQUALS SIGN

62
3E
>
GREATER-THAN SIGN

63
3F
?
QUESTION MARK

64
40
@
COMMERCIAL AT

65
41
A
LATIN CAPITAL LETTER A

66
42
B
LATIN CAPITAL LETTER B

67
43
C
LATIN CAPITAL LETTER C

68
44
D
LATIN CAPITAL LETTER D

69
45
E
LATIN CAPITAL LETTER E

70
46
F
LATIN CAPITAL LETTER F

71
47
G
LATIN CAPITAL LETTER G

72
48
H
LATIN CAPITAL LETTER H

73
49
I
LATIN CAPITAL LETTER I

74
4A
J
LATIN CAPITAL LETTER J

75
4B
K
LATIN CAPITAL LETTER K

76
4C
L
LATIN CAPITAL LETTER L

77
4D
M
LATIN CAPITAL LETTER M

78
4E
N
LATIN CAPITAL LETTER N

79
4F
O
LATIN CAPITAL LETTER O

80
50
P
LATIN CAPITAL LETTER P

81
51
Q
LATIN CAPITAL LETTER Q

82
52
R
LATIN CAPITAL LETTER R

83
53
S
LATIN CAPITAL LETTER S

84
54
T
LATIN CAPITAL LETTER T

85
55
U
LATIN CAPITAL LETTER U

86
56
V
LATIN CAPITAL LETTER V

87
57
W
LATIN CAPITAL LETTER W

88
58
X
LATIN CAPITAL LETTER X

89
59
Y
LATIN CAPITAL LETTER Y

90
5A
Z
LATIN CAPITAL LETTER Z

91
5B
[
LEFT SQUARE BRACKET

92
5C
\
REVERSE SOLIDUS

93
5D
]
RIGHT SQUARE BRACKET

94
5E
^
CIRCUMFLEX ACCENT

95
5F
_
LOW LINE

96
60
`
GRAVE ACCENT

97
61
a
LATIN SMALL LETTER A

98
62
b
LATIN SMALL LETTER B

99
63
c
LATIN SMALL LETTER C

100
64
d
LATIN SMALL LETTER D

101
65
e
LATIN SMALL LETTER E

102
66
f
LATIN SMALL LETTER F

103
67
g
LATIN SMALL LETTER G

104
68
h
LATIN SMALL LETTER H

105
69
i
LATIN SMALL LETTER I

106
6A
j
LATIN SMALL LETTER J

107
6B
k
LATIN SMALL LETTER K

108
6C
l
LATIN SMALL LETTER L

109
6D
m
LATIN SMALL LETTER M

110
6E
n
LATIN SMALL LETTER N

111
6F
o
LATIN SMALL LETTER O

112
70
p
LATIN SMALL LETTER P

113
71
q
LATIN SMALL LETTER Q

114
72
r
LATIN SMALL LETTER R

115
73
s
LATIN SMALL LETTER S

116
74
t
LATIN SMALL LETTER T

117
75
u
LATIN SMALL LETTER U

118
76
v
LATIN SMALL LETTER V

119
77
w
LATIN SMALL LETTER W

120
78
x
LATIN SMALL LETTER X

121
79
y
LATIN SMALL LETTER Y

122
7A
z
LATIN SMALL LETTER Z

123
7B
{
LEFT CURLY BRACKET

124
7C
|
VERTICAL LINE

125
7D
}
RIGHT CURLY BRACKET

126
7E
~
TILDE

128
80
€
EURO SIGN

130
82
‚
SINGLE LOW-9 QUOTATION MARK

131
83
ƒ
LATIN SMALL LETTER F WITH HOOK

132
84
„
DOUBLE LOW-9 QUOTATION MARK

133
85
…
HORIZONTAL ELLIPSIS

134
86
†
DAGGER

135
87
‡
DOUBLE DAGGER

136
88
ˆ
MODIFIER LETTER CIRCUMFLEX ACCENT

137
89
‰
PER MILLE SIGN

138
8A
Š
LATIN CAPITAL LETTER S WITH CARON

139
8B
‹
SINGLE LEFT-POINTING ANGLE QUOTATION MARK

140
8C
Œ
LATIN CAPITAL LIGATURE OE

142
8E
Ž
LATIN CAPITAL LETTER Z WITH CARON

145
91
‘
LEFT SINGLE QUOTATION MARK

146
92
’
RIGHT SINGLE QUOTATION MARK

147
93
“
LEFT DOUBLE QUOTATION MARK

148
94
”
RIGHT DOUBLE QUOTATION MARK

149
95
•
BULLET

150
96
–
EN DASH

151
97
—
EM DASH

152
98
˜
SMALL TILDE

153
99
™
TRADE MARK SIGN

154
9A
š
LATIN SMALL LETTER S WITH CARON

155
9B
›
SINGLE RIGHT-POINTING ANGLE QUOTATION MARK

156
9C
œ
LATIN SMALL LIGATURE OE

158
9E
ž
LATIN SMALL LETTER Z WITH CARON

159
9F
Ÿ
LATIN CAPITAL LETTER Y WITH DIAERESIS

160
A0

NO-BREAK SPACE

161
A1
¡
INVERTED EXCLAMATION MARK

162
A2
¢
CENT SIGN

163
A3
£
POUND SIGN

164
A4
¤
CURRENCY SIGN

165
A5
¥
YEN SIGN

166
A6
¦
BROKEN BAR

167
A7
§
SECTION SIGN

168
A8
¨
DIAERESIS

169
A9
©
COPYRIGHT SIGN

170
AA
ª
FEMININE ORDINAL INDICATOR

171
AB
«
LEFT-POINTING DOUBLE ANGLE QUOTATION MARK

172
AC
¬
NOT SIGN

173
AD

SOFT HYPHEN

174
AE
®
REGISTERED SIGN

175
AF
¯
MACRON

176
B0
°
DEGREE SIGN

177
B1
±
PLUS-MINUS SIGN

178
B2
²
SUPERSCRIPT TWO

179
B3
³
SUPERSCRIPT THREE

180
B4
´
ACUTE ACCENT

181
B5
µ
MICRO SIGN

182
B6
¶
PILCROW SIGN

183
B7
·
MIDDLE DOT

184
B8
¸
CEDILLA

185
B9
¹
SUPERSCRIPT ONE

186
BA
º
MASCULINE ORDINAL INDICATOR

187
BB
»
RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK

188
BC
¼
VULGAR FRACTION ONE QUARTER

189
BD
½
VULGAR FRACTION ONE HALF

190
BE
¾
VULGAR FRACTION THREE QUARTERS

191
BF
¿
INVERTED QUESTION MARK

192
C0
À
LATIN CAPITAL LETTER A WITH GRAVE

193
C1
Á
LATIN CAPITAL LETTER A WITH ACUTE

194
C2
Â
LATIN CAPITAL LETTER A WITH CIRCUMFLEX

195
C3
Ã
LATIN CAPITAL LETTER A WITH TILDE

196
C4
Ä
LATIN CAPITAL LETTER A WITH DIAERESIS

197
C5
Å
LATIN CAPITAL LETTER A WITH RING ABOVE

198
C6
Æ
LATIN CAPITAL LETTER AE

199
C7
Ç
LATIN CAPITAL LETTER C WITH CEDILLA

200
C8
È
LATIN CAPITAL LETTER E WITH GRAVE

201
C9
É
LATIN CAPITAL LETTER E WITH ACUTE

202
CA
Ê
LATIN CAPITAL LETTER E WITH CIRCUMFLEX

203
CB
Ë
LATIN CAPITAL LETTER E WITH DIAERESIS

204
CC
Ì
LATIN CAPITAL LETTER I WITH GRAVE

205
CD
Í
LATIN CAPITAL LETTER I WITH ACUTE

206
CE
Î
LATIN CAPITAL LETTER I WITH CIRCUMFLEX

207
CF
Ï
LATIN CAPITAL LETTER I WITH DIAERESIS

208
D0
Ð
LATIN CAPITAL LETTER ETH

209
D1
Ñ
LATIN CAPITAL LETTER N WITH TILDE

210
D2
Ò
LATIN CAPITAL LETTER O WITH GRAVE

211
D3
Ó
LATIN CAPITAL LETTER O WITH ACUTE

212
D4
Ô
LATIN CAPITAL LETTER O WITH CIRCUMFLEX

213
D5
Õ
LATIN CAPITAL LETTER O WITH TILDE

214
D6
Ö
LATIN CAPITAL LETTER O WITH DIAERESIS

215
D7
×
MULTIPLICATION SIGN

216
D8
Ø
LATIN CAPITAL LETTER O WITH STROKE

217
D9
Ù
LATIN CAPITAL LETTER U WITH GRAVE

218
DA
Ú
LATIN CAPITAL LETTER U WITH ACUTE

219
DB
Û
LATIN CAPITAL LETTER U WITH CIRCUMFLEX

220
DC
Ü
LATIN CAPITAL LETTER U WITH DIAERESIS

221
DD
Ý
LATIN CAPITAL LETTER Y WITH ACUTE

222
DE
Þ
LATIN CAPITAL LETTER THORN

223
DF
ß
LATIN SMALL LETTER SHARP S

224
E0
à
LATIN SMALL LETTER A WITH GRAVE

225
E1
á
LATIN SMALL LETTER A WITH ACUTE

226
E2
â
LATIN SMALL LETTER A WITH CIRCUMFLEX

227
E3
ã
LATIN SMALL LETTER A WITH TILDE

228
E4
ä
LATIN SMALL LETTER A WITH DIAERESIS

229
E5
å
LATIN SMALL LETTER A WITH RING ABOVE

230
E6
æ
LATIN SMALL LETTER AE

231
E7
ç
LATIN SMALL LETTER C WITH CEDILLA

232
E8
è
LATIN SMALL LETTER E WITH GRAVE

233
E9
é
LATIN SMALL LETTER E WITH ACUTE

234
EA
ê
LATIN SMALL LETTER E WITH CIRCUMFLEX

235
EB
ë
LATIN SMALL LETTER E WITH DIAERESIS

236
EC
ì
LATIN SMALL LETTER I WITH GRAVE

237
ED
í
LATIN SMALL LETTER I WITH ACUTE

238
EE
î
LATIN SMALL LETTER I WITH CIRCUMFLEX

239
EF
ï
LATIN SMALL LETTER I WITH DIAERESIS

240
F0
ð
LATIN SMALL LETTER ETH

241
F1
ñ
LATIN SMALL LETTER N WITH TILDE

242
F2
ò
LATIN SMALL LETTER O WITH GRAVE

243
F3
ó
LATIN SMALL LETTER O WITH ACUTE

244
F4
ô
LATIN SMALL LETTER O WITH CIRCUMFLEX

245
F5
õ
LATIN SMALL LETTER O WITH TILDE

246
F6
ö
LATIN SMALL LETTER O WITH DIAERESIS

247
F7
÷
DIVISION SIGN

248
F8
ø
LATIN SMALL LETTER O WITH STROKE

249
F9
ù
LATIN SMALL LETTER U WITH GRAVE

250
FA
ú
LATIN SMALL LETTER U WITH ACUTE

251
FB
û
LATIN SMALL LETTER U WITH CIRCUMFLEX

252
FC
ü
LATIN SMALL LETTER U WITH DIAERESIS

253
FD
ý
LATIN SMALL LETTER Y WITH ACUTE

254
FE
þ
LATIN SMALL LETTER THORN

255
FF
ÿ
LATIN SMALL LETTER Y WITH DIAERESIS

Источник

Windows-1252

ISO/IEC 8859-1 (также известная как ISO 8859-1 и Latin-1) — кодовая страница, предназначенная для западноевропейских языков; она базируется на символьном наборе популярных в прошлом терминалов ISO 8859.

ISO-8859-1 — кодировка, зарегистрированная 1992 г. В отличие от ISO/IEC 8859-1, кодовые позиции 0—31 и 127—159 здесь заполнены управляющими символами (большинство из которых, впрочем, всё равно никто не использует). В XHTML, однако, кодировкой по умолчанию является ISO_8859-1:1987, ISO_8859-1, ISO-8859-1, iso-ir-100, sISOLatin1, latin1, l1, IBM819, CP819. ]./

Таблицы

Нижняя часть (0—127) таблиц кодировки не показана, поскольку полностью соответствует обычному Юникоде.

ISO-8859-1

	.0	.1	.2	.3	.4	.5	.6	.7	.8	.9	.A	.B	.C	.D	.E	.F
8.	PAD 80	HOP 81	BPH 82	NBH 83	IND 84	NEL 85	SSA 86	ESA 87	HTS 88	HTJ 89	VTS 8A	PLD 8B	PLU 8C	RI 8D	SS2 8E	SS3 8F
9.	DCS 90	PU1 91	PU2 92	STS 93	CCH 94	MW 95	SPA 96	EPA 97	SOS 98	SGCI 99	SCI 9A	CSI 9B	ST 9C	OSC 9D	PM 9E	APC 9F
A.	A0	¡ A1	¢ A2	£ A3	¤ A4	¥ A5	¦ A6	§ A7	¨ A8	© A9	ª AA	« AB	¬ AC	AD	® AE	¯ AF
B.	° B0	± B1	² B2	³ B3	´ B4	µ B5	¶ B6	· B7	¸ B8	¹ B9	º BA	» BB	¼ BC	½ BD	¾ BE	¿ BF
C.	À C0	Á C1	Â C2	Ã C3	Ä C4	Å C5	Æ C6	Ç C7	È C8	É C9	Ê CA	Ë CB	Ì CC	Í CD	Î CE	Ï CF
D.	Ð D0	Ñ D1	Ò D2	Ó D3	Ô D4	Õ D5	Ö D6	× D7	Ø D8	Ù D9	Ú DA	Û DB	Ü DC	Ý DD	Þ DE	ß DF
E.	à E0	á E1	â E2	ã E3	ä E4	å E5	æ E6	ç E7	è E8	é E9	ê EA	ë EB	ì EC	í ED	î EE	ï EF
F.	ð F0	ñ F1	ò F2	ó F3	ô F4	õ F5	ö F6	÷ F7	ø F8	ù F9	ú FA	û FB	ü FC	ý FD	þ FE	ÿ FF

Windows−1252

В первоначальной версии этой кодировки отсутствовали символы: € (0x80), ˆ (0x88), ˜ (0x98), Ž (0x8E), ž (0x9E).

	.0	.1	.2	.3	.4	.5	.6	.7	.8	.9	.A	.B	.C	.D	.E	.F
8.	€ 20AC		‚ 201A	ƒ 192	„ 201E	… 2026	† 2020	‡ 2021	ˆ 2C6	‰ 2030	Š 160	‹ 2039	Œ 152		Ž 17D
9.		‘ 2018	’ 2019	“ 201C	” 201D	• 2022	– 2013	— 2014	˜ 2DC	™ 2122	š 161	› 203A	œ 153		ž 17E	Ÿ 178
A.	A0	¡ A1	¢ A2	£ A3	¤ A4	¥ A5	¦ A6	§ A7	¨ A8	© A9	ª AA	« AB	¬ AC	AD	® AE	¯ AF
B.	° B0	± B1	² B2	³ B3	´ B4	µ B5	¶ B6	· B7	¸ B8	¹ B9	º BA	» BB	¼ BC	½ BD	¾ BE	¿ BF
C.	À C0	Á C1	Â C2	Ã C3	Ä C4	Å C5	Æ C6	Ç C7	È C8	É C9	Ê CA	Ë CB	Ì CC	Í CD	Î CE	Ï CF
D.	Ð D0	Ñ D1	Ò D2	Ó D3	Ô D4	Õ D5	Ö D6	× D7	Ø D8	Ù D9	Ú DA	Û DB	Ü DC	Ý DD	Þ DE	ß DF
E.	à E0	á E1	â E2	ã E3	ä E4	å E5	æ E6	ç E7	è E8	é E9	ê EA	ë EB	ì EC	í ED	î EE	ï EF
F.	ð F0	ñ F1	ò F2	ó F3	ô F4	õ F5	ö F6	÷ F7	ø F8	ù F9	ú FA	û FB	ü FC	ý FD	þ FE	ÿ FF

Wikimedia Foundation.
2010.

Полезное

Смотреть что такое «Windows-1252» в других словарях:

Windows-1252 — Windows 1252, sometimes called incorrectly ANSI . Blue dots indicate unused or control characters Windows 1252 or CP 1252 is a character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows in English and… … Wikipedia
Windows-1252 — Windows 1252, sometimes called incorrectly ANSI . Blue dots indicate unused or control characters Windows 1252 or CP 1252 es una codificacion de caracteres del alfabeto latino, usado por defecto en los componentes oficiales de Microsoft Windows… … Wikipedia Español
Windows-1252 — ou CP1252 est un jeu de caractères, utilisé historiquement par défaut sur le système d exploitation Microsoft Windows en anglais et dans les principales langues d’Europe de l’Ouest (dont le français). Sommaire 1 Contexte 2 Aspects techniques … Wikipédia en Français
Windows 1252 — ou CP1252 est un jeu de caractères disponible sur le système d exploitation Microsoft Windows, aux États Unis, et dans certains pays de l Union européenne. Sommaire 1 Contexte 2 Aspects techniques 2.1 Support en HTML … Wikipédia en Français
Windows-1252 — ISO 8859 1 Latin 1, Westeuropäisch 2 Latin 2, Mitteleuropäisch 3 Latin 3, Südeuropäisch 4 Latin 4, Baltisch 5 Kyrillisch 6 Arabisch 7 Griechisch 8 … Deutsch Wikipedia
Windows-1250 — is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use Latin script, such as Polish, Czech, Slovak, Hungarian, Slovene, Bosnian, Croatian, Serbian (Latin script), Romanian and… … Wikipedia
Windows code page — Windows code pages are sets of characters or code pages (known as character encodings in other operating systems) used in Microsoft Windows from the 1980s and 1990s. Windows code pages were gradually superseded when Unicode was implemented in… … Wikipedia
Windows-1258 — is a codepage used in Microsoft Windows to represent Vietnamese texts. It makes use of combining diacritical marks. Windows 1258 is not compatible with VISCII. It is very similar to windows 1252 with the differences being that s caron and z caron … Wikipedia
Windows-1257 — (Windows Baltic) is a single byte code page used to support the Estonian, Latvian and Lithuanian languages under Microsoft Windows. This code page is similar in layout to ISO 8859 13, but they differ in codepoints A1, A5, B4, FF, and of course in … Wikipedia
Windows-1251 — набор символов и кодировка, являющаяся стандартной 8 битной кодировкой для всех русских версий Microsoft Windows. Пользуется довольно большой популярностью. Была создана на базе кодировок, использовавшихся в ранних «самопальных» русификаторах… … Википедия

Источник

Content

Overview
ASCII Control Characters (0-31 and 127)
ASCII Characters6)
ANSI Characters (128-159)
ANSI Characters (160-255)

Overview

ASCII (American Standard Code for Information Interchange) is a 7-bit character set that contains characters from 0 to 127.

The generic term ANSI (American National Standards Institute) is used for 8-bit character sets. These character sets contain the unchanged ASCII character set. In addition, they contain further characters from 128 to 255, which differ in the various ANSI character sets. There are character sets for western special characters and umlauts, and for Arabic, Greek or Cyrillic characters.

The following table shows which characters are available in which (western) character set:

Did you like my page, one of my freeware applications or online tools?

Then, please donate via PayPal in order to help keeping its content free — each amount is welcome!

ASCII Control Characters (0-31 and 127)

These characters are part of ASCII, Windows-1252 and ISO-8859-1.

The characters with the ASCII codes 0 to 31 and 127 are control characters which are not intended for display.

The caret notation (in column «C») is often used in terminals to display control characters. These can usually be entered using the control key (Ctrl). For example, the notation «^C» corresponds to the key combination Ctrl+C.

The escape sequence (in column «E») is used e.g. in programming languages or search functions to be able to enter control characters as text.

ASCII Characters (32-126)

These characters are part of ASCII, Windows-1252 and ISO-8859-1.

Characters with ASCII codes 32 to 126 are so-called printable characters intended for display or output on printers.

ANSI Characters (128-159)

These characters are part of Windows-1252. In ISO-8859-1 these characters are control characters.

ANSI Characters (160-255)

These characters are part of Windows-1252 and ISO-8859-1.

Источник

Windows-1252 code page

Windows-1252 (legacy, Western Europe) is a 8-bit single-byte coded character set.

This Windows code page is similar to ISO-8859-1.

Hex to decimal converter

The code page above has hexadecimal numbers, use this tool to convert to decimal:

More character sets

US-ASCII (basic English)
ISO-8859-1 (Western Europe)
ISO-8859-2 (Central Europe)
ISO-8859-3 (Southern Europe)
ISO-8859-4 (Baltic)
ISO-8859-5 (Cyrillic)
ISO-8859-6 (Arabic)
ISO-8859-7 (Greek)
ISO-8859-8 (Hebrew)
ISO-8859-9 (Turkish)
ISO-8859-15 (Latin 9)
SHIFT_JIS (Japanese, Win/Mac)
Windows-1250 (legacy, Central Europe)
Windows-1251 (legacy, Cyrillic)
Windows-1252 (legacy, Western Europe)
Windows-1253 (legacy, Greek)
Windows-1254 (legacy, Turkish)
Windows-1255 (legacy, Hebrew)
Windows-1256 (legacy, Arabic)
Windows-1257 (legacy, Baltic Rim)
Windows-1258 (legacy, Vietnam)

Источник