Character codes and smilies:

« » ‹ › ‘ ’ “ ” „ ‚ ❝ ❞ £ ¥ € $ ¢ ¬ ¶ @ § ® ©™ ° × π ± √ ‰ Ω ∞ ≈ ÷ ~ ≠ ¹ ² ³ ½ ¼ ¾ ‐ – — | ⁄ \ [ ] { } † ‡ … · • ● ⌘ ⌥ ⌃ ⇧ ↩ ¡ ※ ◊ ¿ ‽ ⁂ ∴ ∵ ◊ ※ ← → ↑ ↓ ☜ ☞ ☝ ☟ ✔ ★ ♺ ¤ ☼ ☂ ☺ ☹ ☃ ♠ ♦ ♣ ♥ ♪ ♫ ♯ ♀ ♂ α ß Á á À à Å å Ä ä Æ æ Ç ç É é È è Ê ê Í í Ì ì Î î Ñ ñ Ó ó Ò ò Ô ô Ö ö Ø ø Ú ú Ù ù Ü ü Ž ž

The tables below list the most common character codes for text delimiters, accents, punctuation, numbers, and symbols.

Text delimiters:

Character ASCII ISO/IEC
8859-1
(Decimal)
Unicode (Hex) HTML

 ’
 ′
    U+2018
U+2019
U+2032
‘
’
′

 ”
 ″
    U+201C
U+201D
U+2033
“
”
″
«
»
  0171
0187
U+00AB
U+00BB
«
»

    U+3008
U+3009
‹
›
    U+2020 †
    U+2021 ‡
" 34 034 U+0022 "
# 35 035 U+0023 #
$ 36 036 U+0024 $
% 37 037 U+0025 %
& 38 038 U+0026 &
' 39 039 U+0027 '
¢ (cent)   0162 U+00A2 ¢
£ (Pound Sterling)   0163 U+00A3 £
¤ (Gen currency)   0164 U+00A4 ¤
€ (Euro/EU)   0164 U+20AC €
¥ (Yen)   0165 U+00A5 ¥

* The Unicode standard explicitly states that U+2019 is the preferred punctuation apostrophe (not U+0027).

Accents and Symbols

Character ASCII ISO/IEC
8859-1
Unicode (hex) HTML
¦   0166 U+00A6 ¦
§ (Section)   0167 U+00A7 §
´ (Acute)
[resumé]
á Á
é É
í Í
ó Ó
ú Ú
ý Ý
  0180

0225 / 0193
0233 / 0201
0237 / 0205
0243 / 0211
0250 / 0218
0253 / 0221
U+00B4

U+00E1 U+00C1
U+00E9 U+00C9
U+00ED U+00CD
U+00F3 U+00D3
U+00FA U+00DA
U+00FD U+00DD
&acute

áÁ
éÉ
íÍ
óÓ
úÚ
ýÝ
^ ˆ (Circumflex)
[château]
â Â
ê Ê
î Î
ô Ô
û Û
94    






094

0226 / 0194
0234 / 0202
0238 / 0206
0244 / 0212
0251 / 0219
U+005E

U+00E2 U+00C2
U+00EA U+00CA
U+00EE U+00CE
U+00F4 U+00D4
U+00FB U+00DB
ˆ  

â â
ê Ê
î Î
ô Ô
û Û
¨ (Umlaut/Dieresis)
[naïve]
ä Ä
ë Ë
ï Ï
ö Ö
ü Ü
  0168

0228 / 0196
0235 / 0203
0239 / 0207
0246 / 0214
0252 / 0220
U+00A8

U+00C4 U+00C4
U+00EB U+00CB
U+00EF U+00CF
U+00F6 U+00D6
U+00FC U+00DC
¨ 

ä Ä
ë Ë
ï Ï
ö Ö
ü Ü
©   0169 U+00A9 ©
    U+2122 ™
®   0174 U+00AE ®
` (Grave)
[voilà]
à À
è È
ì Ì
ò Ò
ù Ù
96






096

0224 / 0192
0232 / 0200
0236 / 0204
0242 / 0210
0249 / 0217
U+0060

U+00E0 U+00C0
U+00E8 U+00C8
U+00EC U+00CC
U+00F2 U+00D2
U+00F9 U+00D9
(n.b. do not use the grave character as a left quote)

à À
è È
ì Ì
Ò Ò
Ù Ù
˜ ∼ (Tilde)
[mañana]
ã Ã
ñ Ñ
õ Õ
126 


 

0126

0227 / 0195
0241 / 0209
0245 / 0213
U+02DC U+007E

U+00E3 U+00C3
U+00F1 U+00D1
U+00F5 U+00D5
˜ ∼  

ã Ã
ñ Ñ
õ Õ
¸ (Cedilla)
[façade]
ç Ç
  0184

0231 / 0199
U+00B8

U+00E7 U+00C7
¸

ç Ç

On the Apple Macintosh, diacritical marks (Acute,Circumflex,Umlaut,Grave,Tilde,) can be entered with Option + <e,i,u,`,n> followed by the letter to modify, so for example Option+i, A  will produce Â. To enter a Cedilla just use Option+C

Punctuation

Character ASCII ISO/IEC
8859-1
Unicode HTML
SP (Space Bar) 32 032 U+0020 &#32;
! 33 033 U+0021 &#33;
( 40 040 U+0028 &#40;
) 41 041 U+0029 &#41;
* 42 042 U+002A &#42;
+ 43 043 U+002B &#43;
, (Comma) 44 044 U+002C &#44;
- (Hyphen/minus) 45 045 U+002D &#45;
. (Full Stop/period) 46 046 U+002E &#46;
    U+2026 &hellip;
/ 47 047 U+002F &#47;
: (Colon) 58 058 U+003A &#58;
; (Semicolon) 59 059 U+003B &#59;
< 60 060 U+003C &lt;
= 61 061 U+003D &equiv;
62 062 U+003E &gt;
≤ ≥     U+2264 / U+2265 &le; &ge;
? 63 063 U+003F &#63;
@ 64 064 U+0040 &#64;
[ 91 091 U+005B &#91;
\ 92 092 U+005C &#92;
] 93 093 U+005D &#93;
_ 95 095 U+005F &#95;
{ 123 0123 U+007B &#123;
| 124 0124 U+007C &#124;
} 125 0125 U+007D &#125;
DEL (Delete) 127 0127 U+007F  
non breaking Space   0160 U+00A0 &nbsp;
¡ (inverted Exclamation)   0161 U+00A1 &iexcl;
ª (feminine ordinal)   0170 U+00AA &ordf;
¬ (NOT sign)   0172 U+00AC &not;
­ SHY(soft hyphen)   0173 U+00AD &shy;
¯ (Macron accent)   0175 U+00AF &macr;
°   0176 U+00B0 &deg;
±   0177 U+00B1 &plusmn;
µ (Micro)   0181 U+00B5 &micro;
¶ (Paragraph)   0182 U+00B6 &para;
· (middle dot)   0183 U+00B7 &middot;
7   U+2022 &bull;
¸ (Cedilla)   0184 U+00B8 &cedil;
º (Masculine Ordinal)   0186 U+00BA &ordm;
¿   0191 U+00BF &iquest;
    U+2013 &ndash;
    U+2014 &mdash;
String Terminator     U+009C  
π (PI)     U+203B &pi;
← →
↑ ↓
    U+2190 U+2192
U+2191 U+2193 
&larr; &rarr;
&uarr; &darr;
    U+263A &#9786;

Numerics

Character ASCII ISO/IEC
8859-1
Unicode HTML
0 48 048 U+0030 &#48;
1 49 049 U+0031 &#49;
2 50 050 U+0032 &#50;
3 51 051 U+0033 &#51;
4 52 052 U+0034 &#52;
5 53 053 U+0035 &#53;
6 54 054 U+0036 &#54;
7 55 055 U+0037 &#55;
8 56 056 U+0038 &#56;
9 57 057 U+0039 &#57;
¼   0188 U+00BC &frac14;
½   0189 U+00BD &frac12;
¾   0190 U+00BE &frac34;
¹   0185 U+00B9 &sup1
²   0178 U+00B2 &sup2;
³   0179 U+00B3 &sup3;

Characters A - Z, a-z

Character ASCII ISO/IEC
8859-1
Unicode HTML   Character ASCII ISO/IEC
8859-1
Unicode HTML
A 65 065 U+0041 &#65;   a 97 097 U+0061 &#97;
B 66 066 U+0042 &#66;   b 98 098 U+0062 &#98;
C 67 067 U+0043 &#67;   c 99 099 U+0063 &#99;
D 68 068 U+0044 &#68;   d 100 0100 U+0064 &#100;
E 69 069 U+0045 &#69;   e 101 0101 U+0065 &#101;
F 70 070 U+0046 &#70;   f 102 0102 U+0066 &#102;
G 71 071 U+0047 &#71;   g 103 0103 U+0067 &#103;
H 72 072 U+0048 &#72;   h 104 0104 U+0068 &#104;
I 73 073 U+0049 &#73;   i 105 0105 U+0069 &#105;
J 74 074 U+004A &#74;   j 106 0106 U+006A &#106;
K 75 075 U+004B &#75;   k 107 0107 U+006B &#107;
L 76 076 U+004C &#76;   l 108 0108 U+006C &#108;
M 77 077 U+004D &#77;   m 109 0109 U+006D &#109;
N 78 078 U+004E &#78;   n 110 0110 U+006E &#110;
O 79 079 U+004F &#79;   o 111 0111 U+006F &#111;
P 80 080 U+0050 &#80;   p 112 0112 U+0070 &#112;
Q 81 081 U+0051 &#81;   q 113 0113 U+0071 &#113;
R 82 082 U+0052 &#82;   r 114 0114 U+0072 &#114;
S 83 083 U+0053 &#83;   s 115 0115 U+0073 &#115;
T 84 084 U+0054 &#84;   t 116 0116 U+0074 &#116;
U 85 085 U+0055 &#85;   u 117 0117 U+0075 &#117;
V 86 086 U+0056 &#86;   v 118 0118 U+0076 &#118;
W 87 087 U+0057 &#87;   w 119 0119 U+0077 &#119;
X 88 088 U+0058 &#88;   x 120 0120 U+0078 &#120;
Y 89 089 U+0059 &#89;   y 121 0121 U+0079 &#121;
Z 90 090 U+005A &#90;   z 122 0122 U+007A &#122;

Control codes

Character ASCII ISO/IEC
8859-1
Unicode HTML
NUL (Null) 0   U+0000  
SOH (Start of heading) 1   U+0001 <h1>
STX (Start of text) 2   U+0002 <body>
ETX (End of text) 3   U+0003 </body>
EOT (End of transmit) 4   U+0004  
ENQ (Enquiry) 5   U+0005  
ACK (Acknowledgment) 6   U+0006  
BEL (Audible bell) 7   U+0007  
BS (Backspace) 8   U+0008  
HT (Horizontal tab) 9   U+0009  
LF (Line feed) 10   U+000A  
VT (Vertical tab) 11   U+000B  
FF (Form feed) 12   U+000C  
CR (Carriage return) 13   U+000D <br>
SO (Shift out) 14   U+000E <blockquote>
SI (Shift in) 15   U+000F </blockquote>
DLE (Data link escape) 16   U+0010  
DC1 (Device control 1) 17   U+0011  
DC2 (Device control 2) 18   U+0012  
DC3 (Device control 3) 19   U+0013  
DC4 (Device control 4) 20   U+0014  
NAK (Neg. acknowledge) 21   U+0015  
" SYN (Synchronous idle) 22   U+0016  
ETB (End trans. block) 23   U+0017  
CAN (Cancel) 24   U+0018  
EM (End of medium) 25   U+0019  
SUB (Substitution) 26   U+001A  
ESC (Escape) 27   U+001B  
FS (File separator/Figure shift) 28   U+001C  
GS (Group separator) 29   U+001D <div>
RS (Record separator) 30   U+001E  
US (Unit separator) 31   U+001F  

n.b. The first 256 Unicode values represent the same characters as ISO 8859-1
The complete unicode character set runs to over 90,000 entries.

Entering character codes at the terminal:

In Windows to input a 4 digit decimal ISO/IEC code, hold down the ALT key, type e.g. 0164 on the numeric keypad, then release ALT.

In bash type echo $'\xNN' where NN is a two digit (hex) unicode, this covers the Hex values 20 to 7e as shown below:

<space>!"#$%&'()*+,-./0123456789 :;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ [\]^_` abcdefghijklmnopqrstuvwxyz{|}~

bash will also accept the following special characters with Alt-key and Alt-Shift-key combinations:

¡¢£¤¥¦§¨©ª«¬®¯°±²³´µ¶·¸¹º»¼½¾ ¿ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓ ÔÕÖרÙÚÛ ÜÝ Þßàáâãäåæçèéêëìíîïð ñòóôõö÷ø ùúûüýþÿ
(equivalent to Hex A1-FF)

Extended Ascii Characters
At one time there were many versions of the extended ASCII set (128-255), used to provide (limited) foreign language and graphics support, this is the most popular set. These are not compatible with ISO or unicode and are no longer used in modern systems.

"Smart data structures and dumb code works a lot better than the other way around" - Eric S. Raymond

Related:
Decimal to Hex conversion
Ascii table in numeric order
DecodeUnicode - information on every character (Wiki)
Ishida's online unicode converter
Unicode characters - Wikipedia
Quotation mark glyphs - Wikipedia
Unicode characters - FileFormat.info
Pronunciation guide
Unicode in MS Office
ISO 8859-1 - Wikipedia
Unicode.org
Amazon book:- Unicode Explained by Jukka Korpela


Back to the Top

Simon Sheppard
SS64.com