ASCII
With American Standard Code for Information Interchange (ASCII), we can represent a character using a code that associates 7 bits binary digits.
A = 1000001 = 65 (decimal)
B = 1000010 = 66
…
a = 1100001
b = 1100010
space = 0100000
etc…
Keyboard symbols in ASCII
symbol | name | ASCII code (decimal) |
---|---|---|
! | Exclamation mark | 33 |
” | Double quotes | 34 |
# | Hashtag | 35 |
$ | Dollar | 36 |
% | Per cent sign | 37 |
& | Ampersand | 38 |
` | Single quote | 39 |
The maximum code value ASCII can represent is 127(7 bits) or 1111111, that is 128 (0-127) characters can be represented. However, 128 characters are far from enough to represent other languages with their characters (e.g. Chinese characters), therefore we need a new encoding standard (i.e. Unicode).
Unicode
Unicode using 32bits (4 bytes) for a character representation, which could theoretically represent 4 billion characters. Currently there are over 140,000 characters represented in unicode that cover more than 150 languages, including emojis.
Operating_system INFO1112 Character_representation Unicode ASCII