https://www.joelonsoftware.com/2003/10/08/the-absolute-minimum-every-software-developer-absolutely-positively-must-know-about-unicode-and-character-sets-no-excuses/

  • ASCII: (using number from 0 - 127) (2^7) in 7 bits. Characters from 32 to 127
  • Code pages: To fill the last bit in a byte from 128 to 255 (2^8) every language and system used their own code page.
  • Unicode: (U+0639) = code point.
  • UTF-8: Encoding to store Unicode

Updated: