This document provides an overview of the Unicode standard for character encoding. It discusses the design goals of being universal, consistent, and able to encode all written languages. Unicode represents characters as abstract code points from 0 to 10FFFF in hexadecimal and defines UTF-8, UTF-16, and UTF-32 encoding forms to represent these code points with 8, 16, or 32-bit units. The encoding forms allow lossless transformation between representations. UTF-8 is commonly used in XML documents while UTF-16 and UTF-32 are used by different operating systems.