Why is each byte exactly eight bits?

Eight bits became the standard unit of memory decades ago because it holds 256 values, enough for a full character set of the era. UTF-8 builds on that history, so binary here is always shown in eight-bit groups.

Can I use plain ASCII instead of UTF-8?

For ordinary English text you already are. ASCII is the first 128 code points, and UTF-8 encodes each of them as one identical byte, so the binary you see matches an ASCII table character for character.

How text becomes binary with UTF-8

Follow one character from your keyboard to eight binary digits, see why UTF-8 uses one to four bytes, and learn how a binary string reads back as text.

Open the Binary Translator →

From a character to a code point

Every character has a number assigned by the Unicode standard, called a code point. The capital letter A is code point 65, a space is 32, and an emoji sits far higher in the tens of thousands. Software never stores the shape of a glyph, only this number, which is why the same text can render in any font. Turning text into binary starts by looking up each character's code point.

From a code point to UTF-8 bytes

UTF-8 packs a code point into one to four bytes depending on its size. Code points up to 127 fit in a single byte that begins with a 0, which is exactly the old ASCII range. Larger values split across two, three or four bytes that each begin with a marker pattern, so the letter e with an acute accent becomes the two bytes 11000011 10101001. This variable width keeps English text compact while still covering every symbol.

From bytes to eight-bit binary

A byte is eight bits, and each bit is a single 0 or 1 standing for a power of two. To print a byte the translator converts its value to base two and pads it on the left with zeros until it is eight digits wide, so byte 65 shows as 01000001 rather than 1000001. Padding keeps every byte the same width, which is what lets a reader split a long stream back into clean groups of eight.

Reading binary back into text

Decoding reverses the path. The tool removes any spaces, checks that only 0s and 1s remain, and confirms the length is a multiple of eight. It then slices the stream into eight-bit groups, reads each group as a byte value, and feeds the bytes through a UTF-8 decoder that reassembles the original characters, including any multi-byte ones. If a stray character or an odd length slips in, decoding stops with a clear error instead of guessing.

How text becomes binary with UTF-8

From a character to a code point

From a code point to UTF-8 bytes

From bytes to eight-bit binary

Reading binary back into text

Frequently asked questions