Boneyard Tools

Unicode Character Lookup

Paste text to break it down character by character. Each entry shows the Unicode code point, the U+ hex form, the decimal value, the numeric HTML entity and the UTF-8 bytes. Emoji count as one character.

How to look up Unicode characters

  1. Paste or type your text into the box.
  2. Read each character's code point, hex, decimal, HTML entity and UTF-8 bytes.
  3. Copy any value you need for code or documentation.

Examples

Inspect a letter

A
Code point 65, hex U+0041, decimal 65, HTML A, UTF-8 0x41

Inspect an emoji

πŸ˜€
Code point 128512, hex U+1F600, UTF-8 0xF0 0x9F 0x98 0x80

Frequently asked questions

Why is an emoji counted as one character?

The tool iterates by Unicode code point, not by UTF-16 unit, so an astral character like an emoji is a single entry even though it occupies two units in JavaScript strings.

What is the difference between the code point and the UTF-8 bytes?

The code point is the abstract Unicode number for the character. The UTF-8 bytes are how that code point is stored on disk or sent over the network, which is one to four bytes.

How are combining characters handled?

A base letter and a separate combining mark are shown as two entries because they are two code points. That reflects how the text is actually stored.

What format is the hex value?

It uses the standard Unicode notation U+XXXX, padded to at least four digits, for example U+0041 for the letter A and U+1F600 for a grinning face.

Is my text private?

Yes. Everything is analyzed in your browser and nothing is uploaded.

Related tools