UT

Unicode Character Inspector

Inspect Unicode characters with code points, hex, UTF-8 bytes, and HTML entities.

What Is Unicode Character Inspector?

Unicode Character Inspector breaks down any text into its individual Unicode characters, showing the code point, decimal value, hexadecimal value, UTF-8 byte sequence, and HTML entity for each character. It's essential for debugging encoding issues, understanding multilingual text, and working with special characters.

Results update in real time as you type. This tool runs entirely in your browser — no text is sent to any server.

How to Use This Tool

1

Enter Your Text

Type directly into the input area, paste content with Ctrl+V, or upload/drag a .txt file to inspect.

2

Review the Table

Each character is shown with its Unicode code point, decimal, hex, UTF-8 bytes, and HTML entity.

3

Copy or Download

Use Copy to copy the full inspection table to clipboard, or Download to save it as a tab-separated file.

4

Clear and Start Over

Use the Clear button to reset the input and inspect new text.

Columns Explained

Code Point

The Unicode code point in U+XXXX format. This is the unique identifier assigned to every character in the Unicode standard, from U+0000 to U+10FFFF.

Decimal

The numeric value of the code point in base 10. Useful for programming languages that accept decimal character references and for database storage.

Hex

The hexadecimal representation of the code point prefixed with 0x. Commonly used in programming languages for character escape sequences like \x41 or \u0041.

UTF-8 Bytes

The actual byte sequence when the character is encoded in UTF-8. Single-byte characters use one value (00–FF), while multi-byte characters show multiple space-separated hex bytes.

HTML Entity

The numeric HTML entity (&#decimal;) that can be used in HTML documents to represent the character. Works in all browsers regardless of document encoding.

Who Is This Tool For?

Developers

Debug encoding issues, inspect special characters in source code, and find correct escape sequences for any character.

Web Developers

Get HTML entities for special characters, verify UTF-8 encoding, and troubleshoot rendering issues in web pages.

Data Engineers

Investigate encoding mismatches in data pipelines, identify problematic characters in CSV and JSON files.

Linguists

Analyze text in multiple scripts, identify code points for diacritics, combining marks, and special linguistic characters.

Security Researchers

Inspect suspicious text for homoglyph attacks, hidden characters, and Unicode-based obfuscation techniques.

Technical Writers

Find correct HTML entities and code points for special symbols, arrows, mathematical operators, and other characters.

Tips for Unicode Inspection

Spot look-alike characters

Characters from different scripts can look identical (e.g., Latin 'a' vs Cyrillic 'а'). Check code points to tell them apart.

Debug encoding issues

If text displays as garbled characters, inspect the bytes to determine if the encoding is UTF-8, Latin-1, or something else.

Find HTML entities

Copy the HTML entity column value to use special characters safely in HTML without worrying about encoding.

Check emoji code points

Emoji can be single code points or sequences of multiple code points joined with ZWJ. Inspect to see the full breakdown.

Identify combining characters

Accented characters may be a single precomposed code point or a base character plus combining marks. Inspection reveals which.

Verify string length

JavaScript string length counts UTF-16 code units, not characters. This tool shows the true character count using Unicode iteration.

Privacy & Security

This tool runs 100% in your browser. Your text is never uploaded to any server. It is stored only in your browser's local storage so it persists when you refresh the page.

You can clear it at any time using the “Clear” button. No cookies are used, no analytics track your text content, and no third-party services have access to what you type.