Text Encoding Detector
Analyze text encoding, view byte representation, and convert between encodings.
Runs entirely in your browser.
Text Encoding Detection Tool
Detected Encoding
UTF-8
Characters
32
UTF-8 Bytes
45
UTF-16 Code Units
33
Scripts Detected
Latin, Extended Latin, CJK, Emoji
ASCII Only
No
Character Details
| Char | Unicode | UTF-8 Bytes | UTF-16 | HTML Entity | Category |
|---|---|---|---|---|---|
| H | U+0048 | 48 | 0048 | H | ASCII |
| e | U+0065 | 65 | 0065 | e | ASCII |
| l | U+006C | 6C | 006C | l | ASCII |
| l | U+006C | 6C | 006C | l | ASCII |
| o | U+006F | 6F | 006F | o | ASCII |
| , | U+002C | 2C | 002C | , | ASCII |
| U+0020 | 20 | 0020 | ASCII | ||
| W | U+0057 | 57 | 0057 | W | ASCII |
| o | U+006F | 6F | 006F | o | ASCII |
| r | U+0072 | 72 | 0072 | r | ASCII |
| l | U+006C | 6C | 006C | l | ASCII |
| d | U+0064 | 64 | 0064 | d | ASCII |
| ! | U+0021 | 21 | 0021 | ! | ASCII |
| U+0020 | 20 | 0020 | ASCII | ||
| H | U+0048 | 48 | 0048 | H | ASCII |
| é | U+00E9 | C3 A9 | 00E9 | é | Latin-1 |
| l | U+006C | 6C | 006C | l | ASCII |
| l | U+006C | 6C | 006C | l | ASCII |
| o | U+006F | 6F | 006F | o | ASCII |
| U+0020 | 20 | 0020 | ASCII | ||
| W | U+0057 | 57 | 0057 | W | ASCII |
| ö | U+00F6 | C3 B6 | 00F6 | ö | Latin-1 |
| r | U+0072 | 72 | 0072 | r | ASCII |
| l | U+006C | 6C | 006C | l | ASCII |
| d | U+0064 | 64 | 0064 | d | ASCII |
| U+0020 | 20 | 0020 | ASCII | ||
| 你 | U+4F60 | E4 BD A0 | 4F60 | 你 | CJK |
| 好 | U+597D | E5 A5 BD | 597D | 好 | CJK |
| 世 | U+4E16 | E4 B8 96 | 4E16 | 世 | CJK |
| 界 | U+754C | E7 95 8C | 754C | 界 | CJK |
| U+0020 | 20 | 0020 | ASCII | ||
| 🌍 | U+1F30D | F0 9F 8C 8D | D83C DF0D | 🌍 | Emoji/Symbol |
Encoded Representations
About Text Encoding
Text Encoding is the process of converting text into a format that can be stored and transmitted.
Learn more: Character Sets