Cjk Ideograph-8a00

U+8A00
BMP Unicode 1.1
Character
Decimal 言
Hex 言

Classification

Unicode properties assigned to this character by the Unicode Consortium. The codepoint is its unique numeric identifier. Category, block, and script determine how text systems render and process it.

Codepoint
U+8A00
Decimal
35328
Plane
BMP — Basic Multilingual Plane
Category
Other Letter (Lo)
Script
Han
Bidi class
L Left-to-Right
East Asian Width
W Wide
Properties
Alphabetic ID Start ID Continue

Encodings & Escape Sequences

Every Unicode character can be represented in multiple ways depending on context. HTML entities let you embed it safely in web pages. UTF-8 bytes are what gets stored on disk and sent over the network. Escape sequences let you reference it in source code without pasting the raw glyph. All formats below refer to the same character — Cjk Ideograph-8a00.

Click the copy icon to copy any value.

Format Value
HTML Decimal
言
HTML Hex
言
UTF-8 Hex Bytes
E8 A8 80
UTF-16 Hex Bytes
8A 00
UTF-32 Hex
00008A00
CSS Escape
\8A00
JavaScript Escape
\u8A00
Python Escape
\u8A00
URL Encoded
%E8%A8%80
Have a string containing this character? Decode it to see every codepoint. UnicodeDecoder →

Unihan Data

Readings and dictionary data from the Unicode Han Database (Unihan).

Definition
words, speech; speak, say; Kangxi radical 149
Mandarin (Pinyin)
yán
Cantonese (Jyutping)
jin4
Japanese On
GEN GON GIN
Japanese Kun
KOTO IU KOTOBA
Korean
EN UN
Vietnamese
ngôn

Characters That Include This

These characters decompose to a sequence that includes Cjk Ideograph-8a00 as a component. They are effectively precomposed versions or compounds built on this base character.