What is the HTML entity for Cjk Compatibility Ideograph-2f840?

The HTML entity for Cjk Compatibility Ideograph-2f840 is 咢 (decimal: 咢, hex: 咢).

What is the Unicode codepoint for Cjk Compatibility Ideograph-2f840?

Cjk Compatibility Ideograph-2f840 has the Unicode codepoint U+2F840 (decimal: 194624).

What is the UTF-8 encoding of Cjk Compatibility Ideograph-2f840?

The UTF-8 encoding of Cjk Compatibility Ideograph-2f840 is F0 AF A1 80.

How do I use Cjk Compatibility Ideograph-2f840 in JavaScript?

In JavaScript, use the escape sequence \uD87E\uDC40 or paste the character directly in a string.

What is the CSS escape for Cjk Compatibility Ideograph-2f840?

The CSS escape for Cjk Compatibility Ideograph-2f840 is \2F840. Use it in CSS content properties or selectors.

How do I write Cjk Compatibility Ideograph-2f840 in Python?

In Python, use the escape sequence \U0002F840 inside a string literal.

What is the URL encoding for Cjk Compatibility Ideograph-2f840?

The percent-encoded (URL encoded) form of Cjk Compatibility Ideograph-2f840 is %F0%AF%A1%80.

What Unicode block does Cjk Compatibility Ideograph-2f840 belong to?

Cjk Compatibility Ideograph-2f840 belongs to the CJK Compatibility Ideographs Supplement Unicode block.

咢

Cjk Compatibility Ideograph-2f840

U+2F840

SIP Unicode 3.1

Character 咢

Decimal 咢

Hex 咢

Classification

Unicode properties assigned to this character by the Unicode Consortium. The codepoint is its unique numeric identifier. Category, block, and script determine how text systems render and process it.

Codepoint: U+2F840
Decimal: 194624
Plane: SIP — Supplementary Ideographic Plane
Category: Other Letter (Lo)
Block: CJK Compatibility Ideographs Supplement
Script: Han
Bidi class: L Left-to-Right
East Asian Width: W Wide
Properties: Alphabetic ID Start ID Continue

Looks Like (Confusables)

Characters that are visually similar — relevant for security, font design, and homoglyph detection.

咢 U+54A2 compare →

Encodings & Escape Sequences

Every Unicode character can be represented in multiple ways depending on context. HTML entities let you embed it safely in web pages. UTF-8 bytes are what gets stored on disk and sent over the network. Escape sequences let you reference it in source code without pasting the raw glyph. All formats below refer to the same character — Cjk Compatibility Ideograph-2f840.

Click the copy icon to copy any value.

Format	Value
HTML Decimal	`咢` Decimal numeric character reference (&#decimal;).
HTML Hex	`咢` Hexadecimal numeric character reference (&#xHex;).
UTF-8 Hex Bytes	`F0 AF A1 80` UTF-8 encoding as hex byte values. Most common on the web.
UTF-16 Hex Bytes	`D8 7E DC 40` Used internally by JavaScript, Java, and Windows.
UTF-32 Hex	`0002F840` Fixed-width: one codepoint = 4 bytes.
CSS Escape	`\2F840` Use in CSS content property: content: "\XXXX"
JavaScript Escape	`\uD87E\uDC40` Use in JS strings: "\uXXXX" or template literals.
Python Escape	`\U0002F840` Use in Python strings: "\uXXXX" or "\UXXXXXXXX".
URL Encoded	`%F0%AF%A1%80` Percent-encoded for use in URLs.

Have a string containing this character? Decode it to see every codepoint. UnicodeDecoder →

Unihan Data

Readings and dictionary data from the Unicode Han Database (Unihan).

Cantonese (Jyutping): ngok6

Normalization Forms

Unicode defines four normalization forms that affect how characters with diacritics, compatibility variants, and combining marks are represented. This character has a non-trivial normalization — the forms below differ from its codepoint. Mismatched normalization is the most common cause of failed string comparisons across systems.

NFC

咢 U+54A2

NFD

咢 U+54A2

NFKC

咢 U+54A2

NFKD

咢 U+54A2

NFC = Canonical Decomposition then Canonical Composition (preferred for storage) · NFD = Canonical Decomposition · NFKC/NFKD = Compatibility forms (fold variants like ﬁ → fi)

Decomposition

This character can be broken down into a sequence of simpler Unicode codepoints. This is a canonical decomposition — the character and its components are semantically identical and interchangeable in NFC/NFD normalization.

咢 U+54A2

Previous in block

周 Cjk Compatibility Ideograph-2f83f

Next in block

Cjk Compatibility Ideograph-2f841 哶

More in CJK Compatibility Ideographs Supplement

View all →

丽 U+2F800 丸 U+2F801 乁 U+2F802 𠄢 U+2F803 你 U+2F804 侮 U+2F805 侻 U+2F806 倂 U+2F807 偺 U+2F808 備 U+2F809 僧 U+2F80A 像 U+2F80B 㒞 U+2F80C 𠘺 U+2F80D 免 U+2F80E 兔 U+2F80F 兤 U+2F810 具 U+2F811 𠔜 U+2F812 㒹 U+2F813 內 U+2F814 再 U+2F815 𠕋 U+2F816 冗 U+2F817

More Han Script Characters

View all →

⺀ U+2E80 ⺁ U+2E81 ⺂ U+2E82 ⺃ U+2E83 ⺄ U+2E84 ⺅ U+2E85 ⺆ U+2E86 ⺇ U+2E87 ⺈ U+2E88 ⺉ U+2E89 ⺊ U+2E8A ⺋ U+2E8B ⺌ U+2E8C ⺍ U+2E8D ⺎ U+2E8E ⺏ U+2E8F