Ь

Cyrillic Capital Letter Soft Sign

U+042C
BMP Unicode 1.1
Character Ь
Decimal Ь
Hex Ь

Classification

Unicode properties assigned to this character by the Unicode Consortium. The codepoint is its unique numeric identifier. Category, block, and script determine how text systems render and process it.

Codepoint
U+042C
Decimal
1068
Plane
BMP — Basic Multilingual Plane
Category
Uppercase Letter (Lu)
Script
Cyrillic
Bidi class
L Left-to-Right
East Asian Width
A Ambiguous
Properties
Alphabetic ID Start ID Continue
Lowercase
ь U+044C

Looks Like (Confusables)

Characters that are visually similar — relevant for security, font design, and homoglyph detection.

Encodings & Escape Sequences

Every Unicode character can be represented in multiple ways depending on context. HTML entities let you embed it safely in web pages. UTF-8 bytes are what gets stored on disk and sent over the network. Escape sequences let you reference it in source code without pasting the raw glyph. All formats below refer to the same character — Cyrillic Capital Letter Soft Sign.

Click the copy icon to copy any value.

Format Value
HTML Decimal
Ь
HTML Hex
Ь
UTF-8 Hex Bytes
D0 AC
UTF-16 Hex Bytes
04 2C
UTF-32 Hex
0000042C
CSS Escape
\42C
JavaScript Escape
\u042C
Python Escape
\u042C
URL Encoded
%D0%AC
Have a string containing this character? Decode it to see every codepoint. UnicodeDecoder →