What is the HTML entity for Bengali Vowel Sign Au?

The HTML entity for Bengali Vowel Sign Au is ৌ (decimal: ৌ, hex: ৌ).

What is the Unicode codepoint for Bengali Vowel Sign Au?

Bengali Vowel Sign Au has the Unicode codepoint U+09CC (decimal: 2508).

What is the UTF-8 encoding of Bengali Vowel Sign Au?

The UTF-8 encoding of Bengali Vowel Sign Au is E0 A7 8C.

How do I use Bengali Vowel Sign Au in JavaScript?

In JavaScript, use the escape sequence \u09CC or paste the character directly in a string.

What is the CSS escape for Bengali Vowel Sign Au?

The CSS escape for Bengali Vowel Sign Au is \9CC. Use it in CSS content properties or selectors.

How do I write Bengali Vowel Sign Au in Python?

In Python, use the escape sequence \u09CC inside a string literal.

What is the URL encoding for Bengali Vowel Sign Au?

The percent-encoded (URL encoded) form of Bengali Vowel Sign Au is %E0%A7%8C.

What Unicode block does Bengali Vowel Sign Au belong to?

Bengali Vowel Sign Au belongs to the Bengali Unicode block.

ৌ

Bengali Vowel Sign Au

U+09CC

BMP Unicode 1.1

Character ৌ

Decimal ৌ

Hex ৌ

Classification

Unicode properties assigned to this character by the Unicode Consortium. The codepoint is its unique numeric identifier. Category, block, and script determine how text systems render and process it.

Codepoint: U+09CC
Decimal: 2508
Plane: BMP — Basic Multilingual Plane
Category: Spacing Mark (Mc)
Block: Bengali
Script: Bengali
Bidi class: L Left-to-Right
East Asian Width: N Narrow
Properties: Alphabetic ID Continue

Encodings & Escape Sequences

Every Unicode character can be represented in multiple ways depending on context. HTML entities let you embed it safely in web pages. UTF-8 bytes are what gets stored on disk and sent over the network. Escape sequences let you reference it in source code without pasting the raw glyph. All formats below refer to the same character — Bengali Vowel Sign Au.

Click the copy icon to copy any value.

Format	Value
HTML Decimal	`ৌ` Decimal numeric character reference (&#decimal;).
HTML Hex	`ৌ` Hexadecimal numeric character reference (&#xHex;).
UTF-8 Hex Bytes	`E0 A7 8C` UTF-8 encoding as hex byte values. Most common on the web.
UTF-16 Hex Bytes	`09 CC` Used internally by JavaScript, Java, and Windows.
UTF-32 Hex	`000009CC` Fixed-width: one codepoint = 4 bytes.
CSS Escape	`\9CC` Use in CSS content property: content: "\XXXX"
JavaScript Escape	`\u09CC` Use in JS strings: "\uXXXX" or template literals.
Python Escape	`\u09CC` Use in Python strings: "\uXXXX" or "\UXXXXXXXX".
URL Encoded	`%E0%A7%8C` Percent-encoded for use in URLs.

Have a string containing this character? Decode it to see every codepoint. UnicodeDecoder →

Normalization Forms

Unicode defines four normalization forms that affect how characters with diacritics, compatibility variants, and combining marks are represented. This character has a non-trivial normalization — the forms below differ from its codepoint. Mismatched normalization is the most common cause of failed string comparisons across systems.

NFD

ে U+09C7 ৗ U+09D7

NFKD

ে U+09C7 ৗ U+09D7

NFC = Canonical Decomposition then Canonical Composition (preferred for storage) · NFD = Canonical Decomposition · NFKC/NFKD = Compatibility forms (fold variants like ﬁ → fi)

Decomposition

This character can be broken down into a sequence of simpler Unicode codepoints. This is a canonical decomposition — the character and its components are semantically identical and interchangeable in NFC/NFD normalization.

ে U+09C7 ৗ U+09D7

Previous in block

ো Bengali Vowel Sign O

Next in block

Bengali Sign Virama ্

More in Bengali

View all →

ঀ U+0980 ঁ U+0981 ং U+0982 ঃ U+0983 অ U+0985 আ U+0986 ই U+0987 ঈ U+0988 উ U+0989 ঊ U+098A ঋ U+098B ঌ U+098C এ U+098F ঐ U+0990 ও U+0993 ঔ U+0994 ক U+0995 খ U+0996 গ U+0997 ঘ U+0998 ঙ U+0999 চ U+099A ছ U+099B জ U+099C

More Bengali Script Characters

View all →

ঀ U+0980 ঁ U+0981 ং U+0982 ঃ U+0983 অ U+0985 আ U+0986 ই U+0987 ঈ U+0988 উ U+0989 ঊ U+098A ঋ U+098B ঌ U+098C এ U+098F ঐ U+0990 ও U+0993 ঔ U+0994