Policy for IDN Code Points - CC Policy from VeriSign, Inc.

You Are Here: US Home > Domain Name Services > Domain Name Services > IDNs > CC Policy

Policy for IDN Code Points



CC Policy

1. Part I: Permissible Code Points 
2. Part II: Restricted Code Points 
3. Part III: Further Restrictions on Commingling of Scripts

1. Part I: Permissible Code Points

The implementation of IDN registration for the .CC Top Level Domain strictly follows the guidelines set forth in the following RFC documents by the Internet Engineering Task Force (IETF):

  1. RFC 3490 describes IDNA which is the IETF standard for IDNs.
  2. RFC 3491 and RFC 3454 describe the algorithms for normalizing IDNs.
  3. RFC 3492 describes punycode, the ASCII encoding algorithm used to make IDNs compatible with the DNS.
  4. The .CC registries allow registration of code points included in Unicode 3.2 subject to the restrictions in the above RFC documents.

2. Part II: Restricted Code Points

The .CC registry implements certain restrictions on code points beyond that allowed by the RFC documents:

    • Code points introduced after Unicode 3.2 are prohibited.
    • The following special characters and symbols are prohibited 

Character/Symbol

Unicode Range

General Punctuation

U+2000 – U+206F

Currency Symbols

U+20A0 – U+20CF

Combining Diacriticals for Symbols

U+20D0 – 20FF

Arrows

U+2190 – 21FF

Mathematical Operators

U+2200 – U+22FF

Miscellaneous Technical

2300 – 23FF

Control Pictures

2400 – 243F

Optical Character Recognition

2440 – 245F

Box Drawing

2500 – 257F

Block Elements

2580 – 259F

Geometric Shapes

25A0 – 25D9

Miscellaneous Symbols

2600 – 26FF

Dingbats

2700 – 27BF

Misc Mathematical Symbols – A

27D0 – 27EB

Supplemental Arrows – A

27F0 – 27FF

Braille Patterns

2800 – 28FF

Supplemental Arrows – B

2900 – 297F

Misc Mathematical Symbols – B

2980 – 29FF

Supp Mathematical Operators

2A00 – 2AFF

Misc Symbols and Arrows

2B00 – 2BFF

CJK Symbols and Punctuation

3000 – 303F

Yijing Hexagram Symbols

4DC0 – 4DFF

Variation Selectors

FE00 – FE0F

CJK Compatability Forms

FE30 – FE4F

Small Form Variants

FE50 – FE6F

Specials

FFF0 – FFFF

Linear B Syllabary

10000 – 1007F

Linear B Ideograms

10080 – 100FF

Aegean Numbers

10100 – 1013F

Old Italic

10300 – 1032F

Gothic

10330 – 1034F

Ugaritic

10380 – 1039F

Deseret

10400 – 104FF

Shavian

10450 – 1047F

Osmanya

10480 – 104AF

Cypriot Syllabary

10800 – 1083F

Byzantine Musical Symbols

1D000 – 1D0FF

Musical Symbols

1D100 – 1D1FF

Tai Xuan Jing Symbols

1D300 – 1D35F

Mathematical Alphanumeric Symbol

1D400 – 1D7FF

    • IDN objects must be properly encoded in punycode, and start with the “xn--“ prefix as identifier indicating they are IDNs.
    • Domain names that start with 2 ASCII letters immediately followed by 2 consecutive hyphens other than “xn--“ are not allowed for registrations.

3. Part III: Further Restrictions on Commingling of Scripts

The Rule 
As a rule, the .CC registry rejects the commingling of code points from different Unicode blocks.  That is, if an IDN contains code points from any two or more Unicode blocks, then that IDN registration is rejected. For example, letter(s) from Latin alphabets cannot be used with letter(s) from Cyrillic in an IDN. All code points within an IDN must come from the same single Unicode block.

The Exception: 
An exception to this rule is made for well-defined languages which want to allow this commingling in certain circumstances.  For instance, symbols for the Korean language are contained in the Unicode block called “Hangul Syllables” (u+AC00 – u+D7AF).  However, Korean commonly uses basic Latin characters as well, so National Internet Development Agency of Korea (NIDA, the .kr ccTLD registry) developed a language table which includes the “Hangul Syllables” as well as lowercase Latin letters (a-z), numbers (0-9) and the hyphen character.  Normally these characters would not be allowed to commingle in a single domain name, but IDN objects marked with the “KOR” language tag for Korean can safely contain the code points above.

The .CC Top Level Domain maintains language definitions for the following languages:

Language Tag

Language

AZE

Azerbaijani

BEL

Belarusian

BUL

Bulgarian

CHI

Chinese

GRE

Greek

JPN

Japanese

KOR

Korean

KUR

Kurdish

MAC

Macedonian

MKD

Macedonian

MOL

Moldavian

POL

Polish

RUS

Russian

SCC

Serbian

SCR

Croatian

SRP

Serbian

UKR

Ukrainian

IDN registrations marked with one of the above language tags will be checked against that languages definition to determine validity. IDN registrations marked with a language not in the above table must meet the zero-commingling conditions of the original rule.

Failed to load XML file with Content ID '044409'.



Contact Us
Contact Us

Phone: (703) 925-6999
info@verisign-grs.com