 |
Policy for IDN Code Points |
 |
|
TV Policy
1. Part I: Permissible Code Points
2. Part II: Restricted
Code Points
3. Part III:
Further Restrictions on Commingling of Scripts
1. Part I: Permissible Code Points
The implementation of IDN registration for the .TV
Top Level Domain strictly follows the guidelines set forth in the following
RFC documents by the Internet Engineering Task Force (IETF):
- RFC
3490 describes IDNA which is the IETF standard for IDNs.
- RFC
3491 and RFC
3454 describe the algorithms for normalizing IDNs.
- RFC
3492 describes punycode, the ASCII encoding algorithm used
to make IDNs compatible with the DNS.
- The .TV registries
allow registration of code points included in Unicode
3.2 subject to the restrictions in the above RFC documents.
2. Part II: Restricted Code Points
The .TV registry implements certain restrictions on
code points beyond that allowed by the RFC documents:
- Code points introduced
after Unicode
3.2 are prohibited.
- The following special
characters and symbols are prohibited
|
|
General
Punctuation |
U+2000 U+206F |
Currency
Symbols |
U+20A0 U+20CF |
Combining
Diacriticals for Symbols |
U+20D0 20FF |
Arrows |
U+2190 21FF |
Mathematical
Operators |
U+2200 U+22FF |
Miscellaneous
Technical |
2300 23FF |
Control
Pictures |
2400 243F |
Optical
Character Recognition |
2440 245F |
Box
Drawing |
2500 257F |
Block
Elements |
2580 259F |
Geometric
Shapes |
25A0 25D9 |
Miscellaneous
Symbols |
2600 26FF |
Dingbats |
2700 27BF |
Misc
Mathematical Symbols A |
27D0 27EB |
Supplemental
Arrows A |
27F0 27FF |
Braille
Patterns |
2800 28FF |
Supplemental
Arrows B |
2900 297F |
Misc
Mathematical Symbols B |
2980 29FF |
Supp
Mathematical Operators |
2A00 2AFF |
Misc
Symbols and Arrows |
2B00 2BFF |
CJK
Symbols and Punctuation |
3000 303F |
Yijing
Hexagram Symbols |
4DC0 4DFF |
Variation
Selectors |
FE00 FE0F |
CJK
Compatability Forms |
FE30 FE4F |
Small
Form Variants |
FE50 FE6F |
Specials |
FFF0 FFFF |
Linear
B Syllabary |
10000 1007F |
Linear
B Ideograms |
10080 100FF |
Aegean
Numbers |
10100 1013F |
Old
Italic |
10300 1032F |
Gothic |
10330 1034F |
Ugaritic |
10380 1039F |
Deseret |
10400 104FF |
Shavian |
10450 1047F |
Osmanya |
10480 104AF |
Cypriot
Syllabary |
10800 1083F |
Byzantine
Musical Symbols |
1D000 1D0FF |
Musical
Symbols |
1D100 1D1FF |
Tai
Xuan Jing Symbols |
1D300 1D35F |
Mathematical
Alphanumeric Symbol |
1D400 1D7FF |
- IDN objects must be properly encoded in punycode, and start with the
xn-- prefix as identifier indicating they are IDNs.
- Domain names that
start with 2 ASCII letters immediately followed by 2 consecutive hyphens
other than xn-- are not allowed for registrations.
3. Part III: Further Restrictions on Commingling
of Scripts
The Rule
As a rule, the .TV registry rejects the commingling of code points
from different Unicode blocks. That is, if an IDN contains code
points from any two or more Unicode blocks, then that IDN registration
is rejected. For example, letter(s) from Latin alphabets cannot be used
with letter(s) from Cyrillic in an IDN. All code points within an IDN
must come from the same single Unicode block.
The Exception:
An exception to this rule is made for well-defined languages
which want to allow this commingling in certain circumstances.
For instance, symbols for the Korean language are contained in the Unicode
block called Hangul Syllables (u+AC00 u+D7AF). However, Korean
commonly uses basic Latin characters as well, so National Internet Development
Agency of Korea (NIDA, the .kr ccTLD registry) developed a language
table which includes the Hangul Syllables as well as lowercase Latin
letters (a-z), numbers (0-9) and the hyphen character. Normally
these characters would not be allowed to commingle in a single domain
name, but IDN objects marked with the KOR language tag for Korean
can safely contain the code points above.
The .TV Top Level Domain maintains language definitions
for the following languages:
IDN registrations marked
with one of the above language tags will be checked against that languages
definition to determine validity. IDN registrations marked with a language
not in the above table must meet the zero-commingling conditions of
the original rule.
Failed to load XML file with Content ID '044410'.
|