
Published:Thu, 04 Aug 2011 13:31:48 -0700
NEW HOSTEL... SINGAPORE: Foreign students of MDIS need not have to look far for accommodation -- its new S$80 million student hostel has opened its doors within the integrated cam......
Published:Fri, 19 Aug 2011 12:50:10 -0700
“Accountable care organizations” is the health wonk phrase du jour. Obamacare’s advocates point to its support for ACOs as one of the important cost-control initiatives in t......
Published:Tue, 23 Aug 2011 07:19:39 -0700
Shortly after the vote, Peter Dengate Thrush joined a New gTLD Group which stands to greatly benefit directly from this vote on the program he led on for nearly 3 years.......
Published:Mon, 08 Aug 2011 07:00:00 -0700
BOSTON, MASSACHUSETTS-- - AHRMM11 Annual Conference & Exhibition - TECSYS Inc. , an industry-leading supply chain management software company, announced today TECSYS Supply Manage......
Published:Sun, 07 Aug 2011 12:42:07 -0700
After a touching keynote address by ICANNs Kurt Pritz, industry leaders shared viewpoints and predictions at Master of Your Domain? New TLD Conference for the first time since his......
The conversions between ASCII and non-ASCII forms of a domain name are accomplished by algorithms called ToASCII and ToUnicode. These algorithms are not applied to the domain name as a whole, but rather to individual labels. For example, if the domain name is www.example.com, then the labels are www, example, and com. ToASCII or ToUnicode are applied to each of these three separately.
The details of these two algorithms are complex, and are specified in RFC 3490. The following gives an overview of their function.
ToASCII leaves unchanged any ASCII label, but will fail if the label is unsuitable for the Domain Name System. If given a label containing at least one non-ASCII character, ToASCII will apply the Nameprep algorithm, which converts the label to lowercase and performs other normalization, and will then translate the result to ASCII usingPunycode before prepending the four-character string "xn--". This four-character string is called the ASCII Compatible Encoding (ACE) prefix, and is used to distinguish Punycode encoded labels from ordinary ASCII labels. The ToASCII algorithm can fail in several ways; for example, the final string could exceed the 63-character limit of a DNS name. A label for which ToASCII fails cannot be used in an internationalized domain name.
The function ToUnicode reverses the action of ToASCII, stripping off the ACE prefix and applying the Punycode decode algorithm. It does not reverse the Nameprep processing, since that is merely a normalization and is by nature irreversible. Unlike ToASCII, ToUnicode always succeeds, because it simply returns the original string if decoding fails. In particular, this means that ToUnicode has no effect on a string that does not begin with the ACE prefix.

Digg
|
Reddit
|
Mixx
|
del.icio.us
|
Stumble it! |