{"title":"一种用于中文域名解析的Web重定向服务","authors":"Jeng-Wei Lin, L. Tseng, Jan-Ming Ho, F. Lai","doi":"10.1109/ICITA.2005.50","DOIUrl":null,"url":null,"abstract":"Many efforts in past years have been made to lower the linguistic barriers for non-native English speakers to access the Internet. IDNA (Faltstrom et al., 2003) focuses on access to internationalized domain names (IDN) in a range of scripts that is broader in scope than the original ASCII. However, the use of character variants that have similar appearances and/or interpretations could create confusion. A variant IDL (internationalized domain label), derived from an IDL by replacing some characters with their variants, should match the original IDL. In JET Guidelines (Konishi et al., 2004), it is suggested that zone administrators model this concept of equivalence as an atomic IDL package that contains the variant IDLs generated according to the language variant tables (LVT). In addition to the registered IDL, some of the variant IDLs are activated and stored in the zone files and thus become resolvable. However, an issue of scalability arises when the number of the activated variant IDLs is large. In this paper, we present a mechanism to resolve the variant IDLs in an IDL package into the registered IDL. Specifically, we target Han character variants. Two Han characters are said to be variants of each other if they have the same meaning and are pronounced the same. Furthermore, Han character variants usually have similar appearances. We introduce a new resource record (RR) type, denoted as VarIdx, to associate the variant IDLs with the registered IDL. Experiment results show that a small number of VarIdx RRs are sufficient for enumerating all variant IDLs in an IDL package. We then present a redirection service that uses VarIdx RRs to redirect user requests with variant IDLs to the URLs with the corresponding registered IDLs.","PeriodicalId":371528,"journal":{"name":"Third International Conference on Information Technology and Applications (ICITA'05)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Web redirection service for variant Chinese domain name resolution\",\"authors\":\"Jeng-Wei Lin, L. Tseng, Jan-Ming Ho, F. Lai\",\"doi\":\"10.1109/ICITA.2005.50\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many efforts in past years have been made to lower the linguistic barriers for non-native English speakers to access the Internet. IDNA (Faltstrom et al., 2003) focuses on access to internationalized domain names (IDN) in a range of scripts that is broader in scope than the original ASCII. However, the use of character variants that have similar appearances and/or interpretations could create confusion. A variant IDL (internationalized domain label), derived from an IDL by replacing some characters with their variants, should match the original IDL. In JET Guidelines (Konishi et al., 2004), it is suggested that zone administrators model this concept of equivalence as an atomic IDL package that contains the variant IDLs generated according to the language variant tables (LVT). In addition to the registered IDL, some of the variant IDLs are activated and stored in the zone files and thus become resolvable. However, an issue of scalability arises when the number of the activated variant IDLs is large. In this paper, we present a mechanism to resolve the variant IDLs in an IDL package into the registered IDL. Specifically, we target Han character variants. Two Han characters are said to be variants of each other if they have the same meaning and are pronounced the same. Furthermore, Han character variants usually have similar appearances. We introduce a new resource record (RR) type, denoted as VarIdx, to associate the variant IDLs with the registered IDL. Experiment results show that a small number of VarIdx RRs are sufficient for enumerating all variant IDLs in an IDL package. We then present a redirection service that uses VarIdx RRs to redirect user requests with variant IDLs to the URLs with the corresponding registered IDLs.\",\"PeriodicalId\":371528,\"journal\":{\"name\":\"Third International Conference on Information Technology and Applications (ICITA'05)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Third International Conference on Information Technology and Applications (ICITA'05)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICITA.2005.50\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Third International Conference on Information Technology and Applications (ICITA'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITA.2005.50","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在过去的几年里,为了降低非英语母语人士访问互联网的语言障碍,人们做出了许多努力。IDNA (Faltstrom et al., 2003)侧重于获取国际化域名(IDN)的一系列脚本,其范围比原始ASCII更广。然而,使用具有相似外观和/或解释的字符变体可能会造成混淆。通过用IDL的变体替换某些字符而从IDL派生出来的变体IDL(国际化域标签)应该与原始IDL匹配。在JET指南(Konishi et al., 2004)中,建议区域管理员将这种等效概念建模为包含根据语言变体表(LVT)生成的变体IDL的原子IDL包。除了注册的IDL之外,一些变体IDL被激活并存储在区域文件中,因此可以解析。但是,当激活的变体idl数量很大时,就会出现可伸缩性问题。在本文中,我们提出了一种将IDL包中的变体IDL解析为注册IDL的机制。具体来说,我们的目标是汉字变体。两个汉字如果意思相同,发音相同,就被称为彼此的变体。此外,汉字变体通常具有相似的外观。我们引入一个新的资源记录(RR)类型,表示为VarIdx,将变体IDL与注册的IDL关联起来。实验结果表明,少量的VarIdx RRs足以枚举一个IDL包中的所有变体IDL。然后,我们提供了一个重定向服务,该服务使用VarIdx rr将具有可变idl的用户请求重定向到具有相应注册idl的url。
A Web redirection service for variant Chinese domain name resolution
Many efforts in past years have been made to lower the linguistic barriers for non-native English speakers to access the Internet. IDNA (Faltstrom et al., 2003) focuses on access to internationalized domain names (IDN) in a range of scripts that is broader in scope than the original ASCII. However, the use of character variants that have similar appearances and/or interpretations could create confusion. A variant IDL (internationalized domain label), derived from an IDL by replacing some characters with their variants, should match the original IDL. In JET Guidelines (Konishi et al., 2004), it is suggested that zone administrators model this concept of equivalence as an atomic IDL package that contains the variant IDLs generated according to the language variant tables (LVT). In addition to the registered IDL, some of the variant IDLs are activated and stored in the zone files and thus become resolvable. However, an issue of scalability arises when the number of the activated variant IDLs is large. In this paper, we present a mechanism to resolve the variant IDLs in an IDL package into the registered IDL. Specifically, we target Han character variants. Two Han characters are said to be variants of each other if they have the same meaning and are pronounced the same. Furthermore, Han character variants usually have similar appearances. We introduce a new resource record (RR) type, denoted as VarIdx, to associate the variant IDLs with the registered IDL. Experiment results show that a small number of VarIdx RRs are sufficient for enumerating all variant IDLs in an IDL package. We then present a redirection service that uses VarIdx RRs to redirect user requests with variant IDLs to the URLs with the corresponding registered IDLs.