{"title":"“施加号”断句的分类与识别","authors":"Guonian Wang, Lin He","doi":"10.1109/IALP.2013.16","DOIUrl":null,"url":null,"abstract":"Studies on Chinese sentences with shi (ÊÇ) as predicate have been profoundly fruitful from the perspective of syntax, semantics and pragmatics. In a broader sense, however, a large number of sentences with shi functioning as other syntactic roles - adverb, conjunction, auxiliary and even interjection - are practically used, and stand as barriers to natural language processing (NLP) and machine translation (MT). The special fragments consisting of shi plus punctuation are divided into \"shi plus comma\" and \"comma plus shi\", which are examined and discussed with the instruments of corpora, illustrations and comparison. Two exceptional fragments are also briefed to improve the precision in computer identification of these shi-plus-punctuation fragments.","PeriodicalId":413833,"journal":{"name":"2013 International Conference on Asian Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Categorization and Identification of Fragments with Shi Plus Punctuation\",\"authors\":\"Guonian Wang, Lin He\",\"doi\":\"10.1109/IALP.2013.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Studies on Chinese sentences with shi (ÊÇ) as predicate have been profoundly fruitful from the perspective of syntax, semantics and pragmatics. In a broader sense, however, a large number of sentences with shi functioning as other syntactic roles - adverb, conjunction, auxiliary and even interjection - are practically used, and stand as barriers to natural language processing (NLP) and machine translation (MT). The special fragments consisting of shi plus punctuation are divided into \\\"shi plus comma\\\" and \\\"comma plus shi\\\", which are examined and discussed with the instruments of corpora, illustrations and comparison. Two exceptional fragments are also briefed to improve the precision in computer identification of these shi-plus-punctuation fragments.\",\"PeriodicalId\":413833,\"journal\":{\"name\":\"2013 International Conference on Asian Language Processing\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Conference on Asian Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IALP.2013.16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Asian Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2013.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Categorization and Identification of Fragments with Shi Plus Punctuation
Studies on Chinese sentences with shi (ÊÇ) as predicate have been profoundly fruitful from the perspective of syntax, semantics and pragmatics. In a broader sense, however, a large number of sentences with shi functioning as other syntactic roles - adverb, conjunction, auxiliary and even interjection - are practically used, and stand as barriers to natural language processing (NLP) and machine translation (MT). The special fragments consisting of shi plus punctuation are divided into "shi plus comma" and "comma plus shi", which are examined and discussed with the instruments of corpora, illustrations and comparison. Two exceptional fragments are also briefed to improve the precision in computer identification of these shi-plus-punctuation fragments.