{"title":"中文文本与齐夫定律的一致性","authors":"J. Clark, K. Lua, J. McCallum","doi":"10.1109/PARBSE.1990.77200","DOIUrl":null,"url":null,"abstract":"An investigation was carried out to determine whether Chinese text material conforms to Zipf's law. The information reservoir for this particular investigation contains 2,022,604 Chinese ideograms. It is shown that single Chinese characters do not conform to Zipf's law; however compound words are found to conform well. In addition, examining the regression analysis for compound words implies a good degree of conformity.<<ETX>>","PeriodicalId":389644,"journal":{"name":"Proceedings. PARBASE-90: International Conference on Databases, Parallel Architectures, and Their Applications","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1990-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Conformance of Chinese text to Zipf's law\",\"authors\":\"J. Clark, K. Lua, J. McCallum\",\"doi\":\"10.1109/PARBSE.1990.77200\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An investigation was carried out to determine whether Chinese text material conforms to Zipf's law. The information reservoir for this particular investigation contains 2,022,604 Chinese ideograms. It is shown that single Chinese characters do not conform to Zipf's law; however compound words are found to conform well. In addition, examining the regression analysis for compound words implies a good degree of conformity.<<ETX>>\",\"PeriodicalId\":389644,\"journal\":{\"name\":\"Proceedings. PARBASE-90: International Conference on Databases, Parallel Architectures, and Their Applications\",\"volume\":\"61 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1990-03-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. PARBASE-90: International Conference on Databases, Parallel Architectures, and Their Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PARBSE.1990.77200\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. PARBASE-90: International Conference on Databases, Parallel Architectures, and Their Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PARBSE.1990.77200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An investigation was carried out to determine whether Chinese text material conforms to Zipf's law. The information reservoir for this particular investigation contains 2,022,604 Chinese ideograms. It is shown that single Chinese characters do not conform to Zipf's law; however compound words are found to conform well. In addition, examining the regression analysis for compound words implies a good degree of conformity.<>