{"title":"运用判别分析选择含内容词","authors":"M. Dillon, Peggy Federhart","doi":"10.1002/asi.4630330409","DOIUrl":null,"url":null,"abstract":"This article presents a method for identifying good indexing terms from frequently occurring stems. The method uses discriminant analysis to distinguish terms that refer to topics from general terms that do not refer to topics. The steps in the method are the selection of discriminating variables, the calibration of predefined groups and the derivation of discriminant functions from them, and the classification of a second, unknown set of terms and its evaluation. The method is tested by applying it to the Harris Survey Question database, which covers 121 different surveys and includes the text of over 12, 000 Individual questions. The evaluation demonstrates the success of the method.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"1 1","pages":"245-253"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"The Use of Discriminant Analysis to Select Content-Bearing Words\",\"authors\":\"M. Dillon, Peggy Federhart\",\"doi\":\"10.1002/asi.4630330409\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents a method for identifying good indexing terms from frequently occurring stems. The method uses discriminant analysis to distinguish terms that refer to topics from general terms that do not refer to topics. The steps in the method are the selection of discriminating variables, the calibration of predefined groups and the derivation of discriminant functions from them, and the classification of a second, unknown set of terms and its evaluation. The method is tested by applying it to the Harris Survey Question database, which covers 121 different surveys and includes the text of over 12, 000 Individual questions. The evaluation demonstrates the success of the method.\",\"PeriodicalId\":50013,\"journal\":{\"name\":\"Journal of the American Society for Information Science and Technology\",\"volume\":\"1 1\",\"pages\":\"245-253\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the American Society for Information Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/asi.4630330409\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Society for Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/asi.4630330409","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Use of Discriminant Analysis to Select Content-Bearing Words
This article presents a method for identifying good indexing terms from frequently occurring stems. The method uses discriminant analysis to distinguish terms that refer to topics from general terms that do not refer to topics. The steps in the method are the selection of discriminating variables, the calibration of predefined groups and the derivation of discriminant functions from them, and the classification of a second, unknown set of terms and its evaluation. The method is tested by applying it to the Harris Survey Question database, which covers 121 different surveys and includes the text of over 12, 000 Individual questions. The evaluation demonstrates the success of the method.