{"title":"Person attributes extraction in profiles based on SVM and pattern","authors":"Zhen Zhu, Yuan Sun","doi":"10.1109/ICSESS.2015.7339037","DOIUrl":null,"url":null,"abstract":"This paper is an exploration to find a way to get the person attributes in profiles. Considering those attributes exists in large volume of unstructured data, and it is very difficult to gain in a short time. So, we use a method combing the pattern and SVM to extract the person attributes. Firstly, we collect many raw profiles in websites by our configurable crawler. Secondly, we use statistic methods to do pre-processing works include lexical analysis and name recognition. Thirdly, we build the patterns, which can use in model to extract the person attributes. Also we generalize those patterns to SVM features. Finally, we use SVM assisted with pattern-based method to predict the person attributes. The results prove the method is effective and the data we extracted is useful in building specific-areas' expert database and information retrieval.","PeriodicalId":335871,"journal":{"name":"2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS.2015.7339037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper is an exploration to find a way to get the person attributes in profiles. Considering those attributes exists in large volume of unstructured data, and it is very difficult to gain in a short time. So, we use a method combing the pattern and SVM to extract the person attributes. Firstly, we collect many raw profiles in websites by our configurable crawler. Secondly, we use statistic methods to do pre-processing works include lexical analysis and name recognition. Thirdly, we build the patterns, which can use in model to extract the person attributes. Also we generalize those patterns to SVM features. Finally, we use SVM assisted with pattern-based method to predict the person attributes. The results prove the method is effective and the data we extracted is useful in building specific-areas' expert database and information retrieval.