{"title":"PMML and UIMA based frameworks for deploying analytic applications and services","authors":"D. Ferrucci, R. Grossman, A. Levas","doi":"10.1145/1289612.1289614","DOIUrl":null,"url":null,"abstract":"It is convenient to divide data into structured data, semi-structured data and unstructured data. By structured data, we mean data that is organized into fields or attributes. Examples include database records. Semi-structured data has attributes but does not have the regularity of structured data. Data defined by HTML or XML tags are examples of semi-structured data. Unstructured data lacks attributes or fields and includes text data, signals, images, video, audio or similar data. Of course, data may be a combination of one or more of these types. For example, the content of a message can be unstructured text and the metadata semi-structured XML tags.","PeriodicalId":413380,"journal":{"name":"Proceedings of the 4th international workshop on Data mining standards, services and platforms","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th international workshop on Data mining standards, services and platforms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1289612.1289614","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
It is convenient to divide data into structured data, semi-structured data and unstructured data. By structured data, we mean data that is organized into fields or attributes. Examples include database records. Semi-structured data has attributes but does not have the regularity of structured data. Data defined by HTML or XML tags are examples of semi-structured data. Unstructured data lacks attributes or fields and includes text data, signals, images, video, audio or similar data. Of course, data may be a combination of one or more of these types. For example, the content of a message can be unstructured text and the metadata semi-structured XML tags.