{"title":"数据科学的发展与沙特阿拉伯案例。13年来我们改变了多少?","authors":"Igor Barahona","doi":"arxiv-2310.14808","DOIUrl":null,"url":null,"abstract":"A comprehensive examination of data science vocabulary usage over the past 13\nyears in this work is conducted. The investigation commences with a dataset\ncomprising 16,018 abstracts that feature the term \"data science\" in either the\ntitle, abstract, or keywords. The study involves the identification of\ndocuments that introduce novel vocabulary and subsequently explores how this\nvocabulary has been incorporated into scientific literature. To achieve these\nobjectives, I employ techniques such as Exploratory Data Analysis, Latent\nSemantic Analysis, Latent Dirichlet Analysis, and N-grams Analysis. A\ncomparison of scientific publications between overall results and those\nspecific to Saudi Arabia is presented. Based on how the vocabulary is utilized,\nrepresentative articles are identified.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":"27 10","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The evolving of Data Science and the Saudi Arabia case. How much have we changed in 13 years?\",\"authors\":\"Igor Barahona\",\"doi\":\"arxiv-2310.14808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A comprehensive examination of data science vocabulary usage over the past 13\\nyears in this work is conducted. The investigation commences with a dataset\\ncomprising 16,018 abstracts that feature the term \\\"data science\\\" in either the\\ntitle, abstract, or keywords. The study involves the identification of\\ndocuments that introduce novel vocabulary and subsequently explores how this\\nvocabulary has been incorporated into scientific literature. To achieve these\\nobjectives, I employ techniques such as Exploratory Data Analysis, Latent\\nSemantic Analysis, Latent Dirichlet Analysis, and N-grams Analysis. A\\ncomparison of scientific publications between overall results and those\\nspecific to Saudi Arabia is presented. Based on how the vocabulary is utilized,\\nrepresentative articles are identified.\",\"PeriodicalId\":501323,\"journal\":{\"name\":\"arXiv - STAT - Other Statistics\",\"volume\":\"27 10\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Other Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2310.14808\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2310.14808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The evolving of Data Science and the Saudi Arabia case. How much have we changed in 13 years?
A comprehensive examination of data science vocabulary usage over the past 13
years in this work is conducted. The investigation commences with a dataset
comprising 16,018 abstracts that feature the term "data science" in either the
title, abstract, or keywords. The study involves the identification of
documents that introduce novel vocabulary and subsequently explores how this
vocabulary has been incorporated into scientific literature. To achieve these
objectives, I employ techniques such as Exploratory Data Analysis, Latent
Semantic Analysis, Latent Dirichlet Analysis, and N-grams Analysis. A
comparison of scientific publications between overall results and those
specific to Saudi Arabia is presented. Based on how the vocabulary is utilized,
representative articles are identified.