Christian M. Choy, M. Co, Matthew J. Fogel, Clarke D. Garrioch, C. Leung, Ekaterina Martchenko
{"title":"自然科学与社会科学:人口普查数据分析检测母语的变化","authors":"Christian M. Choy, M. Co, Matthew J. Fogel, Clarke D. Garrioch, C. Leung, Ekaterina Martchenko","doi":"10.1109/IMCOM51814.2021.9377412","DOIUrl":null,"url":null,"abstract":"As we are living in a global environment, it is not unusual to have more than one languages or dialects used in a country. Examples include Canada in the Americas, Singapore in Asia, and Switzerland in Europe. With the initiatives of globalization, many people immigrate or live in a country other than their birthplace. As a result, different people in the same country may have different home language (i.e., first language). For instance, as a nation composed of a highly diverse language population, Canada provides a unique opportunity to study the factors causing certain languages (or families of language) to be lost over subsequent generations among allophones (i.e., people whose mother tongue is neither English or French). In this paper, we focus on census data analytics. Specifically, we analyze census microdata by exploring machine learning and data mining techniques-such as decision tree induction, random forest, and categorical naive Bayes-to study the influence of various social and economic factors on the probability that allophones adopt official languages as their language spoken at home. This study is a showcase where natural sciences and engineering (NSE) meet social sciences, in which NSE solutions (e.g., census data analytics) are applicable for the study of social science related phenomena (e.g., successful detection of shifts in home languages).","PeriodicalId":275121,"journal":{"name":"2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Natural Sciences Meet Social Sciences: Census Data Analytics for Detecting Home Language Shifts\",\"authors\":\"Christian M. Choy, M. Co, Matthew J. Fogel, Clarke D. Garrioch, C. Leung, Ekaterina Martchenko\",\"doi\":\"10.1109/IMCOM51814.2021.9377412\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As we are living in a global environment, it is not unusual to have more than one languages or dialects used in a country. Examples include Canada in the Americas, Singapore in Asia, and Switzerland in Europe. With the initiatives of globalization, many people immigrate or live in a country other than their birthplace. As a result, different people in the same country may have different home language (i.e., first language). For instance, as a nation composed of a highly diverse language population, Canada provides a unique opportunity to study the factors causing certain languages (or families of language) to be lost over subsequent generations among allophones (i.e., people whose mother tongue is neither English or French). In this paper, we focus on census data analytics. Specifically, we analyze census microdata by exploring machine learning and data mining techniques-such as decision tree induction, random forest, and categorical naive Bayes-to study the influence of various social and economic factors on the probability that allophones adopt official languages as their language spoken at home. This study is a showcase where natural sciences and engineering (NSE) meet social sciences, in which NSE solutions (e.g., census data analytics) are applicable for the study of social science related phenomena (e.g., successful detection of shifts in home languages).\",\"PeriodicalId\":275121,\"journal\":{\"name\":\"2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IMCOM51814.2021.9377412\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCOM51814.2021.9377412","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Natural Sciences Meet Social Sciences: Census Data Analytics for Detecting Home Language Shifts
As we are living in a global environment, it is not unusual to have more than one languages or dialects used in a country. Examples include Canada in the Americas, Singapore in Asia, and Switzerland in Europe. With the initiatives of globalization, many people immigrate or live in a country other than their birthplace. As a result, different people in the same country may have different home language (i.e., first language). For instance, as a nation composed of a highly diverse language population, Canada provides a unique opportunity to study the factors causing certain languages (or families of language) to be lost over subsequent generations among allophones (i.e., people whose mother tongue is neither English or French). In this paper, we focus on census data analytics. Specifically, we analyze census microdata by exploring machine learning and data mining techniques-such as decision tree induction, random forest, and categorical naive Bayes-to study the influence of various social and economic factors on the probability that allophones adopt official languages as their language spoken at home. This study is a showcase where natural sciences and engineering (NSE) meet social sciences, in which NSE solutions (e.g., census data analytics) are applicable for the study of social science related phenomena (e.g., successful detection of shifts in home languages).