Melaku Alelign, Tesfamariam M Abuhay, Adane Letta, Tizita Dereje
{"title":"使用监督机器学习技术识别风险因素和预测食品安全状况","authors":"Melaku Alelign, Tesfamariam M Abuhay, Adane Letta, Tizita Dereje","doi":"10.1109/ict4da53266.2021.9672241","DOIUrl":null,"url":null,"abstract":"In 2018, more than 821 million undernourished people were registered all over the world. Of these, 239 million were in Sub-Saharan Africa. The numbers are particularly high in Ethiopia, Kenya, Somalia, and South Sudan. The determinant factors of food insecurity in Ethiopia are multidimensional encompassing climate change, civil conflicts, natural disasters, and social norms. This study, hence, aims to identify risk factors and predict food security status at household level in North West Ethiopia using supervised machine learning techniques. To this end, a dataset was gathered from the Dabat Health and Demographic Surveillance and statistically interesting risk factors were identified using logistics regression at a threshold level of p<0.05. Three experiments were also conducted using random forest, support vector machine and decision tree (C4.5) to predict food security status at household level and the performance of each model was evaluated using accuracy, precision, recall, and f1- measure. As a result, the C4.5 algorithm is selected as the best appropriate supervised machine learning algorithm with 97.23% of recall, 91.58% of accuracy, 80.97% of f1-measure, and 69.38% of precision. Family size, level of education, age of the household head, number and types of communication media, numbers of livestock, cultivated land size, access to credit, and access to irrigation are some of the risk factors of food security.","PeriodicalId":371663,"journal":{"name":"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Identifying Risk Factors and Predicting Food Security Status using Supervised Machine Learning Techniques\",\"authors\":\"Melaku Alelign, Tesfamariam M Abuhay, Adane Letta, Tizita Dereje\",\"doi\":\"10.1109/ict4da53266.2021.9672241\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In 2018, more than 821 million undernourished people were registered all over the world. Of these, 239 million were in Sub-Saharan Africa. The numbers are particularly high in Ethiopia, Kenya, Somalia, and South Sudan. The determinant factors of food insecurity in Ethiopia are multidimensional encompassing climate change, civil conflicts, natural disasters, and social norms. This study, hence, aims to identify risk factors and predict food security status at household level in North West Ethiopia using supervised machine learning techniques. To this end, a dataset was gathered from the Dabat Health and Demographic Surveillance and statistically interesting risk factors were identified using logistics regression at a threshold level of p<0.05. Three experiments were also conducted using random forest, support vector machine and decision tree (C4.5) to predict food security status at household level and the performance of each model was evaluated using accuracy, precision, recall, and f1- measure. As a result, the C4.5 algorithm is selected as the best appropriate supervised machine learning algorithm with 97.23% of recall, 91.58% of accuracy, 80.97% of f1-measure, and 69.38% of precision. Family size, level of education, age of the household head, number and types of communication media, numbers of livestock, cultivated land size, access to credit, and access to irrigation are some of the risk factors of food security.\",\"PeriodicalId\":371663,\"journal\":{\"name\":\"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ict4da53266.2021.9672241\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ict4da53266.2021.9672241","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identifying Risk Factors and Predicting Food Security Status using Supervised Machine Learning Techniques
In 2018, more than 821 million undernourished people were registered all over the world. Of these, 239 million were in Sub-Saharan Africa. The numbers are particularly high in Ethiopia, Kenya, Somalia, and South Sudan. The determinant factors of food insecurity in Ethiopia are multidimensional encompassing climate change, civil conflicts, natural disasters, and social norms. This study, hence, aims to identify risk factors and predict food security status at household level in North West Ethiopia using supervised machine learning techniques. To this end, a dataset was gathered from the Dabat Health and Demographic Surveillance and statistically interesting risk factors were identified using logistics regression at a threshold level of p<0.05. Three experiments were also conducted using random forest, support vector machine and decision tree (C4.5) to predict food security status at household level and the performance of each model was evaluated using accuracy, precision, recall, and f1- measure. As a result, the C4.5 algorithm is selected as the best appropriate supervised machine learning algorithm with 97.23% of recall, 91.58% of accuracy, 80.97% of f1-measure, and 69.38% of precision. Family size, level of education, age of the household head, number and types of communication media, numbers of livestock, cultivated land size, access to credit, and access to irrigation are some of the risk factors of food security.