检测 1606 名学龄前儿童简单贴纸艺术性别差异的机器学习试验。

IF 1 Q3 PEDIATRICS

Minerva Pediatrics Pub Date : 2024-06-01 DOI:10.23736/S2724-5276.21.06067-5

Keiko Matsubara, Yuko Ohgami, Koji Okamura, Saki Aoto, Maki Fukami, Yukiko Shimada

{"title":"检测 1606 名学龄前儿童简单贴纸艺术性别差异的机器学习试验。","authors":"Keiko Matsubara, Yuko Ohgami, Koji Okamura, Saki Aoto, Maki Fukami, Yukiko Shimada","doi":"10.23736/S2724-5276.21.06067-5","DOIUrl":null,"url":null,"abstract":"Background: Previous studies suggested that drawings made by preschool boys and girls show distinguishable differences. However, children's drawings on their own are too complexly determined and inherently ambiguous to be a reliable indicator. In the present study, we attempted to develop a machine learning algorithm for classification of sex of the subjects using children's artworks.Methods: We studied three types of simple sticker artworks from 1606 Japanese preschool children aged 51-83 months (803 boys and 803 girls). Those artworks were processed into digitalized data. Simulated data based on the original data were also generated. Logistic regression approach was applied to each dataset to make a classifier, and run on each dataset in a stratified ten-fold cross-validation with hyperparameter tuning. A probability score was calculated in each sample and utilized for sex classification. Prediction performance was evaluated using accuracy, recall, and precision scores, as well as learning curves.Results: Two models created from the original and simulated data showed comparably low metrics. The distributions of probability scores in the samples from boys and girls mostly overlapped and were indistinguishable. Learning curves of the models showed an extremely under-fitted pattern.Conclusions: Our machine learning algorithm was unable to distinguish simple sticker arts created by boys and girls. More complex tasks will enable to develop an accurate classifier.","PeriodicalId":56337,"journal":{"name":"Minerva Pediatrics","volume":"76 3","pages":"343-349"},"PeriodicalIF":1.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning trial to detect sex differences in simple sticker arts of 1606 preschool children.\",\"authors\":\"Keiko Matsubara, Yuko Ohgami, Koji Okamura, Saki Aoto, Maki Fukami, Yukiko Shimada\",\"doi\":\"10.23736/S2724-5276.21.06067-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Previous studies suggested that drawings made by preschool boys and girls show distinguishable differences. However, children's drawings on their own are too complexly determined and inherently ambiguous to be a reliable indicator. In the present study, we attempted to develop a machine learning algorithm for classification of sex of the subjects using children's artworks.Methods: We studied three types of simple sticker artworks from 1606 Japanese preschool children aged 51-83 months (803 boys and 803 girls). Those artworks were processed into digitalized data. Simulated data based on the original data were also generated. Logistic regression approach was applied to each dataset to make a classifier, and run on each dataset in a stratified ten-fold cross-validation with hyperparameter tuning. A probability score was calculated in each sample and utilized for sex classification. Prediction performance was evaluated using accuracy, recall, and precision scores, as well as learning curves.Results: Two models created from the original and simulated data showed comparably low metrics. The distributions of probability scores in the samples from boys and girls mostly overlapped and were indistinguishable. Learning curves of the models showed an extremely under-fitted pattern.Conclusions: Our machine learning algorithm was unable to distinguish simple sticker arts created by boys and girls. More complex tasks will enable to develop an accurate classifier.\",\"PeriodicalId\":56337,\"journal\":{\"name\":\"Minerva Pediatrics\",\"volume\":\"76 3\",\"pages\":\"343-349\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Minerva Pediatrics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23736/S2724-5276.21.06067-5\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PEDIATRICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Minerva Pediatrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23736/S2724-5276.21.06067-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PEDIATRICS","Score":null,"Total":0}

引用次数: 0

摘要

研究背景以往的研究表明，学龄前男孩和女孩的绘画表现出明显的差异。然而，儿童绘画本身的决定因素过于复杂，而且本身具有模糊性，因此不能作为可靠的指标。在本研究中，我们尝试开发一种机器学习算法，利用儿童的美术作品对受试者的性别进行分类：我们研究了来自 1606 名 51-83 个月大的日本学龄前儿童（803 名男孩和 803 名女孩）的三种类型的简单贴纸作品。这些作品被处理成数字化数据。此外，还根据原始数据生成了模拟数据。对每个数据集采用逻辑回归方法制作分类器，并对每个数据集进行分层十倍交叉验证和超参数调整。在每个样本中计算出概率分数，并用于性别分类。使用准确率、召回率和精确度分数以及学习曲线对预测性能进行评估：结果：根据原始数据和模拟数据创建的两个模型显示出相当低的指标。男孩和女孩样本的概率分数分布大多重叠，无法区分。模型的学习曲线显示出一种拟合度极低的模式：我们的机器学习算法无法区分男生和女生创作的简单贴纸艺术。结论：我们的机器学习算法无法区分男生和女生创作的简单贴纸艺术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Machine learning trial to detect sex differences in simple sticker arts of 1606 preschool children.

Background: Previous studies suggested that drawings made by preschool boys and girls show distinguishable differences. However, children's drawings on their own are too complexly determined and inherently ambiguous to be a reliable indicator. In the present study, we attempted to develop a machine learning algorithm for classification of sex of the subjects using children's artworks.

Methods: We studied three types of simple sticker artworks from 1606 Japanese preschool children aged 51-83 months (803 boys and 803 girls). Those artworks were processed into digitalized data. Simulated data based on the original data were also generated. Logistic regression approach was applied to each dataset to make a classifier, and run on each dataset in a stratified ten-fold cross-validation with hyperparameter tuning. A probability score was calculated in each sample and utilized for sex classification. Prediction performance was evaluated using accuracy, recall, and precision scores, as well as learning curves.

Results: Two models created from the original and simulated data showed comparably low metrics. The distributions of probability scores in the samples from boys and girls mostly overlapped and were indistinguishable. Learning curves of the models showed an extremely under-fitted pattern.

Conclusions: Our machine learning algorithm was unable to distinguish simple sticker arts created by boys and girls. More complex tasks will enable to develop an accurate classifier.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Minerva Pediatrics

CiteScore

2.50

自引率

0.00%

发文量

294