Evaluating Expert Curation in a Baby Milestone Tracking App

Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems Pub Date : 2019-05-02 DOI:10.1145/3290605.3300783

Ayelet Ben-Sasson, Eli Ben-Sasson, Kayla Jacobs, Elisheva Rotman Argaman, Eden Saig

{"title":"Evaluating Expert Curation in a Baby Milestone Tracking App","authors":"Ayelet Ben-Sasson, Eli Ben-Sasson, Kayla Jacobs, Elisheva Rotman Argaman, Eden Saig","doi":"10.1145/3290605.3300783","DOIUrl":null,"url":null,"abstract":"Early childhood developmental screening is critical for timely detection and intervention. babyTRACKS (Formerly Baby CROINC, CROwd INtelligence Curation.) is a free, live, interactive developmental tracking mobile app with over 3,000 children's diaries. Parents write or select short milestone texts, like \"began taking first steps,\" to record their babies' developmental achievements, and receive crowd-based percentiles to evaluate development and catch potential delays. Currently, an expert-based Curated Crowd Intelligence (CCI) process manually groups incoming novel parent-authored milestone texts according to their similarity to existing milestones in the database (for example, starting to walk), or determining that the milestone represents a new developmental concept not seen before in another child's diary. CCI cannot scale well, however, and babyTRACKS is mature enough, with a rich enough database of existing milestone texts, to now consider machine learning tools to replace or assist the human curators. Three new studies explore (1) the usefulness of automation, by analyzing the human cost of CCI and how the work is currently broken down; (2) the validity of automation, by testing the inter-rater reliability of curators; and (3) the value of automation, by appraising the \"real world\" clinical value of milestones when assessing child development. We conclude that automation can indeed be appropriate and helpful for a large percentage, though not all, of CCI work. We further establish realistic upper bounds for algorithm performance; confirm that the babyTRACKS milestones dataset is valid for training and testing purposes; and verify that it represents clinically meaningful developmental information.","PeriodicalId":20454,"journal":{"name":"Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3290605.3300783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Early childhood developmental screening is critical for timely detection and intervention. babyTRACKS (Formerly Baby CROINC, CROwd INtelligence Curation.) is a free, live, interactive developmental tracking mobile app with over 3,000 children's diaries. Parents write or select short milestone texts, like "began taking first steps," to record their babies' developmental achievements, and receive crowd-based percentiles to evaluate development and catch potential delays. Currently, an expert-based Curated Crowd Intelligence (CCI) process manually groups incoming novel parent-authored milestone texts according to their similarity to existing milestones in the database (for example, starting to walk), or determining that the milestone represents a new developmental concept not seen before in another child's diary. CCI cannot scale well, however, and babyTRACKS is mature enough, with a rich enough database of existing milestone texts, to now consider machine learning tools to replace or assist the human curators. Three new studies explore (1) the usefulness of automation, by analyzing the human cost of CCI and how the work is currently broken down; (2) the validity of automation, by testing the inter-rater reliability of curators; and (3) the value of automation, by appraising the "real world" clinical value of milestones when assessing child development. We conclude that automation can indeed be appropriate and helpful for a large percentage, though not all, of CCI work. We further establish realistic upper bounds for algorithm performance; confirm that the babyTRACKS milestones dataset is valid for training and testing purposes; and verify that it represents clinically meaningful developmental information.

查看原文本刊更多论文

评估专家策展在婴儿里程碑跟踪应用程序

儿童早期发育筛查对于及时发现和干预至关重要。babyTRACKS(原Baby CROINC, CROwd INtelligence Curation.)是一个免费，实时，互动的发展跟踪移动应用程序，拥有超过3000个儿童日记。父母写或选择简短的里程碑文本，比如“开始迈出第一步”，来记录他们孩子的发展成就，并获得基于人群的百分位数，以评估发展并发现潜在的延迟。目前，一个基于专家的策划群体智能(CCI)流程，根据与数据库中现有里程碑(例如，开始走路)的相似性，手动将即将到来的父母撰写的新里程碑文本分组，或者确定该里程碑代表了一个新的发展概念，在其他孩子的日记中从未见过。然而，CCI不能很好地扩展，babyTRACKS已经足够成熟，拥有足够丰富的现有里程碑文本数据库，现在可以考虑使用机器学习工具来取代或协助人类管理员。三项新研究探索了(1)自动化的有用性，通过分析CCI的人力成本和当前工作的分解方式;(2)自动化的效度，通过对策展人间信度的测试;(3)自动化的价值，通过评估“现实世界”里程碑在评估儿童发展时的临床价值。我们得出的结论是，自动化对于很大一部分(尽管不是全部)CCI工作确实是合适的和有帮助的。我们进一步建立了算法性能的现实上界;确认babyTRACKS里程碑数据集对于训练和测试目的是有效的;并验证它代表临床有意义的发展信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

自引率

0.00%

发文量