Atsushi Hashimoto, Shinsuke Mori, Tetsuro Sasada, M. Minoh, Yoko Yamakata
{"title":"KUSK dataset: toward a direct understanding of recipe text and human cooking activity","authors":"Atsushi Hashimoto, Shinsuke Mori, Tetsuro Sasada, M. Minoh, Yoko Yamakata","doi":"10.1145/2638728.2641338","DOIUrl":null,"url":null,"abstract":"In this paper, we provide a multimodal dataset for understanding cooking activities. To build the dataset, we instructed the subjects to perform cooking according to instructional texts shown on a display one by one. The instructional texts were generated from flow graphs, which were automatically extracted from recipes sampled from a Web site. The main identity of this dataset is the correspondence between the steps automatically extracted from recipes, and real human activities. Typical uses of our dataset are to construct classifiers for understanding human activities in the kitchen, text generation through observing the activities, and so on.","PeriodicalId":20496,"journal":{"name":"Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2638728.2641338","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19
Abstract
In this paper, we provide a multimodal dataset for understanding cooking activities. To build the dataset, we instructed the subjects to perform cooking according to instructional texts shown on a display one by one. The instructional texts were generated from flow graphs, which were automatically extracted from recipes sampled from a Web site. The main identity of this dataset is the correspondence between the steps automatically extracted from recipes, and real human activities. Typical uses of our dataset are to construct classifiers for understanding human activities in the kitchen, text generation through observing the activities, and so on.