Atsushi Hashimoto, Shinsuke Mori, Tetsuro Sasada, M. Minoh, Yoko Yamakata
{"title":"KUSK数据集:直接理解食谱文本和人类烹饪活动","authors":"Atsushi Hashimoto, Shinsuke Mori, Tetsuro Sasada, M. Minoh, Yoko Yamakata","doi":"10.1145/2638728.2641338","DOIUrl":null,"url":null,"abstract":"In this paper, we provide a multimodal dataset for understanding cooking activities. To build the dataset, we instructed the subjects to perform cooking according to instructional texts shown on a display one by one. The instructional texts were generated from flow graphs, which were automatically extracted from recipes sampled from a Web site. The main identity of this dataset is the correspondence between the steps automatically extracted from recipes, and real human activities. Typical uses of our dataset are to construct classifiers for understanding human activities in the kitchen, text generation through observing the activities, and so on.","PeriodicalId":20496,"journal":{"name":"Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"KUSK dataset: toward a direct understanding of recipe text and human cooking activity\",\"authors\":\"Atsushi Hashimoto, Shinsuke Mori, Tetsuro Sasada, M. Minoh, Yoko Yamakata\",\"doi\":\"10.1145/2638728.2641338\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we provide a multimodal dataset for understanding cooking activities. To build the dataset, we instructed the subjects to perform cooking according to instructional texts shown on a display one by one. The instructional texts were generated from flow graphs, which were automatically extracted from recipes sampled from a Web site. The main identity of this dataset is the correspondence between the steps automatically extracted from recipes, and real human activities. Typical uses of our dataset are to construct classifiers for understanding human activities in the kitchen, text generation through observing the activities, and so on.\",\"PeriodicalId\":20496,\"journal\":{\"name\":\"Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2638728.2641338\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2638728.2641338","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
KUSK dataset: toward a direct understanding of recipe text and human cooking activity
In this paper, we provide a multimodal dataset for understanding cooking activities. To build the dataset, we instructed the subjects to perform cooking according to instructional texts shown on a display one by one. The instructional texts were generated from flow graphs, which were automatically extracted from recipes sampled from a Web site. The main identity of this dataset is the correspondence between the steps automatically extracted from recipes, and real human activities. Typical uses of our dataset are to construct classifiers for understanding human activities in the kitchen, text generation through observing the activities, and so on.