{"title":"无袖长衫智人","authors":"Chaim Ash, Amelia Hans","doi":"arxiv-2310.08323","DOIUrl":null,"url":null,"abstract":"This paper proposes a new method of natural language acquisition for robots\nthat does not require the conversion of speech to text. Folks'Talks employs\nvoice2voice technology that enables a robot to understand the meaning of what\nit is told and to have the ability to learn and understand new languages -\ninclusive of accent, dialect, and physiological differences. To do this, sound\nprocessing and computer vision are incorporated to give the robot a sense of\nspatiotemporal causality. The \"language model\" we are proposing equips a robot\nto imitate a natural speaker's conversational behavior by thinking contextually\nand articulating its surroundings.","PeriodicalId":501310,"journal":{"name":"arXiv - CS - Other Computer Science","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robo Sapiens\",\"authors\":\"Chaim Ash, Amelia Hans\",\"doi\":\"arxiv-2310.08323\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a new method of natural language acquisition for robots\\nthat does not require the conversion of speech to text. Folks'Talks employs\\nvoice2voice technology that enables a robot to understand the meaning of what\\nit is told and to have the ability to learn and understand new languages -\\ninclusive of accent, dialect, and physiological differences. To do this, sound\\nprocessing and computer vision are incorporated to give the robot a sense of\\nspatiotemporal causality. The \\\"language model\\\" we are proposing equips a robot\\nto imitate a natural speaker's conversational behavior by thinking contextually\\nand articulating its surroundings.\",\"PeriodicalId\":501310,\"journal\":{\"name\":\"arXiv - CS - Other Computer Science\",\"volume\":\"20 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Other Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2310.08323\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Other Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2310.08323","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper proposes a new method of natural language acquisition for robots
that does not require the conversion of speech to text. Folks'Talks employs
voice2voice technology that enables a robot to understand the meaning of what
it is told and to have the ability to learn and understand new languages -
inclusive of accent, dialect, and physiological differences. To do this, sound
processing and computer vision are incorporated to give the robot a sense of
spatiotemporal causality. The "language model" we are proposing equips a robot
to imitate a natural speaker's conversational behavior by thinking contextually
and articulating its surroundings.