A conversation with Dr. Edward Y. Chang

Edward Y. Chang
{"title":"A conversation with Dr. Edward Y. Chang","authors":"Edward Y. Chang","doi":"10.1145/2207243.2207256","DOIUrl":null,"url":null,"abstract":"1. Please share with us your view on the history and important milestones of the Chinese KDD research and application areas. Ample evidence shows that KDD has become a major topic of interest in both research and industry in China since 2006. In academia, professor Zhi-Hua Zhou at Nanjing University in 2006 chaired a National Machine Learning workshop, inviting researchers in the greater China area to share their experience. In 2009, the first Asian Conference on Machine learning was inaugurated in Nanjing. In industry, both Google and MSRA influenced China Internet leading companies such as Tencent, Baidu, Alibaba, and subsequently Renren and Shanda, to start their large-scale KDD operations. Three KDD engineers on my team were recruited to join Baidu knowledge, the primary KDD application of these Internet companies this far is monetization, improving their ad/offer relevance and hence revenue. Genome Institute (BGI) have made impressive progress in areas of computer vision, pattern recognition, and bio-genomics. Applications such as face, gesture, voice, handwriting, and license plate recognition have been widely deployed. In the bio-genomics area, a team at BGI reached a significant milestone in 2008 by sequencing the first Asian individual's diploid genome and published the result in Nature [1]. This sequencing effort took BGI one year to complete. Subsequently, speeding up genome sequencing has been among BGI's top R&D priorities. (One cannot imagine what one billion genomic sequences and their associated disease profiles can bring to advancing human health.) Researchers led by Ruiqiang Li from BGI and researchers from Google and universities at Canada and Hong Kong have met a couple of times to discuss large-scale data mining issues and solutions in hardware, algorithms, and data transportation. There is no doubt that KDD is thriving in China in several areas and its applications are rapidly growing, thanks to the increase of both data volume and demand for intelligent information analysis and trend prediction. 2. Please describe your expertise and contribution to KDD. In 2005, my team started working on developing parallel machine learning algorithms to mine large-scale datasets. My team were made publicly available through Apache foundation, and they have been downloaded more than 4,000 times. Several Google products also use these parallel algorithms. Prior to the large-scale machine learning work, my work with Simon Tong on using active learning to refine user query concepts published in 2001 [8] has been cited 850 times. Together with my works on …","PeriodicalId":90050,"journal":{"name":"SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining","volume":"30 1","pages":"73-74"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2207243.2207256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

1. Please share with us your view on the history and important milestones of the Chinese KDD research and application areas. Ample evidence shows that KDD has become a major topic of interest in both research and industry in China since 2006. In academia, professor Zhi-Hua Zhou at Nanjing University in 2006 chaired a National Machine Learning workshop, inviting researchers in the greater China area to share their experience. In 2009, the first Asian Conference on Machine learning was inaugurated in Nanjing. In industry, both Google and MSRA influenced China Internet leading companies such as Tencent, Baidu, Alibaba, and subsequently Renren and Shanda, to start their large-scale KDD operations. Three KDD engineers on my team were recruited to join Baidu knowledge, the primary KDD application of these Internet companies this far is monetization, improving their ad/offer relevance and hence revenue. Genome Institute (BGI) have made impressive progress in areas of computer vision, pattern recognition, and bio-genomics. Applications such as face, gesture, voice, handwriting, and license plate recognition have been widely deployed. In the bio-genomics area, a team at BGI reached a significant milestone in 2008 by sequencing the first Asian individual's diploid genome and published the result in Nature [1]. This sequencing effort took BGI one year to complete. Subsequently, speeding up genome sequencing has been among BGI's top R&D priorities. (One cannot imagine what one billion genomic sequences and their associated disease profiles can bring to advancing human health.) Researchers led by Ruiqiang Li from BGI and researchers from Google and universities at Canada and Hong Kong have met a couple of times to discuss large-scale data mining issues and solutions in hardware, algorithms, and data transportation. There is no doubt that KDD is thriving in China in several areas and its applications are rapidly growing, thanks to the increase of both data volume and demand for intelligent information analysis and trend prediction. 2. Please describe your expertise and contribution to KDD. In 2005, my team started working on developing parallel machine learning algorithms to mine large-scale datasets. My team were made publicly available through Apache foundation, and they have been downloaded more than 4,000 times. Several Google products also use these parallel algorithms. Prior to the large-scale machine learning work, my work with Simon Tong on using active learning to refine user query concepts published in 2001 [8] has been cited 850 times. Together with my works on …
与张德昌博士的对话
1. 请与我们分享您对中国KDD研究和应用领域的历史和重要里程碑的看法。大量证据表明,自2006年以来,KDD已成为中国研究和产业界感兴趣的主要话题。在学术界,南京大学的周志华教授于2006年主持了全国机器学习研讨会,邀请大中华地区的研究人员分享他们的经验。2009年,第一届亚洲机器学习会议在南京开幕。在工业领域,谷歌和MSRA都影响了中国互联网领先公司,如腾讯、百度、阿里巴巴,以及随后的人人和盛大,开始了大规模的KDD业务。我团队的三名KDD工程师被招募到百度知识,这些互联网公司目前主要的KDD应用是货币化,提高他们的广告/报价相关性,从而提高收入。华大基因研究所(BGI)在计算机视觉、模式识别和生物基因组学领域取得了令人瞩目的进展。人脸、手势、语音、手写和车牌识别等应用已得到广泛应用。在生物基因组学领域,华大基因的一个团队在2008年完成了第一个亚洲个体二倍体基因组测序,并将结果发表在《自然》杂志上,这是一个重要的里程碑[1]。这项测序工作花了华大基因一年的时间才完成。因此,加快基因组测序一直是华大基因的首要研发重点之一。(人们无法想象,10亿个基因组序列及其相关的疾病概况能为促进人类健康带来什么。)由华大基因李瑞强领导的研究人员和来自谷歌以及加拿大和香港大学的研究人员已经会面了几次,讨论了大规模数据挖掘问题以及硬件、算法和数据传输方面的解决方案。毫无疑问,由于数据量的增加以及对智能信息分析和趋势预测需求的增加,KDD在中国的一些领域正在蓬勃发展,其应用正在迅速增长。2. 请描述您的专业知识和对KDD的贡献。2005年,我的团队开始开发并行机器学习算法来挖掘大规模数据集。我的团队通过Apache基金会公开发布,它们已经被下载了4000多次。谷歌的一些产品也使用了这些并行算法。在大规模机器学习工作之前,我与Simon Tong在2001年发表的关于使用主动学习来改进用户查询概念的研究[8]被引用了850次。连同我的作品……
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信