{"title":"利用纳米孔数据进行信号处理的噬菌体DNA无参考鉴定","authors":"K. Kupkova, K. Sedlář, I. Provazník","doi":"10.1109/BIBE.2017.00-71","DOIUrl":null,"url":null,"abstract":"Nanopore sequencing has become an invaluable aid in small sequencing projects. Thanks to its compact size, the Oxford Nanopore MinION platform is often used in crisis situations, such as outbreaks of microbial infections, to determine the causes of the problem. As a platform that produces data in real-time, it requires bioinformatics techniques designed for fast data processing. In this paper, we demonstrate the possibility of the direct processing of nanopore current signals, the so-called squiggles, for fast reference-free identification of phage DNA. The proposed technique is based on the computation of Hjorth parameters and is suitable for fast visualization of the data, as well as for proper classification by many machine learning algorithms. The classification of the data also raises the possibility of applying adapted base calling algorithms for both groups separately, as phage and host DNA have different features.","PeriodicalId":262603,"journal":{"name":"2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reference-free Identification of Phage DNA Using Signal Processing on Nanopore Data\",\"authors\":\"K. Kupkova, K. Sedlář, I. Provazník\",\"doi\":\"10.1109/BIBE.2017.00-71\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nanopore sequencing has become an invaluable aid in small sequencing projects. Thanks to its compact size, the Oxford Nanopore MinION platform is often used in crisis situations, such as outbreaks of microbial infections, to determine the causes of the problem. As a platform that produces data in real-time, it requires bioinformatics techniques designed for fast data processing. In this paper, we demonstrate the possibility of the direct processing of nanopore current signals, the so-called squiggles, for fast reference-free identification of phage DNA. The proposed technique is based on the computation of Hjorth parameters and is suitable for fast visualization of the data, as well as for proper classification by many machine learning algorithms. The classification of the data also raises the possibility of applying adapted base calling algorithms for both groups separately, as phage and host DNA have different features.\",\"PeriodicalId\":262603,\"journal\":{\"name\":\"2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBE.2017.00-71\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2017.00-71","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reference-free Identification of Phage DNA Using Signal Processing on Nanopore Data
Nanopore sequencing has become an invaluable aid in small sequencing projects. Thanks to its compact size, the Oxford Nanopore MinION platform is often used in crisis situations, such as outbreaks of microbial infections, to determine the causes of the problem. As a platform that produces data in real-time, it requires bioinformatics techniques designed for fast data processing. In this paper, we demonstrate the possibility of the direct processing of nanopore current signals, the so-called squiggles, for fast reference-free identification of phage DNA. The proposed technique is based on the computation of Hjorth parameters and is suitable for fast visualization of the data, as well as for proper classification by many machine learning algorithms. The classification of the data also raises the possibility of applying adapted base calling algorithms for both groups separately, as phage and host DNA have different features.