Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics最新文献
{"title":"Building Large Resources for Text Mining: The Leipzig Corpora Collection","authors":"U. Quasthoff, Dirk Goldhahn, Thomas Eckart","doi":"10.1007/978-3-319-12655-5_1","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_1","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"29 1","pages":"3-24"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86797492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Learning Textologies: Networks of Linked Word Clusters","authors":"Hristo Tanev","doi":"10.1007/978-3-319-12655-5_2","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_2","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"39 1","pages":"25-40"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86238869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Natural Language Processing Supporting Interoperability in Healthcare","authors":"F. Oemig, B. Blobel","doi":"10.1007/978-3-319-12655-5_7","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_7","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"51 1","pages":"137-156"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75636911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
P. Oesterling, Christian Heine, G. Weber, G. Scheuermann
{"title":"A Topology-Based Approach to Visualize the Thematic Composition of Document Collections","authors":"P. Oesterling, Christian Heine, G. Weber, G. Scheuermann","doi":"10.1007/978-3-319-12655-5_4","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_4","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"41 1","pages":"63-85"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77476802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Simple, Fast and Accurate Taxonomy Learning","authors":"Zornitsa Kozareva","doi":"10.1007/978-3-319-12655-5_3","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_3","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"28 1 1","pages":"41-62"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82839038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Marco Büchler, Philip R. Burns, Martin Müller, E. Franzini, G. Franzini
{"title":"Towards a Historical Text Re-use Detection","authors":"Marco Büchler, Philip R. Burns, Martin Müller, E. Franzini, G. Franzini","doi":"10.1007/978-3-319-12655-5_11","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_11","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"47 1","pages":"221-238"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78233877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Verónica Pérez-Rosas, Cristian-Sorin Bologa, Mihai Burzo, Rada Mihalcea
{"title":"Deception Detection Within and Across Cultures","authors":"Verónica Pérez-Rosas, Cristian-Sorin Bologa, Mihai Burzo, Rada Mihalcea","doi":"10.1007/978-3-319-12655-5_8","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_8","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"23 1","pages":"157-175"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73062944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sentiment Analysis: What's Your Opinion?","authors":"J. Sonntag, Manfred Stede","doi":"10.1007/978-3-319-12655-5_9","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_9","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"42 6 1","pages":"177-199"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85022353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-perspective Event Detection in Texts Documenting the 1944 Battle of Arnhem","authors":"M. Düring, Antal van den Bosch","doi":"10.1007/978-3-319-12655-5_10","DOIUrl":"https://doi.org/10.1007/978-3-319-12655-5_10","url":null,"abstract":"","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"75 1","pages":"201-219"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83819409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kristina Doing-Harris, Olga Patterson, Sean Igo, John Hurdle
{"title":"Document Sublanguage Clustering to Detect Medical Specialty in Cross-institutional Clinical Texts.","authors":"Kristina Doing-Harris, Olga Patterson, Sean Igo, John Hurdle","doi":"10.1145/2512089.2512101","DOIUrl":"https://doi.org/10.1145/2512089.2512101","url":null,"abstract":"<p><p>This paper reports on a set of studies designed to identify sublanguages in documents for domain-specific processing across institutions. Psychological evidence indicates that humans use context-specific linguistic information when they read. Natural Language Processing (NLP) pipelines are successful within specific domains (i.e., contexts). To limit the number of domain-specific NLP systems, a natural focus would be on sublanguages. Sublanguages are identified by shared lexical and semantic features.[1] Patterson and Hurdle[2] developed a sublanguage identification system that functioned well for 12 clinical specialties at the University of Utah. The current work compares sublanguages across institutions. Using a clinical NLP pipeline augmented by a new document corpus from the University of Pittsburg (UPitt), new documents were assigned to clusters based on the minimum cosine-distance to a Utah cluster centroid. The UPitt documents were divided into a nine-group specialty corpus. Across institutions, five of the specialty groups fell within the expected clusters. We find that clustering encounters difficulty due to documents with mixed sublanguages; naming convention differences across institutions; and document types used across specialties. The findings indicate that clinical specialty sublanguages can be identified across institutions.</p>","PeriodicalId":91598,"journal":{"name":"Proceedings of the ACM ... International Workshop on Data and Text Mining in Biomedical Informatics. ACM International Workshop on Data and Text Mining in Biomedical Informatics","volume":"2013 ","pages":"9-12"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/2512089.2512101","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34459915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}