{"title":"通过基于语料库的软件获取历史证据和数据的机遇与威胁:杰拉德-马林斯案例研究","authors":"Remo Appolloni","doi":"10.24425/linsi.2024.150390","DOIUrl":null,"url":null,"abstract":"The aim of this paper is to exploit the informative nature of datasets that can be created from corpus-based software to explore specific phenomena in early modern specialized discourse, and to corroborate the adoption of the same software for historical analysis. Particular relevance will be devoted to the special nature of historical evidence, which has caused critical issues in the reliability of the data collected for the purpose of historical investigation of English. Spelling variation, in this sense, is one of the most crucial problems of Early Modern English, and this has often affected the reliability of data to be collected via software, especially when statistical findings are involved. The normalisation of historical texts has contributed enormously to make texts better readable for historical corpus analysis; and, consequently, to improve the accuracy and manipulation of data. Moreover, several tools used in corpus linguistics have benefited from the normalisation of spelling variants in the same terms, e.g. part-of-speech taggers for historical variety. This case study will attempt to explore the data retrievable from corpus-based software like VARD, #LancsBox and CQPweb, and to use them to corroborate a preliminary analysis of early modern economics discourse in two treatises written by Gerard Malynes in 1601 and 1623.","PeriodicalId":52527,"journal":{"name":"Linguistica Silesiana","volume":"108 46","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Opportunities And Threats Of Historical Evidence And Data Via Corpus-Based Software: A Case Study On Gerard Malynes\",\"authors\":\"Remo Appolloni\",\"doi\":\"10.24425/linsi.2024.150390\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this paper is to exploit the informative nature of datasets that can be created from corpus-based software to explore specific phenomena in early modern specialized discourse, and to corroborate the adoption of the same software for historical analysis. Particular relevance will be devoted to the special nature of historical evidence, which has caused critical issues in the reliability of the data collected for the purpose of historical investigation of English. Spelling variation, in this sense, is one of the most crucial problems of Early Modern English, and this has often affected the reliability of data to be collected via software, especially when statistical findings are involved. The normalisation of historical texts has contributed enormously to make texts better readable for historical corpus analysis; and, consequently, to improve the accuracy and manipulation of data. Moreover, several tools used in corpus linguistics have benefited from the normalisation of spelling variants in the same terms, e.g. part-of-speech taggers for historical variety. This case study will attempt to explore the data retrievable from corpus-based software like VARD, #LancsBox and CQPweb, and to use them to corroborate a preliminary analysis of early modern economics discourse in two treatises written by Gerard Malynes in 1601 and 1623.\",\"PeriodicalId\":52527,\"journal\":{\"name\":\"Linguistica Silesiana\",\"volume\":\"108 46\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Linguistica Silesiana\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.24425/linsi.2024.150390\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguistica Silesiana","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24425/linsi.2024.150390","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Arts and Humanities","Score":null,"Total":0}
Opportunities And Threats Of Historical Evidence And Data Via Corpus-Based Software: A Case Study On Gerard Malynes
The aim of this paper is to exploit the informative nature of datasets that can be created from corpus-based software to explore specific phenomena in early modern specialized discourse, and to corroborate the adoption of the same software for historical analysis. Particular relevance will be devoted to the special nature of historical evidence, which has caused critical issues in the reliability of the data collected for the purpose of historical investigation of English. Spelling variation, in this sense, is one of the most crucial problems of Early Modern English, and this has often affected the reliability of data to be collected via software, especially when statistical findings are involved. The normalisation of historical texts has contributed enormously to make texts better readable for historical corpus analysis; and, consequently, to improve the accuracy and manipulation of data. Moreover, several tools used in corpus linguistics have benefited from the normalisation of spelling variants in the same terms, e.g. part-of-speech taggers for historical variety. This case study will attempt to explore the data retrievable from corpus-based software like VARD, #LancsBox and CQPweb, and to use them to corroborate a preliminary analysis of early modern economics discourse in two treatises written by Gerard Malynes in 1601 and 1623.