International Conference on Natural Language and Speech Processing最新文献

Supervised Acoustic Embeddings And Their Transferability Across Languages 监督声嵌入及其跨语言可移植性

International Conference on Natural Language and Speech Processing Pub Date : 2023-01-03 DOI: 10.48550/arXiv.2301.01020

Sreepratha Ram, Hanan Aldarmaki

引用次数: 2

Arguments to Key Points Mapping with Prompt-based Learning 论点到关键点的映射与基于提示的学习

International Conference on Natural Language and Speech Processing Pub Date : 2022-11-28 DOI: 10.48550/arXiv.2211.14995

Ahnaf Mozib Samin, Behrooz Nikandish, Jingyan Chen

引用次数: 0

International Conference on Natural Language and Speech Processing Pub Date : 2022-11-20 DOI: 10.48550/arXiv.2211.11057

Phillip Schneider, Markus Voggenreiter, Abdullah Gulraiz, F. Matthes

{"title":"Semantic Similarity-Based Clustering of Findings From Security Testing Tools","authors":"Phillip Schneider, Markus Voggenreiter, Abdullah Gulraiz, F. Matthes","doi":"10.48550/arXiv.2211.11057","DOIUrl":"https://doi.org/10.48550/arXiv.2211.11057","url":null,"abstract":"Over the last years, software development in domains with high security demands transitioned from traditional methodologies to uniting modern approaches from software development and operations (DevOps). Key principles of DevOps gained more importance and are now applied to security aspects of software development, resulting in the automation of security-enhancing activities. In particular, it is common practice to use automated security testing tools that generate reports after inspecting a software artifact from multiple perspectives. However, this raises the challenge of generating duplicate security findings. To identify these duplicate findings manually, a security expert has to invest resources like time, effort, and knowledge. A partial automation of this process could reduce the analysis effort, encourage DevOps principles, and diminish the chance of human error. In this study, we investigated the potential of applying Natural Language Processing for clustering semantically similar security findings to support the identification of problem-specific duplicate findings. Towards this goal, we developed a web application for annotating and assessing security testing tool reports and published a human-annotated corpus of clustered security findings. In addition, we performed a comparison of different semantic similarity techniques for automatically grouping security findings. Finally, we assess the resulting clusters using both quantitative and qualitative evaluation methods.","PeriodicalId":405017,"journal":{"name":"International Conference on Natural Language and Speech Processing","volume":"180 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121882114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Scaling Native Language Identification with Transformer Adapters 使用变压器适配器扩展本地语言标识

International Conference on Natural Language and Speech Processing Pub Date : 2022-11-18 DOI: 10.48550/arXiv.2211.10117

Ahmet Uluslu, G. Schneider

引用次数: 2

Efficient Task-Oriented Dialogue Systems with Response Selection as an Auxiliary Task 以响应选择为辅助任务的高效任务导向对话系统

International Conference on Natural Language and Speech Processing Pub Date : 2022-08-15 DOI: 10.48550/arXiv.2208.07097

Radostin Cholakov, T. Kolev

引用次数: 3

Constructing the Corpus of Chinese Textual ‘Run-on’ Sentences (CCTRS): Discourse Corpus Benchmark with Multi-layer Annotations 构建汉语文本“连续”句语料库:基于多层注释的篇章语料库基准

International Conference on Natural Language and Speech Processing Pub Date : 2021-11-12 DOI: 10.31234/osf.io/jua9g

Kun Sun, Rong Wang

{"title":"Constructing the Corpus of Chinese Textual ‘Run-on’ Sentences (CCTRS): Discourse Corpus Benchmark with Multi-layer Annotations","authors":"Kun Sun, Rong Wang","doi":"10.31234/osf.io/jua9g","DOIUrl":"https://doi.org/10.31234/osf.io/jua9g","url":null,"abstract":"Chinese is a discourse-oriented language. “Run-on” sentences (liushui ju) are a typical and prevalent form of discourse in Chinese. These sentences show the capacity of the Chinese language for organizing loose structures into an effective and coherent discourse. Despite their widespread use in Chinese, previous studies have only explored “run-on” sentences by using small-scale examples. In order to carry out a quantitative investigation of “run-on” sentences, we need to establish a corpus. The present study selects 500 “run-on” sentences and annotates them on the levels of discourse, syntax and semantics. We mainly adopt PDTB (Penn Discourse Treebank) styles in the discourse annotations but we also borrow some features from RST (rhetorical structure theory). We find that the distribution of the frequency of discourse relations in the data extracted from this corpus follows the power law. The preliminary results reveal that semantic leaps in “run-on” sentences are closely related to the use of the topic chain and the animacy and the span of discourse relations. This corpus can thus aid in carrying out further computational and cognitive studies of Chinese discourse.","PeriodicalId":405017,"journal":{"name":"International Conference on Natural Language and Speech Processing","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123520634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluation of topic segmentation algorithms on Arabic texts 阿拉伯语文本主题分割算法的评价

International Conference on Natural Language and Speech Processing Pub Date : 1900-01-01 DOI: 10.1109/ICNLSP.2018.8374389

Fayçal Nouar, H. Belhadef

引用次数: 1