2019 International Conference on Bangla Speech and Language Processing (ICBSLP)最新文献

筛选
英文 中文
Real-time Bangladeshi Currency Detection System for Visually Impaired Person 孟加拉视障人士即时货币侦测系统
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201518
Md Ferdousur Rahman Sarker, Md. Israfil Mahmud Raju, A. Marouf, Rubaiya Hafiz, S. A. Hossain, Munim Hossain Khandker Protik
{"title":"Real-time Bangladeshi Currency Detection System for Visually Impaired Person","authors":"Md Ferdousur Rahman Sarker, Md. Israfil Mahmud Raju, A. Marouf, Rubaiya Hafiz, S. A. Hossain, Munim Hossain Khandker Protik","doi":"10.1109/ICBSLP47725.2019.201518","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201518","url":null,"abstract":"This paper presents a real-time Bangladeshi currency detection system for visually impaired persons. The proposed system exploits the image processing algorithms to facilitate the visually impaired people to prosperously recognize banknotes. The recent banknotes of Bangladesh have blind embossing or blind dots, which could be effective to recognize the value of the bill by touching. As the embossing fades away in the long-term used notes, detecting right value of the banknote using image processing algorithms could be considered as a challenging task. Particularly in Bangladesh, each banknote seems similar using the direct exertion of simplified image processing algorithms. In this paper, a recognition system was implemented that can detect Bangladeshi banknote in different viewpoints and scales. The detection system is also able to detect currency those are rumpled, decrepit or even worn. The detection system includes image preprocessing, image analysis and image recognition. To enhance the determination of currency recognition, the descriptor of an individual input scene is matched with various training images of the same category. After that, by analyzing their matching result it recognizes the currency with higher confidence. For real-time recognition, we have deployed the system into a mobile application.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124338854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Handwritten Bengali Number Detection using Region Proposal Network 基于区域建议网络的手写体孟加拉数字检测
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.202049
Shaharat Tajrean, Mohammad Abu Yousuf
{"title":"Handwritten Bengali Number Detection using Region Proposal Network","authors":"Shaharat Tajrean, Mohammad Abu Yousuf","doi":"10.1109/ICBSLP47725.2019.202049","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.202049","url":null,"abstract":"As a seventh most spoken native language, Bengali needs a robust and accurate optical character recognition (OCR) especially for number detection. As there is no publicly available well-organized dataset for Bangla number OCR, a synthesized dataset was generated to fill the lack of available data. The recent advancement in artificial intelligence using deep neural networks easily outperforms prior hand selected feature-based machine learning approaches. As the region proposal networks (RPN) in deep neural networks perform very well in detecting objects, it can be used for digit detection in an image. So, in this work a very robust Bengali handwritten number detection system is presented where with the help of deep neural networks and a very well-organized, unbiased generated dataset we achieved state of the art result in handwritten Bangla number detection. This system beats any related prior works by a large margin while considering a real world dataset for benchmarks. The overall detection accuracy was 97.8%. The processing can be done real-time with about 35 images per second using a GPU. Also, while implementing the solution is completely based on python, the framework used for deep learning is Google’s Tensorflow and the dependencies, all of which are publicly available.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115914424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Automatic Detection of Satire in Bangla Documents: A CNN Approach Based on Hybrid Feature Extraction Model 孟加拉文讽刺文章自动检测:一种基于混合特征提取模型的CNN方法
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201517
Arnab Sen Sharma, Maruf Ahmed Mridul, Md. Saiful Islam
{"title":"Automatic Detection of Satire in Bangla Documents: A CNN Approach Based on Hybrid Feature Extraction Model","authors":"Arnab Sen Sharma, Maruf Ahmed Mridul, Md. Saiful Islam","doi":"10.1109/ICBSLP47725.2019.201517","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201517","url":null,"abstract":"Wide spread of satirical news in online communities is an ongoing trend. The nature of satires are so inherently ambiguous that sometimes it’s too hard even for humans to understand whether it’s actually satire or not. So, research interest has grown in this field. The purpose of this research is to detect Bangla satirical news spread in online news portals as well as social media. In this paper we propose a hybrid technique for extracting feature from text documents combining Word2Vec and TF-IDF. Using our proposed feature extraction technique, with standard CNN architecture we could detect whether a Bangla text document is satire or not with an accuracy of more than 96%.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124885818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Intrinsic Evaluation of Bangla Word Embeddings 孟加拉语词嵌入的内在评价
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201506
Nafiz Sadman, Akib Sadmanee, Md. Iftekhar Tanveer, Md. Ashraful Amin, A. Ali
{"title":"Intrinsic Evaluation of Bangla Word Embeddings","authors":"Nafiz Sadman, Akib Sadmanee, Md. Iftekhar Tanveer, Md. Ashraful Amin, A. Ali","doi":"10.1109/ICBSLP47725.2019.201506","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201506","url":null,"abstract":"Word embeddings are vector representations of word that allow machines to learn semantic and syntactic meanings by performing computations on them. Two wellknown embedding models are CBOW and Skipgram. Different methods proposed to evaluate the quality of embeddings are categorized into extrinsic and intrinsic evaluation methods. This paper focuses on intrinsic evaluation - the evaluation of the models on tasks, such as analogy prediction, semantic relatedness, synonym detection, antonym detection and concept categorization. We present intrinsic evaluations on Bangla word embedding created using CBOW and Skipgram models on a Bangla corpus that we built. These are trained on more than 700,000 articles consisting of more than 1.3 million unique words with different embedding dimension sizes, e.g., 300, 100, 64, and 32. We created the evaluation datasets for the abovementioned tasks and performed a comprehensive evaluation. We observe, word vectors of dimension 300, produced using Skipgram models, achieves accuracy of 51.33% for analogy prediction, a correlation of 0.62 for semantic relatedness, and accuracy of 53.85% and 9.56% for synonym and antonym detection 9.56%. Finally, for concept categorization the accuracy is 91.02%. The corpus and evaluation datasets are made publicly available for further research.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"19 10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129070756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Development of a Bangla Sense Annotated Corpus for Word Sense Disambiguation 孟加拉语语义标注语料库的开发与应用
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201516
Monisha Biswas, M. M. Hoque
{"title":"Development of a Bangla Sense Annotated Corpus for Word Sense Disambiguation","authors":"Monisha Biswas, M. M. Hoque","doi":"10.1109/ICBSLP47725.2019.201516","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201516","url":null,"abstract":"Sense annotated corpus can be treated as an essential resource for lexicon development, morphological processing and also for evaluating the performance of a word sense disambiguation (WSD) system. In this paper, a Bangla sense annotated corpus is generated from a raw collection of Bangla text, where only the sentences which contain at least one Bangla ambiguous word are retrieved from the raw corpus. All individual word forms of the sentences stored in our Bangla sense annotated corpus are tagged with their corresponding root word forms and POS types and the detected ambiguous words in the sentences are also tagged with their actual senses. The developed Bangla sense annotated corpus initially contains 5028 Bangla sentences with proper annotation and the overall performance of our Bangla sense annotated corpus creation system is 86.95%. Index Terms – Bangla language processing, Sense annotated corpus, Lexicon, Word sense disambiguation, Ambiguous word.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130269199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
End to End Parts of Speech Tagging and Named Entity Recognition in Bangla Language 孟加拉语端到端词性标注与命名实体识别
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201541
Jillur Rahman Saurav, Summit Haque, Farida Chowdhury
{"title":"End to End Parts of Speech Tagging and Named Entity Recognition in Bangla Language","authors":"Jillur Rahman Saurav, Summit Haque, Farida Chowdhury","doi":"10.1109/ICBSLP47725.2019.201541","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201541","url":null,"abstract":"Automatic Parts of Speech(POS) tagging is one of the most fundamental tasks for a language in Natural Language Processing(NLP), which acts as a feature for solving advanced NLP tasks. Named Entity Recognition(NER) is another essential task of NLP for information retrieval. Researchers could not find up to the mark solution yet on these two tasks for Bangla language compared to other languages, for instance, English, Ger-man. Moreover, many solutions heavily depend on handcrafted features that require strong linguistic expertise. As these two sequence labeling tasks are similar, In this work, two different datasets of POS tagging and NER were prepared, and different deep neural network approaches studied for solving these two tasks separately. All of the approaches were end to end and did not need any handcrafted feature like word suffixes or affixes, gazetteers, dictionary. This study came up with an end to end solution using deep neural network-based model consisting of Bi-directional Long short-term memory(BLSTM), Convolutional Neural Network(CNN) and Conditional Random Field(CRF). The proposed model trained on respected datasets achieved an accuracy of 93.86% on POS tagging and a strict f1 score of 0.6285 on NER on prepared datasets, respectively.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133244670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A model of diphone duration for speech synthesis in Bangla 孟加拉语语音合成的双音持续时间模型
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201469
K. Swarna, M. S. Rahman
{"title":"A model of diphone duration for speech synthesis in Bangla","authors":"K. Swarna, M. S. Rahman","doi":"10.1109/ICBSLP47725.2019.201469","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201469","url":null,"abstract":"The aim of this paper is to provide an improved duration model for diphones and a diphone selection technique in Bangla text to speech system Subachan for better speech quality. Segment duration is one of the most important factors for the intonation of the generated speech in concatenative synthesis. For this reason, a good duration model is indispensable. To achieve this, we observed diphone durations in a recorded corpus of words, determined the diphone durations for different positions in the words, observed the signals of consonants in the corpus and categorized them accordingly. The results show that the proper durations produce notably better intonation in the generated speech which improves the naturalness and intelligibility of Subachan.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130396263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing Sentiment of Movie Reviews in Bangla by Applying Machine Learning Techniques 应用机器学习技术分析孟加拉语电影评论的情感
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201483
Rumman Rashid Chowdhury, M. Shahadat Hossain, S. Hossain, Karl Andersson
{"title":"Analyzing Sentiment of Movie Reviews in Bangla by Applying Machine Learning Techniques","authors":"Rumman Rashid Chowdhury, M. Shahadat Hossain, S. Hossain, Karl Andersson","doi":"10.1109/ICBSLP47725.2019.201483","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201483","url":null,"abstract":"This paper proposes a process of sentiment analysis of movie reviews written in Bangla language. This process can automate the analysis of audience’s reaction towards a specific movie or TV show. With more and more people expressing their opinions openly in the social networking sites, analyzing the sentiment of comments made about a specific movie can indicate how well the movie is being accepted by the general public. The dataset used in this experiment was collected and labeled manually from publicly available comments and posts from social media websites. Using Support Vector Machine algorithm, this model achieves 88.90% accuracy on the test set and by using Long Short Term Memory network [1] the model manages to achieve 82.42% accuracy. Furthermore, a comparison with some other machine learning approaches is presented in this paper.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122893876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Sentimental Style Transfer in Text with Multigenerative Variational Auto-Encoder 基于多生成变分自动编码器的文本情感风格转换
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201508
Md. Palash, P. Das, Summit Haque
{"title":"Sentimental Style Transfer in Text with Multigenerative Variational Auto-Encoder","authors":"Md. Palash, P. Das, Summit Haque","doi":"10.1109/ICBSLP47725.2019.201508","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201508","url":null,"abstract":"Style transfer is an emerging trend in the fields of deep learning’s applications, especially in images and audio data this is proven very useful and sometimes the results are astonishing. Gradually styles of textual data are also being changed in many novel works. This paper focuses on the transfer of the sentimental vibe of a sentence. Given a positive clause, the negative version of that clause or sentence is generated keeping the context same. The opposite is also done with negative sentences. Previously this was a very tough job because the go-to techniques for such tasks such as Recurrent Neural Networks(RNNs) [1] and Long Short-Term Memories(LSTMs) [2] can’t perform well with it. But since newer technologies like Generative Adversarial Network(GAN) and Variational AutoEncoder(VAE) are emerging, this work seem to become more and more possible and effective. In this paper, Multi-Genarative Variational Auto-Encoder is employed to transfer sentiment values. Inspite of working with a small dataset, this model proves to be promising.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126297285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair 神经机器翻译与统计机器翻译:重新审视孟加拉语-英语语言对
2019 International Conference on Bangla Speech and Language Processing (ICBSLP) Pub Date : 2019-09-01 DOI: 10.1109/ICBSLP47725.2019.201502
Md. Arid Hasan, Firoj Alam, S. A. Chowdhury, Naira Khan
{"title":"Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair","authors":"Md. Arid Hasan, Firoj Alam, S. A. Chowdhury, Naira Khan","doi":"10.1109/ICBSLP47725.2019.201502","DOIUrl":"https://doi.org/10.1109/ICBSLP47725.2019.201502","url":null,"abstract":"Machine translation systems facilitate our communication and access to information, taking down language barriers. It is a well-researched area of Natural Language Processing (NLP), especially for resource-rich languages (e.g., language pairs in Europarl Parallel corpus). Besides these languages, there is also work on other language pairs including the Bangla-English language pair. In the current study, we aim to revisit both Statistical Machine Translation (SMT) and Neural Machine Translation (NMT) approaches using well-known, publicly available corpora for the Bangla-English (Bangla to English) language pair. We reported how the performance of the models differ based on the data and modeling techniques; consequently, we also compared the results obtained with Google’s machine translation system. Our findings, across different corpora, indicates that NMT based approaches outperform SMT systems. Our results also outperform existing baselines by a large margin.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123710565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信