A Method of Accelerating LDA Program with GPU

2012 Third International Conference on Networking and Distributed Computing Pub Date : 2012-10-21 DOI:10.1109/ICNDC.2012.14

Yanjun Jiang, Hualong Wen, Zhanchun Gao

引用次数: 1

Abstract

LDA (Latent Dirichlet Allocation) is a text modeling algorithm based on a generative probabilistic model. It is widely used to discover latent topics among a set of documents. Mahout has implemented LDA algorithm, however, the execution time of the LDA program is very long when processing a large amount of documents, because the documents are processed in sequence. This paper introduces a method to modify this program with CUDA toolkit provided by NVIDIA, in order that a group of documents could be processed in parallel on GPU. Using this method, the LDA program could be accelerated greatly.

查看原文本刊更多论文

一种用GPU加速LDA程序的方法

LDA (Latent Dirichlet Allocation)是一种基于生成概率模型的文本建模算法。它被广泛用于发现一组文档中的潜在主题。Mahout实现了LDA算法，但是在处理大量文档时，由于文档是按顺序处理的，所以LDA程序的执行时间很长。本文介绍了一种利用NVIDIA提供的CUDA工具包对该程序进行修改的方法，使一组文档可以在GPU上并行处理。采用这种方法可以大大加快LDA程序的速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 Third International Conference on Networking and Distributed Computing

自引率

0.00%

发文量