Mining Maximal Quasi-Bicliques to Co-Cluster Stocks and Financial Ratios for Value Investment

Sixth International Conference on Data Mining (ICDM'06) Pub Date : 2006-12-18 DOI:10.1109/ICDM.2006.111

Kelvin Sim, Jinyan Li, V. Gopalkrishnan, Guimei Liu

引用次数: 57

Abstract

We introduce an unsupervised process to co-cluster groups of stocks and financial ratios, so that investors can gain more insight on how they are correlated. Our idea for the co-clustering is based on a graph concept called maximal quasi-bicliques, which can tolerate erroneous or/and missing information that are common in the stock and financial ratio data. Compared to previous works, our maximal quasi-bicliques require the errors to be evenly distributed, which enable us to capture more meaningful co-clusters. We develop a new algorithm that can efficiently enumerate maximal quasi-bicliques from an undirected graph. The concept of maximal quasi-bicliques is domain-independent; it can be extended to perform co-clustering on any set of data that are modeled by graphs.

查看原文本刊更多论文

价值投资中共聚类股票与财务比率的最大拟曲线挖掘

我们引入了一个无监督的过程来共同聚集股票和财务比率组，这样投资者就可以更深入地了解它们是如何相关的。我们的共聚类想法是基于一个称为最大准双曲线的图概念，它可以容忍股票和财务比率数据中常见的错误或/和缺失信息。与以前的工作相比，我们的最大拟双曲线要求误差均匀分布，这使我们能够捕获更有意义的共簇。提出了一种从无向图中有效枚举极大拟双曲线的新算法。极大拟双曲线的概念是域无关的;它可以扩展到对任何由图建模的数据集执行共聚类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Sixth International Conference on Data Mining (ICDM'06)

自引率

0.00%

发文量