A parallel learning algorithm for Bayesian inference networks

Alberto Maria  Segre

A parallel learning algorithm for Bayesian inference networks

Alberto Maria Segre

2002, IEEE Transactions on Knowledge and Data …

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract
AI

This research presents a novel distributed learning algorithm for Bayesian inference networks, designed to alleviate the knowledge engineering bottleneck associated with manual network construction. Leveraging under-utilized computing resources, the approach employs the Minimum Description Length (MDL) principle to formulate a serial search algorithm, which is then parallelized through an asynchronous distributed search technique known as nagging. Empirical results demonstrate significant improvements in learning performance and computational efficiency, enabling the learning of large Bayesian networks with up to 150 nodes across multiple workstations.

jason xu

—Bayesian Network parameter learning is one of the core issues of Bayesian Network research. The parameter estimation of Bayesian Network from large incomplete dataset can be very compute-intensive. A factor graph based Bayesian Network parameter learning algorithm using MapReduce is presented in this paper, which decomposes one Bayesian Network into factors and gets the Bayesian Network parameter through computing the conditional probability tables of each factor independently using Expectation Maximization (EM) algorithm within MapReduce framework. Experimental results show that when the number of training samples is 10 7 , the speed of this parallel algorithm can get 2~6 times the speed of Sequential Expectation Maximization. The algorithm can reduce the training time significantly with increasing the number of Hadoop nodes. Compared with the existing parallel EM method using MapReduce, this algorithm has also a higher speed and can avoid the problem of load imbalance at the same time.

Log In

A parallel learning algorithm for Bayesian inference networks

Sign up for access to the world's latest research

AbstractAI

Related papers

Abstract
AI