Note: This codebase is no longer being maintained. It will be sunset and archived soon. A new and re-written library (under the same name) is available here.
PyBBN is Python library for Bayesian Belief Networks (BBNs) exact inference using the junction tree algorithm or Probability Propagation in Trees of Clusters (PPTC). The implementation is taken directly from C. Huang and A. Darwiche, "Inference in
Belief Networks: A Procedural Guide," in International Journal of Approximate Reasoning, vol. 15,
pp. 225--263, 1999. In this API, PPTC is applied to BBNs with all discrete variables. When dealing with a BBN with all Gaussian variables (or a Gaussian Belief Network, GBN), exact inference is conducted through an incremental algorithm manipulating the means and covariance matrix. Additionally, there is the ability to generate singly- and multi-connected graphs, which is taken from JS Ide and FG Cozman,
"Random Generation of Bayesian Network," in Advances in Artificial Intelligence, Lecture Notes in Computer Science, vol 2507. There is also the option to generate sample data from your BBN. This synthetic data may be summarized to generate your posterior marginal probabilities and work as a form of approximate inference. Lastly, we have added Pearl's do-operator
for causal inference.
Rocket Vector | Autonosis | pyspark-bbn |
---|---|---|
If you like py-bbn, please inquire about our next-generation products below! [email protected]
- Rocket Vector is a causal learning platform in the cloud!
- Autonosis is a GenAI + CausalAI capabable platform.
- pyspark-bbn is a is a scalable, massively parallel processing MPP framework for learning structures and parameters of Bayesian Belief Networks BBNs using Apache Spark.
Below is an example code to create a Bayesian Belief Network, transform it into a join tree, and then set observation evidence. The last line prints the marginal probabilities for each node.
from pybbn.graph.dag import Bbn
from pybbn.graph.edge import Edge, EdgeType
from pybbn.graph.jointree import EvidenceBuilder
from pybbn.graph.node import BbnNode
from pybbn.graph.variable import Variable
from pybbn.pptc.inferencecontroller import InferenceController
# create the nodes
a = BbnNode(Variable(0, 'a', ['on', 'off']), [0.5, 0.5])
b = BbnNode(Variable(1, 'b', ['on', 'off']), [0.5, 0.5, 0.4, 0.6])
c = BbnNode(Variable(2, 'c', ['on', 'off']), [0.7, 0.3, 0.2, 0.8])
d = BbnNode(Variable(3, 'd', ['on', 'off']), [0.9, 0.1, 0.5, 0.5])
e = BbnNode(Variable(4, 'e', ['on', 'off']), [0.3, 0.7, 0.6, 0.4])
f = BbnNode(Variable(5, 'f', ['on', 'off']), [0.01, 0.99, 0.01, 0.99, 0.01, 0.99, 0.99, 0.01])
g = BbnNode(Variable(6, 'g', ['on', 'off']), [0.8, 0.2, 0.1, 0.9])
h = BbnNode(Variable(7, 'h', ['on', 'off']), [0.05, 0.95, 0.95, 0.05, 0.95, 0.05, 0.95, 0.05])
# create the network structure
bbn = Bbn() \
.add_node(a) \
.add_node(b) \
.add_node(c) \
.add_node(d) \
.add_node(e) \
.add_node(f) \
.add_node(g) \
.add_node(h) \
.add_edge(Edge(a, b, EdgeType.DIRECTED)) \
.add_edge(Edge(a, c, EdgeType.DIRECTED)) \
.add_edge(Edge(b, d, EdgeType.DIRECTED)) \
.add_edge(Edge(c, e, EdgeType.DIRECTED)) \
.add_edge(Edge(d, f, EdgeType.DIRECTED)) \
.add_edge(Edge(e, f, EdgeType.DIRECTED)) \
.add_edge(Edge(c, g, EdgeType.DIRECTED)) \
.add_edge(Edge(e, h, EdgeType.DIRECTED)) \
.add_edge(Edge(g, h, EdgeType.DIRECTED))
# convert the BBN to a join tree
join_tree = InferenceController.apply(bbn)
# insert an observation evidence
ev = EvidenceBuilder() \
.with_node(join_tree.get_bbn_node_by_name('a')) \
.with_evidence('on', 1.0) \
.build()
join_tree.set_observation(ev)
# print the marginal probabilities
for node in join_tree.get_bbn_nodes():
potential = join_tree.get_bbn_potential(node)
print(node)
print(potential)
The example belows shows how to perform inference on multivariate Gaussian variables.
import numpy as np
from pybbn.gaussian.inference import GaussianInference
def get_cowell_data():
"""
Gets Cowell data.
:return: Data and headers.
"""
n = 10000
Y = np.random.normal(0, 1, n)
X = np.random.normal(Y, 1, n)
Z = np.random.normal(X, 1, n)
D = np.vstack([Y, X, Z]).T
return D, ['Y', 'X', 'Z']
# assume we have data and headers (variable names per column)
# X is the data (rows are observations, columns are variables)
# H is just a list of variable names
X, H = get_cowell_data()
# then we can compute the means and covariance matrix easily
M = X.mean(axis=0)
E = np.cov(X.T)
# the means and covariance matrix are all we need for gaussian inference
# notice how we keep `g` around?
# we'll use `g` over and over to do inference with evidence/observations
g = GaussianInference(H, M, E)
# {'Y': (0.00967, 0.98414), 'X': (0.01836, 2.02482), 'Z': (0.02373, 3.00646)}
print(g.P)
# we can make a single observation with do_inference()
g1 = g.do_inference('X', 1.5)
# {'X': (1.5, 0), 'Y': (0.76331, 0.49519), 'Z': (1.51893, 1.00406)}
print(g1.P)
# we can make multiple observations with do_inferences()
g2 = g.do_inferences([('Z', 1.5), ('X', 2.0)])
# {'Z': (1.5, 0), 'X': (2.0, 0), 'Y': (1.97926, 0.49509)}
print(g2.P)
To build, you will need 3.7. Managing environments through Anaconda is highly recommended to be able to build this project (though not absolutely required if you know what you are doing). Assuming you have installed Anaconda, you may create an environment as follows (make sure you cd
into the root of this project's location).
To create the environment, use the following commands.
conda env create -f environment.yml
If you want to use the environments with Jupyter, install the kernel.
conda activate pybbn37
python -m ipykernel install --user --name pybbn37 --display-name "pybbn37"
Then you may build the project as follows. (Note that in Python 3.6 you will get some warnings).
make build
To build the documents, go into the docs sub-directory and type in the following.
make html
You can do a fresh test with Docker as follows.
docker build -t pybbn-test:local -f Dockerfile.test .
Use pip to install the package as it has been published to PyPi.
pip install pybbn
If you check out the source do the following.
pip list | grep pybbn
pip uninstall pybbn
python setup.py install
pip list | grep pybbn
Make sure you install GraphViz on your system.
- CentOS:
yum install graphviz*
- Ubuntu:
sudo apt-get install graphviz libgraphviz-dev
- Mac OSX:
brew install graphviz
and when you install pygraphvizpip install pygraphviz --install-option="--include-path=/usr/local/lib/graphviz/" --install-option="--library-path=/usr/local/lib/graphviz/"
- Windows: use the msi installer
- For Anaconda + Windows, install pygraphviz from this channel
conda install -c alubbock pygraphviz
- For Anaconda + Windows, install pygraphviz from this channel
You should NOT be doing this operation, but if you do want to install from testpypi
, then add the --extra-index-url
as follows.
pip install -i https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple/ pybbn
Here is a list of other Python libraries for inference in Bayesian Belief Networks.
Library | Algorithm | Algorithm Type | License |
---|---|---|---|
BayesPy | variational message passing | approximate | MIT |
pomegranate | loopy belief | approximate | MIT |
pgmpy | multiple | approximate/exact | MIT |
libpgm | likelihood sampling | approximate | Proprietary |
bayesnetinference | variable elimination | exact | None |
I found other packages in PyPI too.
But I am coming from the Java mothership and I want to use Bayesian Belief Networks in Java. How do I perform probabilistic inference in Java?
This Python code base is a port of the original Java code.
@misc{vang_2017,
title={PyBBN},
url={https://github.com/vangj/py-bbn/},
journal={GitHub},
author={Vang, Jee},
year={2017},
month={Jan}}
I found these online articles using PyBBN.
- BBN: Bayesian Belief Networks — How to Build Them Effectively in Python
- Introduction to Bayesian Belief Networks
Copyright 2017 -- 2023 Jee Vang
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
Copyright 2020 Daytchia Vang