Back to Articles

Example Format Risk Management

Quantifying Uncertainty: Bayesian Networks in Practice

From: "Probabilistic Graphical Models" by Daphne Koller and Nir Friedman

The Problem with Certainty

Business decisions are made with incomplete information. Will the product launch succeed? Is this customer about to churn? Will the supply chain disruption last 2 weeks or 2 months?

Traditional models give you point estimates: "Expected revenue: $500K." But what you really need is probability distributions: "60% chance between $400K-$600K, 20% chance below $400K, 20% chance above $600K."

Bayesian networks provide that framework.

What Is a Bayesian Network?

A Bayesian network is a directed acyclic graph (DAG) where:

Nodes represent random variables (events, states, measurements)
Edges represent probabilistic dependencies between variables
Conditional probability tables quantify how parent nodes influence children

Simple Network: Customer Churn

Variables:

LoginFrequency (High/Low)
SupportTickets (Many/Few)
FeatureUsage (Active/Inactive)
Churn (Yes/No)

Dependencies:

LoginFrequency → FeatureUsage
SupportTickets → Churn
FeatureUsage → Churn

The network encodes: users who log in frequently tend to use more features, and high feature usage reduces churn probability. Support tickets also directly influence churn risk.

Why Bayesian Networks Matter

1. Reasoning Under Uncertainty

Not all variables are observed. Maybe you know a customer has high support tickets but don't know their feature usage. The network can infer likely feature usage patterns based on observed variables and historical correlations.

2. Causal Modeling (with care)

Edges can represent causal relationships, not just correlations. If you know "rain causes wet ground" (not the other way around), the network structure reflects reality. This matters for intervention analysis: "If we increase login frequency, will churn decrease?"

3. Belief Updating

As new evidence arrives, the network updates all probability distributions. You learn a customer opened 5 support tickets today—instantly, their churn probability updates based on the conditional probabilities encoded in the network.

Inference: The Core Operation

Given observed variables (evidence), compute probability distributions over unobserved variables. Two main types:

Predictive Inference

P(Churn | LoginFrequency=Low) — If we observe low logins, what's the probability of churn? This flows forward through the network.

Diagnostic Inference

P(FeatureUsage | Churn=Yes) — If a customer churned, what was their likely feature usage? This flows backward, reasoning from effects to causes.

"Bayesian networks make the implicit explicit. Your mental model of how things relate is now a queryable, testable structure."

Business Applications

Medical Diagnosis Systems

Symptoms → Diseases → Test Results. Given observed symptoms, the network suggests likely diagnoses and recommends tests that would most reduce uncertainty.

Fraud Detection

Transaction patterns, account history, geolocation, device fingerprints—all feeding into fraud probability. As each new signal arrives, fraud score updates in real-time.

Risk Assessment

Project delays depend on: supplier reliability, team experience, technical complexity, external dependencies. Model these relationships, then ask: "If Supplier A delays shipment, what's the probability we miss the launch date?"

Building a Bayesian Network

Step 1: Identify Variables

What factors influence your outcome of interest? Start broad, then prune. Too many variables make the network intractable; too few miss important dependencies.

Step 2: Define Dependencies

Which variables directly influence others? Draw edges. Avoid cycles—if A causes B and B causes C, don't add C → A (that's feedback, requires dynamic Bayesian networks).

Step 3: Quantify Probabilities

For each variable, define: P(Variable | Parents). This requires data, expert judgment, or both. Start with rough estimates, refine with learning algorithms as data accumulates.

Step 4: Validate

Test the network on known scenarios. Does P(Outcome | Evidence) match reality? Are predictions calibrated? Adjust structure and probabilities iteratively.

Key Insight

Bayesian networks don't eliminate uncertainty—they quantify it. Instead of guessing "this customer might churn," you say "72% probability of churn given current behavior." That's actionable.

Common Pitfalls

Assuming Independence

Bayesian networks exploit conditional independence: variables that are independent given their parents. But if you miss a dependency, the model will give wrong answers. Validate your independence assumptions.

Data Scarcity

Estimating P(X | Y, Z) requires observing all combinations of Y and Z. With many variables, you need exponentially more data. Solution: impose structure (e.g., noisy-OR models) or use expert priors.

Confusing Correlation and Causation

Edges suggest directionality, but correlation doesn't imply causation without further justification. Observational data can't distinguish "A causes B" from "B causes A" or "hidden C causes both." Causal inference requires careful reasoning or experimental data.

Inference Algorithms

Exact Inference

Variable elimination, junction tree algorithms. Guaranteed correct probabilities but computationally expensive for large networks.

Approximate Inference

Monte Carlo sampling, loopy belief propagation. Faster but probabilistic answers are estimates. Good enough for most practical applications.

Dynamic Bayesian Networks

Standard Bayesian networks are static snapshots. Dynamic Bayesian Networks (DBNs) model how variables evolve over time. Think Kalman filters or Hidden Markov Models, but more expressive.

Use Case: Predictive Maintenance

Machine state at time t depends on state at t-1 and sensor readings. As new sensor data streams in, the DBN continuously updates failure probability. Maintenance is triggered when probability exceeds threshold.

Tools and Libraries

pgmpy (Python) – General-purpose probabilistic graphical models
bnlearn (R) – Structure learning and inference
PyMC (Python) – Bayesian modeling with MCMC sampling
GeNIe & SMILE – Commercial tools with graphical interface

When to Use Bayesian Networks

✅ Good fit when:

You need to reason with incomplete information
Variables have clear dependencies
Decisions require probabilistic risk assessment
Domain experts can provide structural knowledge

❌ Consider alternatives when:

All variables are always observed (use regression instead)
Relationships are purely correlational (use ML models)
You need real-time inference on massive networks (use approximations)