Semantics of NPBNs#
In previous section, you have seen how to assign (un)conditional rank correlations to a DAG and the rank correlation matrix as a representation of the NPBN. If you want to read more about NPBN and their semantics, we refer you to Pearl (1988)[1].
In this section, you will see the meaning of the DAG in terms of (un)conditional dependence or independence and how that translates to the rank correlation matrix. To do so, we will analyze the three basic cases of NPBN on 3 nodes shown in the Figure below.
data:image/s3,"s3://crabby-images/8484d/8484d5bb28e0c731267f0f8d80df47c4fead87dd" alt="../_images/DAGs.png"
Fig. 12 Cases of DAGs on 3 nodes.#
Case (a): 1 \(\rightarrow\) 2 \(\rightarrow\) 3#
data:image/s3,"s3://crabby-images/d492e/d492e27c270cfbe4f6957f49eac7f3ae2e0251af" alt="../_images/case_a.png"
Fig. 13 Case (a).#
In this first case, \(X_1\) is the parent of \(X_2\) and \(X_2\) is the parent of \(X_3\). The first thing that we can derive from the DAG is that \(X_1 \cancel{\perp} X_2\) and \(X_2 \cancel{\perp} X_3\) as there are direct statements of the dependence between the variables. Moreover, due to the direction of the arrows in this case as a sequence, \(X_1 \cancel{\perp} X_3\). This is, the dependence between \(X_1\) and \(X_3\) is inferred through the DAG. However, \(X_1\) and \(X_3\) become independent once I know the value of \(X_2\) (mathematically \(X_1 \perp X_3|X_2\)). Therefore, the rank correlation matrix that would be derived from this NPBN would be complete as
Let’s see it with examples!#
The first example is with a discrete Bayesian Network (DBN) and, thus, with discrete variables. In the Figure below, you have a DBN with the following nodes: (1) the probability that it snows, (2) the probability that the train is late, and (3) the probability that you, as train used, are late. If it snows, it is much more likely that the train is late due to the metheorological conditions and, thus, that you are also late. Therefore, (1) and (3) are not independent, as we saw before. However, once I know that the train is late, the probability of being late is not affected any more about the fact that it snows or not. This is, (1) and (3) become independent once I know (2), as we saw before
data:image/s3,"s3://crabby-images/4fbc6/4fbc6b0ea984d5b7369710cd2436d551cfba42ec" alt="../_images/discrete_case_a.png"
Fig. 14 Discrete case (a).#
Let’s make the variables continuous now. In the Figure below, you have a NPBN with the following random variables in the nodes: (1) accumulated milimeters of snow a day, (2) minutes of delay of a train, and (3) minutes of delay to my destination. They are, thus, continuous variables quantified with continuous distribution functions.
data:image/s3,"s3://crabby-images/104cf/104cfee9d34cd6a5b01fa4c89571e0d9acb93cce" alt="../_images/continuous_case_a.png"
Fig. 15 Continuous case (a).#
If the probability of high values of snow increase for a day, the probabilitiy of a higher delay of the train also increases and, together with it, also the probability of higher delay for me. However, once I know how many minutes the train is late, the amount of snow does not have an influence on my expected delay.
Case (b): 1 \(\leftarrow\) 2 \(\rightarrow\) 3#
data:image/s3,"s3://crabby-images/ea01a/ea01ab6467f9178f602ab83a95a8d3aa176842ad" alt="../_images/case_b.png"
Fig. 16 Case (b).#
In this second case, \(X_2\) is the parent of both \(X_1\) and \(X_3\). The first thing that we can derive from the DAG is that \(X_1 \cancel{\perp} X_2\) and \(X_2 \cancel{\perp} X_3\) as there are direct statements of the dependence between the variables. Moreover, since both \(X_1\) and \(X_3\) depend on \(X_2\), they are not independent (\(X_1 \cancel{\perp} X_3\)). This is, the dependence between \(X_1\) and \(X_3\) is inferred through the DAG. However, \(X_1\) and \(X_3\) become independent once I know the value of \(X_2\) (mathematically \(X_1 \perp X_3|X_2\)). Therefore, the rank correlation matrix that would be derived from this NPBN would be complete as
Let’s see it with examples!#
The first example is with discrete variables. In the Figure below, you have a DBN with the following nodes: (1) the probability of having lung cancer, (2) the probability of smoking, and (3) the probability of having bronchitis. According to several studies, smokers are more prone to develing lung conditions such as bronchitis and lung cancer. Thus, having bronchitis makes more likely to be a smoker and, thus, develop lung cancer (\(X_1 \cancel{\perp} X_3\)). However, once I know that somebody smokes, the fact of having bronchitis does not provide me with more information to determine what is the probability of that person having lung cancer (\(X_1 \perp X_3| X_2\))
data:image/s3,"s3://crabby-images/71a0e/71a0e5147838f50ece612d0904d679b33d7eec18" alt="../_images/discrete_case_b.png"
Fig. 17 Discrete case (b).#
Let’s make the variables continuous now. In the Figure below, you have a NPBN with the following random variables in the nodes: (1) forces on buildings due to wind, (2) wind speed, and (3) wave height of the wind-generated waves. They are, thus, continuous variables quantified with continuous distribution functions.
data:image/s3,"s3://crabby-images/2c26a/2c26aab5ba9880616ec5ffa4498e54d3bb804be6" alt="../_images/continuous_case_b.png"
Fig. 18 Continuous case (b).#
When the wind blows along a coastal city, it generates waves near the shore and also impacts the building in the city generating forces on them. The higher the wind speed, the more probable become high forces on the buildings and high wave heights. High wind forces on the buildings make more likely high wind speeds and, thus, high wave heights. Therefore, \(X_1 \cancel{\perp} X_3\). However, once I know the value of the wind speed, observing high values of the wave height does not provide me with more information about the wind forces on the buildings (\(X_1 \perp X_3|X_2\)).
Case (c): 1 \(\rightarrow\) 2 \(\leftarrow\) 3#
data:image/s3,"s3://crabby-images/dd23f/dd23f2f56207717fd410125e04b93966b19d2470" alt="../_images/case_c.png"
Fig. 19 Case (c).#
In the third case, \(X_2\) is the child of both \(X_1\) and \(X_3\). The first thing that we can derive from the DAG is that \(X_1 \cancel{\perp} X_2\) and \(X_2 \cancel{\perp} X_3\) as there are direct statements of the dependence between the variables. \(X_2\) depends on \(X_1\) and \(X_3\), but not the other way around. Therefore, \(X_1 \perp X_3\). Nevertheless, \(X_1\) and \(X_3\) become dependent once I know the value of \(X_2\) (mathematically \(X_1 \cancel{\perp} X_3|X_2\)). Therefore, the rank correlation matrix that would be derived from this NPBN would be complete as
Let’s see it with examples!#
The first example is with discrete variables. In the Figure below, you have a DBN with the following nodes: (1) the probability of burglary, (2) the probability of the alarm of the house ringing, and (3) the probability of an earthquake happening. The fact that the alarm rings can be then cause by either an earthquake or burglary. However, the fact that there is an earthquake has nothing to do with the probability of somebody breaking in the house (\(X_1 \perp X_3\)). This changes once the alarm rings: if there is an earthquake, it is potentially the cause of making the alarm ring and, thus, burglary is less likely (\(X_1 \cancel{\perp} X_3|X_2\))
data:image/s3,"s3://crabby-images/89e9c/89e9c868b573d47f89bce239fc07ce5aaed2121a" alt="../_images/discrete_case_c.png"
Fig. 20 Discrete case (c).#
Let’s make the variables continuous now. In the Figure below, you have a NPBN with the following random variables in the nodes: (1) axle loads of trucks over a bridge, (2) quantitative damage that the elements of the bridge suffer, and (3) the intensity of the earthquake. They are, thus, continuous variables quantified with continuous distribution functions.
data:image/s3,"s3://crabby-images/824f3/824f332560f5e4f52ebd4d4691ea6f432f0b21a7" alt="../_images/continuous_case_c.png"
Fig. 21 Example of a NPBN for case (c) with the two possible combinations of (un)conditional rank correlations in the arcs.#
When the trucks go over the bridge, the loads induced through their axles go deteriorating over time the bridge and, thus, gradually increasing the damage. The higher the axle loads, more likely to observe higher values of deterioration in the bridge. The same holds for the earthquake intensity: the higher the intensity, the more likely to observe higher values of damage in the bridge. However, the intensity of the earthquake does not have anything to do with the axle loads of the trucks. Nevertheless, if I observe a high value of damage in the bridge, this change. If I also observe a high intensity of an earthquake, it is likely that the cause of the damage is the earthquake and, thus, it is less likely to observe high values of the axle loads.
It’s your turn now!#
Define the rank correlation matrix for the following graph.
Solution