Occurrences Report

Context, Background, and Motivation

  • Occurrences refer to the number of observations in a cell of a Probability Table or a Conditional Probability Table.

  • The number of cells in a Conditional Probability Table is a function of the following parameters:

    • The number of Parent Nodes.

    • The number of Node States of the Parent Nodes.

    • The number of Node States of the Child Nodes.

  • Here, Age is discretized into 4 states and BMI into 6 for a total of 48 cells in the table associated with BMI.

  • The numbers in each cell are counts of observations or Occurrences. In our case, each Occurrence represents one person from the sample of 200 individuals.

  • For instance, the Occurrence table associated with BMI states that Count(BMIā‰¤20 | Ageā‰¤30)=2. So, we have only two Occurrences of that particular condition, i.e., only two individuals who are 30 years old or younger have a BMI of 20 or lower.

  • To create a Bayesian network, BayesiaLab needs to translate the Occurrences in each cell into probabilities.

  • However, with a small number of Occurrences, that can become an issue.

  • We have repeatedly referenced a rule of thumb, which says that we should have a minimum of 5 Occurrences per cell to estimate a Probability Table or Conditional Probability Table reliably.

  • In our example, several cells fall below the recommended minimum.

  • Such deficiencies are easy to recognize in a small example, but in more complex networks, it can be difficult to spot such weaknesses.

  • That is the motivation for the Occurrence Report. It displays all tables in a network and visually highlights potentially problematic cells with low Occurrences.

Usage

  • Select the nodes you want to include in the Occurrences Report. I none are selected, the analysis will be performed on all nodes.

  • Select Main Menu > Network > Reports > Reports> Occurrences to create the Occurrences Report.

  • The Occurrence Report opens up and shows all Probability Tables and Conditional Probability Tables.

  • The fields in the report are color-coded to highlight potential issues:

    • Cells with 0 Occurrences are marked in red.

    • Cells with 5 Occurrences are marked in yellow. This is generally considered the minimum number of Occurrences.

    • Cells with 40 or more Occurrences are marked in green.

  • Furthermore, the Occurrence Report calculates the mean number of Occurrences for each row in all Probability Tables and Conditional Probability Tables.

  • If the mean value of any row in any of the nodes drops below the threshold of 5, the corresponding nodes are called out at the top of the report.

Last updated

Logo

Bayesia USA

info@bayesia.us

Bayesia S.A.S.

info@bayesia.com

Bayesia Singapore

info@bayesia.com.sg

Copyright Ā© 2024 Bayesia S.A.S., Bayesia USA, LLC, and Bayesia Singapore Pte. Ltd. All Rights Reserved.