bivariate_missingness dataset

Systematic Analysis of Bivariate Missing Data Problems

Systematic Analysis of Bivariate Missing Data Problems

This data set contains the results of a systematic analysis of all missing data problems of two variables. Each problem is associated with a graph containing two vertices, XX and YY, and their response indicators, RXR_X and RYR_Y. data

Format

A data frame with 6144 rows and 8 variables:

  • graph: the graph of the instance.

  • nedges: number of edges in the graph (directed and bidirected).

  • arrowXtoY: whether the graph contains an arrow from XX to YY or not.

  • jointXY: identifiability of the joint distribution of XX and YY

  • marginX: identifiability of the marginal distribution of XX.

  • marginY: identifiability of the marginal distribution of YY.

  • YcondX: identifiability of the conditional distribution of YY

     given $X$.
    
  • YdoX: identifiability of the causal effect of XX on YY.

Source

Tikka et al. https://arxiv.org/abs/1902.01073

data(bivariate_missingness)
  • Maintainer: Santtu Tikka
  • License: GPL (>= 3)
  • Last published: 2024-07-16

About the dataset

  • Number of rows: 6144
  • Number of columns: 8
  • Class: data.frame

Column names and types

  • graph:factor
  • nedges:integer
  • arrowXtoY:logical
  • jointXY:logical
  • marginX:logical
  • marginY:logical
  • YcondX:logical
  • YdoX:logical