BGGM: Bayesian Gaussian Graphical Models
The R
package BGGM provides tools for making Bayesian inference in Gaussian graphical models (GGM). The methods are organized around two general approaches for Bayesian inference: (1) estimation \insertCite Williams2019BGGM and (2) hypothesis testing \insertCite Williams2019_bfBGGM. The key distinction is that the former focuses on either the posterior or posterior predictive distribution, whereas the latter focuses on model comparison with the Bayes factor.
The methods in BGGM build upon existing algorithms that are well-known in the literature. The central contribution of BGGM is to extend those approaches:
Bayesian estimation with the novel matrix-F prior distribution \insertCite Mulder2018BGGM.
estimate
.Bayesian hypothesis testing with the novel matrix-F prior distribution \insertCite Mulder2018BGGM.
explore
.confirm
.Comparing GGMs \insertCite williams2020comparingBGGM
ggm_compare_estimate
.ggm_compare_ppc
.ggm_compare_explore
.ggm_compare_confirm
.Extending inference beyond the conditional (in)dependence structure
Predictability with Bayesian variance explained \insertCite gelman_r2_2019BGGM
predictability
.
Posterior uncertainty in the partial correlations estimate
.
Custom Network Statistics roll_your_own
.
Furthermore, the computationally intensive tasks are written in c++
via the R
package Rcpp \insertCite eddelbuettel2011rcppBGGM and the c++
library Armadillo \insertCite sanderson2016armadilloBGGM, there are plotting functions for each method, control variables can be included in the model, and there is support for missing values bggm_missing
.
Supported Data Types :
Additional Features :
The primary focus of BGGM
is Gaussian graphical modeling (the inverse covariance matrix).
The residue is a suite of useful methods not explicitly for GGMs:
zero_order_cors
.estimate
).coef.estimate
).Note on Conditional (In)dependence Models for Latent Data :
All of the data types (besides continuous) model latent data. That is, unobserved (latent) data is assumed to be Gaussian. For example, a tetrachoric correlation (binary data) is a special case of a polychoric correlation (ordinal data). Both capture relations between "theorized normally distributed continuous latent variables" (Wikipedia). In both instances, the corresponding partial correlation between observed variables is conditioned on the remaining variables in the latent space. This implies that interpretation is similar to continuous data, but with respect to latent variables. We refer interested users to \insertCite @page 2364, section 2.2, in @webb2008bayesian;textualBGGM.
High Dimensional Data?
BGGM was built specifically for social-behavioral scientists. Of course, the methods can be used by all researchers. However, there is currently not support for high-dimensional data (i.e., more variables than observations) that are common place in the genetics literature. These data are rare in the social-behavioral sciences. In the future, support for high-dimensional data may be added to BGGM .
The R
package BGGM provides tools for making Bayesian inference in Gaussian graphical models (GGM). The methods are organized around two general approaches for Bayesian inference: (1) estimation \insertCite Williams2019BGGM and (2) hypothesis testing \insertCite Williams2019_bfBGGM. The key distinction is that the former focuses on either the posterior or posterior predictive distribution, whereas the latter focuses on model comparison with the Bayes factor.
The methods in BGGM build upon existing algorithms that are well-known in the literature. The central contribution of BGGM is to extend those approaches:
Bayesian estimation with the novel matrix-F prior distribution \insertCite Mulder2018BGGM.
estimate
.Bayesian hypothesis testing with the novel matrix-F prior distribution \insertCite Mulder2018BGGM.
explore
.confirm
.Comparing GGMs \insertCite williams2020comparingBGGM
ggm_compare_estimate
.ggm_compare_ppc
.ggm_compare_explore
.ggm_compare_confirm
.Extending inference beyond the conditional (in)dependence structure
Predictability with Bayesian variance explained \insertCite gelman_r2_2019BGGM
predictability
.
Posterior uncertainty in the partial correlations estimate
.
Custom Network Statistics roll_your_own
.
Furthermore, the computationally intensive tasks are written in c++
via the R
package Rcpp \insertCite eddelbuettel2011rcppBGGM and the c++
library Armadillo \insertCite sanderson2016armadilloBGGM, there are plotting functions for each method, control variables can be included in the model, and there is support for missing values bggm_missing
.
Supported Data Types :
Additional Features :
The primary focus of BGGM
is Gaussian graphical modeling (the inverse covariance matrix).
The residue is a suite of useful methods not explicitly for GGMs:
zero_order_cors
.estimate
).coef.estimate
).Note on Conditional (In)dependence Models for Latent Data :
All of the data types (besides continuous) model latent data. That is, unobserved (latent) data is assumed to be Gaussian. For example, a tetrachoric correlation (binary data) is a special case of a polychoric correlation (ordinal data). Both capture relations between "theorized normally distributed continuous latent variables" (Wikipedia). In both instances, the corresponding partial correlation between observed variables is conditioned on the remaining variables in the latent space. This implies that interpretation is similar to continuous data, but with respect to latent variables. We refer interested users to \insertCite @page 2364, section 2.2, in @webb2008bayesian;textualBGGM.
High Dimensional Data?
BGGM was built specifically for social-behavioral scientists. Of course, the methods can be used by all researchers. However, there is currently not support for high-dimensional data (i.e., more variables than observations) that are common place in the genetics literature. These data are rare in the social-behavioral sciences. In the future, support for high-dimensional data may be added to BGGM .
\insertAllCited