mr_cML function

Constrained maximum likelihood (cML) method

Constrained maximum likelihood (cML) method

Constrained maximum likelihood (cML) based Mendelian Randomization method robust to both correlated and uncorrelated pleiotropy. methods

mr_cML( object, MA = TRUE, DP = TRUE, K_vec = 0:(length(object@betaX) - 2), random_start = 0, num_pert = 200, random_start_pert = 0, maxit = 100, random_seed = 314, n, Alpha = 0.05 ) ## S4 method for signature 'MRInput' mr_cML( object, MA = TRUE, DP = TRUE, K_vec = 0:(length(object@betaX) - 2), random_start = 0, num_pert = 200, random_start_pert = 0, maxit = 100, random_seed = 314, n, Alpha = 0.05 )

Arguments

  • object: An MRInput object.
  • MA: Whether model average is applied or not. Default is TRUE.
  • DP: Whether data perturbation is applied or not. Default is TRUE.
  • K_vec: Set of candidate K's, the constraint parameter representing number of invalid IVs. Default is from 0 to (#IV - 2).
  • random_start: Number of random starting points for cML, default is 0.
  • num_pert: Number of perturbation when DP is TRUE, default is 200.
  • random_start_pert: Number of random start points for cML with data perturbation, default is 0.
  • maxit: Maximum number of iterations for each optimization. Default is 100.
  • random_seed: Random seed, default is 314. When random_seed=NULL, no random seed will be used and the results may not be reproducible.
  • n: Sample size. When sample sizes of GWAS for exposure and outcome are different, and/or when sample sizes of different SNPs are different, the smallest sample size is recommended to get conservative result and avoid type-I error. See reference for more discussions.
  • Alpha: Significance level for the confidence interval for estimate, default is 0.05.

Returns

The output from the function is an MRcML object containing:

  • Exposure: A character string giving the name given to the exposure.

  • Outcome: A character string giving the name given to the outcome.

  • Estimate: Estimate of theta.

  • StdError: Standard error of estimate.

  • Pvalue: p-value of estimate.

  • BIC_invalid: Set of selected invalid IVs if cML-BIC is performed, i.e. without MA or DP.

  • GOF1_p: p-value of the first goodness-of-fit test.

  • GOF2_p: p-value of the second goodness-of-fit test.

  • SNPs: The number of SNPs that were used in the calculation.

  • Alpha: Significance level for the confidence interval for estimate, default is 0.05.

  • CILower: Lower bound of the confidence interval for estimate.

  • CIUpper: Upper bound of the confidence interval for estimate.

  • MA: Indicator of whether model average is applied.

  • DP: Indicator of whether data perturbation is applied.

Details

The MRcML method selects invalid IVs with correlated and/or uncorrelated peliotropic effects using constrained maximum likelihood. cML-BIC gives results of the selected model with original data, while cML-MA-BIC averages over all candidate models. cML-BIC-DP and cML-MA-BIC-DP are the versions with data-perturbation to account for selection uncertainty when many invalid IVs have weak pleiotropic effects.

When DP is performed, two goodness-of-fit (GOF) tests are developed to check whether the model-based and DP- based variance estimates converge to the same estimate. Small p-values of GOF tests indicate selection uncertainty is not ignorable, and results from DP is more reliable. See reference for more details.

As the constrained maximum likelihood function is non-convex, multiple random starting points could be used to find a global minimum. For some starting points the algorithm may not converge and a warning message will be prompted, typically this will not affect the results.

Examples

# Perform cML-MA-BIC-DP: mr_cML(mr_input(bx = ldlc, bxse = ldlcse, by = chdlodds, byse = chdloddsse), num_pert=5, MA = TRUE, DP = TRUE, n = 17723) # num_pert is set to 5 to reduce computational time # the default value of 200 is recommended in practice # Perform cML-BIC-DP: mr_cML(mr_input(bx = ldlc, bxse = ldlcse, by = chdlodds, byse = chdloddsse), MA = TRUE, DP = FALSE,, n = 17723)

References

Xue, H., Shen, X., & Pan, W. (2021). Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects. The American Journal of Human Genetics, 108(7), 1251-1269.

  • Maintainer: Stephen Burgess
  • License: GPL-2 | GPL-3
  • Last published: 2024-04-12

Useful links