Predictive Value Weighting Estimation of the Binary Mediator Misclassification Model
Predictive Value Weighting Estimation of the Binary Mediator Misclassification Model
Estimate β, γ, and θ parameters from the true mediator, observed mediator, and outcome mechanisms, respectively, in a binary mediator misclassification model using a predictive value weighting approach.
Mstar: A numeric vector of indicator variables (1, 2) for the observed mediator M*. There should be no NA terms. The reference category is 2.
outcome: A vector containing the outcome variables of interest. There should be no NA terms.
outcome_distribution: A character string specifying the distribution of the outcome variable. Options are "Bernoulli", "Poisson", or "Normal".
interaction_indicator: A logical value indicating if an interaction between x and m should be used to generate the outcome variable, y.
x_matrix: A numeric matrix of predictors in the true mediator and outcome mechanisms. x_matrix should not contain an intercept and no values should be NA.
z_matrix: A numeric matrix of covariates in the observation mechanism. z_matrix should not contain an intercept and no values should be NA.
c_matrix: A numeric matrix of covariates in the true mediator and outcome mechanisms. c_matrix should not contain an intercept and no values should be NA.
beta_start: A numeric vector or column matrix of starting values for the β
parameters in the true mediator mechanism. The number of elements in beta_start
should be equal to the number of columns of x_matrix and c_matrix plus 1. Starting values should be provided in the following order: intercept, slope coefficient for the x_matrix term, slope coefficient for first column of the c_matrix, ..., slope coefficient for the final column of the c_matrix.
gamma_start: A numeric vector or matrix of starting values for the γ
parameters in the observation mechanism. In matrix form, the gamma_start matrix rows correspond to parameters for the M* = 1
observed mediator, with the dimensions of z_matrix plus 1, and the gamma parameter matrix columns correspond to the true mediator categories M∈{1,2}. A numeric vector for gamma_start is obtained by concatenating the gamma matrix, i.e. gamma_start <- c(gamma_matrix). Starting values should be provided in the following order within each column: intercept, slope coefficient for first column of the z_matrix, ..., slope coefficient for the final column of the z_matrix.
theta_start: A numeric vector or column matrix of starting values for the θ
parameters in the outcome mechanism. The number of elements in theta_start
should be equal to the number of columns of x_matrix and c_matrix plus 2 (if interaction_indicator is FALSE) or 3 (if interaction_indicator is TRUE). Starting values should be provided in the following order: intercept, slope coefficient for the x_matrix term, slope coefficient for the mediator m term, slope coefficient for first column of the c_matrix, ..., slope coefficient for the final column of the c_matrix, and, optionally, slope coefficient for xm).
tolerance: A numeric value specifying when to stop estimation, based on the difference of subsequent log-likelihood estimates. The default is 1e-7.
max_em_iterations: A numeric value specifying when to stop estimation, based on the difference of subsequent log-likelihood estimates. The default is 1e-7.
em_method: A character string specifying which EM algorithm will be applied. Options are "em", "squarem", or "pem". The default and recommended option is "squarem".
Returns
COMMA_PVW returns a data frame containing four columns. The first column, Parameter, represents a unique parameter value for each row. The next column contains the parameter Estimates. The third column, Convergence, reports whether or not the algorithm converged for a given parameter estimate. The final column, Method, reports that the estimates are obtained from the "PVW" procedure.
Details
Note that this method can only be used for binary outcome models.