estimate function

Estimate/variance of policy evaluation via non-contextual weighting.

Estimate/variance of policy evaluation via non-contextual weighting.

Computes the estimate and variance of a policy evaluation based on non-contextual weights, AIPW scores, and a policy matrix.

estimate(w, gammahat, policy)

Arguments

  • w: Numeric vector. Non-contextual weights, length A. Must not contain NA values.
  • gammahat: Numeric matrix. AIPW scores, shape [A, K]. Must not contain NA values.
  • policy: Numeric matrix. Policy matrix π(Xt,w)\pi(X_t, w), shape [A, K]. Must have the same shape as gammahat and must not contain NA values.

Returns

Named numeric vector with elements estimate and var, representing the estimated policy value and the variance of the estimate, respectively.

Examples

w <- c(0.5, 1, 0.5, 1.5) scores <- matrix(c(0.5, 0.8, 0.6, 0.3, 0.9, 0.2, 0.5, 0.7, 0.4, 0.8, 0.2, 0.6), ncol = 3, byrow = TRUE) policy <- matrix(c(0.2, 0.3, 0.5, 0.6, 0.1, 0.3, 0.4, 0.5, 0.1, 0.2, 0.7, 0.1), ncol = 3, byrow = TRUE) gammahat <- scores - policy estimate(w = w, gammahat = gammahat, policy = policy)