Estimate policy value via non-contextual adaptive weighting.
Estimate policy value via non-contextual adaptive weighting.
Estimates the value of a policy based on AIPW scores and a policy matrix using non-contextual adaptive weighting. If evalwts is not provided, uses equal weights for all observations.
aw_estimate(scores, policy, evalwts =NULL)
Arguments
scores: Numeric matrix. AIPW scores, shape [A, K], where A is the number of observations and K is the number of arms. Must not contain NA values.
policy: Numeric matrix. Policy matrix π(Xt,w), shape [A, K]. Must have the same shape as scores and must not contain NA values.
evalwts: Optional numeric vector. Non-contextual adaptive weights ht, length A, or NULL. Default is NULL.