aw_estimate function

Estimate policy value via non-contextual adaptive weighting.