cram_bandit_est function

Cram Bandit Policy Value Estimate