cram_policy function

Cram Policy: Efficient Simultaneous Policy Learning and Evaluation