cram_learning function

Cram Policy Learning