plus_states function

Find which states have positive reward