n_pos function

Find how many states have positive reward