optimal_tree_policy_minimizer function

Learner for training Optimal Policy Trees where the policy should aim to minimize outcomes