optimal_tree_policy_maximizer function

Learner for training Optimal Policy Trees where the policy should aim to maximize outcomes