Example data generating process from Offline Multi-Action Policy Learning: Generalization and Optimization
Useful links