Determines the optimal action for a policy (solved POMDP) for a given belief at a given epoch.
optimal_action(model, belief =NULL, epoch =1)
Arguments
model: a solved POMDP .
belief: The belief (probability distribution over the states) as a vector or a matrix with multiple belief states as rows. If NULL, then the initial belief of the model is used.
epoch: what epoch of the policy should be used. Use 1 for converged policies.
Returns
The name of the optimal action.
Examples
data("Tiger")Tiger
sol <- solve_POMDP(model = Tiger)# these are the statessol$states
# belief that tiger is to the leftoptimal_action(sol, c(1,0))optimal_action(sol,"tiger-left")# belief that tiger is to the rightoptimal_action(sol, c(0,1))optimal_action(sol,"tiger-right")# belief is 50/50optimal_action(sol, c(.5,.5))optimal_action(sol,"uniform")# the POMDP is converged, so all epoch give the same result.optimal_action(sol,"tiger-right", epoch =10)