Function to retrieve and help to log Q values during RL progress.
Function to retrieve and help to log Q values during RL progress.
Function will record Q values during the model update. These values will be used by another function Function was developed to help to estimate best control parameters during optimisation process
rl_log_progress(x, states, actions, control)
Arguments
x: * dataframe containing trading results
states: * Selected states of the System
actions: * Selected actions executed under environment
control: * control parameters as defined in the Reinforcement Learning Package
Returns
dataframe with log of RL model reward sequences during model update