Generates a comprehensive evaluation of the performance of a given Coxmos evaluation object from eval_Coxmos_models(), offering both statistical tests and visual plots for assessment.
subtitle_size_text: Numeric. Text size for subtitle (default: 12).
legend.position: Character. Legend position. Must be one of the following: "top", "bottom", "right" or "left (default: "right").
legend_title: Character. Legend title (default: "Method").
legend_size_text: Numeric. Text size for legend title (default: 12).
x_axis_size_text: Numeric. Text size for x axis (default: 10).
y_axis_size_text: Numeric. Text size for y axis (default: 10).
label_x_axis_size: Numeric. Text size for x label axis (default: 10).
label_y_axis_size: Numeric. Text size for y label axis (default: 10).
txt.x.angle: Numeric. Angle of X text (default: 0).
Returns
A list of lst_eval_results length. Each element is a list of three elements. lst_plots: A list of two plots. The evaluation over the time, and the extension adding the mean or median on the right. lst_plot_comparisons: A list of comparative boxplots by t.test, anova, wilcoxon, kruscal. df: Data.frame of evaluation result.
Details
The plot_evaluation function is designed to facilitate a rigorous evaluation of the performance of models, specifically in the context of survival analysis. This function is tailored to work with a Coxmos evaluation object, which encapsulates the results of survival models. The primary objective is to provide both statistical and visual insights into the model's performance.
The function offers flexibility in the evaluation metric, allowing users to choose between the Area Under the Curve (AUC) and the Brier score. The chosen metric is then evaluated based on either its mean or median value, as specified by the "pred.attr" parameter. The resulting plots can be tailored to display continuous performance over time or aggregated mean performance, based on the "type" parameter.
A salient feature of this function is its ability to conduct statistical tests to compare the performance across different methods. Supported tests include the t-test, ANOVA, Wilcoxon rank-sum test, and Kruskal-Wallis test. These tests provide a quantitative measure of the differences in performance, aiding in the objective assessment of the models.
The visual outputs are generated using the 'ggplot2' package, ensuring high-quality and interpretable plots. The function also offers extensive customization options for the plots, including axis labels, title, and text sizes, ensuring that the outputs align with the user's preferences and the intended audience's expectations.