data: a data frame containing at least the input variables.
sample: a character or factor column in the data data frame that contains the sample names.
grouping: a character column in the data data frame that contains either precursor or peptide identifiers.
intensity: a numeric column in the data data frame that contains any intensity intensity values that missingness should be determined for.
digestion: optional, a character column in the data data frame that indicates the mode of digestion (limited proteolysis or tryptic digest). Alternatively, any other variable by which the data should be split can be provided.
plot: a logical value that indicates whether the result should be plotted.
interactive: a logical value that specifies whether the plot should be interactive (default is FALSE).
Returns
A bar plot that displays the percentage of data completeness over all samples. If plot = FALSE a data frame is returned. If interactive = TRUE, the plot is interactive.
Examples
set.seed(123)# Makes example reproducible# Create example datadata <- create_synthetic_data( n_proteins =100, frac_change =0.05, n_replicates =3, n_conditions =2, method ="effect_random")# Determine data completenessqc_data_completeness( data = data, sample = sample, grouping = peptide, intensity = peptide_intensity_missing, plot =FALSE)# Plot data completenessqc_data_completeness( data = data, sample = sample, grouping = peptide, intensity = peptide_intensity_missing, plot =TRUE)