dlookr0.6.3 package

Tools for Data Diagnosis, Exploration, Transformation

binning

Binning the Numeric Data

binning_by

Optimal Binning for Scoring Modeling

binning_rgr

Binning by recursive information gain ratio maximization

compare_category.data.frame

Compare categorical variables

compare_numeric.data.frame

Compare numerical variables

correlate.data.frame

Compute the correlation coefficient between two variable

cramer

Cramer's V statistic

describe.data.frame

Compute descriptive statistic

describe.tbl_dbi

Compute descriptive statistic

diagnose.data.frame

Diagnose data quality of variables

diagnose.tbl_dbi

Diagnose data quality of variables in the DBMS

diagnose_category.data.frame

Diagnose data quality of categorical variables

diagnose_category.tbl_dbi

Diagnose data quality of categorical variables in the DBMS

diagnose_numeric.data.frame

Diagnose data quality of numerical variables

diagnose_numeric.tbl_dbi

Diagnose data quality of numerical variables in the DBMS

diagnose_outlier.data.frame

Diagnose outlier of numerical variables

diagnose_outlier.tbl_dbi

Diagnose outlier of numerical variables in the DBMS

diagnose_paged_report.data.frame

Reporting the information of data diagnosis

diagnose_paged_report.tbl_dbi

Reporting the information of data diagnosis for table of the DBMS

diagnose_report.data.frame

Reporting the information of data diagnosis

diagnose_report.tbl_dbi

Reporting the information of data diagnosis for table of the DBMS

diagnose_sparese.data.frame

Diagnosis of level combinations of categorical variables

diagnose_web_report.data.frame

Reporting the information of data diagnosis with html

diagnose_web_report.tbl_dbi

Reporting the information of data diagnosis for table of the DBMS with...

dlookr-deprecated

Deprecated functions in package dlookr

dlookr-package

dlookr: Tools for Data Diagnosis, Exploration, Transformation

dlookr_orange_paged

Generate paged HTML document

dlookr_templ_html

dlookr HTML template

eda_paged_report.data.frame

Reporting the information of EDA

eda_paged_report.tbl_dbi

Reporting the information of EDA for table of the DBMS

eda_report.data.frame

Reporting the information of EDA

eda_report.tbl_dbi

Reporting the information of EDA for table of the DBMS

eda_web_report.data.frame

Reporting the information of EDA with html

eda_web_report.tbl_dbi

Reporting the information of EDA for table of the DBMS with html

entropy

Calculate the entropy

extract.bins

Extract bins from "bins"

find_class

Extract variable names or indices of a specific class

find_na

Finding variables including missing values

find_outliers

Finding variables including outliers

find_skewness

Finding skewed variables

get_class

Extracting a class of variables

get_column_info

Describe column of table in the DBMS

get_os

Finding Users Machine's OS

get_percentile

Finding percentile

get_transform

Transform a numeric vector

import_google_font

Import Google Fonts

imputate_na

Impute Missing Values

imputate_outlier

Impute Outliers

jsd

Jensen-Shannon Divergence

kld

Kullback-Leibler Divergence

kurtosis

Kurtosis of the data

normality.data.frame

Performs the Shapiro-Wilk test of normality

normality.tbl_dbi

Performs the Shapiro-Wilk test of normality

overview

Describe overview of data

performance_bin

Diagnose Performance Binned Variable

plot.bins

Visualize Distribution for a "bins" object

plot.compare_category

Visualize Information for an "compare_category" Object

plot.compare_numeric

Visualize Information for an "compare_numeric" Object

plot.correlate

Visualize Information for an "correlate" Object

plot.imputation

Visualize Information for an "imputation" Object

plot.infogain_bins

Visualize Distribution for an "infogain_bins" Object

plot.optimal_bins

Visualize Distribution for an "optimal_bins" Object

plot.overview

Visualize Information for an "overview" Object

plot.performance_bin

Visualize Performance for an "performance_bin" Object

plot.pps

Visualize Information for an "pps" Object

plot.relate

Visualize Information for an "relate" Object

plot.transform

Visualize Information for an "transform" Object

plot.univar_category

Visualize Information for an "univar_category" Object

plot.univar_numeric

Visualize Information for an "univar_numeric" Object

plot_bar_category.data.frame

Plot bar chart of categorical variables

plot_box_numeric.data.frame

Plot Box-Plot of numerical variables

plot_correlate.data.frame

Visualize correlation plot of numerical data

plot_correlate.tbl_dbi

Visualize correlation plot of numerical data

plot_hist_numeric.data.frame

Plot histogram of numerical variables

plot_na_hclust

Combination chart for missing value

plot_na_intersect

Plot the combination variables that is include missing value

plot_na_pareto

Pareto chart for missing value

plot_normality.data.frame

Plot distribution information of numerical data

plot_normality.tbl_dbi

Plot distribution information of numerical data

plot_outlier.data.frame

Plot outlier information of numerical data diagnosis

plot_outlier.target_df

Plot outlier information of target_df

plot_outlier.tbl_dbi

Plot outlier information of numerical data diagnosis in the DBMS

plot_qq_numeric.data.frame

Plot Q-Q plot of numerical variables

pps

Compute Predictive Power Score

print.relate

Summarizing relate information

relate

Relationship between target variable and variable of interest

skewness

Skewness of the data

summary.bins

Summarizing Binned Variable

summary.compare_category

Summarizing compare_category information

summary.compare_numeric

Summarizing compare_numeric information

summary.correlate

Summarizing Correlation Coefficient

summary.imputation

Summarizing imputation information

summary.optimal_bins

Summarizing Performance for Optimal Bins

summary.overview

Summarizing overview information

summary.performance_bin

Summarizing Performance for Binned Variable

summary.pps

Summarizing Predictive Power Score

summary.transform

Summarizing transformation information

summary.univar_category

Summarizing univar_category information

summary.univar_numeric

Summarizing univar_numeric information

target_by.data.frame

Target by one variables

target_by.tbl_dbi

Target by one column in the DBMS

theil

Theil's U statistic

transform

Data Transformations

transformation_paged_report

Reporting the information of transformation

transformation_report

Reporting the information of transformation

transformation_web_report

Reporting the information of transformation with html

univar_category.data.frame

Statistic of univariate categorical variables

univar_numeric.data.frame

Statistic of univariate numerical variables

A collection of tools that support data diagnosis, exploration, and transformation. Data diagnostics provides information and visualization of missing values, outliers, and unique and negative values to help you understand the distribution and quality of your data. Data exploration provides information and visualization of the descriptive statistics of univariate variables, normality tests and outliers, correlation of two variables, and the relationship between the target variable and predictor. Data transformation supports binning for categorizing continuous variables, imputes missing values and outliers, and resolves skewness. And it creates automated reports that support these three tasks.

  • Maintainer: Choonghyun Ryu
  • License: GPL-2
  • Last published: 2024-02-07