Calculate the Standardized Difference for Numeric, Binary and Category Variables in Apache Spark
Check if group variable has exactly 2 levels
Map spark to base R types
Compute Standardized Differences for Binary Variables (Spark)
Compute Standardized Differences for Categorical Variables (Spark)
Compute Standardized Differences for Numeric Variables (Spark)
Validate inputs for stddiff functions
Provides functions to compute standardized differences for numeric, binary, and categorical variables on Apache Spark DataFrames using 'sparklyr'. The implementation mirrors the methods used in the 'stddiff' package but operates on distributed data. See Zhicheng Du, Yuantao Hao (2022) <doi:10.32614/CRAN.package.stddiff> for reference.
Useful links