stddiff.spark1.0 package

Calculate the Standardized Difference for Numeric, Binary and Category Variables in Apache Spark

Provides functions to compute standardized differences for numeric, binary, and categorical variables on Apache Spark DataFrames using 'sparklyr'. The implementation mirrors the methods used in the 'stddiff' package but operates on distributed data. See Zhicheng Du, Yuantao Hao (2022) <doi:10.32614/CRAN.package.stddiff> for reference.

  • Maintainer: Alicja Januszkiewicz
  • License: GPL (>= 3)
  • Last published: 2026-01-15