AI-Driven Anomaly Detection for Data Quality
Calculate Benchmarking Metrics
Calculate Feature Importance for Anomalies
Create R Markdown Template
Extract Benchmark Metrics from Scored Data
Flag Top Anomalies Based on Score Threshold
Generate Automated Data Quality Audit Report
Get Top Anomalous Records
Prepare Data for Anomaly Detection
Score Anomalies Using Unsupervised Machine Learning
Score anomalies using Isolation Forest
Score anomalies using Local Outlier Factor
Automated data quality auditing using unsupervised machine learning. Provides AI-driven anomaly detection for data quality assessment, primarily designed for Electronic Health Records (EHR) data, with benchmarking capabilities for validation and publication. Methods based on: Liu et al. (2008) <doi:10.1109/ICDM.2008.17>, Breunig et al. (2000) <doi:10.1145/342009.335388>.
Useful links