sdcMicro5.7.8 package

Statistical Disclosure Control Methods for Anonymization of Data and Risk Estimation

addGhostVars

addGhostVars

addNoise

Adding noise to perturb data

argus_microaggregation

argus_microaggregation

argus_rankswap

argus_rankswap

calcRisks

Recompute Risk and Frequencies for a sdcMicroObj

createDat

Dummy Dataset for Record Swapping

createNewID

Creates new randomized IDs

dataGen

Fast generation of synthetic data

distributeDraws_cpp

Distribute number of swaps

distributeRandom_cpp

Distribute

dRisk

overal disclosure risk

dRiskRMD

RMD based disclosure risk

dUtility

Data-Utility measures

extractManipData

Remove certain variables from the data set inside a sdc object.

freq

Freq

freqCalc

Frequencies calculation for risk estimation

generateStrata

Generate one strata variable from multiple factors

get.sdcMicroObj

get.sdcMicroObj

globalRecode

Global Recoding

groupAndRename

Join levels of a variables in an object of class sdcMicroObj-class o...

il_additional

Additional Information-Loss measures

importProblem

importProblem

indivRisk

Individual Risk computation

infoLoss

Calculate information loss after targeted record swapping

kAnon_violations

kAnon_violations

LocalRecProg

Local recoding via Edmond's maximum weighted matching algorithm

localSupp

Local Suppression

localSuppression

Local Suppression to obtain k-anonymity

mafast

Fast and Simple Microaggregation

measure_risk

Disclosure Risk for Categorical Variables

mergeHouseholdData

Replaces the raw household-level data with the anonymized household-le...

microaggregation

Microaggregation

microaggrGower

Microaggregation for numerical and categorical key variables based on ...

modRisk

Global risk using log-linear models.

mvTopCoding

Detection and winsorization of multivariate outliers

nextSdcObj

nextSdcObj

orderData_cpp

Reorder data

plot.localSuppression

Plots for localSuppression objects

plot.sdcMicroObj

Plotfunctions for objects of class sdcMicroObj

plotMicro

Comparison plots

pram

Post Randomization

print.freqCalc

Print method for objects from class freqCalc.

print.indivRisk

Print method for objects from class indivRisk

print.localSuppression

Print method for objects from class localSuppression

print.micro

Print method for objects from class micro

print.modrisk

Print method for objects from class modrisk

print.pram

Print method for objects from class pram

print.sdcMicroObj

Print and Extractor Functions for objects of class sdcMicroObj-class

print.suda2

Print method for objects from class suda2

randSample_cpp

Random Sampling

rankSwap

Rank Swapping

readMicrodata

readMicrodata

recordSwap

Targeted Record Swapping

recordSwap_cpp

Targeted Record Swapping

removeDirectID

Remove certain variables from the data set inside a sdc object.

report

Generate an Html-report from an sdcMicroObj

riskyCells

riskyCells

sampleDonor_cpp

Random sample for donor records

sdcApp

sdcApp

sdcMicro-package

sdcMicro: Statistical Disclosure Control Methods for Anonymization of ...

sdcMicroObj-class

Class "sdcMicroObj"

selectHouseholdData

Creates a household level file from a dataset with a household structu...

set.sdcMicroObj

set.sdcMicroObj

setLevels_cpp

Define Swap-Levels

setRisk_cpp

Calculate Risk

show-sdcMicroObj-method

Show

shuffle

Shuffling and EGADP

subsetMicrodata

subsetMicrodata

suda2

Suda2: Detecting Special Uniques

summary.freqCalc

Summary method for objects from class freqCalc

summary.micro

Summary method for objects from class micro

summary.pram

Summary method for objects from class pram

topBotCoding

Top and Bottom Coding

valTable

Comparison of different microaggregation methods

varToFactor

Change the a keyVariable of an object of class sdcMicroObj-class fro...

writeSafeFile

writeSafeFile

Data from statistical agencies and other institutions are mostly confidential. This package, introduced in Templ, Kowarik and Meindl (2017) <doi:10.18637/jss.v067.i04>, can be used for the generation of anonymized (micro)data, i.e. for the creation of public- and scientific-use files. The theoretical basis for the methods implemented can be found in Templ (2017) <doi:10.1007/978-3-319-50272-4>. Various risk estimation and anonymization methods are included. Note that the package includes a graphical user interface published in Meindl and Templ (2019) <doi:10.3390/a12090191> that allows to use various methods of this package.