Simulate properties based on the empricial distribution of the original data and new words with frequency one
Useful links