kde2d_faster function

Based on the MASS kde2d() function, but heavily simplified; it's just tcrossprod() now.