Extract bigrams instead of words (currently not taking utterance boundaries into account)
Useful links