Fast n-Gram tokenization
