ML & NLP home

Estimating n-gram probabilities

Let’s start by estimating bi-gram probabilities

Estimating bi-gram probabilities

Bi-gram probabilities of sentences

Practical Issues

Public Language modelling toolkits:

- SRILM
- Google N-gram corpus (Released in 2006). Contains 13m unique words.
- Google Books N-gram corpus