In this paper we describe a language recognition algorithm for multilingual documents that is based on mixed-order n-grams, Markov chains, maximum likelihood, and dynamic programming. We present the re- sults of an experimental study that showed that the performance of this algorithm has practical value.
Ludovik, Yevgeny and Ron Zacharski. 1999. Multilingual document language recognition. Proceedings of the Machine Translation Summit VII, 317-323. Singapore, September 13-17, 1999. (pdf)
MTS-1999-Ludovik.pdf