Language Detector
Pangea Language Detector can be successfully used:

Our Language Detector combines both statistical and neural technologies in order to obtain the highest recognition results. Our proprietary language detection algorithm is based on a strong mathematical model of vector spacing algorithm. We create a multidimensional space of vectors scanning document contests and use N-grams notion for calculating frequencies. The algorithm analyzes the positions of the necessary vectors in space to determine their similarity. Finally, combined algorithm results are corrected using special linguistic rules developed by our language team.
For evaluation purposes, we have created a demo page to detect the most popular languages achieving language identification accuracy from 95% to 99% (typical competitors’ results: 86 – 96%). The average processing speed was over 8000 KB/s.