Trilang is a project mainly written in Python, it's free.
Statistical language detector
trilang -- Statistical language detector
trilang is a statistical language detector. To detect the language of a text it divides it into trigrams (blocks of three letters) and compares their frequency with reference values in its database. The database is initially empty and has to be filled by learning from texts with known languages.
The statistical approach has been described in the article "A Statistical Approach to the Spam Problem" by Gary Robinson, 1 Mar 2003, Linux Journal, http://www.linuxjournal.com/article/6467 .
Contact
Please email me with any comments or questions you have: Hermann Schwarting [email protected]