Overview
lidc is an application to detect the language and charset of textual input. It is provided as a command line application and available for a variety of Unix systems (Linux, Solaris, FreeBSD). You can use it to determine important information of your textual data and even automate these tasks.
lidc detects the language and charset of textual data very accurately and has been optimized by linguistic research. Thus even very short sentences can be processed successfully.
lidc supports a variety of input formats. Due to integrated parsers, email, HTML and XML files can be processed in addition to the plain text format.
In addition to this application, we offer other solutions for language
detection as well!
Go to the overview on language detection


