Product Information
lid identifies the language and character encoding of textual input fast and reliably. Due to optimized algorithms the library does not need much disk space and has no further dependencies. Therefore lid is a high-quality software, that fits into complex applications perfectly.
Features
- lid identifies language and character encoding with high accuracy. We reach accuracy by enhancing statistical methods by linguistic knowledge engineering.
- lid does not only identify languages, but is able to identify languages in their transliterated form. This feature allows to process the greatest possible number of texts, found in practice.
- lid identifies a great variety of character encodings and supports all common Unicode encodings. Therefore there is not only no need to know the character encoding before processing a text - this information is provided for you afterwards.
- Even very short input of about five words is in most cases sufficient to identify the language correctly.
Read more about lid in the product information for developers and for decision-makers and get a first impression of lid in the online demonstration.
Supported Platforms
The C/C++ library is provided for several Unix operating systems and in their native package format.
| Operating System | Distribution/Version | Architecture |
|---|---|---|
| Linux | Debian Etch (4.0) | x86/IA-32 |
| Linux | Debian Lenny (5.0) | x86/IA-32 |
| Linux | Ubuntu LTS (10.04) | x86/IA-32 |
| Solaris | 10 | Sparc |
| FreeBSD | 6 | x86/IA-32 |
| FreeBSD | 7 | x86/IA-32 |
| FreeBSD | 8 | x86/IA-32 |
| Windows | XP | x86/IA-32 |
| Windows | Server 2003 | x86/IA-32 |
| Windows | Vista | x86/IA-32 |
| Windows | Server 2008 | x86/IA-32 |
| Windows | 7 | x86/IA-32 & x86_64/IA-64 |
| Windows | Server 2008 R2 | x86/IA-32 & x86_64/IA-64 |
If you need the software for another operating system or distribution do not hesitate to contact us.
Interfaces
- C/C++
- Perl (via Lingua::Lid)
Requirements
There are very little requirements as lid does not need much resources and only depends on the native C and thread library of the respective operating system.
- C and thread library of the respective system
- 300 KiB RAM
- 1.5 MB disk space
All technical details are summed up in the software specification.


