Product Information

lid identifies the language and character encoding of textual input fast and reliably. Due to optimized algorithms the library does not need much disk space and has no further dependencies. Therefore lid is a high-quality software, that fits into complex applications perfectly.

Features

  • lid identifies language and character encoding with high accuracy. We reach accuracy by enhancing statistical methods by linguistic knowledge engineering.
  • lid does not only identify languages, but is able to identify languages in their transliterated form. This feature allows to process the greatest possible number of texts, found in practice.
  • lid identifies a great variety of character encodings and supports all common Unicode encodings. Therefore there is not only no need to know the character encoding before processing a text - this information is provided for you afterwards.
  • Even very short input of about five words is in most cases sufficient to identify the language correctly.

Read more about lid in the product information for developers and for decision-makers and get a first impression of lid in the online demonstration.

Supported Platforms

The C/C++ library is provided for several operating systems and in their native package format.

Operating System Distribution/Version Architecture
Linux Debian Lenny (5.0) x86, x86_64
Linux Debian Squeeze (6.0) x86, x86_64
Linux Ubuntu LTS (10.04) x86, x86_64
Linux Red Hat Enterprise 5 x86, x86_64
FreeBSD 7 x86
FreeBSD 8 x86
FreeBSD 9 x86
Windows XP x86
Windows Server 2003 x86
Windows Server 2008 x86
Windows 7 x86, x86_64
Windows Server 2008 R2 x86, x86_64

If you need the software for another operating system or distribution do not hesitate to contact us.

Interfaces

Requirements

There are very little requirements as lid does not need much resources and only depends on the native C and thread library of the respective operating system.

  • C and thread library of the respective system
  • 300 KiB RAM
  • 1.5 MB disk space

All technical details are summed up in the software specification.