Man page of tw_optimize_db(3)

Index


NAME

tw_optimize_db - optimize Textweiser database

SYNOPSIS

 #include <tw.h>

 tw_errno_t tw_optimize_db(tw_t *tw);

DESCRIPTION

tw_optimize() optimizes a Textweiser database to improve both classification accuracy and performance.

PARAMETERS

tw (tw_t *)

Pointer to an initialized Textweiser object.

RETURN VALUE

tw_optimize_db() returns an error indicator (tw_errno_t). A return value of TW_OK indicates success, any other value discriminates the occurred error.

The function tw_strerror(3) can be used to obtain a natural language error message.

NOTES

Depending on the number of categories and the amount of learned classification data stored within the Textweiser database, optimization may be a long-lasting task that may use a lot of system resources. But using an optimized database Textweiser is often able to speed up classification of unknown documents by a couple of hundred percent. Besides that, the accuracy of classification results is likely to improve.

Therefore we recommend to optimize the database manually after any substantial amount of new documents learned and periodically during regular system maintenance.

To automate database optimization, tw-admin(1) may be used, for example using cron(8).

SEE ALSO

tw-admin(1), tw_strerror(3)

Textweiser User Manual

http://www.lingua-systems.com/text-classifier/textweiser-library/