Man page of tw_optimize_db(3)
Index
NAME
tw_optimize_db - optimize Textweiser database
SYNOPSIS
#include <tw.h> tw_errno_t tw_optimize_db(tw_t *tw);
DESCRIPTION
tw_optimize() optimizes a Textweiser database to improve both classification accuracy and performance.
PARAMETERS
- tw (tw_t *)
-
Pointer to an initialized Textweiser object.
RETURN VALUE
tw_optimize_db() returns an error indicator (tw_errno_t).
A return value of TW_OK indicates success, any other value
discriminates the occurred error.
The function tw_strerror(3) can be used to obtain a natural language error message.
NOTES
Depending on the number of categories and the amount of learned classification data stored within the Textweiser database, optimization may be a long-lasting task that may use a lot of system resources. But using an optimized database Textweiser is often able to speed up classification of unknown documents by a couple of hundred percent. Besides that, the accuracy of classification results is likely to improve.
Therefore we recommend to optimize the database manually after any substantial amount of new documents learned and periodically during regular system maintenance.
To automate database optimization, tw-admin(1) may be used, for example using cron(8).
SEE ALSO
tw-admin(1), tw_strerror(3)
Textweiser User Manual
http://www.lingua-systems.com/text-classifier/textweiser-library/
COPYRIGHT
Copyright (c) 2010-2011 Lingua-Systems Software GmbH


