Configuring full-text search#
sysman sysadmin
Allegra can search items and attachments for text. Supported formats include PDF, OpenOffice, Word, Excel, PowerPoint, HTML, RTF, and XML. To do this, full-text search must be configured.
Go to Administration > Global settings > Full-text search
to configure full-text search.
Enabling/disabling full-text search#
You can enable or disable full-text search. Note that full-text search requires storage space, especially when many or large attachments are indexed. The computational load can also be high, particularly when a complete rebuild of the index is forced at system startup.
Enabling/disabling full-text search in attachments#
You can enable or disable the indexing of attachments separately. Disabling it is useful when little storage space is available or when the server load should be reduced.
We recommend enabling the indexing of attachments.
Re-indexing at system startup#
When the checkbox for a complete rebuild is enabled, the index is rebuilt from scratch at system startup. Depending on the number of items and attachments, this can take a few minutes to several hours. During this time, full-text search is not available.
A complete rebuild of the index should not occur at every server startup. As a rule, rebuilding the index about every six months is sufficient. During the rebuild, access to the system should be disabled if possible.
Selecting an analyzer#
Allegra uses Lucene for full-text indexing and search. When building the index, it makes sense to ignore frequent words that are not normally searched for. In German, these include, for example, “ein”, “der”, “die”, “das”, “auch”, or “aber”. Such words should not be indexed.
Various analyzers are available for this purpose. They focus the indexing on the text components that are relevant to the search. Which analyzer is suitable depends on the language of the texts to be indexed.
Choose the analyzer that best fits your requirements. The standard analyzer is less efficient than a language-specific analyzer, but it works in all cases. If your texts and attachments contain multiple languages, the standard analyzer is usually the best choice.
Index directory#
By default, the directory for the full-text search index is created in $ALLEGRA_HOME/index. If attachments are included in full-text search, the space requirement can become considerable. You specify a different location for the index in the application.properties file:
allegra.index-path=abs-path