summaryrefslogtreecommitdiff
path: root/docs/reference/index-modules/analysis.asciidoc
blob: 1cf33e84db959ab9dddafed6abe6cfde79c59875 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
[[index-modules-analysis]]
== Analysis

The index analysis module acts as a configurable registry of Analyzers
that can be used in order to both break indexed (analyzed) fields when a
document is indexed and process query strings. It maps to the Lucene
`Analyzer`.

Analyzers are (generally) composed of a single `Tokenizer` and zero or
more `TokenFilters`. A set of `CharFilters` can be associated with an
analyzer to process the characters prior to other analysis steps. The
analysis module allows one to register `TokenFilters`, `Tokenizers` and
`Analyzers` under logical names that can then be referenced either in
mapping definitions or in certain APIs. The Analysis module
automatically registers (*if not explicitly defined*) built in
analyzers, token filters, and tokenizers.

See <<analysis>> for configuration details.