@@ -9,20 +9,6 @@ Input text is lowercased, normalized to remove extended characters, sorted,
9
9
deduplicated and concatenated into a single token. If a stopword list is
10
10
configured, stop words will also be removed.
11
11
12
- [float]
13
- === Definition
14
-
15
- It consists of:
16
-
17
- Tokenizer::
18
- * <<analysis-standard-tokenizer,Standard Tokenizer>>
19
-
20
- Token Filters (in order)::
21
- 1. <<analysis-lowercase-tokenfilter,Lower Case Token Filter>>
22
- 2. <<analysis-asciifolding-tokenfilter>>
23
- 3. <<analysis-stop-tokenfilter,Stop Token Filter>> (disabled by default)
24
- 4. <<analysis-fingerprint-tokenfilter>>
25
-
26
12
[float]
27
13
=== Example output
28
14
@@ -149,3 +135,46 @@ The above example produces the following term:
149
135
---------------------------
150
136
[ consistent godel said sentence yes ]
151
137
---------------------------
138
+
139
+ [float]
140
+ === Definition
141
+
142
+ The `fingerprint` tokenizer consists of:
143
+
144
+ Tokenizer::
145
+ * <<analysis-standard-tokenizer,Standard Tokenizer>>
146
+
147
+ Token Filters (in order)::
148
+ * <<analysis-lowercase-tokenfilter,Lower Case Token Filter>>
149
+ * <<analysis-asciifolding-tokenfilter>>
150
+ * <<analysis-stop-tokenfilter,Stop Token Filter>> (disabled by default)
151
+ * <<analysis-fingerprint-tokenfilter>>
152
+
153
+ If you need to customize the `fingerprint` analyzer beyond the configuration
154
+ parameters then you need to recreate it as a `custom` analyzer and modify
155
+ it, usually by adding token filters. This would recreate the built-in
156
+ `fingerprint` analyzer and you can use it as a starting point for further
157
+ customization:
158
+
159
+ [source,js]
160
+ ----------------------------------------------------
161
+ PUT /fingerprint_example
162
+ {
163
+ "settings": {
164
+ "analysis": {
165
+ "analyzer": {
166
+ "rebuilt_fingerprint": {
167
+ "tokenizer": "standard",
168
+ "filter": [
169
+ "lowercase",
170
+ "asciifolding",
171
+ "fingerprint"
172
+ ]
173
+ }
174
+ }
175
+ }
176
+ }
177
+ }
178
+ ----------------------------------------------------
179
+ // CONSOLE
180
+ // TEST[s/\n$/\nstartyaml\n - compare_analyzers: {index: fingerprint_example, first: fingerprint, second: rebuilt_fingerprint}\nendyaml\n/]
0 commit comments