Skip to content

Commit abbe853

Browse files
Add limits for ngram and shingle settings (#27211) (#27318)
Relates to #25887
1 parent 6810aa8 commit abbe853

File tree

1 file changed

+18
-0
lines changed

1 file changed

+18
-0
lines changed

docs/reference/migration/migrate_7_0/indices.asciidoc

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,3 +19,21 @@ had the undocumented side-effect of refreshing the index which made new document
1919
visible to searches and non-realtime GET operations. From now on these operations
2020
don't have this side-effect anymore. To make documents visible an explicit `_refresh`
2121
call is needed unless the index is refreshed by the internal scheduler.
22+
23+
24+
==== Limit to the difference between max_size and min_size in NGramTokenFilter and NGramTokenizer
25+
26+
To safeguard against creating too many index terms, the difference between `max_ngram` and
27+
`min_ngram` in `NGramTokenFilter` and `NGramTokenizer` has been limited to 1. This default
28+
limit can be changed with the index setting `index.max_ngram_diff`. Note that if the limit is
29+
exceeded a error is thrown only for new indices. For existing pre-7.0 indices, a deprecation
30+
warning is logged.
31+
32+
33+
==== Limit to the difference between max_size and min_size in ShingleTokenFilter
34+
35+
To safeguard against creating too many tokens, the difference between `max_shingle_size` and
36+
`min_shingle_size` in `ShingleTokenFilter` has been limited to 3. This default
37+
limit can be changed with the index setting `index.max_shingle_diff`. Note that if the limit is
38+
exceeded a error is thrown only for new indices. For existing pre-7.0 indices, a deprecation
39+
warning is logged.

0 commit comments

Comments
 (0)