Skip to content

Commit de476d5

Browse files
author
Christoph Büscher
committed
[Docs] Add clarification to analysis example (#31826)
There have been at least two PRs trying to fix the spelling of "lazi" because it isn't very clear from the example that the english analyzer will stem each token in the example. This adds a short description of the analysis process to make this clearer. Relates to #31797
1 parent 06fabe6 commit de476d5

File tree

1 file changed

+6
-3
lines changed

1 file changed

+6
-3
lines changed

docs/reference/analysis.asciidoc

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,15 +13,18 @@ defined per index.
1313
[float]
1414
== Index time analysis
1515

16-
For instance at index time, the built-in <<english-analyzer,`english`>> _analyzer_ would
17-
convert this sentence:
16+
For instance, at index time the built-in <<english-analyzer,`english`>> _analyzer_
17+
will first convert the sentence:
1818

1919
[source,text]
2020
------
2121
"The QUICK brown foxes jumped over the lazy dog!"
2222
------
2323

24-
into these terms, which would be added to the inverted index.
24+
into distinct tokens. It will then lowercase each token, remove frequent
25+
stopwords ("the") and reduce the terms to their word stems (foxes -> fox,
26+
jumped -> jump, lazy -> lazi). In the end, the following terms will be added
27+
to the inverted index:
2528

2629
[source,text]
2730
------

0 commit comments

Comments
 (0)