Skip to content

Commit aec7f77

Browse files
committed
Expose the Lucene Korean analyzer module in a plugin (#30397)
This change adds a new plugin called `analysis-nori` that exposes Korean text analysis in es using the new Lucene Korean analyzer module named (`nori`). The plugin adds: * a Korean analyzer: `nori` * a Korean tokenizer: `nori_tokenizer` * a part of speech stop filter: `nori_part_of_speech` * a filter that can replace Hanja characters with their Hangul transcription: `nori_readingform`
1 parent 39e0031 commit aec7f77

File tree

23 files changed

+1731
-1
lines changed

23 files changed

+1731
-1
lines changed

docs/CHANGELOG.asciidoc

+3
Original file line numberDiff line numberDiff line change
@@ -285,6 +285,9 @@ are used.
285285
//[float]
286286
//=== New Features
287287

288+
A new analysis plugin called `analysis_nori` that exposes the Lucene Korean
289+
analysis module. ({pull}30397[#30397])
290+
288291
[float]
289292
=== Enhancements
290293

docs/build.gradle

+1
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@ integTestCluster {
3232
configFile 'analysis/synonym.txt'
3333
configFile 'analysis/stemmer_override.txt'
3434
configFile 'userdict_ja.txt'
35+
configFile 'userdict_ko.txt'
3536
configFile 'KeywordTokenizer.rbbi'
3637
extraConfigFile 'hunspell/en_US/en_US.aff', '../server/src/test/resources/indices/analyze/conf_dir/hunspell/en_US/en_US.aff'
3738
extraConfigFile 'hunspell/en_US/en_US.dic', '../server/src/test/resources/indices/analyze/conf_dir/hunspell/en_US/en_US.dic'

0 commit comments

Comments
 (0)