Skip to content

Commit 891d3bd

Browse files
authored
Expose the Lucene Korean analyzer module in a plugin (#30397)
This change adds a new plugin called `analysis-nori` that exposes Korean text analysis in es using the new Lucene Korean analyzer module named (`nori`). The plugin adds: * a Korean analyzer: `nori` * a Korean tokenizer: `nori_tokenizer` * a part of speech stop filter: `nori_part_of_speech` * a filter that can replace Hanja characters with their Hangul transcription: `nori_readingform`
1 parent 13e2223 commit 891d3bd

File tree

23 files changed

+1731
-1
lines changed

23 files changed

+1731
-1
lines changed

docs/CHANGELOG.asciidoc

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -146,6 +146,9 @@ The new <<mapping-ignored-field,`_ignored`>> field allows to know which fields
146146
got ignored at index time because of the <<ignore-malformed,`ignore_malformed`>>
147147
option. ({pull}30140[#29658])
148148

149+
A new analysis plugin called `analysis_nori` that exposes the Lucene Korean
150+
analysis module. ({pull}30397[#30397])
151+
149152
[float]
150153
=== Enhancements
151154

docs/build.gradle

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@ integTestCluster {
3232
configFile 'analysis/synonym.txt'
3333
configFile 'analysis/stemmer_override.txt'
3434
configFile 'userdict_ja.txt'
35+
configFile 'userdict_ko.txt'
3536
configFile 'KeywordTokenizer.rbbi'
3637
extraConfigFile 'hunspell/en_US/en_US.aff', '../server/src/test/resources/indices/analyze/conf_dir/hunspell/en_US/en_US.aff'
3738
extraConfigFile 'hunspell/en_US/en_US.dic', '../server/src/test/resources/indices/analyze/conf_dir/hunspell/en_US/en_US.dic'

0 commit comments

Comments
 (0)