1
- up to: 1a3e41130ee6321d3e649fabb26c8b60b354f8be
1
+ up to: cd61f01a4e04c7a79b91a3cc2a42b6ffa144a99b
2
2
3
3
Snowball 3.0.0 (2025-05-??)
4
4
===========================
@@ -413,7 +413,11 @@ Behavioural changes to existing algorithms
413
413
text was written using the visually similar "cedilla" diacritic on these
414
414
letters instead ("ş" and "ţ"). Previously our stemmer only recognised the
415
415
latter. Now it maps the cedilla forms to "comma below" as a first step.
416
- Patch from Rober Muir.
416
+ Patch from Robert Muir.
417
+
418
+ * Spanish: Handle -acion like -ación and -ucion like -ución. It's apparently
419
+ common to miss off accents in Spanish, and there are examples in our test
420
+ vocabulary that these change helps. Proposed by Damian Janowski.
417
421
418
422
* Swedish: Replace suffix "öst" with "ös" when preceded by any of 'iklnprtuv'
419
423
rather than just 'l'. The new rule only requires the "öst" to be in R1
@@ -459,6 +463,8 @@ Optimisations to existing algorithms
459
463
460
464
* Lithuanian: Remove redundant R1 check.
461
465
466
+ * Nepali: Eliminate redundant check_category_2 routine.
467
+
462
468
* Tamil: Optimise by using `among` instead of long `or` chains. The generated
463
469
C version now takes 43% less time to processes the test vocabulary.
464
470
@@ -469,6 +475,8 @@ Optimisations to existing algorithms
469
475
470
476
* Tamil: Make `fix_va_start` simpler and faster.
471
477
478
+ * Tamil: Localise use of `found_a_match` flag.
479
+
472
480
* Tamil: Eliminate pointless flag changes.
473
481
474
482
* Turkish: Minor optimisations.
0 commit comments