Skip to content

Releases: PyThaiNLP/pythainlp

PyThaiNLP v2.3.0-beta1

23 Mar 11:21
Compare
Choose a tag to compare
Pre-release

PyThaiNLP v2.3.0-beta1 is The first beta release of PyThaiNLP 2.3

Documentation: https://pythainlp.github.io/dev-docs/index.html
Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 2.3 change log #445

Deprecation and other API changes

Tokenizer

  • #484 Add: model option for attacut.tokenize()
  • #502 Add: corpus.util.revise_wordset() to revise tokenization dictionary
  • #503 Add: NERCut tokenization engine

Corpus

POS Tagger

  • #464 Add: LST20 language model for part-of-speech tagging
  • #468 Add: port PerceptronTagger from NTLK. POS tagging no longer needs NLTK for dependency.
  • #478 Update: ORCHID POS tags documentation

Name Entity Tagging

  • #526 Update ThaiNER 1.4 to ThaiNER 1.5
  • #538 Add ThaiNameTagger version and add ThaiNER 1.4 support

Transliterate

  • #485 Fixed Romanize failed in some examples
  • #511 Add Thai W2P (Thai Word-to-Phoneme converter)

Text Summarize

  • #523 Add mT5 text summarize to pythainlp.summarize

Chunk parser

  • #524 Add pythainlp.tag.chunk

Util

  • #481 Fix: remove_repeat_vowels() bug that remove spaces between different vowels
  • #483 Add: add remove() method to remove a word from a trie -- thanks @korakot
  • #490 Fix: thai_strftime() - normalize output for unsupported directive (running in glibc and musl should produce the same output)
  • #512 Add: emoji_to_thai() to convert emoji to Thai description -- thanks @ppirch for the development
  • #513 Add: thai_keyboard_dist() to calculate euclidean distance between two characters according to their location on a Thai keyboard layout -- thanks @ppirch for the development

Links

Thanks all the contributors. (Image made with contributors-img)

We build Thai NLP.

PyThaiNLP

v2.3.0-dev0

16 Mar 07:02
2d912bd
Compare
Choose a tag to compare
v2.3.0-dev0 Pre-release
Pre-release

PyThaiNLP v2.3.0-dev0 is The first development release of PyThaiNLP 2.3 (For development only)

Documentation: https://pythainlp.github.io/dev-docs/index.html
Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 2.3 change log #445

Deprecation and other API changes

Tokenizer

  • #484 Add: model option for attacut.tokenize()
  • #502 Add: corpus.util.revise_wordset() to revise tokenization dictionary
  • #503 Add: NERCut tokenization engine

Corpus

POS Tagger

  • #464 Add: LST20 language model for part-of-speech tagging
  • #468 Add: port PerceptronTagger from NTLK. POS tagging no longer needs NLTK for dependency.
  • #478 Update: ORCHID POS tags documentation

Name Entity Tagging

  • #526 Update ThaiNER 1.4 to ThaiNER 1.5
  • #538 Add ThaiNameTagger version and add ThaiNER 1.4 support

Transliterate

  • #485 Fixed Romanize failed in some examples
  • #511 Add Thai W2P (Thai Word-to-Phoneme converter)

Text Summarize

  • #523 Add mT5 text summarize to pythainlp.summarize

Chunk parser

  • #524 Add pythainlp.tag.chunk

Util

  • #481 Fix: remove_repeat_vowels() bug that remove spaces between different vowels
  • #483 Add: add remove() method to remove a word from a trie -- thanks @korakot
  • #490 Fix: thai_strftime() - normalize output for unsupported directive (running in glibc and musl should produce the same output)
  • #512 Add: emoji_to_thai() to convert emoji to Thai description -- thanks @ppirch for the development
  • #513 Add: thai_keyboard_dist() to calculate euclidean distance between two characters according to their location on a Thai keyboard layout -- thanks @ppirch for the development

PyThaiNLP 2.2.6

13 Dec 12:20
590e24d
Compare
Choose a tag to compare

PyThaiNLP 2.2.6 Released!
This release is a bug fix release.

  • Update pythainlp.tag docs #492
  • thai_strftime: Normalize output for unsupported directive #490
  • port pickle to json and add lst20 postag model to pythainlp.corpus #488

Thanks to the following contributors to 2.2.6: @c4n

Thanks to other contributors listed here: https://github.com/PyThaiNLP/pythainlp/blob/dev/CONTRIBUTING.md

You can install or upgrade using pip install -U pythainlp

We build Thai NLP
PyThaiNLP Team

PyThaiNLP 2.2.5

16 Nov 13:00
Compare
Choose a tag to compare

PyThaiNLP 2.2.5 Released!
This release is a bug fix release.

  • Fix: not found file for pythainlp.corpus #486

https://github.com/PyThaiNLP/pythainlp/releases/tag/v2.2.5

You can install or upgrade using pip install -U pythainlp
Documentation: https://www.thainlp.org/pythainlp/docs/2.2/
Tutorials: https://thainlp.org/pythainlp/tutorials/
GitHub: https://github.com/PyThaiNLP/pythainlp
We build Thai NLP
PyThaiNLP Team

PyThaiNLP 2.2.4

17 Sep 08:20
Compare
Choose a tag to compare
  • #481 Fix: remove_repeat_vowels() bug that remove spaces between different vowels

PyThaiNLP 2.2.3

01 Aug 19:17
8db45b3
Compare
Choose a tag to compare

This release is a bug fix release.

  • fix crfcut last segment not included if not predicted as end-of-sentence #459

Installation

  • You can install or upgrade using pip install -U pythainlp

More information

We build Thai NLP

PyThaiNLP Team

PyThaiNLP 2.2.2

09 Jul 22:37
Compare
Choose a tag to compare

This release is a bug fix release.

Installation

  • You can install or upgrade using pip install -U pythainlp

More information

We build Thai NLP

PyThaiNLP Team

PyThaiNLP 2.2.1

27 Jun 14:01
c534140
Compare
Choose a tag to compare

This release is a bug fix release.

  • Fix %O modifier for thai_strftime() #441
  • Fix db.json #442

Installation

  • You can install or upgrade using pip install -U pythainlp

More information

We build Thai NLP

PyThaiNLP Team

PyThaiNLP 2.2.0

24 Jun 14:07
980d104
Compare
Choose a tag to compare

English

Hello World. Today, we're happy to announce the availability of PyThaiNLP. It has been four years since PyThaiNLP's the first release. Thank you very much for supporting PyThaiNLP.

Summary – Release Highlights

New Features

Tokenizer

  • Fix longest engine, last character is now consumed
  • Add CRFCut sentence segmentation

Transliteration

  • Add Thai Grapheme-to-Phoneme (Thai G2P) deep learning sequence-to-sequence model

Normalization

  • Add more normalize functions, like remove zero-width characters, remove duplicate spaces, etc.

Utilities

  • Add thaiword_to_date() and thaiword_to_time()
  • Fix countthai() to handle a case where the text has only numbers and symbols

Command line

Others

  • Code improvement: Move non-init code out of init.py files, etc.
  • Remove dependency: Unigram POS tagger no longer need NLTK module

Installation

You can install or upgrade using pip install -U pythainlp

Change log: #330

Documentation: https://www.thainlp.org/pythainlp/docs/2.2/

Tutorials: https://thainlp.org/pythainlp/tutorials/

GitHub: https://github.com/PyThaiNLP/pythainlp

We build Thai NLP

PyThaiNLP Team

ภาษาไทย

สวัสดีชาวโลก วันนี้ 24 มิถุนายน 2563 พวกเราได้ปล่อย PyThaiNLP 2.2 ตอนนี้ PyThaiNLP อายุครบ 4 ปี ขอบคุณที่ใช้ PyThaiNLP :)

สรุป – สิ่งที่สำคัญ

คุณลักษณะใหม่

ตัวตัดข้อความ

  • แก้ไขตัวตัดคำ longest
  • เพิ่มตัวตัดประโยค CRFCut

ถอดเสียง

  • เพิ่มการถอดเสียงภาษาไทยเป็น IPA ด้วย Thai Grapheme-to-Phoneme (Thai G2P)

Normalization

  • เพิ่มเติมความสามารถให้กับฟังก์ชัน normalize เช่น ลบช่องว่างซ้ำกัน เป็นต้น

เครื่องมือ

  • เพิ่ม thaiword_to_date() และ thaiword_to_time()
  • ปรับปรุง countthai()

Command line

  • ปรับปรุงคำสั่ง command และไวยากรณ์ sub-command - ดูเพิ่มเติมได้ที่ command line docs

อื่น ๆ

  • ปรับปรุงโค้ด: ย้ายโค้ดออกจากไฟล์ init.py เป็นต้น
  • ลดความต้องการไลบรารีภายนอก: Unigram POS tagger สามารถทำงานได้โดยไม่ต้องการ NLTK

การติดตั้ง

สามารถติดตั้งหรือปรับรุ่นได้ด้วยคำสั่ง pip install -U pythainlp

Change log: #330

Documentation: https://www.thainlp.org/pythainlp/docs/2.2/

Tutorials https://thainlp.org/pythainlp/tutorials/

GitHub: https://github.com/PyThaiNLP/pythainlp

พวกเราสร้าง Thai NLP

ทีม PyThaiNLP

PyThaiNLP 2.2.0-beta1

15 Jun 13:29
8512bf8
Compare
Choose a tag to compare
PyThaiNLP 2.2.0-beta1 Pre-release
Pre-release

This the first beta version of PyThaiNLP 2.2.

Installation

pip install --pre pythainlp

PyThaiNLP 2.2 change log #330

Documentation : https://www.thainlp.org/pythainlp/docs/dev/

Report bug : https://github.com/PyThaiNLP/pythainlp/issues

We build Thai NLP.

PyThaiNLP Team