Skip to content

PyThaiNLP 1.7 #117

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 154 commits into from
Sep 22, 2018
Merged

PyThaiNLP 1.7 #117

merged 154 commits into from
Sep 22, 2018

Conversation

wannaphong
Copy link
Member

@wannaphong wannaphong commented Sep 22, 2018

PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.

What's new in PyThaiNLP 1.7 ?

  • Deprecate Python 2 support
  • Refactor pythainlp.tokenize.pyicu for readability
  • Add Thai NER model to pythainlp.ner
  • thai2vec v0.2 - larger vocab, benchmarking results on Wongnai dataset
  • Sentiment classifier based on ULMFit and various product review datasets
  • Add ULMFit utility to PyThaiNLP
  • Add Thai romanization model thai2rom
  • Retrain POS-tagging model
  • Improve word tokenize (newmm,mm) and dict_word_tokenize
  • Documentation added

wannaphong and others added 30 commits February 25, 2018 23:51
…ew method of using custom dict to tokenize words
if no have engine in word_tokenize then it show error.
เดติดโค้ด pyicu จากคุณ @korakot

Co-Authored-By: Korakot Chaovavanich <[email protected]>
del เccอะ

Co-Authored-By: Korakot Chaovavanich <[email protected]>
@coveralls
Copy link

Coverage Status

Coverage decreased (-3.03%) to 59.537% when pulling 22ef79a on dev into 007e644 on master.

@wannaphong wannaphong added this to the 1.7 milestone Sep 22, 2018
@wannaphong wannaphong merged commit 7abc2ef into master Sep 22, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants