You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+27-8
Original file line number
Diff line number
Diff line change
@@ -12,9 +12,9 @@ Thai Natural Language Processing in Python.
12
12
13
13
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to `nltk` but with focus on Thai language.
14
14
15
-
PyThaiNLP supports Python 3.4+.
16
-
Since version 1.7, PyThaiNLP deprecates its support for Python 2. The future PyThaiNLP 1.8 will completely drop all supports for Python 2.
17
-
Python 2 users can still use PyThaiNLP 1.6.
15
+
PyThaiNLP 1.8 supports Python 3.6+. Some functions may work with older version of Python 3, but it is not well-tested and will not be supported. See [PyThaiNLP 1.8 change log](https://github.com/PyThaiNLP/pythainlp/issues/118).
16
+
17
+
Python 2 users can use PyThaiNLP 1.6, our latest released that tested with Python 2.7.
18
18
19
19
**This is a document for development branch (post 1.7.x). Things will break. For a document for stable branch, see [master](https://github.com/PyThaiNLP/pythainlp/tree/master).**
20
20
@@ -34,21 +34,40 @@ Python 2 users can still use PyThaiNLP 1.6.
34
34
35
35
## Installation
36
36
37
-
**Using pip**
37
+
PyThaiNLP uses PyPI as its main distribution channel, see https://pypi.org/project/pythainlp/
38
+
39
+
### Stable release
38
40
39
-
Stable release
41
+
Standard installation:
40
42
41
43
```sh
42
44
$ pip install pythainlp
43
45
```
44
46
45
-
Development release
47
+
For some advanced functionalities, like word vector, extra packages may be needed. Install them with these options during pip install:
Note: PyTorch is required for ulmfit sentiment analyser. ```pip install torch``` is needed for the feature. gensim and keras packages may also needed for other modules that rely on these machine learning libraries.
53
+
where ```extras``` can be
54
+
-```artagger``` (to support artagger part-of-speech tagger)
55
+
-```deepcut``` (to support deepcut machine-learnt tokenizer)
56
+
-```icu``` (for ICU support in transliteration and tokenization)
57
+
-```ipa``` (for International Phonetic Alphabet support in transliteration)
58
+
-```ml``` (to support ULMFit models, like one for sentiment analyser)
59
+
-```ner``` (for named-entity recognizer)
60
+
-```thai2rom``` (for machine-learnt romanization)
61
+
-```thai2vec``` (for Thai word vector)
62
+
-```full``` (install everything)
63
+
64
+
see ```extras``` and ```extras_require``` in [```setup.py```](https://github.com/PyThaiNLP/pythainlp/blob/dev/setup.py) for details.
0 commit comments