Hi! Thanks for your interest in contributing to PyThaiNLP. In this document we'll try to summarize everything that you need to know to do a good job.
We use Git as our version control system, so the best way to contribute is to learn how to use it and put your changes on a Git repository. There's a plenty of documentation about Git -- you can start with the Pro Git book.
We use the famous gitflow to manage our branches.
- Use PEP8;
- Write tests for your new features (please see "Tests" topic below);
- Always remember that commented code is dead code;
- Name identifiers (variables, classes, functions, module names) with meaningful
and pronounceable names (
x
is always wrong); - When manipulating strings, use Python's new-style
formatting
(
'{} = {}'.format(a, b)
instead of'%s = %s' % (a, b)
); - All
#TODO
comments should be turned into issues (use our GitHub issue system); - Run all tests before pushing (just execute
tox
) so you will know if your changes broke something; - All source code and all text files should be ended with one empty line. This is to please git and also to keep up with POSIX standard.
- Facebook group: https://www.facebook.com/groups/thainlp
- GitHub issues: https://github.com/PyThaiNLP/pythainlp/issues
Happy hacking! (;
- Wannaphong Phatthiyaphaibun [email protected]
- Korakot Chaovavanich
- Charin Polpanumas
- Peeradej Tanruangporn
- Arthit Suriyawongkul
- Korakot Chaovavanich
- Charin Polpanumas
- Peeradej Tanruangporn
- See more contributions here https://github.com/PyThaiNLP/pythainlp/graphs/contributors
- [Maximum Matching] -- Manabu Sassano. Deterministic Word Segmentation Using Maximum Matching with Fully Lexicalized Rules. Retrieved from http://www.aclweb.org/anthology/E14-4016
- [MetaSound] -- Snae & Brückner. (2009). Novel Phonetic Name Matching Algorithm with a Statistical Ontology for Analysing Names Given in Accordance with Thai Astrology. Retrieved from https://pdfs.semanticscholar.org/3983/963e87ddc6dfdbb291099aa3927a0e3e4ea6.pdf
- [Thai Character Cluster] -- T. Teeramunkong, V. Sornlertlamvanich, T. Tanhermhong and W. Chinnan, “Character cluster based Thai information retrieval,” in IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages, 2000.
- เพ็ญศิริ ลี้ตระกูล. การเลือกประโยคสำคัญในการสรุปความภาษาไทย โดยใช้แบบจำลองแบบลำดับชั้น (Selection of Important Sentences in Thai Text Summarization Using a Hierarchical Model). Retrieved from http://digi.library.tu.ac.th/thesis/st/0192/