.. -*- mode: rst -*- Changelog ========= Version 0.3 ----------- Code name: ``Packaging`` * type checking and remove support for Py2 * add tox.ini, requirements.txt, .travis.yml, LICENSE, MANIFEST, .pypirc * doc v0.3 [OK 50%] - changing doc to API style of yellowbrick[OK] - include data for examples[OK] - include datasets rst * test v0.3 * separate aligning and helpers in utils.aling.py [OK] * doc ngrams.py + Basic docstring of sn_gram [OK] * separated pack for ngrams, because sngrams is to long [OK] * embedding notebook examples to the sphinx doc [OK] * translate to English the notebooks involved. * include the test data from NLP course for Text Normalization Jupyter ntb * define if the hypenation functions must be in normalization [OK] * Write the Quickstart [OK, 10%] * Finish @Appender to link docstrings to .py with close to 200 LOC Version 0.2.x ------------- Code name: ``Testing in Reality`` - str input restriction management with only one decorator (if: isinstance(input,str) do, else: raise error)[NO, replaced by type checking en release 0.3] - Revisar los init y comprobar que no se cargan variables y funciones que no deben. - Poner privadas todas las variables y funciones que deben estarlo. - Put the normalization funcs in shallow module[NO] - Set punctuation in shallow module - add collocations with nltk - made some changes to regular expressions in normalize.py after PSTS experiments - add first version of pdftotext - test configurations for documentation with Sphinx - ES support for Regular Expressions y los text normalization methods. - Revisar sí las funciones viejas donde se especificaba la codificación han sido cambiadas a Py3. Version 1.0 ----------- Code name: ``preproc-tech`` - first version of pipeline - lemmatization, stemming - ngrams added, also stowords_ngrams, contextual_ngrams, and syntactic_ngrams (as sngrams) - Notebook of POS - Syntactic Dependency Parsing con Stanford - Syntactic Tree Parsing con Stanford - multioutputs for every tagger func - NER con Stanford - POS con Stanford[OK]