Changelog

Version 0.3

Code name: Packaging

  • type checking and remove support for Py2

  • add tox.ini, requirements.txt, .travis.yml, LICENSE, MANIFEST, .pypirc

  • doc v0.3 [OK 50%]
    • changing doc to API style of yellowbrick[OK]

    • include data for examples[OK]

    • include datasets rst

  • test v0.3

  • separate aligning and helpers in utils.aling.py [OK]

  • doc ngrams.py + Basic docstring of sn_gram [OK]

  • separated pack for ngrams, because sngrams is to long [OK]

  • embedding notebook examples to the sphinx doc [OK]

  • translate to English the notebooks involved.

  • include the test data from NLP course for Text Normalization Jupyter ntb

  • define if the hypenation functions must be in normalization [OK]

  • Write the Quickstart [OK, 10%]

  • Finish @Appender to link docstrings to .py with close to 200 LOC

Version 0.2.x

Code name: Testing in Reality

  • str input restriction management with only one decorator (if: isinstance(input,str) do, else: raise error)[NO, replaced by type checking en release 0.3]

  • Revisar los init y comprobar que no se cargan variables y funciones que no deben.

  • Poner privadas todas las variables y funciones que deben estarlo.

  • Put the normalization funcs in shallow module[NO]

  • Set punctuation in shallow module

  • add collocations with nltk

  • made some changes to regular expressions in normalize.py after PSTS experiments

  • add first version of pdftotext

  • test configurations for documentation with Sphinx

  • ES support for Regular Expressions y los text normalization methods.

  • Revisar sí las funciones viejas donde se especificaba la codificación han sido cambiadas a Py3.

Version 1.0

Code name: preproc-tech

  • first version of pipeline

  • lemmatization, stemming

  • ngrams added, also stowords_ngrams, contextual_ngrams, and syntactic_ngrams (as sngrams)

  • Notebook of POS

  • Syntactic Dependency Parsing con Stanford

  • Syntactic Tree Parsing con Stanford

  • multioutputs for every tagger func

  • NER con Stanford

  • POS con Stanford[OK]